Cask Data Application Platform Administration

Cask Data Application Platform Administration
The Cask Data Application Platform Administration course will cover concepts and
background that is required to run and manage CDAP and CDAP applications in a
production environment. At the end of this course participants will be able to install,
configure and operationalize CDAP.
Course Duration
8 Hours
Audience & Prerequisites
This course is designed for administrators who are looking to manage CDAP. Basic
knowledge of scripting, HDFS and YARN is required.
Materials Required
This course will not have any labs, so no other training materials are required
Course Overview
CDAP Concepts & Capabilities
Architecture deep-dive
Installing and configuring CDAP
CDAP Application overview
Operational aspects of a CDAP application
Course Outline
1. The Motivation to use CDAP
Context
The Hadoop Ecosystem Today
Challenges with Hadoop
The need for CDAP
CDAP benefits for developers
●
●
●
●
8. Metrics and logging
Metrics and logging
programming APIs
Accessing metrics and logs
using REST
●
●
●
2. CDAP Overview
Functional view of CDAP
CDAP architecture
Building blocks
●
●
●
3. Understanding CDAP architecture a deep dive
CDAP system components
Understanding data flow
●
●
4. CDAP Deployment architecture
CDAP components and
requirements
Simplified deployment
architecture
High-availability deployment
●
●
●
5. Installing CDAP
Preparing a cluster for CDA
P
Installation using rpm/deb
packages
Installation via Cloudera
manager
●
●
●
6. Configuring CDAP
Understanding CDAP
configuration
The run-down on importan
t
config settings
●
●
7. Introduction to Coopr
Provisioning a CDAP cluster
using Coopr
●
9. CDAP application building blocks
Overview of CDAP application
components
●
16. Transaction service operations
Impact of invalid transactions
Pruning invalid lists
●
●
17. Scaling programs
Scaling instances in services
Scaling Flowlet instances
●
●
10. Introduction to Twill
The need for Twill
Using Twill in CDAP
18. Preferences
Introduction to preferences
Setting multi-level
preferences
Scoped runtime arguments
11. Data ingestion to CDAP
Data ingestion in batch
Data ingestion in real-time
Integration with Kafka and
Flume for data ingestion
19. Namespaces
Data and application isolation
using Namespaces
Guaranteeing application
resources in Namespaces
12. Data serving
Using services in applications
Writing handlers to serve data
20. Guaranteeing compute
resources
Introduction to capacity
scheduler
Managing CDAP application
resources
●
●
●
●
●
●
●
13. Introduction to APIs
CDAP REST APIs
CLI
●
●
●
●
●
●
●
●
●
14. Lifecycle management
Application lifecycle
management
Program lifecycle
management
●
21. Security
Introduction to CDAP
security
Enabling perimeter security
●
●
●
15. Health checks and monitoring
CDAP system service health
checks
Monitoring CDAP components
and applications using metrics
●
●
Cask Data, Inc.,
150 Grant Ave, Palo Alto, CA, 94306
22. Custom Authentication
Plugging in custom
authentication for CDAP
●