Cask Data Application Platform Administration The Cask Data Application Platform Administration course will cover concepts and background that is required to run and manage CDAP and CDAP applications in a production environment. At the end of this course participants will be able to install, configure and operationalize CDAP. Course Duration 8 Hours Audience & Prerequisites This course is designed for administrators who are looking to manage CDAP. Basic knowledge of scripting, HDFS and YARN is required. Materials Required This course will not have any labs, so no other training materials are required Course Overview CDAP Concepts & Capabilities Architecture deep-dive Installing and configuring CDAP CDAP Application overview Operational aspects of a CDAP application Course Outline 1. The Motivation to use CDAP Context The Hadoop Ecosystem Today Challenges with Hadoop The need for CDAP CDAP benefits for developers ● ● ● ● 8. Metrics and logging Metrics and logging programming APIs Accessing metrics and logs using REST ● ● ● 2. CDAP Overview Functional view of CDAP CDAP architecture Building blocks ● ● ● 3. Understanding CDAP architecture a deep dive CDAP system components Understanding data flow ● ● 4. CDAP Deployment architecture CDAP components and requirements Simplified deployment architecture High-availability deployment ● ● ● 5. Installing CDAP Preparing a cluster for CDA P Installation using rpm/deb packages Installation via Cloudera manager ● ● ● 6. Configuring CDAP Understanding CDAP configuration The run-down on importan t config settings ● ● 7. Introduction to Coopr Provisioning a CDAP cluster using Coopr ● 9. CDAP application building blocks Overview of CDAP application components ● 16. Transaction service operations Impact of invalid transactions Pruning invalid lists ● ● 17. Scaling programs Scaling instances in services Scaling Flowlet instances ● ● 10. Introduction to Twill The need for Twill Using Twill in CDAP 18. Preferences Introduction to preferences Setting multi-level preferences Scoped runtime arguments 11. Data ingestion to CDAP Data ingestion in batch Data ingestion in real-time Integration with Kafka and Flume for data ingestion 19. Namespaces Data and application isolation using Namespaces Guaranteeing application resources in Namespaces 12. Data serving Using services in applications Writing handlers to serve data 20. Guaranteeing compute resources Introduction to capacity scheduler Managing CDAP application resources ● ● ● ● ● ● ● 13. Introduction to APIs CDAP REST APIs CLI ● ● ● ● ● ● ● ● ● 14. Lifecycle management Application lifecycle management Program lifecycle management ● 21. Security Introduction to CDAP security Enabling perimeter security ● ● ● 15. Health checks and monitoring CDAP system service health checks Monitoring CDAP components and applications using metrics ● ● Cask Data, Inc., 150 Grant Ave, Palo Alto, CA, 94306 22. Custom Authentication Plugging in custom authentication for CDAP ●
© Copyright 2024