Hadoop Developer Foundations
* Looking for a flexible schedule (after hours or weekends)? Please call 858-208-4141 or email us: sales@ccslearningacademy.com.
Student financing options are available.
Transitioning military and Veterans, please contact us to sign up for a free consultation on training and hiring options.
Looking for group training? Contact Us
About This Course
Course Description
New – Learn about the Hadoop ecosystem and how to process large data streams.
Apache Hadoop is a framework for processing Big Data, and Spark is a new in-memory processing engine. This course will introduce you to the Hadoop ecosystem and Spark.
This course explores processing large data streams in the Hadoop ecosystem. Working in a hands-on learning environment, you’ll learn techniques and tools for ingesting, transforming, and exporting data to and from the Hadoop ecosystem for processing. You’ll also process data using Map/Reduce and other critical tools, including Hive and Pig. Towards the end of the course, we’ll review other useful tools such as Oozie and discuss security in the ecosystem.
Learning Objectives
Introduction to Hadoop
HDFS
YARN
Data Ingestion
HBase
Oozie
Working with Hive
Hive advanced
Hive in Cloudera/Hortonworks Distribution (or tools of choice)
Working with Spark
Spark Basics
Spark Shell
RDDs
Spark Dataframes and Datasets
Spark SQL
Spark API programming
Spark and Hadoop
Machine Learning (ML/MLlib)
GraphX
Spark Streaming
Inclusions
- Instructor-led training
- Training Seminar Student Handbook
- Collaboration with classmates (not currently available for self-paced course)
- Real-world learning activities and scenarios
- Exam scheduling support*
- Enjoy job placement assistance for the first 12 months after course completion.
- This course is eligible for CCS Learning Academy’s Learn and Earn Program: get a tuition fee refund of up to 50% if you are placed in a job through CCS Global Tech’s Placement Division*
- Government and Private pricing available.*
Pre-requisites
- Familiar with a programming language
- Comfortable in Linux environment (be able to navigate Linux command line, edit files using vi or nano)
Target Audience
- Experienced Developers and Architects seeking to be proficient in Hadoop, Hive, and Spark within an enterprise data environment.
Curriculum
103 Lessons32h