Facebook

Hadoop Developer Foundations

* Looking for a flexible schedule (after hours or weekends)? Please call 858-208-4141 or email us:  sales@ccslearningacademy.com.

Student financing options are available.

Transitioning military and Veterans, please contact us to sign up for a free consultation on training and hiring options.

Looking for group training? Contact Us

psinghal
Last Update April 15, 2024
0 already enrolled

About This Course

Course Description

New – Learn about the Hadoop ecosystem and how to process large data streams.

Apache Hadoop is a framework for processing Big Data, and Spark is a new in-memory processing engine. This course will introduce you to the Hadoop ecosystem and Spark.

This course explores processing large data streams in the Hadoop ecosystem. Working in a hands-on learning environment, you’ll learn techniques and tools for ingesting, transforming, and exporting data to and from the Hadoop ecosystem for processing. You’ll also process data using Map/Reduce and other critical tools, including Hive and Pig. Towards the end of the course, we’ll review other useful tools such as Oozie and discuss security in the ecosystem.

Learning Objectives

Introduction to Hadoop
HDFS
YARN
Data Ingestion
HBase
Oozie
Working with Hive
Hive advanced
Hive in Cloudera/Hortonworks Distribution (or tools of choice)
Working with Spark
Spark Basics
Spark Shell
RDDs
Spark Dataframes and Datasets
Spark SQL
Spark API programming
Spark and Hadoop
Machine Learning (ML/MLlib)
GraphX
Spark Streaming

Inclusions

  • Instructor-led training
  • Training Seminar Student Handbook
  • Collaboration with classmates (not currently available for self-paced course)
  • Real-world learning activities and scenarios
  • Exam scheduling support*
  • Enjoy job placement assistance for the first 12 months after course completion.
  • This course is eligible for CCS Learning Academy’s Learn and Earn Program: get a tuition fee refund of up to 50% if you are placed in a job through CCS Global Tech’s Placement Division*
  • Government and Private pricing available.*

Pre-requisites

  • Familiar with a programming language
  • Comfortable in Linux environment (be able to navigate Linux command line, edit files using vi or nano)

Target Audience

  • Experienced Developers and Architects seeking to be proficient in Hadoop, Hive, and Spark within an enterprise data environment.

Curriculum

103 Lessons32h

1. Introduction to Hadoop

Hadoop history, concepts
Ecosystem
Distributions
High-level architecture
Hadoop myths
Hadoop challenges
Hardware and software

2. HDFS

3. YARN

4. HBase

5. Oozie

6. Working with Hive

7. Hive advanced

8. Hive in Cloudera or HortonWorks Distribution (or tools of choice)

9. Spark Basics

10. Spark Shell

11. RDDs

12. Spark SQL

13. Spark API programming (Scala and Python)

14. Spark and Hadoop

15. Machine Learning (ML/MLlib)

16. GraphX

17. Spark Streaming

Your Instructors

psinghal

0/5
471 Courses
0 Reviews
0 Students
See more

Write a review

$2,395.00

Level
Intermediate
Duration 32 hours
Lectures
103 lectures
Print Friendly, PDF & Email

Inclusions

  • Instructor-led training
  • Training Seminar Student Handbook
  • Collaboration with classmates (not currently available for self-paced course)
  • Real-world learning activities and scenarios
  • Exam scheduling support*
  • Enjoy job placement assistance for the first 12 months after course completion.
  • This course is eligible for CCS Learning Academy’s Learn and Earn Program: get a tuition fee refund of up to 50% if you are placed in a job through CCS Global Tech’s Placement Division*
  • Government and Private pricing available.*
#edumall-wp-widget-courses-1 { display: none; } #single-course-ratings { display: none; } .tutor-single-course-lead-meta { display: none; } .lead-meta-item meta-course-total-enrolled { display: none; }