DescripciĂłn del curso:
Learn the essentials of using Spark for your big data workloads.
Apache Spark is an important component in the Hadoop Ecosystem as a cluster computing engine used for Big Data. Building on top of the Hadoop YARN and HDFS ecosystem, Spark offers faster in-memory processing for computing tasks when compared to Map/Reduce. It can be programmed in Java, Scala, Python, and R along with SQL-based front-ends.
This course introduces Scala, Python, or R developers to the world of Spark programming. It begins with an overview of the ecosystem and hands-on experience with the platform such as working with the Spark Shell, using RDDs, and DataFrames. You’ll later explore a wider-scoped introduction to NoSQL, Spark Streaming, Spark SQL, Spark MLLib, and how the pieces are put together in a larger application.
Esquema del curso
Overview of Spark
- Hadoop Ecosystem
- Hadoop YARN vs. Mesos
- Spark vs. Map/Reduce
- Spark: Lambda Architecture
- Spark in the Enterprise Data Science Architecture
Spark Component Overview
- Spark Shell
- RDDs: Resilient Distributed Datasets
- Data Frames
- Spark 2 Unified DataFrames
- Spark Sessions
- Functional Programming
- Spark SQL
- MLib
- Structured Streaming
- Spark R
- Spark and Python
RDDs: Resilient Distributed Datasets
- Coding with RDDs
- Transformaciones
- Actions
- Lazy Evaluation and Optimization
- RDDs in Map/Reduce
- Exercise: Working with RDDs
DataFrames
- RDDs vs. DataFrames
- Unified DataFrames (UDF) in Spark 2.x
- Partitioning
- Exercise: Working with Unified DataFrames
Advanced Spark Overview
- NoSQL
- Spark SQL
- Spark Streaming
- Spark ML Lib
PĂşblico objetivo
Data Scientists, Data Engineers, Software Engineers, Architects, and Developers.
Lo que aprenderás
Join an engaging hands-on learning environment, where you’ll learn:
- The essentials of Spark architecture and applications
- How to execute Spark Programs
- How to create and manipulate both RDDs (Resilient Distributed Datasets) and UDFs (Unified Data Frames)
- How Spark core components come together for complete applications
Requisitos previos
Before attending this course, you should have:
- Experience programming in either Java, Python, R, or Scala (only one language needed)
- Basic understanding of SQL
Inclusiones
Con CCS Learning Academy, recibirás:
- Instructor-led training
- Manual del estudiante del seminario de capacitaciĂłn
- Colaboración con compañeros de clase (actualmente no disponible para cursos a su propio ritmo)
- Escenarios y actividades de aprendizaje del mundo real.
- Soporte para la programación de exámenes*
- Disfrute de asistencia para la colocación laboral durante los primeros 12 meses después de finalizar el curso.
- Este curso es elegible para el programa Learn and Earn de CCS Learning Academy: obtenga un reembolso de la matrĂcula de hasta 50% si se le coloca en un trabajo a travĂ©s de CCS Global Tech. DivisiĂłn de colocaciĂłn*
- Precios gubernamentales y privados disponibles.*
*Para más detalles llame al: 858-208-4141 o correo electrónico: formación@ccslearningacademy.com; ventas@ccslearningacademy.com