Introducción a los conceptos básicos de Apache Spark (TTSK7502)

*ÂżBusca un horario flexible (fuera de horario o fines de semana)? Por favor llame o envĂ­enos un correo electrĂłnico: 858-208-4141 o sales@ccslearningacademy.com.

Opciones de financiación para estudiantes están disponibles.
¿Buscas formación en grupo? Contáctenos
CategorĂ­a:

Descargar PDF de detalles del curso

DescripciĂłn del curso:

Learn the essentials of using Spark for your big data workloads.

Apache Spark is an important component in the Hadoop Ecosystem as a cluster computing engine used for Big Data. Building on top of the Hadoop YARN and HDFS ecosystem, Spark offers faster in-memory processing for computing tasks when compared to Map/Reduce. It can be programmed in Java, Scala, Python, and R along with SQL-based front-ends.

This course introduces Scala, Python, or R developers to the world of Spark programming. It begins with an overview of the ecosystem and hands-on experience with the platform such as working with the Spark Shell, using RDDs, and DataFrames. You’ll later explore a wider-scoped introduction to NoSQL, Spark Streaming, Spark SQL, Spark MLLib, and how the pieces are put together in a larger application.

Formato

Dirigido por un instructor

Tema

Longitud

Esquema del curso

Overview of Spark

  • Hadoop Ecosystem
  • Hadoop YARN vs. Mesos
  • Spark vs. Map/Reduce
  • Spark: Lambda Architecture
  • Spark in the Enterprise Data Science Architecture

Spark Component Overview

  • Spark Shell
  • RDDs: Resilient Distributed Datasets
  • Data Frames
  • Spark 2 Unified DataFrames
  • Spark Sessions
  • Functional Programming
  • Spark SQL
  • MLib
  • Structured Streaming
  • Spark R
  • Spark and Python

RDDs: Resilient Distributed Datasets

  • Coding with RDDs
  • Transformaciones
  • Actions
  • Lazy Evaluation and Optimization
  • RDDs in Map/Reduce
  • Exercise: Working with RDDs

DataFrames

  • RDDs vs. DataFrames
  • Unified DataFrames (UDF) in Spark 2.x
  • Partitioning
  • Exercise: Working with Unified DataFrames

Advanced Spark Overview

  • NoSQL
  • Spark SQL
  • Spark Streaming
  • Spark ML Lib

PĂşblico objetivo

Data Scientists, Data Engineers, Software Engineers, Architects, and Developers.

Lo que aprenderás

Join an engaging hands-on learning environment, where you’ll learn:

  • The essentials of Spark architecture and applications
  • How to execute Spark Programs
  • How to create and manipulate both RDDs (Resilient Distributed Datasets) and UDFs (Unified Data Frames)
  • How Spark core components come together for complete applications

Requisitos previos

Before attending this course, you should have:

  • Experience programming in either Java, Python, R, or Scala (only one language needed)
  • Basic understanding of SQL

Inclusiones

Con CCS Learning Academy, recibirás:

  • Instructor-led training
  • Manual del estudiante del seminario de capacitaciĂłn
  • ColaboraciĂłn con compañeros de clase (actualmente no disponible para cursos a su propio ritmo)
  • Escenarios y actividades de aprendizaje del mundo real.
  • Soporte para la programaciĂłn de exámenes*
  • Disfrute de asistencia para la colocaciĂłn laboral durante los primeros 12 meses despuĂ©s de finalizar el curso.
  • Este curso es elegible para el programa Learn and Earn de CCS Learning Academy: obtenga un reembolso de la matrĂ­cula de hasta 50% si se le coloca en un trabajo a travĂ©s de CCS Global Tech. DivisiĂłn de colocaciĂłn*
  • Precios gubernamentales y privados disponibles.*

*Para más detalles llame al: 858-208-4141 o correo electrónico: formación@ccslearningacademy.com; ventas@ccslearningacademy.com

 

Carro de la compra
es_CRSpanish