Acerca de este Curso
10,864 vistas recientes

100 % en línea

Comienza de inmediato y aprende a tu propio ritmo.

Fechas límite flexibles

Restablece las fechas límite en función de tus horarios.

Nivel avanzado

Aprox. 23 horas para completar

Sugerido: 4 weeks of study, 4-8 hours/week...

Inglés (English)

Subtítulos: Inglés (English)

Habilidades que obtendrás

KafkaNoSQLSpark StreamingSpark

100 % en línea

Comienza de inmediato y aprende a tu propio ritmo.

Fechas límite flexibles

Restablece las fechas límite en función de tus horarios.

Nivel avanzado

Aprox. 23 horas para completar

Sugerido: 4 weeks of study, 4-8 hours/week...

Inglés (English)

Subtítulos: Inglés (English)

Los estudiantes que toman este Course son

  • Data Engineers
  • Data Scientists
  • Machine Learning Engineers
  • Data Analysts
  • Technical Solutions Engineers

Programa - Qué aprenderás en este curso

Semana
1
16 minutos para completar

Welcome to the course "Big Data Applications: Real-Time Streaming"

4 videos (Total 6 minutos), 1 lectura
4 videos
Real-Time Streaming Course Structure2m
Meet Artyom56s
Meet Ivan1m
1 lectura
Slack Channel is the quickest way to get answers to your questions10m
7 horas para completar

Basics of real-time data processing

12 videos (Total 57 minutos), 4 lecturas, 5 cuestionarios
12 videos
Real-time processing of Big Data4m
Delivery semantics6m
Approaches to real-time data processing5m
Lambda and Kappa Architectures6m
Introduction1m
Data storages for real-time big data processing6m
Introduction to Kafka6m
Kafka pros and cons4m
Kafka CLI7m
How to submit your first assignment3m
How to Install Docker on Windows 7, 8, 104m
4 lecturas
Assignments. General requirements10m
Docker Installation Guide10m
FAQ How to show your code to teaching staff10m
Grading System: Instructions and Common Problems10m
3 ejercicios de práctica
Introduction to the world of big data real-time processing. Delivery semantics12m
Data storages for real-time big data processing. Kafka16m
Basics of real-time data processing18m
Semana
2
5 horas para completar

Spark Streaming

6 videos (Total 34 minutos), 1 lectura, 5 cuestionarios
6 videos
Spark Streaming concept6m
How to write Spark Streaming application6m
Spark Streaming hints6m
Spark Streaming and delivery semantics8m
Exactly once extended4m
1 lectura
Spark streaming tasks. General guidelines10m
2 ejercicios de práctica
Spark Streaming12m
Spark Streaming: final18m
Semana
3
2 horas para completar

NoSQL. Cassandra

13 videos (Total 69 minutos), 3 cuestionarios
13 videos
CAP Theorem5m
Cassandra Architecture4m
Read-Write Path6m
Cassandra Data Model5m
Virtual Nodes2m
Gossip Protocol5m
Building Applications with Cassandra4m
Getting Started with CQL7m
Accessing Cassandra from Python7m
Static Columns4m
Overcoming Cassandra's Limitation3m
Secondary Indexes4m
3 ejercicios de práctica
Basics of NoSQL18m
Apache Cassandra16m
NoSQL. Cassandra14m
Semana
4
3 horas para completar

NoSQL. Redis

7 videos (Total 50 minutos), 2 cuestionarios
7 videos
Command line interface8m
Python interface7m
Strings, Lists, Hashes5m
Sets, Sorted Sets, HyperLogLogs7m
Transactions7m
Advanced features7m
1 ejercicio de práctica
Redis12m

Instructores

Avatar

Alexey A. Dral

Founder and Chief Executive Officer
BigData Team
Avatar

Artyom Vybornov

Team Lead at Rambler&Co
Avatar

Vladislav Goncharenko

DCAM MIPT, Skoltech
Avatar

Ivan Mushketyk

Software Engineer, ConsenSys

Acerca de Yandex

Yandex is a technology company that builds intelligent products and services powered by machine learning. Our goal is to help consumers and businesses better navigate the online and offline world....

Acerca de Programa especializado Big Data for Data Engineers

This specialization is made for people working with data (either small or big). If you are a Data Analyst, Data Scientist, Data Engineer or Data Architect (or you want to become one) — don’t miss the opportunity to expand your knowledge and skills in the field of data engineering and data analysis on the large scale. In four concise courses you will learn the basics of Hadoop, MapReduce, Spark, methods of offline data processing for warehousing, real-time data processing and large-scale machine learning. And Capstone project for you to build and deploy your own Big Data Service (make your portfolio even more competitive). Over the course of the specialization, you will complete progressively harder programming assignments (mostly in Python). Make sure, you have some experience in it. This course will master your skills in designing solutions for common Big Data tasks: - creating batch and real-time data processing pipelines, - doing machine learning at scale, - deploying machine learning models into a production environment — and much more! Join some of best hands-on big data professionals, who know, their job inside-out, to learn the basics, as well as some tricks of the trade, from them. Special thanks to Prof. Mikhail Roytberg (APT dept., MIPT), Oleg Sukhoroslov (PhD, Senior Researcher, IITP RAS), Oleg Ivchenko (APT dept., MIPT), Pavel Akhtyamov (APT dept., MIPT), Vladimir Kuznetsov, Asya Roitberg, Eugene Baulin, Marina Sudarikova....
Big Data for Data Engineers

Preguntas Frecuentes

  • Una vez que te inscribes para obtener un Certificado, tendrás acceso a todos los videos, cuestionarios y tareas de programación (si corresponde). Las tareas calificadas por compañeros solo pueden enviarse y revisarse una vez que haya comenzado tu sesión. Si eliges explorar el curso sin comprarlo, es posible que no puedas acceder a determinadas tareas.

  • Cuando te inscribes en un curso, obtienes acceso a todos los cursos que forman parte del Programa especializado y te darán un Certificado cuando completes el trabajo. Se añadirá tu Certificado electrónico a la página Logros. Desde allí, puedes imprimir tu Certificado o añadirlo a tu perfil de LinkedIn. Si solo quieres leer y visualizar el contenido del curso, puedes auditar el curso sin costo.

¿Tienes más preguntas? Visita el Centro de Ayuda al Alumno.