This course is for novice programmers or business people who would like to understand the core tools used to wrangle and analyze big data. With no prior experience, you will have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. You will be comfortable explaining the specific components and basic processes of the Hadoop architecture, software stack, and execution environment. In the assignments you will be guided in how data scientists apply the important concepts and techniques such as Map-Reduce that are used to solve fundamental problems in big data. You'll feel empowered to have conversations about big data and the data analysis process.
ofrecido por
Hadoop Platform and Application Framework
Universidad de California en San DiegoAcerca de este Curso
Habilidades que obtendrás
- Python Programming
- Apache Hadoop
- Mapreduce
- Apache Spark
ofrecido por

Universidad de California en San Diego
UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory.
Programa - Qué aprenderás en este curso
Hadoop Basics
Welcome to the first module of the Big Data Platform course. This first module will provide insight into Big Data Hype, its technologies opportunities and challenges. We will take a deeper look into the Hadoop stack and tool and technologies associated with Big Data solutions.
Introduction to the Hadoop Stack
In this module we will take a detailed look at the Hadoop stack ranging from the basic HDFS components, to application execution frameworks, and languages, services.
Introduction to Hadoop Distributed File System (HDFS)
In this module we will take a detailed look at the Hadoop Distributed File System (HDFS). We will cover the main design goals of HDFS, understand the read/write process to HDFS, the main configuration parameters that can be tuned to control HDFS performance and robustness, and get an overview of the different ways you can access data on HDFS.
Introduction to Map/Reduce
This module will introduce Map/Reduce concepts and practice. You will learn about the big idea of Map/Reduce and you will learn how to design, implement, and execute tasks in the map/reduce framework. You will also learn the trade-offs in map/reduce and how that motivates other tools.
Reseñas
- 5 stars45,44 %
- 4 stars28,03 %
- 3 stars12,36 %
- 2 stars6,71 %
- 1 star7,44 %
Principales reseñas sobre HADOOP PLATFORM AND APPLICATION FRAMEWORK
A very nice course covering the basics of the Hadoop ecosystem and Apache spark. The lectures are high quality and the presenters do a very good work of explaining the concepts. Thanks
This course gives a nice introduction to Hadoop basics. Unfortunatly, i faced many issues to work with cloudera VM and some commands in tutorials are obsolete. Thank you very much for your efforts.
Learned about Hadoop Ecosystem, limitations of map-reduce approach and Spark as a solution to overcome some of limitations.Thanks for giving me the opportunity to participate in this MOOC.
Very good overview, but not extremely in-depth. It's a great course to get an understanding of the concepts and you can investigate deeper later into the topics that interest you.
Preguntas Frecuentes
¿Cuándo podré acceder a las lecciones y tareas?
¿Qué recibiré si compro el Certificado?
¿Hay ayuda económica disponible?
¿Tienes más preguntas? Visita el Centro de Ayuda al Alumno.