Volver a análisis exploratorio de datos

4.7

estrellas

5,605 calificaciones

•

804 revisiones

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data....

CC

Jul 29, 2016

This is the second course I have taken from Roger Peng and both were outstanding. I have a strong math background, but not much of a background in stats, but this course was very approachable for me.

Y

Sep 24, 2017

Very good course! It provide me the foundation in learning how to plot and interpret data. This will definitely strengthen my "R programming" to generate publication type figure for my genomics data!

Filtrar por:

por JM

•Jul 11, 2018

Once it got to the clustering section the lessons were inscrutable. Extremely difficult to understand and not explained well.

por Luca R

•Jun 10, 2017

The videos were merely repeating the content from swirl, with absolutely no added values.

por Dilyan D

•Feb 12, 2018

This is the worst of the Data Science courses so far (they've all been pretty good up to this point).

It's called Exploratory Data Analysis, but is actually all about the graphics systems in R. And it does a botched job on those as well.

All quizzes and assignments are about the graphics systems. The only portion of the course that deviates from that is Week 3 (for which there is no quiz or project) where we "learn" about clustering and dimension reduction. However, that material is presented really poorly: not enough depth for someone who is already familiar with the subject matter; and not nearly well enough explained for newbies.

On the graphics side, none of the systems is explored in great depth. The lattice system is essentially just mentioned in passing.

To cap it all off, the brief for the last assignment is really ambiguous, which often causes perfectly valid work to be graded poorly by peers. (Just look at the forums, if you need proof.)

por Daniel H

•May 13, 2019

Provides a solid overview of the base plotting system and a discussion (better elsewhere) of others. Introduces some higher level exploratory methods, without much information on either the theory or application (simply walks through the recipe). Assessments do not match the lecture material, so the credential is essentially meaningless. Read the associated book, watch the video lectures if you'd like. Don't bother with paying for the certificate.

por Roman

•Aug 30, 2018

Cons:

# Too much focus on hopelessly outdated R functions.

# Lectures are mostly powerpoint karaoke along the lines of "You can do that thing. And you can also do that other thing. And also you do this third thing" without much real-world application.

# ggplot2 is the only modern viz package that gets mentioned

Pros:

# The swirl exercises are great (but very buggy on Mac)

por Beverly A

•Sep 20, 2016

When it comes down to it, there's simply not the support to assist a student that has a really hard problem, "hacker mentality" seems to equate to "figure it out on your own cuz nobody's going to help you". If things do not work perfectly for you then you are likely never to be able to finish because your "peers" don't know any better either. The way this class is set up makes me angry every time I have to deal with it. I would probably be just as well served doing just the swirl() exercises. I would quit if I hadn't paid all the way through in advance. I can't believe this is the type of school John Hopkins is to produce a course of this quality, but I guess I have to.

por Faben W

•Feb 04, 2019

This lesson could have been significantly improved if there was at least one assignment on clustering/dimensional reduction. Those are probably the hardest concepts thought thus far, so it would have been extremely useful to have at least one challenge to work through.

por Paul R

•Mar 12, 2019

This course covers plotting (base, lattice, ggplot) then takes a confusing tour into heavy topics of clustering and dimension reduction, then flips back to coloring in charts. The order of the lectures is confusing and PCA/SVD needs more background, clearer explanation and treatment (gets covered a bit more later under regression). Assignments are good and swirl courses helped solidify the lectures.

por Rok B

•May 15, 2019

This course is basically plotting with R and clustering/dimensionality reduction. There's is not enough emphasis on the later in my opinion. The final assignment focuses only on plotting, which is a shame.

por Pamela M

•Jun 05, 2016

Alas, after only 10 minutes of the first video, I am reminded that this instructor does not gear his lectures to the true Beginners among us. He speaks much more for an audience of grad students. I do want to complete this Specialization, so I will try again perhaps after learning more - about statistics and R and who knows what else. I fought my way through the first three courses, but now I'm going to work smarter by finding other ways to acquire this knowledge. Then return to him; maybe. This course should be labelled Intermediate and Statistics should be listed as a prerequisite. (I think; since I don't know what it is that I don't know, I am making a guess as to the missing piece of the puzzle.)

por Sergey K

•May 10, 2016

This course mostly about how to use plotting libraries in R.

por NISHANT P

•Oct 05, 2017

Very insightful course!!!

The swirl packages and course projects in "Exploratory Data Analysis" course have really helped me to understand the power of R in performing introductory graphical analyses towards initial inferences. It has good hands-on exercises to really put to action various sophisticated graphs and plots for boardroom conversations on how to go deeper into the data analysis in order to find meaningful business insights or build powerful predictive models. As I advance through the specialization, I am getting to realize how powerful Statistical Learning through R is for quick business action and automation.

por Dale O J

•Oct 16, 2018

This has been a challenging course for me, for whatever reasons. I have devoted a great deal of time in reading Dr. Peng's books as well as reviewing work product of other students to get a better grasp of the logic and methodology. I have enjoyed this course more than any of the preceding courses. And, the struggle I believe will be worth the effort and facilitate my completion of the data science specialization program.

por Chandrakanth C

•Jun 18, 2018

Well organised course for Exploratory Data Analysis. After this course you will be thorough with the basics of the Exploratory Data Analysis. The peer graded assignment is one level higher than the concepts thought in the lectures is what I felt. Overall, it is worth taking this course. Thanks a lot, Coursera.

por Rishabh J

•Aug 22, 2017

I found Prof. Roger Peng to be the best instructor in this specialization. This course just proves my point further. He teaches different concepts in a lucid manner. These concepts were presented in a way that could be applied to real world data sets right away. Awesome course!

por Farah N

•Aug 28, 2019

I enjoyed taking this course specially the projects and swirl practice. If the clustering were a bit detailed, it would be useful. Also we could do a project using the 3 different approaches, it would be interesting. Nevertheless, it was fantastic with the amazing professors.

por Diego A Q

•Jun 18, 2019

Great course, it teaches you a lot about how to create plots, charts and other tools using R code. This course is focused on "get to know your data" by using all this tools during a research process. It is like the previous step you have to do before going into any analytics.

por Linwood C I

•Mar 07, 2016

I loved this course!! All of the classes taken in the specialization all come together for practical use. Course 2 is where it really kicked in. Students will learn how to use R to explore data sets that send you down interesting paths.

por Clare S

•Mar 21, 2016

Really nice course. Good to put the graphics functions in R to use. I think it would be helpful to have a summary page somewhere that compares the format of how to generate simple plots using each of the 3 packages - just for reference.

por José S C S

•Jul 07, 2019

This course teaches how to use three different plotting systems in R. Given the dominance of the tidyverse/ggplot2 paradigm, I really appreciate the opportunity to learn the base plotting system and the lattice plot system.

por Johann R

•May 28, 2017

Graphs and plotting is at the heart of data analysis and data science, and without it you would have difficulty conveying ideas, and having graphs to explain numerical/statistical data is always handy. Visual representation of a data set, and using visual cues to gain an understanding of data, can save a lot of time, and can help you gain additional insights into the data. This course teaches you key techniques on how to apply some graphing and plotting methods to visually explore data, and it does so really well and in great detail, and also provides some good demos.

por Anthony C C

•Sep 27, 2017

I was able to learn the material presented over the time of the course. It's a lot of material to cover in the time I could commit to it but I feel confident using the tools and methods presented. The projects were very valuable both from getting to practice the methods and tooling and also from seeing how other students approached the solutions. I really helped put all the options into context and highlighted the value of using the different tools and where to use them. Only knock would be sometimes the background noises in the videos were distracting.

por TARUN S

•Apr 29, 2017

I really appreciate the course design. Even if somebody doesn't have much background in R, she/he can comfortably learn from the videos and understand the concepts. The exercises and project assignments are challenging and actually help you practice and re-visit the lectures and explore further. Though I had already known and used Clustering, PCA and SVD in my work before, I really liked the way these concepts were explained here. I would strongly recommend this course to anybody who is keen to see R in action!

por Amanuel G

•Jan 06, 2017

It was a wonderful experience to read the structure of data before delving into the advanced statistical levels of data analysis.The need for inclusion or exclusion of dependent variables or dimension reduction in regression analysis can be intuitively understood and visualized using Data Exploratory techniques and then we have the clue as what to do in the next level.It is like putting the whole characteristic of the data under full control.

por Monisha

•Apr 23, 2020

I strongly recommend this course to anyone who needs clearer understanding of data using Visualizations. The course is well structured and each lecture delivers new concepts in concise format with a very detailed swirl lessons to understand the working of each functions. At the end of the course, I got a completely different view of handling the data and how to extract maximum information from the data to gain meaningful insights.

- IA para todos
- Introducción a TensorFlow
- Redes neurales y aprendizaje profundo
- Algoritmos, parte 1
- Algoritmos, parte 2
- Aprendizaje Automático
- Aprendizaje automático con Python
- Aprendizaje automático con Sas Viya
- Programación R
- Introducción a la programación con Matlab
- Análisis de datos con Python
- Aspectos básicos de AWS: El paso a la nube nativa
- Aspectos básicos de la plataforma en la nube de Google
- Ingeniería de confiabilidad del sitio
- Hablar inglés de manera profesional
- La ciencia del bienestar
- Aprendiendo a aprender
- Mercados financieros
- Prueba de hipótesis en el área de la salud pública
- Aspectos básicos del liderazgo diario

- Aprendizaje profundo
- Python para todos
- Ciencia de Datos
- Ciencias de los Datos Aplicada con Python
- Aspectos básicos de los negocios
- Arquitectura con Google Cloud Platform
- Ingeniería de datos en la plataforma en la nube de Google
- Excel para MySQL
- Aprendizaje automático avanzado
- Matemática aplicada al aprendizaje automático
- Automóviles de auto conducción
- Revolución de la cadena de bloques para la empresa
- Análisis comercial
- Habilidades de Excel aplicadas para los negocios
- mercadeo digital
- Análisis estadístico con R para el área de la salud pública
- Aspectos básicos de la inmunología
- Anatomía
- Gestión de la innovación y del pensamiento de diseño
- Aspectos básicos de la psicología positiva

- Soporte de TI de Google
- Especialista en compromiso con el cliente de IBM
- Ciencia de datos de IBM
- Administrador de proyectos aplicado
- Certificado profesional de IA aplicada de IBM
- Aprendizaje automático para análisis
- Análisis y visualización de datos espaciales
- Gestión e ingeniería de construcción
- Diseño instruccional

- Maestría en Ciencia de Datos
- Licenciatura en Ciencias de la Computación
- Títulos de Ciencias de la Computación e Ingeniería
- Maestría en Aprendizaje Automático
- Maestría en Administración de Empresas y títulos de estudios de negocios
- Maestría en Ingeniería Eléctrica
- Maestría en Salud Pública
- Maestría en Tecnología de la Información