Acerca de este Curso
4.6
21 ratings
5 reviews
Note: You should complete all the other courses in this Specialization before beginning this course. This six-week long Project course of the Data Mining Specialization will allow you to apply the learned algorithms and techniques for data mining from the previous courses in the Specialization, including Pattern Discovery, Clustering, Text Retrieval, Text Mining, and Visualization, to solve interesting real-world data mining challenges. Specifically, you will work on a restaurant review data set from Yelp and use all the knowledge and skills you’ve learned from the previous courses to mine this data set to discover interesting and useful knowledge. The design of the Project emphasizes: 1) simulating the workflow of a data miner in a real job setting; 2) integrating different mining techniques covered in multiple individual courses; 3) experimenting with different ways to solve a problem to deepen your understanding of techniques; and 4) allowing you to propose and explore your own ideas creatively. The goal of the Project is to analyze and mine a large Yelp review data set to discover useful knowledge to help people make decisions in dining. The project will include the following outputs: 1. Opinion visualization: explore and visualize the review content to understand what people have said in those reviews. 2. Cuisine map construction: mine the data set to understand the landscape of different types of cuisines and their similarities. 3. Discovery of popular dishes for a cuisine: mine the data set to discover the common/popular dishes of a particular cuisine. 4. Recommendation of restaurants to help people decide where to dine: mine the data set to rank restaurants for a specific dish and predict the hygiene condition of a restaurant. From the perspective of users, a cuisine map can help them understand what cuisines are there and see the big picture of all kinds of cuisines and their relations. Once they decide what cuisine to try, they would be interested in knowing what the popular dishes of that cuisine are and decide what dishes to have. Finally, they will need to choose a restaurant. Thus, recommending restaurants based on a particular dish would be useful. Moreover, predicting the hygiene condition of a restaurant would also be helpful. By working on these tasks, you will gain experience with a typical workflow in data mining that includes data preprocessing, data exploration, data analysis, improvement of analysis methods, and presentation of results. You will have an opportunity to combine multiple algorithms from different courses to complete a relatively complicated mining task and experiment with different ways to solve a problem to understand the best way to solve it. We will suggest specific approaches, but you are highly encouraged to explore your own ideas since open exploration is, by design, a goal of the Project. You are required to submit a brief report for each of the tasks for peer grading. A final consolidated report is also required, which will be peer-graded....
Globe

Cursos 100 % en línea

Comienza de inmediato y aprende a tu propio ritmo.
Calendar

Fechas límite flexibles

Restablece las fechas límite en función de tus horarios.
Clock

Sugerido: 3 hours/week

Aprox. 17 horas para completar
Comment Dots

English

Subtítulos: English

Habilidades que obtendrás

Data Clustering AlgorithmsData AnalysisNatural Language ProcessingData Mining
Globe

Cursos 100 % en línea

Comienza de inmediato y aprende a tu propio ritmo.
Calendar

Fechas límite flexibles

Restablece las fechas límite en función de tus horarios.
Clock

Sugerido: 3 hours/week

Aprox. 17 horas para completar
Comment Dots

English

Subtítulos: English

Programa - Qué aprenderás en este curso

1

Sección
Clock
2 horas para completar

Orientation

In this module, you will become familiar with the course, your instructor, your classmates, and our learning environment....
Reading
1 video (Total: 13 min), 6 readings
Reading6 lecturas
Orientation Overview10m
Syllabus10m
About the Discussion Forums10m
Updating Your Profile10m
MeTA Installation and Overview10m
Data Set and Toolkit Acquisition10m
Clock
2 horas para completar

Task 1 - Exploration of a Data Set

...
Reading
2 readings, 1 quiz
Reading2 lecturas
Task 1 Overview10m
Task 1 Rubric10m

2

Sección
Clock
2 horas para completar

Task 2 - Cuisine Clustering and Map Construction

...
Reading
2 readings, 1 quiz
Reading2 lecturas
Task 2 Overview10m
Task 2 Rubric10m

3

Sección
Clock
2 horas para completar

Task 3 - Dish Recognition

...
Reading
2 readings, 1 quiz
Reading2 lecturas
Task 3 Overview10m
Task 3 Rubric10m

4

Sección
Clock
2 horas para completar

Task 4 & 5 - Popular Dishes and Restaurant Recommendation

...
Reading
2 readings, 1 quiz
Reading2 lecturas
Task 4 and 5 Overview10m
Task 4 and 5 Rubric10m
4.6

Principales revisiones

por IANov 16th 2017

The project help me to practice the whole specialization algorithms and techniques.

Instructores

Jiawei Han

Abel Bliss Professor
Department of Computer Science

ChengXiang Zhai

Professor
Department of Computer Science

John C. Hart

Professor of Computer Science
Department of Computer Science

Acerca de University of Illinois at Urbana-Champaign

The University of Illinois at Urbana-Champaign is a world leader in research, teaching and public engagement, distinguished by the breadth of its programs, broad academic excellence, and internationally renowned faculty and alumni. Illinois serves the world by creating knowledge, preparing students for lives of impact, and finding solutions to critical societal needs. ...

Acerca del programa especializado Data Mining

The Data Mining Specialization teaches data mining techniques for both structured data which conform to a clearly defined schema, and unstructured data which exist in the form of natural language text. Specific course topics include pattern discovery, clustering, text retrieval, text mining and analytics, and data visualization. The Capstone project task is to solve real-world data mining challenges using a restaurant review data set from Yelp. Courses 2 - 5 of this Specialization form the lecture component of courses in the online Master of Computer Science Degree in Data Science. You can apply to the degree program either before or after you begin the Specialization....
Data Mining

Preguntas Frecuentes

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

¿Tienes más preguntas? Visita el Centro de Ayuda al Alumno.