ofrecido por
Qué aprenderás
Use SQL commands to filter, sort, & summarize data; manipulate strings, dates, & numerical data from different sources for analysis
Assess and create datasets to solve your business questions and problems using SQL
Use the collaborative Databricks workspace and create an end-to-end pipeline that reads data, transforms it, and saves the result
Develop a project proposal & select your data, perform statistical analysis & develop metrics, and present your findings & make recommendations
Habilidades que obtendrás
Acerca de este Programa Especializado
No se requiere experiencia previa.
No se requiere experiencia previa.
Hay 4 cursos en este Programa Especializado
SQL for Data Science
As data collection has increased exponentially, so has the need for people skilled at using and interacting with data; to be able to think critically, and provide insights to make better decisions and optimize their businesses. This is a data scientist, “part mathematician, part computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being a data scientist is the best job in America; with a median base salary of $110,000 and thousands of job openings at a time. The skills necessary to be a good data scientist include being able to retrieve and work with data, and to do that you need to be well versed in SQL, the standard language for communicating with database systems.
Data Wrangling, Analysis and AB Testing with SQL
This course allows you to apply the SQL skills taught in “SQL for Data Science” to four increasingly complex and authentic data science inquiry case studies. We'll learn how to convert timestamps of all types to common formats and perform date/time calculations. We'll select and perform the optimal JOIN for a data science inquiry and clean data within an analysis dataset by deduping, running quality checks, backfilling, and handling nulls. We'll learn how to segment and analyze data per segment using windowing functions and use case statements to execute conditional logic to address a data science inquiry. We'll also describe how to convert a query into a scheduled job and how to insert data into a date partition. Finally, given a predictive analysis need, we'll engineer a feature from raw data using the tools and skills we've built over the course. The real-world application of these skills will give you the framework for performing the analysis of an AB test.
Distributed Computing with Spark SQL
This course is for students with SQL experience and now want to take the next step in gaining familiarity with distributed computing using Spark. Students will gain an understanding of when to use Spark and how Spark as an engine uniquely combines Data and AI technologies at scale. The four modules build on one another and by the end of the course the student will understand: Spark architecture, Spark DataFrame, optimizing reading/writing data, and how to build a machine learning model. The first module will introduce Spark, including how Spark works with distributed computing and what are Spark Dataframes. Module 2 covers the core concepts of Spark such as storage vs. computing, caching, partitions and Spark UI. The third module looks at Engineering Data Pipelines covering connecting to databases, schemas and type, file formats and writing good data. The final module looks at the application of Spark with Machine Learning through the business use case, a short introduction to what machine learning is, building and applying models and a final course conclusion. By understanding when to use Spark, either scaling out when the model or data is too large to process on a single machine, or having a need to simply speed up to get faster results, students will hone their SQL skills and become a more adept Data Scientist.
SQL for Data Science Capstone Project
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems.
ofrecido por

Universidad de California, Davis
UC Davis, one of the nation’s top-ranked research universities, is a global leader in agriculture, veterinary medicine, sustainability, environmental and biological sciences, and technology. With four colleges and six professional schools, UC Davis and its students and alumni are known for their academic excellence, meaningful public service and profound international impact.
Preguntas Frecuentes
¿Cuál es la política de reembolsos?
¿Puedo inscribirme en un solo curso?
¿Hay ayuda económica disponible?
¿Puedo tomar este curso de manera gratuita?
¿Este curso es 100 % en línea? ¿Necesito asistir a alguna clase en persona?
¿Recibiré crédito universitario por completar el programa especializado?
¿Recibiré crédito universitario por completar el programa especializado?
¿Cuánto tiempo se necesita para completar un programa especializado?
Do I need to take the courses in a specific order?
¿Tienes más preguntas? Visita el Centro de Ayuda al Alumno.