Curriculum - what you’ll learn
The curriculum is split into 9 units, followed by your capstone project and career advice.
UNIT 1
The Python Data Science Stack
- Learn how to use Python and its standard libraries
- Build visualizations with Matplotlib and Seaborn
- Writing clear, elegant, readable code in Python using the PEP8 standard
Estimated Time: 16+ Hours
UNIT 2
Data Wrangling
- Learn to use Pandas to wrangle and clean data
- Learn to work with different file types from text to Excel, CSV and JSON.
- Get an overview of relational and non-relational databases and gain SQL skills.
- APIs: Collect data from the internet using Application Programming Interfaces (APIs)
Estimated Time: 44+ Hours
UNIT 3
Data Story
Project
You’ll practice the concepts you've learned so far by creating a story out of a dataset. Your mentor will help structure and evaluate your approach as you explore a dataset and present insightful findings to them.
Estimated Time: 10+ Hours
UNIT 4
Statistical Inference
- Learn the basics of inferential statistics and parameter estimation
- Use hypothesis testing to determine if a phenomenon is statistically significant
- Learn how correlation and regression can be used to identify useful features
- Know how to build A/B split tests
- Exploratory data analysis
Estimated Time: 16+ Hours
UNIT 5
Machine Learning
- Use scikit-learn to implement supervised and unsupervised algorithms
- Advanced topics: time series analysis, social network analysis
- Ensemble learning with random forests and gradient boosting
- Validate and evaluate machine learning systems
Estimated Time: 50+ Hours
UNIT 6
Data Science at Scale
- Work with MapReduce, one of the most popular algorithms for large-scale data manipulation
- Understand NoSQL databases and how they differ from SQL
- Learn Spark, the state-of the-art in distributed computing frameworks
- Learn SparkML and MLlib to implement Machine Learning at scale on Spark
Estimated Time: 10 Hours
UNIT 7
Choose your Specialization
Halfway through the course, you will have the option to choose a specialization that aligns with your career goals. These are areas of focus that allow you to adapt your learning experience to your interests within Data Science.
They are: The Generalist Track: Advanced Machine Learning, The Natural Language Processing Track, or The Deep Learning Track.
UNIT 8
Capstone Project
In this program, you'll complete two Capstone Projects for your portfolio. They will be evaluated by your mentor, giving you experience with real business problems and datasets.
Your second capstone project will related directly to the specialization track of your choice (if you have chosen one).
Estimated Time: 50+ Hours
UNIT 9
Career Services
- Get your resume and Linkedin reviewed
- Work with your own personal career coach
- Get a personalized job search strategy
- Learn networking principles that will land you your next job
- Learn how to negotiate your salary
Estimated Time: 35+ Hours
Swipe left to browse chapters