Data Science with Python

A comprehensive BSc course covering Python fundamentals, data manipulation, statistics, machine learning, deep learning, and deployment. Finance-focused examples throughout.

Start Here

48 lessons 904 charts 1517 slides 960 quiz questions

Curriculum

Module 1: Python Fundamentals

6 lessons

Module 2: Data Manipulation

6 lessons

Module 3: Statistics & Visualization

8 lessons
L13Descriptive Statistics
L15Hypothesis Testing
L17Matplotlib Basics
L19Multi Panel Figures

Module 4: ML: Regression

4 lessons

Module 5: ML: Classification

4 lessons

Module 6: ML: Unsupervised

4 lessons

Module 7: Deep Learning

4 lessons
L35Backpropagation
L36Overfitting Prevention

Module 8: NLP & Text

4 lessons

Module 9: Deployment

4 lessons
L41Model Serialization
L43Streamlit Dashboards

Module 10: Capstone & Ethics

4 lessons

Resources

Quick Downloads

Module Overviews

Module 1: Python Fundamentals
Module 2: Data Manipulation
Module 3: Statistics & Visualization
Module 4: ML: Regression
Module 5: ML: Classification
Module 6: Unsupervised Learning
Module 7: Deep Learning
Module 8: NLP & Text
Module 9: Deployment
Module 10: Capstone & Ethics

Colab Notebooks

Module Summaries

Python Fundamentals (L01-L06)
Data Manipulation (L07-L12)
Statistics & Visualization (L13-L20)
ML: Regression (L21-L24)
ML: Classification (L25-L28)
ML: Unsupervised (L29-L32)
Deep Learning (L33-L36)
NLP & Text (L37-L40)
Deployment (L41-L44)
Capstone & Ethics (L45-L48)

Quizzes

Test your knowledge with practice quizzes for each lesson.

EXAM
Exam Practice - Conceptual Questions
Questions with detailed solutions from all 48 lessons
Start Practice

Reading Companions

Pre-class preparation and post-class review materials with vocabulary, examples, and self-tests.

Module 0 Overview Guides View All →
Module 4 Machine Learning: Regression View All →
Module 5 Machine Learning: Classification View All →
Module 6 Machine Learning: Unsupervised View All →

Charts Gallery

Visualizations illustrating key concepts across all modules.

Explore visualizations across all 10 modules covering Python, Statistics, ML, Deep Learning, NLP, and Deployment.

Teaching Materials

Handouts, worksheets, and exercises for deeper study and class activities.

Regression Handout
Classification Handout
Supervised Learning Handout
Unsupervised Learning Handout
UL Pre-Class Worksheet
UL In-Class Exercises
Deep Learning Handout
SL Pre-Class Worksheet
SL In-Class Exercises
DL Pre-Class Worksheet
DL In-Class Exercises
Day 7 Deployment Handout
6-Day Intensive: Day 5 (Unsupervised + Reinforcement)
6-Day Intensive: Day 6 (Embeddings + Frontier)

Additional Lectures

Supplementary lecture materials covering advanced topics beyond the core curriculum.

Foundations and AI in Finance
Introduction to AI
AI in Finance: From Prediction to Production
Supervised Learning: Concepts and Overviews
PILOTSL Mini-Lecture: The Generalization Paradox
Supervised Learning: The Big Idea
Supervised Learning: Complete Overview
Supervised Learning: From Linear Regression to Gradient Boosting
SL: One Dataset, Eight Methods
Supervised Learning: Methods and Algorithms
Comprehensive Regression
Comprehensive Classification
Support Vector Machines
SVM: A Simple Lecture with All Formulas
Naive Bayes: A Simple Lecture with All Formulas
Unsupervised Learning
PILOTUL Mini-Lecture: The Validation Paradox
Unsupervised Learning: The Big Idea
Unsupervised Learning: Complete Overview
Unsupervised Learning: From K-Means to Gaussian Mixtures
Deep Learning
Deep Learning: The Big Idea
Deep Learning: Complete Overview
Comprehensive Deep Learning
Day 6: Neural Networks (Full Lecture)
Frontier and Deployment
NEWML & Innovation 6-Day Intensive: The Frontier (Transformers, LLMs, RAG, Agents)
NEWDeep Reinforcement Learning in Finance: Agents, Q-Learning, Policy Gradients, and Limits
LLM Overview
Day 6: Review

Advanced Topics

Standalone deep-dive decks on advanced machine-learning topics, beyond the core curriculum.

Theory and Generalization
A01: Double Descent
A11: Lottery Ticket Hypothesis
A14: Information Bottleneck
Architectures and Models
A02: Attention Mechanisms
A07: Diffusion Models
A10: Graph Neural Networks
A15: KAN vs MLP
Trust, Safety and Inference
A03: Causal Inference
A04: SHAP Values
A05: Federated Learning
A06: Adversarial Attacks
A09: Conformal Prediction
Training and Optimization
A08: RLHF
A12: Bayesian Optimization
A13: Gradient Pathologies