PSTAT5A
  • Home
  • Class Schedule
  • Syllabus
  • Resources
  • Contact

On this page

  • Week 1: Foundations of Data Science
  • Getting Started with Data
  • 📚 Core Materials
  • 💻 Interactive Tools & Practice
  • 🎯 Learning Objectives & Study Plan
  • 🔍 Additional Resources
  • Week 2: Introduction to Probability
  • Understanding Uncertainty Through Statistics
  • 📚 Core Materials
  • 💻 Interactive Tools & Practice
  • 🎯 Learning Objectives & Study Plan
  • 🔍 Additional Resources
  • Week 3: Conditional Probability, Counting & Discrete Random Variables
  • Advanced Probability & Discrete Distributions
  • 📚 Core Materials
  • 💻 Interactive Tools & Practice
  • 🎯 Learning Objectives & Study Plan
  • 🔍 Additional Resources
  • Week 4: Statistical Methods & Testing

Course Resources

Your comprehensive guide to learning materials and references

🔍

Week 1: Foundations of Data Science

Getting Started with Data

This week introduces fundamental concepts in data science, including data types, basic statistics, and essential Python tools for data manipulation and analysis.

📚 Core Materials

Required Reading

OpenIntro Statistics, Chapters 1 & 2

Essential foundations covering data types, variables, and descriptive statistics. Provides theoretical foundation for understanding how data is structured and analyzed.

📖 PDF • ⏱️ 2-3 hours • 🎯 Beginner

Python for Statistics

Think Stats: Exploratory Data Analysis

Perfect introduction to Python for statistics students. Covers probability, descriptive statistics, and statistical inference using Python with real datasets.

🎓 UCSB Access: Library Database → Search “O’Reilly” → Login with NetID → Search “Think Stats”

💻 Online via UCSB Library • ⏱️ 4-6 hours • 🐍 Python + Statistics

Supplementary Reading

Python for Data Analysis, Ch. 5

Deep dive into pandas operations including describe(), groupby(), and essential aggregation functions for real-world datasets.

🎓 UCSB Access: Library Database → Search “O’Reilly” → Login with NetID → Search “Python for Data Analysis”

💻 Online via UCSB Library • ⏱️ 1-2 hours • 🎯 Intermediate

Library Access

O’Reilly Learning Platform

Access thousands of technology and programming books including Python, statistics, and data science titles. Essential for supplementary reading and advanced topics.

🏛️ UCSB Library • 📖 50,000+ titles • 🔐 NetID Required

💻 Interactive Tools & Practice

Hands-on Workshop

UCSB Data Lab: Data Types & Format

Comprehensive hands-on workshop covering Python data types, pandas DataFrame structures, and input/output operations with downloadable datasets.

🛠️ Interactive • ⏱️ 2-3 hours • 📁 Sample datasets

API Documentation

pandas.DataFrame.describe()

Official documentation for descriptive statistics in pandas. Essential reference for understanding central tendency, dispersion, and shape analysis.

📚 API Docs • 🔗 Quick Reference • 🎯 All levels

Statistical Functions

NumPy Statistical Functions

Complete reference for NumPy’s statistical toolkit including mean(), median(), std(), percentile(), and advanced statistical measures.

📚 API Docs • 🔧 25+ methods • 🎯 Beginner-Advanced

Advanced Reference

SciPy Stats Module

Comprehensive statistical analysis toolkit covering probability distributions, hypothesis testing, and advanced descriptive statistics for research-grade analysis.

📚 API Docs • 🎯 Advanced • 🔬 Research-grade

🎯 Learning Objectives & Study Plan

Mastery Checklist

Week 1 Learning Goals

After completing this week, you should be able to:

✅ Self-Assessment • 🎯 Beginner • 🐍 Python Focus

Study Schedule

Recommended Learning Path

Days 1-2: Core Reading (OpenIntro Chapters 1-2)

Day 3: Python for Statistics (Think Stats)

Day 4: Supplementary Reading (Python for Data Analysis)

Day 5: Interactive tutorials and hands-on practice

Day 6: Get familiar with python documentation

Day 7: Review and concept integration

📅 7-day plan • ⏱️ 2-3 hours daily • 🎯 Beginner-friendly

🔍 Additional Resources

Extended Learning

Supplementary Materials by Topic

Python Basics: Python.org Tutorial • Codecademy Python • Python for Everybody

Data Science Foundations: Kaggle Learn • DataCamp Intro to Python • Coursera Python for Data Science

Statistical Computing: Think Python • Automate the Boring Stuff • Real Python Tutorials

🎯 All skill levels • 📈 Beginner to Advanced • 🆓 Many free options

Week 2: Introduction to Probability

Understanding Uncertainty Through Statistics

This week introduces fundamental concepts in probability theory, including sample spaces, events, conditional probability, independence, and Bayes’ theorem. You’ll learn to quantify uncertainty and make informed decisions with incomplete information.

📚 Core Materials

Required Reading

OpenIntro Statistics, Chapter 3

Essential foundations covering probability definitions, sample spaces, events, conditional probability, and independence.

📖 PDF • ⏱️ 3-4 hours • 🎯 Beginner-Intermediate

Supplementary Reading

Elements of Set Theory for Probability

Interactive guide to set theory operations with Venn diagrams and visual explanations of probability concepts.

🌐 Online Book • ⏱️ 1-2 hours • 📊 Interactive diagrams

Video Lectures

MIT: Introduction to Probability

High-quality video lectures covering probability axioms, sample spaces, and basic probability rules with problem sets.

🎬 Video + Notes • ⏱️ 1 hour • 🎓 University Level

Quick Reference

Probability Formulas Cheat Sheet

Comprehensive reference sheet covering all major probability formulas including conditional probability and independence.

📄 PDF • ⚡ Quick reference • 🎯 All levels

💻 Interactive Tools & Practice

Visualizations

Seeing Theory: Probability

Interactive visual introduction with animations for conditional probability, Bayes’ theorem, and independence.

🎮 Interactive Web App • ⏱️ 1-2 hours

Simulations

StatKey: Probability Simulations

Interactive probability simulation tool with tree diagrams, conditional probability, and Bayes’ theorem calculators.

🌳 Interactive Tool • ⏱️ 1 hour

Practice Problems

Khan Academy: Probability

Comprehensive practice problems covering basic probability, conditional probability, and independence with instant feedback.

📝 Interactive Problems • ⏱️ 3-4 hours

Python Code

Matplotlib Statistical Gallery

Gallery of statistical visualizations including probability distributions, Venn diagrams, and tree diagrams.

📊 Code Examples • 🛠️ Python, Matplotlib

🎯 Learning Objectives & Study Plan

Mastery Checklist

Week 2 Learning Goals

After completing this week, you should be able to:

✅ Self-Assessment • 🎯 Beginner-Intermediate

Study Schedule

Recommended Learning Path

Days 1-2: Core Reading (OpenIntro Chapter 3)

Day 3: Interactive tutorials and visualizations

Day 4: Video lectures and supplementary readings

Day 5: Practice problems and exercises

Day 6: Review and concept integration

Day 7: Assessment preparation

📅 7-day plan • ⏱️ 2-3 hours daily

🔍 Additional Resources

Extended Learning

Supplementary Materials by Learning Style

Visual Learners: Khan Academy Videos • Treena Notes • Math is Fun Stats

Practical Applications: Medical Diagnosis Examples • Real-world Problems

Advanced Study: MIT 6.041 Full Course • Probability Fallacies Guide

🎯 All learning styles • 📈 Beginner to Advanced

Week 3: Conditional Probability, Counting & Discrete Random Variables

Advanced Probability & Discrete Distributions

This week deepens your understanding of conditional probability and Bayes’ theorem, introduces counting principles (permutations and combinations), and explores discrete random variables including their probability mass functions, expected values, and common distributions.

📚 Core Materials

Required Reading

OpenIntro Statistics, Chapter 3 sections 3.3-3.4

Essential coverage of conditional probability, Bayes’ theorem, counting principles, and discrete random variables. Includes probability mass functions and expected values.

📖 PDF • ⏱️ 3-4 hours • 🎯 Intermediate

Bayes’ Theorem

Seeing Theory: Bayes’ Theorem

Interactive visualization of Bayes’ theorem with medical testing examples, false positives/negatives, and real-world applications. Essential for understanding conditional probability.

🎮 Interactive • ⏱️ 1 hour • 🔍 Visual Learning

Combinatorics

Khan Academy: Counting & Probability

Comprehensive coverage of counting principles, permutations, combinations, and their applications to probability with step-by-step examples and practice problems.

🎬 Video + Practice • ⏱️ 2-3 hours • 🎯 Beginner-Friendly

Discrete Distributions

Seeing Theory: Random Variables

Interactive exploration of discrete random variables, probability mass functions, and common distributions (Bernoulli, Binomial, Geometric, Poisson) with parameter adjustments.

🎮 Interactive • ⏱️ 1-2 hours • 📊 Distribution Explorer

💻 Interactive Tools & Practice

Bayes Calculator

Interactive Bayes’ Theorem Calculator

Step-by-step Bayes’ theorem calculator with medical testing examples, tree diagrams, and visual representations of prior and posterior probabilities.

🧮 Calculator • ⏱️ 30 minutes • 🎯 All Levels

Combinatorics Tool

Permutation & Combination Calculator

Online calculator for permutations, combinations, and factorial calculations with explanations and step-by-step solutions for complex counting problems.

🧮 Calculator • ⚡ Quick Results • 📊 Problem Solving

Python Documentation

SciPy Stats: Discrete Distributions

Complete reference for discrete probability distributions in Python including Bernoulli, Binomial, Geometric, and Poisson with PMF, CDF, and random generation functions.

📚 API Docs • 🐍 Python • 🎯 Intermediate-Advanced

Interactive Simulations

Treena: Probability Distributions

Interactive probability distribution notes and overview for discrete random variables.

🎮 Interactive Tool • ⏱️ 1 hour • 📈 Visual Distributions

🎯 Learning Objectives & Study Plan

Mastery Checklist

Week 3 Learning Goals

After completing this week, you should be able to:

✅ Self-Assessment • 🎯 Intermediate • 🎲 Probability & Counting

Study Schedule

Recommended Learning Path

Days 1-2: Conditional probability and Bayes’ theorem review

Day 3: Counting principles: permutations and combinations

Day 4: Introduction to discrete random variables and PMFs

Day 5: Expected values, variance, and common distributions

Day 6: Python implementation and interactive practice

Day 7: Real-world applications and problem integration

📅 7-day plan • ⏱️ 3-4 hours daily • 🔄 Building on Week 2

🔍 Additional Resources

Extended Learning

Supplementary Materials by Topic

Bayes’ Theorem Applications: Medical Diagnosis Examples • Spam Filtering Kaggle Python Example • Spam Filtering • Legal Evidence

Combinatorics: Art of Problem Solving • Brilliant Combinatorics • Pascal’s Triangle

Discrete Distributions: Wolfram MathWorld • NIST Engineering Statistics • Real-world Examples

Python Programming: NumPy Random Documentation • Matplotlib Statistical Plots • Jupyter Notebooks Examples

Advanced Topics: Information Theory • Markov Chains • Maximum Likelihood Estimation

🎯 All skill levels • 📈 Beginner to Advanced • 🔢 Mathematical Applications

Week 4: Statistical Methods & Testing

🔬

Hypothesis Testing Fundamentals

Week 4 will cover hypothesis testing, and two sample t-Tests. Materials will be posted by week 4.