CS 145

Fall 2025 • Intro to Big Data Systems

Course Schedule

Weeks 1 to 10 • Tue and Thu • Nov 4 holiday • Test 1 on Thu Oct 30

SQL Systems nanoDB Transactions Distributed Systems
Week Dates Topics Deliverables Resources
1 Sep 23
Sep 25
SQL Section 1: SQL Basics
Kickoff. SQLite and BigQuery intro. Mission briefing: gear up with SQL.
Setup: Canvas • Gradescope • Ed Setup: BigQuery • Get Credits • Colab
2 Sep 30
Oct 2
SQL Section 1: SQL Basics continued
CTEs, subqueries, window functions. Debugging Text to SQL.
Oct 1 • PSET0
Oct 3 • PSET1 (Section 1)
3 Oct 7
Oct 9
Systems Section 2: Systems Primer
nanoDB Section 3: nanoDb
BigSort. Hash partition. Foundations for nanoDB.
Oct 8 • PSET2 (Section 1)
4 Oct 14
Oct 16
nanoDB Section 3: nanoDB
Indexes. Joins. Query planning.
Oct 12 • PSET3 (Section 2)
Oct 15 • Project 1
5 Oct 21
Oct 23
nanoDb Section 5: Scaling Analytics
Part 1: Parallelism. Distributed systems.
Oct 23: Guest speaker
Fast Analytics • Biswa Chattopadhyay (Google)
Oct 24 • PSET4 (Section 3)
6 Oct 28
Oct 30
TestTest 1
In class on Thu Oct 30
Oct 30 • Test 1
7 Nov 4 (No class)
Nov 6 (Thu only)
Transactions Section 4: Transactions
Concurrency. Locking. Recovery. Ticketing case study.
8 Nov 11
Nov 13
TransactionsSection 4: Transactions continued
Locking vs LSM trees. Real time collaboration.
9 Nov 18
Nov 20
Distributed Systems Section 5: Distributed Systems
Sharding. Replication. Global consistency. Guest Spanner talk.
Nov 20: Guest speaker
Spanner • Sebastian Kanthak (Google)
Nov 21: Project 2 (Data Science)
10 Nov 25 (Tue only)
Nov 27 (No class)
Distributed Systems Distributed Systems wrap up
Lessons from industry.
Nov 24 • PSET5
Nov 27: Thanksgiving no class
10 Dec 2
Dec 4
Lessons from industry
Dec 1: Project 2 (Data Systems)
Dec 2: Guest speaker
Data Security • Neil Daswani
Dec 4: Guest speaker