Daily Schedule Part 2 (Actual — Kept Retrospectively)
Regular meeting schedule is Wednesdays and Saturdays, 11:00-12:00
Back to Course home page
See also Daily Schedule - Part 1
Part 2: Data Science Foundations (using Joel Grus, Data Science from Scratch, 2nd Edition)
Part 2 Uses Grus and lasts for the remaining three-and-a-half weeks of Term 6
Week 4 — Yet Another Review of Python — Some Vector and Matrix Algebra — Statistics and Probability
- June 4 — Chapters 1-3: Another excellent review of Python and Matplotlib which will help systematize your understanding of the language features you were using in Pasha’s book — The assignment is to do the review of the three chapters, but to completely stop using Jupyter or Jupyter lab, and instead get everything working in PyCharm Professional Edition (free for students) or VS Code (but I have zero experience with that) — When Grus says (at the beginning of Chapter 2) that you should not be tampering with your base Python environment, he is completely correct (so learn how to make a venv that you could call grus or dsfs and then switch to it — if you didn’t already do that for working through Pasha)
- June 7 — Chapters 4-6: Linear Algebra (wherein Grus introduces his Vector and Matrix implementations which could have been classes, or could have leveraged numpy, but which he craftily used type aliases, because that was the simplest way to implement from scratch), Statistics, and Probability (due to having taken last fall’s Bayesian Statistics class, the math in Chapters 5 and 6 will be review)
Week 5 — Optimization (aka Minimization and Maximization) — Working with Data
- June 11 — Chapters 7 and 8: Hypotheses & Inference and Gradient Descent — Make a local repo from the magic hexijin.github.io GitHub repo, put an index.md file in it, and then push to origin main — The only remaining step to having your own home page at hexijin.github.io is to enable GitHub pages on the magic repo — In the meantime, you can go straight to this notes page
=== BELOW THIS DIVISION IS GOAL/TENTATIVE PLAN — NOT ACTUAL ===
- June 14 — Chapters 9 and 10: Getting and Working with Data
Week 6 — Machine Learning — Neural Networks — Start Deep Learning
- June 18 — Chapters 11 and 13: Machine Learning and Naive Bayes
- June 21 — Chapters 18 and 19: Neural Networks and Deep Learning — Good prepartion for your coding assessment, would be to do this real-time coding session to see how a real pro codes, including type-hinting, systematic adherence to style choices, and code testing: Joel Grus - Building a Deep Learning Library (build the code in PyCharm as Grus builds it in VS Code, pausing the video whenever you need to catch up with him) — This real-time coding session will also give you a blindingly fast overview of Chapters 18 and 19
Week 7 — Continue Deep Learning — Introduction to Natural Language Processing
- June 25 — Chapter 21: Natural Language Processing — Watch the fifth of the 3Blue1Brown videos, Transformers Explained Visually, by Grant Sanderson numbered DL1 to DL7 — “DL” is short for “Deep Learning,” and the seven videos were published from 2017 to 2024 — The fifth video gives you alook into the 2017 transformers revolution, and at what you would study next if you want to keep getting closer to the state of the art of machine learning and LLMs
See also Looking Beyond