Syllabus for Fa23 - PREDICTIVE MACHINE LEARNING (53235)

Course Syllabus

Instructor:

Professor Chandrajit Bajaj

Lecture Hours – Mon, Wed- 3:30 - 5:00 pm. GDC 2.210 or Zoom Links to an external site.
Office hours -- Tue 1:00 p.m. - 3:00 p.m. or by appointment ( Zoom Links to an external site. or POB 2.324)
Contact: bajaj@oden.utexas.edu

NOTE: All questions related to class should be posted through Piazza. Here is the link to register for Piazza: Links to an external site. You can also join via the Piazza Tab on the Canvas course page.

Teaching Assistant

Omatharv Vaidya

Office hours – Tue/Thurs 10:00 a.m. - 11:00 a.m. GDC 1.302 - TA station Desk 2 or Zoom Links to an external site.
Contact: vomatharv@gmail.com

Note: Please attempt to make reservations a day before to avoid conflicts.

Course Motivation and Synopsis

This Spring and Fall course this 2025 year shall be on the design and performance analysis of optimally controlled, aka reinforcement leaned statistical machine learning algorithms, trained, verified and validated on filtered, noisy observation data distributions collected from various multi-scale dynamical systems. The performance metrics will be on online and energy efficient training, verification and validation protocols that achieve principled and stable learning for maximal generalizability. Interpretability and safety . The emphasis will be on possibly corrupted data and/or the lack of full information for the learned stochastic decision making dynamic algorithmic process. Special emphasis will also be given to the underlying mathematical and statistical physics principles of Hamiltonian dyamics . Students shall thus be exposed to the latest stochastic machine learning modeling approaches for optimized decision-making, multi-player games involving stochastic dynamical systems and optimal stochastic control. These latter topics are foundational to the training of multiple neural networks (agents) both cooperatively and in adversarial scenarios to optimize the learning process of all the agents.

An initial listing of lecture topics and reference material are given in the syllabus below. This is subject to some modification, given the background and speed at which we cover ground. Homework exercises shall be given almost bi-weekly. Assignment solutions that are turned in late shall suffer a 10% per day reduction in credit and a 100% reduction once solutions are posted. There will be a mid-term exam in class. The exam content will be similar to the homework exercises. A list of topics will also be assigned as take-home final projects to train the best of scientific machine-learned decision-making (agents). The projects will involve modern ML programming, an oral presentation, and a written report submitted at the end of the semester.

This project shall be graded and be in lieu of a final exam.

The course is open to graduate students in all disciplines. Those in the 5-year master's program students, and in the CS, CSEM, ECE, MATH, STAT, PHYS, CHEM, and BIO, are welcome. You’ll need an undergraduate level background in the intertwined topics of algorithms, data structures, numerical methods, numerical optimization, functional analysis, algebra, geometry, topology, statistics, stochastic processes . You will need programming experience (e.g., Python ), at a CS undergraduate senior,

Course Reference Material (+ reference papers cited in lectures )

[B1] Chandrajit Bajaj (frequently updated) A Mathematical Primer for Computational Data Sciences
[M] Kevin Murphy Machine Learning: A Probabilistic Perspective Download Machine Learning: A Probabilistic Perspective
[MU] Michael Mitzenmacher, Eli Upfal Probability and Computing (Randomized Algorithms and Probabilistic Analysis) Download Probability and Computing (Randomized Algorithms and Probabilistic Analysis)
[SB] Richard Sutton, Andrew Barto Reinforcement Learning
[SD] Shai Shalev-Shwartz, Shai Ben-David Understanding Machine Learning, From Theory to Algorithms
[Basar] Tamer Basar Lecture Notes on Non-Cooperative Game Theory.
[BHK] Avrim Blum, John Hopcroft, and Ravindran Kannan. Foundations of Data Science
[PML1] Kevin Murphy Probabilistic Machine Learning: An Introduction.
[PML2] Kevin Murphy Probabilistic Machine Learning: Advanced Topics.
[BV] Stephen Boyd and Lieven Vandenberghe Convex Optimization.
Extra reference materials.

TENTATIVE COURSE OUTLINE (in Flux).

Date

Topic

Reading

Assignments

Mon

08-21-2023

1. Introduction to High-Dimensional Spaces, Belief, and Decision-Making Spaces, [Lec1]

Dynamical Systems and Deep Learning [notes]

Modern Statistical Machine Learning [notes Download notes]

[M] Ch 1.1, 1.2, 1.3

[SB] Ch1

Wed

08-23-2023

2. Learning Stochastic Regression Models [Lec2] Download [Lec2]

Geometry of Norms and Approximations [notes];

Log-Sum-Exponential-Stability [notes]

[SD]

[M] Ch 1

[A1] with [latex template] out today; due by 09-06-2023, 11:59 pm

Mon

08-28-2023

3. Learning Theory and Model Selection [Lec3 Download Lec3]

PAC learning, Complexities [notes Download notes]

Probability, Information and Probabilistic Inequalities [notes]

[M] Ch 1.4.7, 1.4.8

[MU] Ch 1-3

[B1] Appendix

[PML1] 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 3.2, 4.7.

Wed

08-30-2023

4. Stocashtic Machine Learning: Entropy, Distributional Estimates [Lec 4]

[MU] Chap 4, 24.2

[BHK] Chap 12.4,12.6

[PML1] Chap 4.1, 4.2, 4.5, 4.7, 6.1, 6.2.

[PML2] Chap 3.3, 5.1, 5.2, 15.1, 15.2, 15.3

Wed

09-06-2023

5. Sampling in High-Dimensional Space-Time 2: [Lec-part1] Concentration of Measure [notes]

[SD] Chap 12

[A1] due by midnight.

[A2] will be out on 09-10-2023; due by 09-24-2023, 11:59 pm.

Mon

09-11-2023

6. Sampling in High Dimensional Space-Time 3 : MonteCarlo vs Quasi Monte-Carlo, Relationship to Integration Error H-K Inequality [Lec4-part1][Lec4-part2]

[PML1] 3.6

[PML2] 2.6, 4, 7.4.5, 7.4.6, 12.1, 12.2, 12.3, 12.6.

Wed

09-13-2023

7. Statistical Machine Learning 1: Introduction to Markov Chains, Page Rank, MCMC [Lec-notes, notes2]

[BHK] Chap 4

[MU] Ch 7, 10

Mon

09-18-2023

8. Statistical Machine Learning 2: Sampling and Learning with MCMC Variations [Lec7]

[SD] Chap 24

[BV] Chap 1-5

Wed

09-20-2023

9. Statistical Machine Learning 3: Bayesian Inference with MCMC and Variational Inference [notes]

Learning by Random Walks on Graphs [notes-BHK]

[BHK] Chap 2.7

[SD] Chap 23, 24

[A2] due by 09-24-2023, 11:59 pm.

Mon

09-25-2023

10. Learning SVM via Continuous Stochastic Gradient Descent Optimization [notes]

Learning with SGD variations, Adagrad, RMSProp, Adam, ...] [notes]

Continuous Stochastic Gradient Descent (SGD) -- Simulated Annealing, Fokker-Planck [notes Download notes]

[M] Ch 9, 13

Wed

09-27-2023

11. Non-convex Optimization: Projected Stochastic Policy Gradient [Notes]

[M] Ch 2, 5

[A3] will be out on 09-28-2023; due by 10-12-2023, 11:59 pm.

Mon

10-02-2023

12. Random Projections, Johnson-Lindenstrauss, Compressive Sensing, [notes]

Tensor Sketching in Space-Time [notes2]

[M] Ch 4

Wed

10-04-2023

13. Robust Sparse Recovery; Alternating Minimization [notes2]

[M] Ch 15

[M] Chap 11

[PML1] 3.2, 3.3, 17.2

[PML2] 2.3, 18, 23, 35.4.3

Mon

10-09-2023

14: Statistical Machine Learning 4: Learning Models with Latent Variables / Expectation Maximization [notes]

[M] Ch 15

Wed

10-11-2023

15. Statistical Machine Learning 5: Multivariate Gaussians and Gaussian Processes [notes1] [notes] [notes2]

[BHK] Chap 5

[A3] due by 10-12-2023, 11:59 pm.

Mon

10-16-2023

16. Statistical Machine Learning V: Non-Gaussian Processes, Conjugate Priors [notes]

[M] Chap 14

Wed

10-18-2023

MIDTERM examination

Mon

10-23-2023

17. Learning Dynamics, Lyapunov Stability and connections to Training Deep Networks [notes]

Learning Dynamics: Auto_Regressive Machine Learning [notes]

A4} will be out today; due by 11-07-2023, 11:59 pm.

Wed

10-25-2023

18. Learning Dynamics with Neural ODEs (NODEs): Adjoint Method for BackProp [notes]

Implicit Euler, Convergence [notes]

Non-convex Projected Gradient Descent [notes-references]

Mon

10-30-2023

19. Introduction to Stochastic Processes [pdf]

Learning Dynamics with Stochastic Processes [notes]

Learning Dynamics with Stochastic Neural ODEs (SNODEs) : Stochastic Adjoint Methods I [notes] [notes2]

See references cited in notes

Wed

11-01-2023

20.Learning Dynamics with Control and Optimality [notes]

See references cited in notes

Mon

11-06-2023

21. The role of Sensors and Optimal Sensor Fusion:

Basics of Kalman Filters [notes]

Illustrated Kalman Filters [notes]

Project details will be out on 11-07-2023; Part (I) of the project due by 11-30-2023, 11:59 pm.

Wed

11-08-2023

22. Reinforcement Learning 2: Learning Dynamics with Optimal Control: Dynamics LQR, iLQR, iLQG [notes]

See references cited in notes and paper

[A4] due today, i.e., 11-08-2023, 11:59 pm.

Mon

11-13-2023

23. Reinforcement Learning 3: Guided Policy Search [notes]

See references cited in [notes]

Wed

11-15-2023

24. RL 3: Bandit Algorithms, Thompson Sampling [notes]

Statistical Machine Learning 6: Bandit Algorithms [notes]

See references cited in [notes]

Part 1 of the project due by 11-30-2023, 11:59 pm.

Project video presentation due by 12-12-2023; 11:59 pm.

Mon

11-27-2023

25. Time Series Analysis [notes] [notes]

Game-Theoretic Learning 1: MARL -Markov Games [notes].

Games & MARL II [notes]

Wed

11-29-2023

26. Reward Reshaping: Inverse Reinforcement Learning with Optimal Control [notes]

Game-Theoretic Learning 1: MARL -Markov Games [notes].

Games & MARL II [notes]

Game Theoretic Learning 2: Stackelberg Equilibrium [notes]

Fri

12-04-2023 onwards

27. Active Learning 2: Dynamic POMDPS

Diffusion Models with Stochastic Langevin Dynamics [notes]

Energy-Based Learning: Hopfield Networks, Boltzmann Machines, Restricted Boltzmann Machines. [notes]

Final Project video presentation due by 12-12-2023; 11:59 pm.

Addtl. Material

NeuralPMP: Reinforcement Learning with Stochastic Hamiltonian Dynamics, Pontryagin Maximum Principle [arxiv] Links to an external site.

[Basar] See Lectures 1, 2, 3

Final report due by 12-12-2023; 11:59 pm.

Addtl. Material

Normalzing Flows [notes]

Actionable Learning [notes]

Robust Continuous learning of PDEs using Sparse Gaussian Processes [arxiv]

Markov Decision Process (MDPs) and Markov Games -- [notes]

Energy-Based Learning: Hopfield Networks, Boltzmann Machines, Restricted Boltzmann Machines. [notes]

Actionable Learning [notes]

Spectral Methods for Learning Dimension Reduction -KPCA, Eigen- Fischer-Faces[notes] [notes] E. KSVM [Notes], Fischer LDA, KDA [notes]

Statistical Machine Learning : (a) Separating Mixture of Gaussians [notes] (b) Expectation Maximization [notes]

Some important Classical Machine Learning Background.

Addtl. Material

Robustness Guarantees for Bayesian Inference and Gaussian Processes [paper]

Risk Averse No Regret Learning for Convex Games [paper]

RL 4: Markov (Reward, Decision) Processes: MPs, MRPs, MDPs and POMDPs [notes]

Statistical Machine Learning 3: Bayesian Inference and Generative Models (VAEs and GANs) [notes1]

Bayesian Modelling and Inference [notes2]

Connections to Variational AutoEncoders (VAEs) [notes]

Statistical Machine Learning 4: Transform Sampling revisited, Sampling Non-Linear Probability Distributions [notes]. Generative Adversarial Networks [notes]

Learning with Normalizing Flows [notes]

Some Theoretical Bounds on Bayesian Optimization and Reinforcement Learning.

Project FAQ

1. How long should the project report be?

Answer: See directions in the Project section in assignments. For full points, please address each of the evaluation questions as succinctly as possible. You will get feedback on your presentations, which should also be incorporated into your final report.

Assignments, Exam, Final Project, and Presentation

There will be four take-home bi-weekly assignments, one in-class midterm exam, one take-home final project (in lieu of a final exam), and one presentation based on your project progress. The important deadline dates are:

Midterm: Wednesday, October 18, 3:30 pm - 5:00 pm.
Final Project Written Report Part 1: Due November 30th, 11:59 pm.
Final Project Written Report, and Presentation Video, Due December 12th, 11:59 pm

Assignments

There will be four written take-home HW assignments and one take-home final project report. Please refer to the above schedule for assignments and the final project report due time.

Assignment solutions that are turned in late shall suffer a 10% per day reduction in credit and a 100% reduction once solutions are posted.

Course Requirements and Grading

Grades will be based on these factors:

In-class attendance and participation (5%)
HW assignments (50% and with the potential to get extra credit)

4 assignments. Some assignments may have extra questions for extra points you can earn. (They will be specified in the assignment sheet each time.)

In-class midterm exam (15%)
First Presentation & Report (10%)
Final Presentation & Report (20%)

Students with Disabilities. Students with disabilities may request appropriate academic accommodations from the Division of Diversity and Community Engagement, Services for Students with Disabilities, 471-6259, http://www.utexas.edu/diversity/ddce/ssd .

Accommodations for Religious Holidays. By UT Austin policy, you must notify the instructor of your pending absence at least fourteen days prior to the date of observance of a religious holiday. If you must miss a class or an examination in order to observe a religious holiday, you will be given an opportunity to complete the missed work within a reasonable time before or after the absence, provided proper notification is given.

Statement on Scholastic Dishonesty. Anyone who violates the rules for the HW assignments or who cheats on in-class tests or the final exam is in danger of receiving an F for the course. Additional penalties may be levied by the Computer Science department, CSEM, and the University. See http://www.cs.utexas.edu/academics/conduct/