Probabilistic and Unsupervised Learning
Approximate Inference and Learning in Probabilistic Models
(2011)

Lecture slides and assignments will be posted here in due course.
Dates Mondays & Thursdays
3 October - 9 December 2011
Time 11:00-13:00
Tutorials Wednesdays 9:00-11:00
Lecturers Yee Whye Teh and Maneesh Sahani
TAs Arthur Guez and Marius Pachitariu
Location Basement B10 Seminar Room, Alexandra House, 17 Queen Square, London WC1N 3AR [directions]
About the course

This course (offered as two successive modules to MSc students) provides students with an in-depth introduction to statistical modelling, unsupervised, and some supervised learning techniques. It presents probabilistic approaches to modelling and their relation to coding theory and Bayesian statistics. A variety of latent variable models will be covered including mixture models (used for clustering), dimensionality reduction methods, time series models such as hidden Markov models which are used in speech recognition and bioinformatics, Gaussian process models, independent components analysis, hierarchical models, and nonlinear models. The course will present the foundations of probabilistic graphical models (e.g. Bayesian networks and Markov networks) as an overarching framework for unsupervised modelling. We will cover Markov chain Monte Carlo sampling methods and variational approximations for inference. Time permitting, students will also learn about other topics in probabilistic (or Bayesian) machine learning.

The complete course forms a component of the Gatsby PhD programme, and is mandatory for Gatsby students. The two component modules are also available to students on the UCL MSc in Machine Learning and UCL MSc in Computational Statistics and Machine Learning. The first part (GI18: Probabilistic and Unsupervised Learning) may be used to fill a core requirement in each Masters programme. The second part (GI16: Approximate Inference and Learning in Probabilistic Models) is an optional module, but is only available to students who have completed GI18.

Students, postdocs and faculty from outside the unit and these programmes are welcome to attend, but should contact the unit in advance.

Prerequisites

A good background in statistics, calculus, linear algebra, and computer science. You should thoroughly review the maths in the following cribsheet [pdf] [ps] before the start of the course. This quiz may help you check where you stand. You must either know Matlab or Octave, be taking a class on Matlab/Octave, or be willing to learn it on your own. Any student or researcher at UCL meeting these requirements is welcome to attend the lectures.

Text

There is no required textbook. However, the following are excellent sources for many of the topics covered here.

David J.C. MacKay (2003) Information Theory, Inference, and Learning Algorithms, Cambridge University Press. (good introduction to information theory and Bayesian machine learning; also available online)
Christopher M. Bishop (2006) Pattern Recognition and Machine Learning, Springer-Verlag. (good general introduction to basics of machine learning)
Daphne Koller and Nir Friedman (2009) Probabilistic Graphical Models, MIT Press. (in depth and broad treatment of graphical models, good for reference)
Reading material

Specific recommendations can be found here and will be updated as course progresses.

Lecture schedule and slides
GI18
3/10: Introduction/Foundations 6/10: Foundations
10/10: Latent Variable Models 13/10: The EM algorithm
17/10: EM contd. 20/10: Time Series models
24/10: Time Series models 27/10: Graphical Models
31/10: Graphical Models 3/11: Exact Bayes
7/11: Gaussian Processes
GI16 (subject to change)
10/11: Hierarchical and Nonlinear Models
14/11: Hierarchical and Nonlinear Models 17/11: no lecture
21/11: Variational Approximations 24/11: Belief Propagation
28/11: Convex Approximations 1/12: Monte Carlo Methods
5/12: Monte Carlo contd. 8/12: Expectation Propagation
12/12: no lecture 15/12: no lecture
 
Assignments
50% of the total mark (for each module) is based on coursework assignments (the other 50% being based on the final written examination).

All assignments (coursework) are to be handed in to the Gatsby Unit, not to the CS department. Please hand in all assignments at the beginning of lecture on the due date to the lecturer or to the TAs. Late assignments will be penalised. If you are unable to attend the lecture, you can also hand in assignments to Mr. Barry Fong at the Alexandra House 4th floor reception.

Late Assignment Policy: we will be using the official UCL-wide late policy. Marks will be reduced by 5% for assignments up to one day late, 15% for assignments between one and seven days late, and will receive no credit thereafter.

Assignment 1        [binarydigits.txt, bindigit.m] due 17 Oct 2011
Assignment 2        [geyser.txt] due 31 Oct 2011
Assignment 3        [co2.txt] due 14 Nov 2011
Assignment 4        [images.jpg, genimages.m, Mstep.m] due 5 Dec 2011
Assignment 5        [message.txt, symbols.txt, code.tgz] due 21 Dec 2011
Assignment 6        due 9 Jan 2012

 
To attend Please contact the unit.