Machine Learning Essentials

Prof. Dr. Ullrich Köthe, SS 2024

Machine learning is one of the most promising approaches to address difficult decision and regression problems under uncertainty. The general idea is very simple: Instead of modeling a solution explicitly, a domain expert provides example data that demonstrate the desired behavior on representative problem instances. A suitable machine learning algorithm is then trained on these examples to reproduce the expert's solutions as well as possible and generalize it to new, unseen data. The last two decades have seen tremendous progress towards ever more powerful algorithms. This course attempts to cover all the essential methods from linear classifiers and robust regression to neural networks and reinforcement learning. In short, it will be a one-semester "best of"-version of the popular two-semester course on "Fundamentals of ML" and "Advanced ML".

The lecture belongs to the Master of Data and Computer Science program, but is also recommended for students towards a Master of Physics (specialization Computational Physics), Master in Scientific Computing and anyone interested.

Solid knowledge in linear algebra, analysis (multi-dimensional differentiation and integration) and probability theory is required.

Dates:

The lecture starts on April 17 and runs until July 26, 2024.
Lecture Wednesdays 11:15-12:45 INF 230 Großer Hörsaal (Center for Organismal Studies, COS)
Lecture Fridays 11:15-12:45 INF 230 Großer Hörsaal (Center for Organismal Studies, COS)
Please sign up for the lecture via Muesli.
Homework assignments and other course material will be published on MaMPF.

Contents

  • Intro (learning from data, features and response, one-hot encoding, supervised/unsupervised/weakly supervised learning, notation, centered data)
  • Simple classifiers (threshold, perceptron & linear decision boundary, nearest neighbor - Voronoi decision boundary)
  • Evaluation (training vs test set, cross validation, confusion matrix, error rate, false positive/negative rate, precision/recall, AOC)
  • Bayes theorem (prior, likelihood, posterior), generative and discriminative classifiers
  • QDA, LDA (clustered data, multi-variate Gaussian distribution, co-variance matrix, precision matrix, generative model, maximum-likelihood estimation, i.i.d.)
  • SVM, LR, unification via loss functions
  • Non-linear classification: nearest neighbor and QDA recap, strategies: non-linear boundaries vs. augmented features, decision trees and forests, hand-crafted mappings
  • Neural networks: hand-crafted example, neurons, layers, architecture, activation functions, loss functions
  • Backprop, training tricks
  • Convolution, ConvNets
  • Famous CNNs and ResNets
  • U-nets and semantic segmentation
  • Ordinary least squares (normal equarions, pseudo-inverse, Cholesky, QR, singular value decomposition, LSQR)
  • Weighted LSQ, Heteroscedastic loss, alternating optimization, IRLS
  • Bias-variance trade-off, ridge regression, LASSO, orthogonal matching pursuit
  • Non-linear regression: non-linear LSQ (short), regression trees/forests, regression neural networks
  • Gaussian processes
  • GP kernels, Bayesian hyper-parameter optimization
  • Robust regression: robust loss functions (median, Huber, logcosh), RANSAC algorithm
  • Linear dimension reduction: PCA, ICA, NMF
  • Non-linear dimension reduction: LLE, t-SNE, UMAP
  • Non-linear dimension reduction: (variational) auto-encoders
  • Generative modelling: GANs, normalizing flows
  • Clustering: hierarchical, k-means, k-means++, k-medoids, GMM, EM algorithm
  • Reinforcement learning, Model-free RF, Deep Q-Learning

Course material

to be published.

Textbooks: