Second Semester
Fri: 11:00 a.m. - 1:00 p.m., Aula 8
Fri: 2:00 p.m. - 4:00 p.m., Aula D8
Introduction [pdf]
AI spring? Artificial Intelligence, Machine Learning, Deep Learning: facts, myths and a few reflections.
Fundamentals: Artificial Neural Networks [pdf]
Foundations of machine learning: dataset, representation, evaluation, optimization. Feed-forward neural networks as universal approximators.
Flow Graphs and Automatic Differentiation [pdf]
Tensorial representation, flow graphs. Automatic differentiation: primal graph, adjoint graph.
Aside 1: Mini-Batches as Tensors [pdf]
Deep Neural Networks [pdf]
Deeper networks: potential advantages and new challenges. Tensorial layerwise representation.
Regression vs. Classification [pdf]
Softmax and cross-entropy. Likelihood and loss functions. Regularization.
Shannon Entropy (Wikipedia)
Cross Entropy (Wikipedia)
Learning as Optimization [pdf]
Vanishing and exploding gradients. First and second order optimization, approximations, optimizers. Further tricks.
Predictions [pdf]
Optimization and prediction: bias and variance. Overfitting. Evaluating classifiers.
Aside 2: Hardware for Deep Learning [pdf]
Aside 3: Differentiating Algorithms [pdf]
Deep Convolutional Neural Networks [pdf]
Convolutional filter, filter banks, feature maps, pooling, layerwise gradients.
Deep Convolutional Neural Networks and Beyond [pdf]
Some insight into what happens in convolution layers. Different DCNN architectures. Transfer learning.
J Yosinski, J Clune, Y Bengio, H Lipson, "How transferable are features in deep neural networks?" in Advances in Neural Information Processing Systems (NIPS 2014) [link]
Recurrent Neural Networks [pdf]
Recurrent Neural Networks (RNN), temporal unfolding, LSTM Cells, GRU cells, encoder / decoder, convolution, time series analysis.
Auto-Encoders [pdf]
A very popular and powerful network architecture pattern, which is also the basis for diffusion models. The relation between Auto-Encoders and Principal Component Analysis.
Aside 4: Word Embedding [pdf]
Skip-grams, probability distributions of context and center words, training and results, continuous bag of words (CBOW) model.
Attention and Transformers [pdf]
Attention as a kernel, attention maps, queries, key and values, attention maps, transformer layer.
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, A N Gomez, L Kaiser, I Polosukhin, "Attention Is All You Need" in Advances in Neural Information Processing Systems (NIPS 2017) [link]
Large Models: LLMs, VLMs [pdf]
LLM architectures: Encoder-Decoder, Encoder-Only, Decoder-only. LLM training strategies, datasets and loss functions. Attention layer variants. Vision Transformers, multimodality.
Aside 5: Segmentation and Object Detection [pdf]
Aside 6: Kullback-Leibler divergence [pdf]
Shannon's entropy in the theory of information: intuition and formalism. Cross-entropy, KL divergence and its relation with likelihood.
Kullback-Leibler Divergence (Wikipedia)
Generative Networks: VAE [pdf]
Generative adversarial networks (GAN), Variational Auto-Encoders (VAE): structuring the latent
space, Gaussian-Mixture VAE: adapting to multiple classes
Generative Networks: Diffusion Models [pdf]
Denoising Diffusion Probabilistic Models (DDPM), mathematical foundations, practical implementation,
conditioning on multimodal labels
Aside 7: Reinforcement Learning [pdf]
A short recap about RL foundations, Markov decision process, state value function, policy, optimality, action value function, Q-learning.
Deep Reinforcement Learning [pdf]
Integrating DNNs into the RL paradigm, DQN algorithm, policy gradient, Actor-Critic methods, PPO, GRPO.
Marco Piastra
Contact: marco.piastra@unipv.it
Christopher Bishop, Hugh Bishop
Deep Learning: Foundations and Concepts
Springer, 2024
[Online version]
Simon J.D. Prince
Understanding Deep Learning
The MIT Press, 2023
[Online version]
Aston Zhang, Zachary Lipton, Mu Li, Alexander Smola
Dive into Deep Learning
Cambridge University Press, 2024
[Online version, with exercises]
Kevin P. Murphy
Probabilistic Machine Learning: Advanced Topics
The MIT Press, 2023
[Pre-print]
Ian Goodfellow, Yoshua Bengio, Aaron Courville
Deep Learning
The MIT Press, 2017
[Online version]
Richard s. Sutton, Andrew G. Barto
Reinforcement Learning: An Introduction (second edition)
MIT Press, 2018
[Online version]