Foto 7

Prof. Vishal Monga, Pennsylvania State University, University Park, USA, "Computational Imaging", 17-20 April 2023

16 hours (4 credits)


Aula Riunioni del Dipartimento di Ingegneria dell’Informazione, Via G. Caruso 16, Pisa - Ground Floor

To register to the course, click here

Short Abstract:

The course will focus on modern image processing and computer vision problems with a strong computational flavor. We will start with image representations from a linear algebraic standpoint – from the classical Fourier to 2-D discrete cosine (DCT) and wavelet transforms, and finally sparse signal representations. Based on this foundation, two key areas will be emphasized: a.) the fundamental generative problem of image resolution enhancement, popularly known as image super-resolution. Both model based and machine-learning methods will be covered, culminating in their combination. b.) the discriminative problem of image classification and segmentation (pixel level classification). The goal will be to show how optimization principles help in the design of prior guided (or domain enriched) learning frameworks that can integrate the robustness merits of classical model based techniques with the superior modeling capacity of machine learning and artificial intelligence (AI) techniques such as modern deep learning architectures.

Course Contents in brief:

Day 1 (4 hours): Course Overview and Mathematical Preliminaries

  1. The anatomy of an image: historical context and new challenges
  2. Linear Algebra Review
  3. Convex Optimization Review
  4. 2-D Fourier Transform: Interpretation and Visualization

Day 2 (4 hours): Image Transforms 

  1. 2-D Discrete Cosine Transform
    • Analytical derivation
    • Energy compaction principle
    • Connections to DFT
    • Compression demo
  2. 1-D and 2-D Wavelet Transforms (the evolution from STFT)
    • Wavelets for joint time-frequency localization
    • Mallat pyramid algorithm for fast computation of 1-D/2-D DWT
  3. Applications of image transforms in compression, denoising 

Day 3 (4 hours): Image Super-resolution

  1. Introduction of image super-resolution
  2. Model-based multi-frame image super-resolution methods
  3. Sparsity based super-resolution methods
  4. Deep learning-based image super-resolution

Day 4 (4 hours): Discriminative Problems – Image Classification and Segmentation

  1. Sparsity based methods (use linear algebra and optimization review)
  2. Dictionary Learning (use linear algebra and optimization review)
  3. Deep Learning methods
  4. Applications: face recognition, medical image analysis for diagnosis


  1. Day1 – 9 AM – 1 PM
  2. Day2 – 9 AM – 1 PM
  3. Day3 – 9 AM – 1 PM
  4. Day4 – 9 AM – 1 PM
  5. Day5 (Friday) – Project assigned to be completed and submitted by following Monday.