DOTTORATO DI RICERCA IN INGEGNERIA DELL'INFORMAZIONE

Foto 7

Link Generali

Info per...

Login

Prof. Fabrizio Silvestri, DIAG, Sapienza University of Roma, Dr. Nicola Tonellotto, DII, University of Pisa, "Neural Models and Techniques in Natural Language Processing and Information Retrieval", 7-11 February 2022

Hours:
20 hours (5 credits)

Room:

Aula Riunioni del Dipartimento di Ingegneria dell’Informazione, Via G. Caruso 16, Pisa - Ground Floor

To register to the course, click here

Short Abstract:

Advances from the natural language processing community have recently sparked a renaissance in the task of ad-hoc search. Particularly, large contextualized language modeling techniques, such as BERT, have equipped ranking models with a far deeper understanding of language than the capabilities of previous bag-of-words models. Applying these techniques to a new task is tricky, requiring knowledge of deep learning frameworks, and significant scripting and data munging. In this course, we provide background on classical (e.g., Bag of Words), modern (e.g., Learning to Rank). We introduce students to the Transformer architecture also showing how they are used in foundational aspects of modern large language models (e.g., BERT) and contemporary search ranking and re-ranking techniques. Going further, we detail and demonstrate how these can be easily experimentally applied to new search tasks in a new declarative style of conducting experiments exemplified by the PyTerrier search toolkit.

Course Contents in brief:

PyTorch
Language Models
Self-attention
Transformers
BERT and beyond
HuggingFace Transformers
PyTerrier
Classical IR: bag of words and probabilistic ranking
Modern IR: learning to rank
Contemporary IR: neural models and techniques

Schedule:

Day 1 – 9 – 13. Intro to PyTorch, Language Models, Implementing Word2Vec in PyTorch. Examples in Google Colab.
Day 2 – 9 – 13. Self-attention, Transformers, BERT, and Beyond. HuggingFace Transformers. Examples in Google Colab.
Day 3 – 9 – 13. Intro to Information Retrieval. Classical models and limitations. PyTerrier. Examples in Google Colab.
Day 4 – 9 – 13. Neural Models for IR. Examples in Google Colab.
Day 5 – 9 – 13. Exam

Published in Elenco Attività Formazione, A.A. 2021/2022 - List of accredited courses, 2021/2022

Download attachments:

PhD_Course_Fabrizio_Silvestri_Nicola_Tonellotto.pdf

News

Subscribe to this RSS feed

Tel +39 050 2217511
PEC: Questo indirizzo email è protetto dagli spambots. È necessario abilitare JavaScript per vederlo.

Dipartimento di Ingegneria dell'Informazione
P.I. 00286820501 - C.F. 80003670504

email: Questo indirizzo email è protetto dagli spambots. È necessario abilitare JavaScript per vederlo.
Via G. Caruso - 56122 - Pisa