Web Search and Data Mining

Course Description

Web Search and Data Mining (WiSDoM) is an area that aims at extracting knowledge from the largest source of information created by humans: The Web! Throughout this course we will see how this extracted knowledge can solve complex tasks with advanced Computer Vision, Natural Language and Information Retrieval algorithms. The main topics of this course are:

  • Text and visual data representation
  • Billion scale text and image search
  • Visual question answering
  • Multimodal conversational assistants
  • Recommender systems

This course includes intensive hands-on laboratories where key CV, NLP and IR algorithms are examined.

Objectives

  • Learn what is an information embedding.
  • Learn the semantic associations between visual data, natural language data and user data.
  • Learn how to relate user information needs to actionable data.
  • Learn how to do a critical analysis of experimental results.
  • Develop autonomous and creative problem solving skills.

Grading

  • Exam (40%)
  • Project (45% = 15% phase 1 + 20% phase 2 + 10% phase 3)
  • Project originality (15%)

Schedule

  • 09/mar/21 Introduction
  • 16/mar/22 Web document categorization
  • 23/mar/22 Word embeddings
  • 30/mar/22 Transformer (encoder)
  • 06/abr/22 Billion scale indexing
  • 13/abr/22 Visual search
  • 20/abr/22 Vision-and-language models
  • 27/abr/22 ExpoFCT
  • 04/mai/22 Multimodal conversational assistants
  • 11/mai/22 Pre-trainning and fine-tunning
  • 18/mai/22 Transformer (decoder)
  • 25/mai/22 Recommender systems
  • 01/jun/22 HuggingFace invited lecture
  • 08/jun/22 Project discussion / Test

Tutorials

We suggest you to use the account in the lab cluster. However, if you would like to have your own setup, you can follow this guide:

Lecturers

Joao Magalhaes ([email protected] - remove the ‘x’ character to send an email)