The lecture will cover the following topics:
- Basics: background, documents, terms, vocabulary, inverted index
- Boolean retrieval, positional retrieval, tolerant retrieval
- Efficient index construction, index compression
- Term weighting, relevance scoring, ranked retrieval
- Semantic text analysis, link analysis
- Complete retrieval systems
- Results visualization and exploration
- Evaluation of retrieval systems
In the exercise, students will work on applied research projects (teamwork is possible) that address complex information retrieval and natural language processing tasks. Using the programming language Python and presenting the intermediate and final results of the projects is mandatory.
The video below describes one of the projects you could contribute to. If interested, we also offer paid research assistant student jobs.
After successfully completing the course, students should be able to:
- Summarize major IR and NLP applications
- Explain important IR and NLP algorithms and data structures
- Determine the conceptual requirements of specific IR and NLP problems
- Compare the suitability of algorithms and data structures for specific tasks
- Devise solutions for complex IR and NLP tasks by implementing and adapting suitable algorithms and data structures
- Evaluate IR and NLP methods and systems quantitatively and qualitatively
The course provides a good foundation for a bachelor’s or master’s thesis in our group. Check this page for our current theses proposals.