IR-101: Information Retrieval Fundamentals: BM25 to Modern Search

Course Description

Foundations of information retrieval from first principles. The Boolean retrieval model and its limitations. TF-IDF: term frequency, inverse document frequency, and their variants. BM25: the probabilistic model that powers most search engines today. Inverted indexes: construction, compression, intersection algorithms. Evaluation metrics: precision, recall, NDCG, MRR. Static site search with Pagefind. Elasticsearch and Solr: practical configuration for content search. Students build a complete search system for a document corpus.