immersinn-ds

Thu 10 November 2016

JM Smoothing Language Model For Ranking

Posted by TRII in text-retrieval-and-search-engines   

Introduction / Overview

We continue our work on the Text Retrieval and Search Engines course (see here for the last article). For the various topics covered in the course, the goal is to implement some of the methods and tools in order to gain some hands-on experience.

The previous articles looked at embedding documents and queries into an $n$-dimensional space, calculating the distances between query-document embeddings, and utilizing these distances as a measure of similarity between documents and queries.

Read more...


Mon 24 October 2016

Virginia Disc One Exploration

Posted by TRII in text-retrieval-and-search-engines   

Introduction / Overview

Virginia Disc One was "the first large-scale distribution of test collections" used in Information Retrieval. The goal was to create a large test collection that hundreds of researchers could contribute to and utilize for work in the IR field. While many larger, more comprehensive collections have been created and distributed since VD1 was first distributed in 1990, we thought it would be interesting (and fun!) to take a look at some of the contents and use them for future notebooks / articles.

Read more...