Question 1

How do I start learning information retrieval from scratch?

Accepted Answer

Begin with the 'Books' section, particularly 'Introduction to Information Retrieval' by Manning et al., and complement it with online courses listed under 'Courses' from universities like Stanford or UT Austin for a structured foundation.

Question 2

What's the difference between TREC and CLEF datasets?

Accepted Answer

TREC focuses on English-language web and news collections with various tracks for specific tasks like web search or question answering, while CLEF emphasizes multi-lingual and European language datasets, making it better for cross-lingual IR research.

Question 3

Apache Lucene or Indri: which should I use for my project?

Accepted Answer

Apache Lucene is more widely adopted in industry with a larger community, suitable for production use. Indri, from the Lemur Project, is favored in academia for research due to its toolkit for language modeling and experimental flexibility.

Question 4

How can I add a resource to the awesome information retrieval list?

Accepted Answer

Submit a pull request on GitHub with the new link and a brief description, following the contribution guidelines in the README to ensure it aligns with the project's curation standards for quality and relevance.

Question 5

Where can I find recent talks on deep learning in search?

Accepted Answer

Check the 'Talks' section for technical presentations, but it may not be exhaustive; supplement with conference websites like SIGIR or WSDM for the latest video lectures, as the list focuses on curated rather than real-time updates.

Question 6

How to use TREC datasets for a machine learning model?

Accepted Answer

Download a TREC collection from the NIST website, preprocess documents and queries as per track guidelines, then use IR tools like Lucene for indexing or libraries like scikit-learn for feature extraction, evaluating with standard metrics like MAP or NDCG.

awesome Information Retrieval

What is awesome Information Retrieval?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions