Skip to main navigation Skip to search Skip to main content

Predicting query performance on the web

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

Predicting the performance of web queries is useful for several applications such as automatic query reformulation and automatic spell correction. In the web environment, accurate performance prediction is challenging because measures such as clarity that work well on homogeneous TREC-like collections, are not as effective and are often expensive to compute. We present Rank-time Performance Prediction (RAPP), an effective and efficient approach for online performance prediction on the web. RAPP uses retrieval scores, and aggregates of the rank-time features used by the document-ranking algorithm to train regressors for query performance prediction. On a set of over 12,000 queries sampled from the query logs of a major search engine, RAPP achieves a linear correlation of 0.78 with DCG@5, and 0.52 with NDCG@5. Analysis of prediction accuracy shows that hard queries are easier to identify while easy queries are harder to identify.

Original languageEnglish
Title of host publicationSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages785-786
Number of pages2
DOIs
StatePublished - 2010
Event33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010 - Geneva, Switzerland
Duration: Jul 19 2010Jul 23 2010

Publication series

NameSIGIR 2010 Proceedings - 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010
Country/TerritorySwitzerland
CityGeneva
Period07/19/1007/23/10

Keywords

  • Performance prediction
  • Query difficulty
  • Web search

Fingerprint

Dive into the research topics of 'Predicting query performance on the web'. Together they form a unique fingerprint.

Cite this