WCL2R : a benchmark collection for Learning to rank research with clickthrough data.

dc.contributor.authorAlcântara, Otávio D. A.
dc.contributor.authorPereira Junior, Álvaro Rodrigues
dc.contributor.authorAlmeida, Humberto Mossri de
dc.contributor.authorGonçalves, Marcos André
dc.contributor.authorMiddleton, Christian
dc.contributor.authorYates, Ricardo Baeza
dc.date.accessioned2012-10-11T21:51:55Z
dc.date.available2012-10-11T21:51:55Z
dc.date.issued2010
dc.description.abstractWCL2R: A benchmark collection for Learning to rank research with clickthrough data In this paper we present WCL2R, a benchmark collection for supporting research in learning to rank (L2R) algorithms which exploit clickthrough features. Differently from other L2R benchmark collections, such as LETOR and the recently released Yahoo!’s collection for a L2R competition, in WCL2R we focus on defining a significant (and new) set of features over clickthrough data extracted from the logs of a real-world search engine. In this paper, we describe the WCL2R collection by providing details about how the corpora, queries and relevance judgments were obtained, how the learning features were constructed and how the process of splitting the collection in folds for representative learning was performed. We also analyze the discriminative power of the WCL2R collection using traditional feature selection algorithms and show that the most discriminative features are, in fact, those based on clickthrough data. We then compare several L2R algorithms on WCL2R, showing that all of them obtain significant gains by exploiting clickthrough information over using traditional ranking approaches.pt_BR
dc.identifier.citationALCÂNTARA, O. D. A. WCL2R : a benchmark collection for Learning to rank research with clickthrough data. Journal of Information and Data Management, v. 1, n. 3, p. 551-566, 2010. Disponível em: <http://seer.lcc.ufmg.br/index.php/jidm/article/viewFile/83/49>. Acesso em: 11 out. 2012.pt_BR
dc.identifier.issn21666288
dc.identifier.urihttp://www.repositorio.ufop.br/handle/123456789/1630
dc.language.isoen_USpt_BR
dc.rights.licensePermission to copy without fee all or part of the material printed in JIDM is granted provided that the copies are not made or distributed for commercial advantage, and that notice is given that copying is by permission of the Sociedade Brasileira de Computação. Fonte: o próprio artigo.
dc.subjectBenchmarkpt_BR
dc.subjectClicktroughpt_BR
dc.subjectLearning to rankpt_BR
dc.titleWCL2R : a benchmark collection for Learning to rank research with clickthrough data.pt_BR
dc.typeArtigo publicado em periodicopt_BR
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
ARTIGO_BenchmarkCollectionLearning.pdf
Size:
504.02 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: