Scraping SERPs to Model Ad-hoc Ranking Profiles

This archive contains the accompanying runs and scraped web content of our reproducibility study submitted to ECIR 2020. As part of this study, we investigated on the replicability and reproducibilty of submissions by M. Grossman and G. Cormack to TREC Common Core 2018. We reproduce the approach by using two different query formulation techniques and four different test collections from the news domain. In order to avoid re-scraping web content, we provide scraped artifacts in this archive. The corresponding code is hosted in an external repository.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.5281/zenodo.3490268
PID https://www.doi.org/10.5281/zenodo.3490267
URL http://dx.doi.org/10.5281/zenodo.3490267
URL https://figshare.com/articles/Scraping_SERPs_to_Model_Ad-hoc_Ranking_Profiles/11535759
URL http://dx.doi.org/10.5281/zenodo.3490268
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Breuer, Timo, 0000-0002-1765-2449
Author Schaer, Philipp, 0000-0002-8817-4632
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Zenodo; Datacite; figshare
Hosted By Zenodo; figshare
Publication Date 2019-10-15
Publisher Zenodo
Additional Info
Field Value
Language UNKNOWN
Resource Type Dataset
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::ce83aace330ddac71d0e33dba232f0ff
Author jsonws_user
Last Updated 10 January 2021, 20:30 (CET)
Created 10 January 2021, 20:30 (CET)