Scraping SERPs to Model Ad-hoc Ranking Profiles - Items

Item
Groups

Scraping SERPs to Model Ad-hoc Ranking Profiles

This archive contains the accompanying runs and scraped web content of our reproducibility study submitted to ECIR 2020. As part of this study, we investigated on the replicability and reproducibilty of submissions by M. Grossman and G. Cormack to TREC Common Core 2018. We reproduce the approach by using two different query formulation techniques and four different test collections from the news domain. In order to avoid re-scraping web content, we provide scraped artifacts in this archive. The corresponding code is hosted in an external repository.

Tags

Data and Resources

To access the resources you must log in

This item has no data

Item URL

http://data.d4science.org/ctlg/RISIS2OpenData/dedup_wf_001--ce83aace330ddac71d0e33dba232f0ff

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field	Value
PID	https://www.doi.org/10.5281/zenodo.3490268
PID	https://www.doi.org/10.5281/zenodo.3490267
URL	http://dx.doi.org/10.5281/zenodo.3490267
URL	https://figshare.com/articles/Scraping_SERPs_to_Model_Ad-hoc_Ranking_Profiles/11535759
URL	http://dx.doi.org/10.5281/zenodo.3490268

Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field	Value
Access Right	Open Access

Attribution

Description: Authorships and contributors

Field	Value
Author	Breuer, Timo, 0000-0002-1765-2449
Author	Schaer, Philipp, 0000-0002-8817-4632

Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field	Value
Collected From	Zenodo; Datacite; figshare
Hosted By	Zenodo; figshare
Publication Date	2019-10-15
Publisher	Zenodo

Additional Info

Field	Value
Language	UNKNOWN
Resource Type	Dataset
system:type	dataset

Management Info

Field	Value
Source	https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::ce83aace330ddac71d0e33dba232f0ff
Author	jsonws_user
Last Updated	10 January 2021, 20:30 (CET)
Created	10 January 2021, 20:30 (CET)