elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

A Test Collection for Dataset Retrieval in Biodiversity Research

Löffler, Felicitas and Schuldt, Andreas and König-Ries, Birgitta and Bruelheide, Helge and Klan, Friederike (2021) A Test Collection for Dataset Retrieval in Biodiversity Research. Research Ideas and Outcomes, 7. Pensoft. doi: 10.3897/rio.7.e67887. ISSN 2367-7163.

[img] PDF - Published version
207kB

Official URL: http://dx.doi.org/10.3897/rio.7.e67887

Abstract

Searching for scientific datasets is a prominent task in scholars' daily research practice. A variety of data publishers, archives and data portals offer search applications that allow the discovery of datasets. The evaluation of such dataset retrieval systems requires proper test collections, including questions that reflect real world information needs of scholars, a set of datasets and human judgements assessing the relevance of the datasets to the questions in the benchmark corpus. Unfortunately, only very few test collections exist for a dataset search. In this paper, we introduce the BEF-China test collection, the very first test collection for dataset retrieval in biodiversity research, a research field with an increasing demand in data discovery services. The test collection consists of 14 questions, a corpus of 372 datasets from the BEF-China project and binary relevance judgements provided by a biodiversity expert.

Item URL in elib:https://elib.dlr.de/146507/
Document Type:Article
Title:A Test Collection for Dataset Retrieval in Biodiversity Research
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Löffler, FelicitasFriedrich-Schiller-Universität Jenahttps://orcid.org/0000-0001-6423-7427UNSPECIFIED
Schuldt, AndreasUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
König-Ries, BirgittaFriedrich-Schiller-Universität Jenahttps://orcid.org/0000-0002-2382-9722UNSPECIFIED
Bruelheide, HelgeUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Klan, FriederikeUNSPECIFIEDhttps://orcid.org/0000-0002-1856-7334UNSPECIFIED
Date:26 May 2021
Journal or Publication Title:Research Ideas and Outcomes
Refereed publication:Yes
Open Access:Yes
Gold Open Access:Yes
In SCOPUS:No
In ISI Web of Science:No
Volume:7
DOI:10.3897/rio.7.e67887
Publisher:Pensoft
ISSN:2367-7163
Status:Published
Keywords:dataset search, dataset retrieval, test collection, biodiversity research
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space System Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Space System Technology
DLR - Research theme (Project):R - Development of a method set for effective access to research data in data portals (MEDIAS)
Location: Jena
Institutes and Institutions:Institute of Data Science > Citizen Science
Deposited By: Klan, Dr. Friederike
Deposited On:30 Nov 2021 13:33
Last Modified:03 Dec 2021 13:25

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.