elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Kontakt | English
Schriftgröße: [-] Text [+]

Open Search @ DLR - towards transparent access to web-based information in science

Voigt, Stefan und Hecking, Tobias und Jankoswski, Dennis und Möller, Julius und Schwinger, Maximilian (2021) Open Search @ DLR - towards transparent access to web-based information in science. OSSYM 2021 - 3rd International Open Search Symposium, 11-13 October 2021, Genf, Schweiz.

[img] PDF
23kB

Kurzfassung

Data is the raw material of the 21st century - for research, innovation, economy and society. Digital sovereignty requires free, uninfluenced & traceable access to information - in other words, open Internet search and systematic access to web data. Currently, there is a monopoly in information search: In Europe, more than 90% of all Internet searches are conducted via a single commercial and advertising-optimized search engine. This holds immense potential for intentional or unintentional manipulation in access to data, information, technology and knowledge (cognitive/economic bias). Especially for science, new concepts for a distributed and open Internet search infrastructure are needed. The wealth of data and information on the web must be rendered more accessible through uninfluenced discovery of scientific data and information, since it is the basis for free research and innovation. Against this background, the German Aerospace Center (DLR) is contributing to the European Open Search Initiative, formed by science and computing centres. Within the Open Search @ DLR project, existing in-house capacities and know-how in data access and search are identified and pooled to set-up a cooperative crawling, indexing and search capability to web data repositories – internal and external to DLR. Furthermore, dedicated pilot applications in areas such as information retrieval, knowledge management or information evaluation and transparency, making use of the infrastructure, are developed in the project. A primary focus of the Open Search @ DLR project is networking of in-house expertise as well as connecting with the Europe-wide Open Search Initiative. Within this talk we present the project layout and findings during the first project phase. This includes inventorying of in-house data and heterogeneous information repositories, coordinated crawling, indexing and searching. We present architecture and set-up of a testbed for cooperative crawling, where single crawling nodes communicate URLs to crawl in a peer-to-peer fashion as basis for joint assembly of large corpora of web data. In a second part of the talk scientific pilot applications of an open search infrastructure are discussed, including the use of georeferenced data from web- and database sources, 1 stefan.voigt@dlr.de e.g. for monitoring of news, events, geospatial analysis and early warning. Furthermore, open search approaches for exploring, linking, and indexing of information from heterogeneous scientific data sources and public web content are particularly being addressed. This includes access to (semi-)structured information in databases as well as information extraction from texts, e.g. automatic geo-tagging. In this context, especially the establishment of geographical connections between scientific, structured databases and human-readable content from the Internet play an important role. In the last part of the talk first ideas and concepts for a long-term activity of science and computing centres to set up an open Internet search ecosystem are discussed. Such a shared activity should be based on cooperative computing, open-source software stacks and public moderation and should involve distributed scientific highperformance computing and cloud facilities forming a cooperative open search infrastructure to warrant a longterm, public and open web search environment. As long as the digital sphere – the web – exists, free and unbiased orientation therein has to be ensured to guarantee free and unbiased access to information for science, economy and society as a whole.

elib-URL des Eintrags:https://elib.dlr.de/145817/
Dokumentart:Konferenzbeitrag (Vortrag)
Titel:Open Search @ DLR - towards transparent access to web-based information in science
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Voigt, Stefanstefan.voigt (at) dlr.dehttps://orcid.org/0000-0002-5908-331XNICHT SPEZIFIZIERT
Hecking, TobiasTobias.Hecking (at) dlr.dehttps://orcid.org/0000-0003-0833-7989NICHT SPEZIFIZIERT
Jankoswski, DennisOFFIS Institute for Computer Science, Oldenburg, GermanyNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Möller, JuliusUniversity of Oldenburg, GermanyNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Schwinger, MaximilianMaximilian.Schwinger (at) dlr.deNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Datum:2021
Referierte Publikation:Nein
Open Access:Ja
Gold Open Access:Nein
In SCOPUS:Nein
In ISI Web of Science:Nein
Status:veröffentlicht
Stichwörter:open search open web search web-based information science
Veranstaltungstitel:OSSYM 2021 - 3rd International Open Search Symposium
Veranstaltungsort:Genf, Schweiz
Veranstaltungsart:internationale Konferenz
Veranstaltungsdatum:11-13 October 2021
Veranstalter :CERN / Open Search Foundation
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Raumfahrt
HGF - Programmthema:Erdbeobachtung
DLR - Schwerpunkt:Raumfahrt
DLR - Forschungsgebiet:R EO - Erdbeobachtung
DLR - Teilgebiet (Projekt, Vorhaben):R - Geoprodukte u. - Systeme, Services
Standort: Oberpfaffenhofen
Institute & Einrichtungen:Deutsches Fernerkundungsdatenzentrum
Institut für Softwaretechnologie > Intelligente und verteilte Systeme
Hinterlegt von: Voigt, Dr. Stefan
Hinterlegt am:24 Nov 2021 10:47
Letzte Änderung:14 Jan 2022 11:56

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.