Farzana, Sheikh Mastura und Hecking, Tobias (2024) Towards a Scalable Geoparsing Approach for the Web. In: 46th European Conference on Information Retrieval, ECIR 2024. CEUR Workshop Proceedings. GeoExT 2024: Second International Workshop on Geographic Information Extraction from Texts at ECIR 2024, 2024-03-24, Glasgow, Scotland. ISBN 978-303156068-2. ISSN 0302-9743.
PDF
336kB |
Offizielle URL: https://ceur-ws.org/Vol-3683/paper4.pdf
Kurzfassung
The ongoing surge in web data generation and storage, coupled with embedded geographic information, holds immense potential for enhancing search applications across diverse domains. However, extracting geographic information for further enhancement of web search remains inadequately explored. This paper addresses a critical gap in the realm of geographic information extraction from web data, emphasizing the absence of unified pipelines for processing such information. In response to this void, we present a pipeline specifically tailored for web data. Furthermore, our contribution extends beyond the development of the pipeline itself to include a comparative analysis of various gazetteer-based geotagging methods in terms of accuracy and scalability along with a sizable corpora of location annotated web documents.
elib-URL des Eintrags: | https://elib.dlr.de/210768/ | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Konferenzbeitrag (Vorlesung) | ||||||||||||
Titel: | Towards a Scalable Geoparsing Approach for the Web | ||||||||||||
Autoren: |
| ||||||||||||
Datum: | 2024 | ||||||||||||
Erschienen in: | 46th European Conference on Information Retrieval, ECIR 2024 | ||||||||||||
Referierte Publikation: | Ja | ||||||||||||
Open Access: | Ja | ||||||||||||
Gold Open Access: | Nein | ||||||||||||
In SCOPUS: | Ja | ||||||||||||
In ISI Web of Science: | Ja | ||||||||||||
Verlag: | CEUR Workshop Proceedings | ||||||||||||
ISSN: | 0302-9743 | ||||||||||||
ISBN: | 978-303156068-2 | ||||||||||||
Status: | veröffentlicht | ||||||||||||
Stichwörter: | Geoparsing, Geographic Information Extraction, Geotagging, Geocoding | ||||||||||||
Veranstaltungstitel: | GeoExT 2024: Second International Workshop on Geographic Information Extraction from Texts at ECIR 2024 | ||||||||||||
Veranstaltungsort: | Glasgow, Scotland | ||||||||||||
Veranstaltungsart: | Workshop | ||||||||||||
Veranstaltungsdatum: | 24 März 2024 | ||||||||||||
HGF - Forschungsbereich: | keine Zuordnung | ||||||||||||
HGF - Programm: | keine Zuordnung | ||||||||||||
HGF - Programmthema: | keine Zuordnung | ||||||||||||
DLR - Schwerpunkt: | Digitalisierung | ||||||||||||
DLR - Forschungsgebiet: | D DAT - Daten | ||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | D - OpenSearch@DLR | ||||||||||||
Standort: | Rhein-Sieg-Kreis | ||||||||||||
Institute & Einrichtungen: | Institut für Softwaretechnologie Institut für Softwaretechnologie > Intelligente und verteilte Systeme | ||||||||||||
Hinterlegt von: | Farzana, Sheikh Mastura | ||||||||||||
Hinterlegt am: | 16 Dez 2024 12:04 | ||||||||||||
Letzte Änderung: | 18 Dez 2024 13:09 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags