Mast, Johannes und Lemoine-Rodriguez, Richard und Rittlinger, Vanessa und Mühlbauer, Martin und Biewer, Carolin und Geiß, Christian und Taubenböck, Hannes (2022) Geospatiality of Topics in English Text Data. 11th International Conference on Computational Social Science, 2025-07-21 - 2025-07-24, Norrköping, Schweden.
![]() |
PDF
2MB |
Kurzfassung
Many scientific disciplines have experienced an increased interest in the role of geographic location in what is often referred to as a “spatial turn”. Web texts that mention locations can be analyzed spatially, making them a promising data source. However, not all texts are equally likely to mention locations. The topic matters – but how much? Across several millions of texts from different web platforms and text types, we algorithmically (A) detected whether they contained an identifiable spatial location and (B) identified the general topic they contained, following a custom taxonomy of 19 topics derived in an inductive coding approach. We modeled the likelihood of geoinformation versus topic in a mixed-modeling approach, controlling for effects of author, time, and text length. A. Topic-specific effects on geospatiality were assessed via the topics’ fixed effects in the model. B. Similarity between geotagging and geoparsing approaches was analyzed using data from Twitter, where we compared the results based on geoparsed location mentions (non-geotagged) to results based on geotags. C. Similarity across datasets was measured using Spearman’s rank correlation coefficient. Results show: A. Observed frequency of mentioned locations varied strongly by topic, being high for the category Travel, Tourism & Migration and low for Private Life, Family & Relationships. The models attribute this largely to the effect of the topics. B. Substantially higher (and sometimes opposed) effects in geoparsingbased than in geotagging-based evaluation. C. Effects of topics largely correlated between datasets, moreso if datasets are similar in type and context. Conclusion: The topic of a text is a crucial factor in determining the presence of geolocation information. Recommendation: Consider and probe relationship between the analyzed content and frequency of georeferences to correctly interpret results. Outlook: Continue evaluation with different data, geoparsing methods, and different definitions of Geospatiality.
elib-URL des Eintrags: | https://elib.dlr.de/216492/ | ||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Konferenzbeitrag (Poster) | ||||||||||||||||||||||||||||||||
Titel: | Geospatiality of Topics in English Text Data | ||||||||||||||||||||||||||||||||
Autoren: |
| ||||||||||||||||||||||||||||||||
Datum: | Juli 2022 | ||||||||||||||||||||||||||||||||
Referierte Publikation: | Nein | ||||||||||||||||||||||||||||||||
Open Access: | Ja | ||||||||||||||||||||||||||||||||
Gold Open Access: | Nein | ||||||||||||||||||||||||||||||||
In SCOPUS: | Nein | ||||||||||||||||||||||||||||||||
In ISI Web of Science: | Nein | ||||||||||||||||||||||||||||||||
Status: | veröffentlicht | ||||||||||||||||||||||||||||||||
Stichwörter: | Geography, NLP, Geoparsing, Textdaten, Topics | ||||||||||||||||||||||||||||||||
Veranstaltungstitel: | 11th International Conference on Computational Social Science | ||||||||||||||||||||||||||||||||
Veranstaltungsort: | Norrköping, Schweden | ||||||||||||||||||||||||||||||||
Veranstaltungsart: | internationale Konferenz | ||||||||||||||||||||||||||||||||
Veranstaltungsbeginn: | 21 Juli 2025 | ||||||||||||||||||||||||||||||||
Veranstaltungsende: | 24 Juli 2025 | ||||||||||||||||||||||||||||||||
Veranstalter : | Linköping University | ||||||||||||||||||||||||||||||||
HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||||||||||||||||||||||
HGF - Programm: | Raumfahrt | ||||||||||||||||||||||||||||||||
HGF - Programmthema: | Technik für Raumfahrtsysteme | ||||||||||||||||||||||||||||||||
DLR - Schwerpunkt: | Raumfahrt | ||||||||||||||||||||||||||||||||
DLR - Forschungsgebiet: | R SY - Technik für Raumfahrtsysteme | ||||||||||||||||||||||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | R - Big Data und KI für die Entscheidungsunterstützung | ||||||||||||||||||||||||||||||||
Standort: | Oberpfaffenhofen | ||||||||||||||||||||||||||||||||
Institute & Einrichtungen: | Deutsches Fernerkundungsdatenzentrum > Georisiken und zivile Sicherheit | ||||||||||||||||||||||||||||||||
Hinterlegt von: | Mast, Johannes | ||||||||||||||||||||||||||||||||
Hinterlegt am: | 25 Sep 2025 13:37 | ||||||||||||||||||||||||||||||||
Letzte Änderung: | 25 Sep 2025 13:37 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags