Sun, Yao und Abdelsalam, Ahmed und Xue, Xizhe und Aravena Pelizari, Patrick und Geiß, Christian (2026) Vision-Language Models for Structural Exposure Modeling from Street-Level Imagery. EGU General Assembly 2026, 2026-05-03 - 2026-05-08, Vienna, Austria. (im Druck)
|
PDF
109kB |
Kurzfassung
Detailed information on building attributes, such as construction materials and structural types, is a fundamental prerequisite for accurate natural hazard risk assessment. Recent deep learning approaches based on convolutional neural networks (CNNs) have demonstrated the effectiveness of extracting such exposure-related information from street-level imagery, establishing a solid foundation for data-driven building characterization. This study is motivated by the emerging capabilities of vision language models (VLMs), which leverage large-scale pretraining and generalized visual semantic reasoning to provide a unified framework for interpreting complex urban scenes. To assess their effectiveness in structural exposure modeling, we conducted comparative experiments using zero-shot inference and fine-tuning strategies. The dataset consists of over 29,000 annotated street-level façade images from the earthquake-prone region of Santiago, Chile. The zero-shot results indicate that general-purpose off-the-shelf VLMs (e.g., InternVL2-8B) struggle to accurately infer complex structural engineering attributes due to insufficient domain-specific knowledge. In contrast, fine-tuning based on InternVL3-2B yields a substantial performance improvement: the model achieves high accuracy in building height estimation (90.6%) and roof shape classification (87.0%), and demonstrates strong performance in predicting lateral load-resisting system materials (78.8%) and complex seismic building structural types (SBST, 72.6%). These results suggest that, fine-tuned VLMs can effectively acquire domain expertise, enabling scalable and low-cost exposure modeling. Future work will further investigate the potential of VLMs to infer latent structural characteristics through semantic reasoning.
| elib-URL des Eintrags: | https://elib.dlr.de/223432/ | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Konferenzbeitrag (Poster) | ||||||||||||||||||||||||
| Titel: | Vision-Language Models for Structural Exposure Modeling from Street-Level Imagery | ||||||||||||||||||||||||
| Autoren: |
| ||||||||||||||||||||||||
| Datum: | 2026 | ||||||||||||||||||||||||
| Referierte Publikation: | Nein | ||||||||||||||||||||||||
| Open Access: | Ja | ||||||||||||||||||||||||
| Gold Open Access: | Nein | ||||||||||||||||||||||||
| In SCOPUS: | Nein | ||||||||||||||||||||||||
| In ISI Web of Science: | Nein | ||||||||||||||||||||||||
| Status: | im Druck | ||||||||||||||||||||||||
| Stichwörter: | Vision-Language Models (VLMs), Exposure Modeling, Building Attributes, Street-Level Imagery (SLI) | ||||||||||||||||||||||||
| Veranstaltungstitel: | EGU General Assembly 2026 | ||||||||||||||||||||||||
| Veranstaltungsort: | Vienna, Austria | ||||||||||||||||||||||||
| Veranstaltungsart: | internationale Konferenz | ||||||||||||||||||||||||
| Veranstaltungsbeginn: | 3 Mai 2026 | ||||||||||||||||||||||||
| Veranstaltungsende: | 8 Mai 2026 | ||||||||||||||||||||||||
| HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||||||||||||||
| HGF - Programm: | Verkehr | ||||||||||||||||||||||||
| HGF - Programmthema: | Straßenverkehr | ||||||||||||||||||||||||
| DLR - Schwerpunkt: | Verkehr | ||||||||||||||||||||||||
| DLR - Forschungsgebiet: | V ST Straßenverkehr | ||||||||||||||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | V - V&V4NGC - Methoden, Prozesse und Werkzeugketten für die Validierung & Verifikation von NGC, R - Optische Fernerkundung | ||||||||||||||||||||||||
| Standort: | Oberpfaffenhofen | ||||||||||||||||||||||||
| Institute & Einrichtungen: | Institut für Methodik der Fernerkundung > Photogrammetrie und Bildanalyse Deutsches Fernerkundungsdatenzentrum > Georisiken und zivile Sicherheit | ||||||||||||||||||||||||
| Hinterlegt von: | Sun, Yao | ||||||||||||||||||||||||
| Hinterlegt am: | 25 Mär 2026 11:32 | ||||||||||||||||||||||||
| Letzte Änderung: | 30 Mär 2026 16:18 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags