Behzadi, Sahar and Valizadeh, Mahyar and Staab, Jeroen and Dallavalle, Marco and Nikolaou, Nikolaos and Peters, Annette and Schneider, Alexandra and Taubenböck, Hannes and Wolf, Kathrin (2022) Comparison of machine learning methods for the prediction of cardiovascular mortality from environmental and socio-economic neighbourhood factors. In: 34th Annual Conference of the International Society for Environmental Epidemiology - Abstract Book, p. 459. 34th Annual Conference of the International Society for Environmental Epidemiology, 2022-09-18 - 2022-09-21, Athen, Griechenland.
PDF
- Only accessible within DLR
200kB |
Official URL: https://isee2022.org/wp-content/uploads/2022/09/ISEE_2022_Abstract_Book.pdf
Abstract
BACKGROUND AND AIM: Several environmental and socio-economic neighbourhood factors have been reported to majorly impact human health, but their interplay is not yet well understood as simultaneous analyses are sparse. We aimed to identify the driving contextual factors and assess their predictive ability for cardiovascular disease (CVD) mortality in a neighbourhood setting by comparing several machine learning methods. METHODS: We obtained the number of CVD deaths per county for entire Germany for 2017. Most socio-economic factors (e.g. proportion of unemployed, foreigners, household income, deprivation index) were also only available on a county level, whereas most environmental factors (e.g. imperviousness, greenness, air pollution, noise, building density) could be gained on a higher spatial resolution (10m-2.4km). All data was aggregated to a 5km*5km grid to compile compliant prediction maps. In addition to traditional linear and additive regression models, we applied one neighbour-based method and several statistical, ensemble and deep learning approaches using a random search strategy for hyper-parameter tuning. Variable importance was assessed by SHapely Additive exPlanations (SHAP) values. RESULTS: CVD mortality for the 5km grid ranged from 2.5 to 8.1 per 10,000 residents with a mean of 4.7 (standard deviation 1.0). The models performed well in the training phase with R² between 0.85-1.00, mean squared error (MSE) between 0.001-0.005, and moderate to well in our test data (R²: 0.27-0.66; MSE: 0.011-0.024). Most models identified the deprivation index as the most important predictor followed by the proportion of foreigners, unemployed, median income and air pollution. Comparison of our predicted and observed CVD mortality rates showed high correlations for all models (Spearman r: 0.69-0.82), though prediction maps indicated heterogeneity in spatial patterns. CONCLUSIONS: The selected approaches differed in their predictive ability but identified similar predictors as the main drivers for CVD mortality.
Item URL in elib: | https://elib.dlr.de/187970/ | ||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Document Type: | Conference or Workshop Item (Poster) | ||||||||||||||||||||||||||||||||||||||||
Title: | Comparison of machine learning methods for the prediction of cardiovascular mortality from environmental and socio-economic neighbourhood factors | ||||||||||||||||||||||||||||||||||||||||
Authors: |
| ||||||||||||||||||||||||||||||||||||||||
Date: | 19 September 2022 | ||||||||||||||||||||||||||||||||||||||||
Journal or Publication Title: | 34th Annual Conference of the International Society for Environmental Epidemiology - Abstract Book | ||||||||||||||||||||||||||||||||||||||||
Refereed publication: | No | ||||||||||||||||||||||||||||||||||||||||
Open Access: | No | ||||||||||||||||||||||||||||||||||||||||
Gold Open Access: | No | ||||||||||||||||||||||||||||||||||||||||
In SCOPUS: | No | ||||||||||||||||||||||||||||||||||||||||
In ISI Web of Science: | No | ||||||||||||||||||||||||||||||||||||||||
Page Range: | p. 459 | ||||||||||||||||||||||||||||||||||||||||
Status: | Published | ||||||||||||||||||||||||||||||||||||||||
Keywords: | Machine learning, prediction, cardiovascular mortality, environment, spatial predictors | ||||||||||||||||||||||||||||||||||||||||
Event Title: | 34th Annual Conference of the International Society for Environmental Epidemiology | ||||||||||||||||||||||||||||||||||||||||
Event Location: | Athen, Griechenland | ||||||||||||||||||||||||||||||||||||||||
Event Type: | international Conference | ||||||||||||||||||||||||||||||||||||||||
Event Start Date: | 18 September 2022 | ||||||||||||||||||||||||||||||||||||||||
Event End Date: | 21 September 2022 | ||||||||||||||||||||||||||||||||||||||||
Organizer: | ISEE | ||||||||||||||||||||||||||||||||||||||||
HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||||||||||||||||||||||
HGF - Program: | Space | ||||||||||||||||||||||||||||||||||||||||
HGF - Program Themes: | Earth Observation | ||||||||||||||||||||||||||||||||||||||||
DLR - Research area: | Raumfahrt | ||||||||||||||||||||||||||||||||||||||||
DLR - Program: | R EO - Earth Observation | ||||||||||||||||||||||||||||||||||||||||
DLR - Research theme (Project): | R - Remote Sensing and Geo Research, R - Geoscientific remote sensing and GIS methods | ||||||||||||||||||||||||||||||||||||||||
Location: | Oberpfaffenhofen | ||||||||||||||||||||||||||||||||||||||||
Institutes and Institutions: | German Remote Sensing Data Center > Geo Risks and Civil Security | ||||||||||||||||||||||||||||||||||||||||
Deposited By: | Staab, Jeroen | ||||||||||||||||||||||||||||||||||||||||
Deposited On: | 08 Nov 2022 10:33 | ||||||||||||||||||||||||||||||||||||||||
Last Modified: | 24 Apr 2024 20:49 |
Repository Staff Only: item control page