elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Development of a model for analysis of census data using statistical methods and machine learning

Aditya, Megha (2023) Development of a model for analysis of census data using statistical methods and machine learning. Master's, Universität Paderborn.

[img] PDF
17MB

Abstract

In the current energy crisis faced by Germany, building an energy map holds great importance for effective energy management and decision-making. This thesis contributes relevant information for constructing an energy map by classifying the number of residents per building. Building population estimation holds importance to identify the energy trends and patterns of building occupants which plays a significant role in managing energy needs and formulating appropriate energy management strategies. The study utilizes two primary datasets, namely 3D building models for the particular study area and the 2011 German census data. Due to data privacy policies, information about individual building occupants is unavailable and this particular information is important to understand not just the final energy demand per building or household but also understand energy consumption behaviour patterns. This thesis addresses this challenge by developing a specialized and comprehensive databank that integrates the building models with census data on population, housing, and buildings. Additionally, to classify the building’s number of residents, a supervised machine learning approach using the XGBoost algorithm was employed. In the absence of training datasets due to data laws, a synthetic training dataset was prepared by incorporating various building types such as single-family houses, multi-family houses, and other types. Three separate models were constructed, and their results were subsequently combined to enhance accuracy. The evaluation of accuracy per grid cell provides insights into the effectiveness of the classification process. The developed classification model showed an average accuracy of 62%, where the models are closely related to the building form classification. The findings of this research have implications for better energy management and decision-making in the face of the country’s energy crisis.

Item URL in elib:https://elib.dlr.de/204743/
Document Type:Thesis (Master's)
Title:Development of a model for analysis of census data using statistical methods and machine learning
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Aditya, MeghaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:29 June 2023
Open Access:Yes
Number of Pages:86
Status:Published
Keywords:Machine Learning, Census 2011, QGIS
Institution:Universität Paderborn
Department:Computer Science
HGF - Research field:Energy
HGF - Program:Materials and Technologies for the Energy Transition
HGF - Program Themes:High-Temperature Thermal Technologies
DLR - Research area:Energy
DLR - Program:E SW - Solar and Wind Energy
DLR - Research theme (Project):E - Condition Monitoring
Location: Jülich
Institutes and Institutions:Institute of Solar Research > Qualification
Deposited By: Blanco Bohorquez, Luis Armando
Deposited On:23 Oct 2024 09:15
Last Modified:11 Nov 2024 13:34

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.