elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Uncertainty Quantification in CLIP using Ensemble Methods

Olbrich, Stephan (2024) Uncertainty Quantification in CLIP using Ensemble Methods. Master's, Friedrich Schiller Universität Jena.

Full text not available from this repository.

Abstract

The multimodal image-text model CLIP, trained with a constrastive objective, shows impressive generalization abilities in downstream tasks, such as zero-shot classification. However, the model’s performance is less promising in several complex tasks, such as traffic sign and satellite image classification. In this thesis, we study the quantification of CLIP’s data bias by examining its learned embedding space. In addition, we investigate methods to quantify the uncertainty in CLIP’s zero-shot predictions. To do this, we conduct experiments on five datasets on which CLIP has a wide range of zero-shot classification accuracies, using baseline methods and extending the use of existing data augmentation methods. Furthermore, we pursue and develop a novel approach of transferring ensemble methods to CLIP, and present three computationally efficient variants. The results show that the presented methods are capable of quantifying model uncertainty for given inputs, but also suggest that there is room for improvement. Regarding our novel approach of using ensembles, the results show that we can match, and in some cases exceed by over 6%, the accuracy of the original CLIP model. We also show that the ensembles can detect the datasets with high uncertainty, indicating CLIP’s biases.

Item URL in elib:https://elib.dlr.de/211817/
Document Type:Thesis (Master's)
Title:Uncertainty Quantification in CLIP using Ensemble Methods
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Olbrich, Stephanstephan.olbrich (at) dlr.deUNSPECIFIEDUNSPECIFIED
Date:2024
Open Access:No
Status:Published
Keywords:Contrastive Learning, Multimodal, Pre-Trained, CLIP, Zero-Shot, Data- Bias, Data Augmentation, Ensembles
Institution:Friedrich Schiller Universität Jena
Department:Fakultät für Mathematik und Informatik
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space System Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Space System Technology
DLR - Research theme (Project):R - Machine Learning
Location: Jena
Institutes and Institutions:Institute of Data Science > Data Analysis and Intelligence
Deposited By: Gawlikowski, Jakob
Deposited On:14 Jan 2025 10:03
Last Modified:14 Jan 2025 10:03

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.