elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Kontakt | English
Schriftgröße: [-] Text [+]

3D Mesh Segmentation Using Transformer Based Graph Operations

Katam, Harinandan (2021) 3D Mesh Segmentation Using Transformer Based Graph Operations. DLR-Interner Bericht. DLR-IB-RM-OP-2021-10. Masterarbeit. Technische Universität München.

[img] PDF
15MB

Kurzfassung

Deep Learning and especially convolutions have been a massive success in computer vision tasks such as Segmentation, Object Detection, and others. However, all of these are limited to 2D images, whereas the progress in the 3D domain has been limited. Extending these priors works to the 3D domain is sadly not straightforward. The biggest challenge here is the unstructured representation of 3D data such as meshes or point clouds. While other works use voxel grids, which have structured representation, they usually struggle with computation time and memory. In this thesis, the task of extending a convolution operation to unstructured data and its problems along with potential solutions is explored. In this thesis, two new methods to the task of Mesh Segmentation are proposed. Both these methods are based on Transformer networks and their components. In the first method, a first of its kind application of transformers to the task of Mesh Segmentation is proposed. In the second method, a permutation invariant Graph Convolution layer named TransConv is proposed which acts similar to a convolution operation on images and can be used in any model architecture. In addition to these methods, two extensions are proposed that improve the performance of both our methods. The first extension is to use depth encoding to add more information about the geodesic distance to the model. The second extension is to extend the concept of atrous convolutions in images to meshes. All of our methods and extensions are evaluated on two datasets and compared with other related works. The first dataset is a collection of high-resolution meshes called the Coseg dataset. The second dataset is a collection of point clouds of 3D objects called ShapeNet part annotation. Our proposed graph convolution layer TransConv outperforms other related works in both the datasets. However, our method to use transformers for mesh segmentation produced comparable results.

elib-URL des Eintrags:https://elib.dlr.de/140566/
Dokumentart:Berichtsreihe (DLR-Interner Bericht, Masterarbeit)
Titel:3D Mesh Segmentation Using Transformer Based Graph Operations
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Katam, HarinandanHarinandan.Katam (at) dlr.deNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Datum:Januar 2021
Referierte Publikation:Nein
Open Access:Ja
Status:veröffentlicht
Stichwörter:Deep Learning, Mesh Segmentation, Semantic Segmentation
Institution:Technische Universität München
Abteilung:Fakultät für Informatik
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Raumfahrt
HGF - Programmthema:Robotik
DLR - Schwerpunkt:Raumfahrt
DLR - Forschungsgebiet:R RO - Robotik
DLR - Teilgebiet (Projekt, Vorhaben):R - Multisensorielle Weltmodellierung (RM) [RO]
Standort: Oberpfaffenhofen
Institute & Einrichtungen:Institut für Robotik und Mechatronik (ab 2013) > Perzeption und Kognition
Hinterlegt von: Denninger, Maximilian
Hinterlegt am:21 Jan 2021 11:56
Letzte Änderung:15 Nov 2021 13:11

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.