elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets

Winkler, Roland and Klawonn, Frank and Kruse, Rudolf (2010) Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets. In: Advances in Data Analysis, Data Handling and Business Intelligence. Sringer. 34th Annual Conference GfKl 2010, 3rd German-Japanese Workshop, 2010-07-20 - 2010-07-23, Karlsruhe.

[img]
Preview
PDF
9MB

Abstract

Fuzzy c-Means and its derivatives work very well on most clustering problems. However, FcM and many similar algorithms have their problems with high dimensional data sets and a large number of prototypes. Similar algorithms in this context are those, which generate fuzzy membership values by using a ratio of distances to ensure a sum of membership values of 1. Possibilistic clustering is explicitly of no concern because the degrees of possibility are computed for each cluster individually. In this paper, we exploit some structural problems using the ratio of distances as normalisation method in high dimensional spaces. We also show that a high number of prototypes influences the clustering procedure in a similar way as a high number of dimensions. Both effects are not entirely independent since the number of dimensions can be effectively reduced if the number of prototypes is smaller than the number of dimensions.

Item URL in elib:https://elib.dlr.de/64654/
Document Type:Conference or Workshop Item (Speech, Paper)
Title:Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Winkler, RolandUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Klawonn, FrankUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Kruse, RudolfUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:July 2010
Journal or Publication Title:Advances in Data Analysis, Data Handling and Business Intelligence
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Publisher:Sringer
Series Name:Studies in Classification, Data Analysis, and Knowledge Organization
Status:Published
Keywords:Fuzzy c-means, clustering, high dimensionsional data sets
Event Title:34th Annual Conference GfKl 2010, 3rd German-Japanese Workshop
Event Location:Karlsruhe
Event Type:international Conference, Workshop
Event Start Date:20 July 2010
Event End Date:23 July 2010
Organizer:German Classification Society
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:ATM and Operation (old)
DLR - Research area:Aeronautics
DLR - Program:L AO - Air Traffic Management and Operation
DLR - Research theme (Project):L - Effiziente Flugführung und Flugbetrieb (old)
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Air traffic systems
Deposited By: Winkler, Roland
Deposited On:24 Aug 2010 14:07
Last Modified:24 Apr 2024 19:29

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.