elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Contact | Deutsch
Fontsize: [-] Text [+]

Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets

Winkler, Roland and Klawonn, Frank and Kruse, Rudolf (2010) Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets. In: Advances in Data Analysis, Data Handling and Business Intelligence. Sringer. 34th Annual Conference GfKl 2010, 3rd German-Japanese Workshop, 20.-23. Jul 2010, Karlsruhe.

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
9MB

Abstract

Fuzzy c-Means and its derivatives work very well on most clustering problems. However, FcM and many similar algorithms have their problems with high dimensional data sets and a large number of prototypes. Similar algorithms in this context are those, which generate fuzzy membership values by using a ratio of distances to ensure a sum of membership values of 1. Possibilistic clustering is explicitly of no concern because the degrees of possibility are computed for each cluster individually. In this paper, we exploit some structural problems using the ratio of distances as normalisation method in high dimensional spaces. We also show that a high number of prototypes influences the clustering procedure in a similar way as a high number of dimensions. Both effects are not entirely independent since the number of dimensions can be effectively reduced if the number of prototypes is smaller than the number of dimensions.

Document Type:Conference or Workshop Item (Speech, Paper)
Title:Problems of Fuzzy c-Means Clustering and Similar Algorithms with High Dimensional Data Sets
Authors:
AuthorsInstitution or Email of Authors
Winkler, Rolandroland.winkler@dlr.de
Klawonn, Frankf.klawonn@fh-wolfenbuettel.de
Kruse, Rudolfkruse@iws.cs.uni-magdeburg.de
Date:July 2010
Journal or Publication Title:Advances in Data Analysis, Data Handling and Business Intelligence
Refereed publication:Yes
In ISI Web of Science:No
Publisher:Sringer
Series Name:Studies in Classification, Data Analysis, and Knowledge Organization
Status:Published
Keywords:Fuzzy c-means, clustering, high dimensionsional data sets
Event Title:34th Annual Conference GfKl 2010, 3rd German-Japanese Workshop
Event Location:Karlsruhe
Event Type:international Conference, Workshop
Event Dates:20.-23. Jul 2010
Organizer:German Classification Society
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:ATM and Operation
DLR - Research area:Aeronautics
DLR - Program:L AO - Air Traffic Management and Operation
DLR - Research theme (Project):L - Effiziente Flugführung und Flugbetrieb (old)
Location: Braunschweig
Institutes and Institutions:Institute of Flight Control > Air traffic systems
Deposited By: Roland Winkler
Deposited On:24 Aug 2010 14:07
Last Modified:12 Dec 2013 20:59

Repository Staff Only: item control page

Browse
Search
Help & Contact
Informationen
electronic library is running on EPrints 3.3.12
Copyright © 2008-2012 German Aerospace Center (DLR). All rights reserved.