AsiaSim 2014: 14th International Conference on Systems Simulation, Kitakyushu, Japan, October 26-30, 2014. Proceedings
Cluster analysis means the organization of an unlabeled collection of objects or patterns into separate groups based on their similarity. The task of computerized data clustering has been approached from diverse domains of knowledge like graph theory, multivariate analysis, neural networks, fuzzy set theory, and so on. Clustering is often described as an unsupervised learning method but most of the traditional algorithms require a prior specification of the number of clusters in the data for guiding the partitioning process, thus making it not completely unsupervised. Modern data mining tools that predict future trends and behaviors for allowing businesses to make proactive and knowledge-driven decisions, demand fast and fully automatic clustering of very large datasets with minimal or no user intervention.
In this volume, we formulate clustering as an optimization problem, where the best partitioning of a given dataset is achieved by minimizing/maximizing one (single-objective clustering) or more (multi-objective clustering) objective functions. Using several real world applications, we illustrate the performance of several metaheuristics, particularly the Differential Evolution algorithm when applied to both single and multi-objective clustering problems, where the number of clusters is not known beforehand and must be determined on the run. This volume comprises of 7 chapters including an introductory chapter giving the fundamental definitions and the last Chapter provides some important research challenges.
Academics, scientists as well as engineers engaged in research, development and application of optimization techniques and data mining will find the comprehensive coverage of this book invaluable.
Research on the problem of clustering tends to be fragmented across the pattern recognition, database, data mining, and machine learning communities. Addressing this problem in a unified way, Data Clustering: Algorithms and Applications provides complete coverage of the entire area of clustering, from basic methods to more refined and complex data clustering approaches. It pays special attention to recent issues in graphs, social networks, and other domains.
The book focuses on three primary aspects of data clustering:
In this book, top researchers from around the world explore the characteristics of clustering problems in a variety of application areas. They also explain how to glean detailed insight from the clustering process—including how to verify the quality of the underlying clusters—through supervision, human intervention, or the automated generation of alternative clusters.
Statistical Data Mining and Knowledge Discovery brings together a stellar panel of experts to discuss and disseminate recent developments in data analysis techniques for data mining and knowledge extraction. This carefully edited collection provides a practical, multidisciplinary perspective on using statistical techniques in areas such as market segmentation, customer profiling, image and speech analysis, and fraud detection. The chapter authors, who include such luminaries as Arnold Zellner, S. James Press, Stephen Fienberg, and Edward K. Wegman, present novel approaches and innovative models and relate their experiences in using data mining techniques in a wide range of applications.
Researchers have made great strides in the field over the last twenty years, and classification is no longer perceived as being concerned solely with exploratory analyses. The second edition of Classification incorporates many of the new and powerful methodologies developed since its first edition. Like its predecessor, this edition describes both clustering and graphical methods of representing data, and offers advice on how to decide which methods of analysis best apply to a particular data set. It goes even further, however, by providing critical overviews of recent developments not widely known, including efficient clustering algorithms, cluster validation, consensus classifications, and the classification of symbolic data.
The author has taken an approach accessible to researchers in the wide variety of disciplines that can benefit from classification analysis and methods. He illustrates the methodologies by applying them to data sets-smaller sets given in the text, larger ones available through a Web site.
Large multivariate data sets can be difficult to comprehend-the sheer volume and complexity can prove overwhelming. Classification methods provide efficient, accurate ways to make them less unwieldy and extract more information. Classification, Second Edition offers the ideal vehicle for gaining the background and learning the methodologies-and begin putting these techniques to use.
* The latest results on support vector machines including v-SVM's and their geometric interpretation
* Classifier combinations including the Boosting approach
* State-of-the-art material for clustering algorithms tailored for large data sets and/or high dimensional data, as required by applications such as web-mining and bioinformatics
* Coverage of diverse applications such as image analysis, optical character recognition, channel equalization, speech recognition and audio classification