This volume contains a selection of papers presented during the biennial meeting of the CLAssification and Data Analysis Group (CLADAG) of the Societa Italiana di Statistica which was orga nized by the Istituto di Statistica of the Universita degli Studi di Palermo and held in the Palazzo Steri in Palermo on July 5-6, 2001. For this conference, and after checking the submitted 4 page abstracts, 54 papers were admitted for presentation. They covered a large range of topics from multivariate data analysis, with special emphasis on classification and clustering, computa tional statistics, time series analysis, and applications in various classical or recent domains. A two-fold careful reviewing process led to the selection of 22 papers which are presented in this vol ume. They convey either a new idea or methodology, present a new algorithm, or concern an interesting application. We have clustered these papers into five groups as follows: 1. Classification Methods with Applications 2. Time Series Analysis and Related Methods 3. Computer Intensive Techniques and Algorithms 4. Classification and Data Analysis in Economics 5. Multivariate Analysis in Applied Sciences. In each section the papers are arranged in alphabetical order. The editors - two of them the organizers of the CLADAG confer ence - would like to express their gratitude to the authors whose enthusiastic participation made the meeting possible and very successful.
The present volume contains a selection of papers presented at the Eighth Conference of the International Federation of Classification Societies (IFCS) which was held in Cracow, Poland, July 16-19, 2002. All originally submitted papers were subject to a reviewing process by two independent referees, a procedure which resulted in the selection of the 53 articles presented in this volume. These articles relate to theoretical investigations as well as to practical applications and cover a wide range of topics in the broad domain of classifi cation, data analysis and related methods. If we try to classify the wealth of problems, methods and approaches into some representative (partially over lapping) groups, we find in particular the following areas: • Clustering • Cluster validation • Discrimination • Multivariate data analysis • Statistical methods • Symbolic data analysis • Consensus trees and phylogeny • Regression trees • Neural networks and genetic algorithms • Applications in economics, medicine, biology, and psychology. Given the international orientation of IFCS conferences and the leading role of IFCS in the scientific world of classification, clustering and data anal ysis, this volume collects a representative selection of current research and modern applications in this field and serves as an up-to-date information source for statisticians, data analysts, data mining specialists and computer scientists.
In diesem Buch wird die deskriptive Statistik im weitesten Sinne behandelt. Explorative Datenanalyse (EDA), deskriptive Statistik und graphische Darstellungstechnik werden unter einem gemeinsamen Aspekt beschrieben. Das Buch versucht, erstmals in deutscher Sprache, die Datenaufbereitung im Stile Tukey (1977) und Mosteller und Tukey (1977) mit Beispielen aus dem sozial- und wirtschaftswissenschaftlichen Bereich dem Studenten nahezubringen. Dabei wird vollständig auf den Gebrauch von Wahrscheinlichkeiten verzichtet, dafür aber der Begriff der empirischen Verteilung zentral behandelt. Darüberhinaus wird eine Brücke zwischen der traditionellen (deutschsprachigen) deskriptiven Statistik und dem neuen explorativen Zugang des angelsächsischen Raumes geschlagen. Schließlich wird in jedem Kapital die Verfügbarkeit der Methoden in Programmpaketen kurz diskutiert.
In science, industry, public administration and documentation centers large amounts of data and information are collected which must be analyzed, ordered, visualized, classified and stored efficiently in order to be useful for practical applications. This volume contains 50 selected theoretical and applied papers presenting a wealth of new and innovative ideas, methods, models and systems which can be used for this purpose. It combines papers and strategies from two main streams of research in an interdisciplinary, dynamic and exciting way: On the one hand, mathematical and statistical methods are described which allow a quantitative analysis of data, provide strategies for classifying objects or making exploratory searches for interesting structures, and give ways to make comprehensive graphical displays of large arrays of data. On the other hand, papers related to information sciences, informatics and data bank systems provide powerful tools for representing, modelling, storing and retrieving facts, data and knowledge characterized by qualitative descriptors, semantic relations, or linguistic concepts. The integration of both fields and a special part on applied problems from biology, medicine, archeology, industry and administration assure that this volume will be informative and useful for theory and practice.
Data Science and Classification provides new methodological developments in data analysis and classification. The broad and comprehensive coverage includes the measurement of similarity and dissimilarity, methods for classification and clustering, network and graph analyses, analysis of symbolic data, and web mining. Beyond structural and theoretical results, the book offers application advice for a variety of problems, in medicine, microarray analysis, social network structures, and music.
International Federation of Classification Societies The International Federation of Classification Societies (lFCS) is an agency for the dissemination of technical and scientific information concerning classification and multivariate data analysis in the broad sense and in as wide a range of applications as possible; founded in 1985 in Cambridge (UK) by the following Scientific Societies and Groups: - British Classification Society - BCS - Classification Society of North America - CSNA - Gesellschaft fUr Klassification - GfKI - Japanese Classification Society - JCS - Classification Group ofItalian Statistical Society - CGSIS - Societe Francophone de Classification - SFC Now the IFCS includes also the following Societies: - Dutch-Belgian Classification Society - VOC - Polish Classification Section - SKAD - Portuguese Classification Association - CLAD - Group at Large - Korean Classification Society - KCS IFCS-98, the Sixth Conference of the International Federation of Classification Societies, was held in Rome, from July 21 to 24, 1998. Five preceding conferences were held in Aachen (Germany), Charlottesville (USA), Edinburgh (UK), Paris (France), Kobe (Japan).
In diesem Lehrbuch zur schliessenden (induktiven) Statistik werden die grundlegenden Methoden der Schätz- und Testtheorie auf einführendem Niveau für Studenten der Wirtschaftswissenschaften dargestellt. Neu ist in diesem Buch ein "dualer" Zugang, in dem die klassische und die Bayes-Theorie gemeinsam dargestellt werden. Die rasante Entwicklung der Bayes-Methoden in den letzten Jahren macht eine einführende Darstellung dieser Methoden notwendig. So werden HPD-Intervalle (höchste Wahrscheinlichkeitsdichte) und einfache Bayes-Tests als Alternativen zu Konfidenzintervalle und Signifikanztest erklärt. Alle Methoden werden ausführlich an Beispielen erklärt.
Explorative Datenanalyse (EDA), deskriptive Statistik und graphische Darstellungstechnik werden unter einem gemeinsamen Aspekt beschrieben. Dem Studenten im ersten Studienjahr soll damit bereits möglichst früh ein Überblick über die verschiedenen Typen der statistischen Modellierung geboten werden. Das Buch präsentiert resistente statistische Methoden, aber ohne deren wahrscheinlichkeitstheoretische oder induktive Begründung. An mehreren Beispielen aus den Sozial- und Wirtschaftswissenschaften wird gezeigt, wie ein deskriptiver Modellbildungsprozeß mit einfachen Mitteln möglich ist.
Raymond Bisdorff CRP-GL, Luxembourg The development of the SODAS software based on symbolic data analysis was extensively described in the previous chapters of this book. It was accompanied by a series of benchmark activities involving some official statistical institutes throughout Europe. Partners in these benchmark activities were the National Statistical Institute (INE) of Portugal, the Instituto Vasco de Estadistica Euskal (EUSTAT) from Spain, the Office For National Statistics (ONS) from the United Kingdom, the Inspection Generale de la Securite Sociale (IGSS) from Luxembourg 1 and marginally the University of Athens . The principal goal of these benchmark activities was to demonstrate the usefulness of symbolic data analysis for practical statistical exploitation and analysis of official statistical data. This chapter aims to report briefly on these activities by presenting some signifi cant insights into practical results obtained by the benchmark partners in using the SODAS software package as described in chapter 14 below.
The TCGOV 2005 international conference on e-government was held at the Free University of Bozen-Bolzano during March 2–4, 2005. The conference was initiated by the working group “Towards Electronic Democracy” (TED) of the European Science Foundation and was jointly organized by the Free University ofBozen-Bolzano,theMunicipalityofBozen-Bolzano,theTEDWorkingGroup, and the IFIP Working Group 8.5. The conference addressed a large spectrum of issues that are relevant and have to be investigated for a successful transition from the traditional form of government to a new form known as e-government. The main focus was on the following topics: – improving citizen participation and policy making (e-democracy) – government application integration – semantic Web technologies for e-government – security aspects for e-government services Two sessions were dedicated to e-democracy, an emerging area within- government that seeks to enhance democratic processes and provide increased opportunities for individuals and communities to be involved in governmental decisions.Thecontributionsofthesetwosessionscovermorefundamentalresults and insights as well as experiences from di?erent countries. Another focus was on government application integration and the use of - mantic Web technologies, which are important technical aspects on the agenda of e-government research. Di?erent architectures for the integration and orch- tration of distributed services and processes were presented along with two case studies. Three papers about Semantic Web technologies discussed the use of ontologies in e-government.
This volume presents theories, models, algorithms, and applications in clustering, classification, and visualization. It also includes applications of clustering, classification, and visualization in various fields such as marketing, recommendation system, biology, sociology, and social survey. The contributions give insight into new models and concepts and show the variety of research in clustering, classification, and visualization.