Model-based clustering with application of copulas for symbolic data

Pan, Wenhao

Model-based clustering with application of copulas for symbolic data

Pan, Wenhao

2018

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Contemporary data sets can be too large or complex for traditional statistical methods to handle. One approach is to use symbolic data first introduced by Diday (1987). Our interest is the study of model-based clustering for symbolic data, especially for distributions (i.e., observations are not single numerical point values). We will describe symbolic data and considerable differences between symbolic data and classical data. For multivariate data, with p > 1, we only have the marginal distributions; so we do not know the dependence relationship between random variables. One approach to measure these dependencies is that of Vrac et al. (2012) in which a copula function is used to describe the cumulative joint distribution function of random variables in a mixture model. We further develop the algorithm from various perspectives. The model-based clustering algorithm is also implemented in R and applied to simulated data.

Details

Record ID

17852

Record Created

2024-12-05

Title

Model-based clustering with application of copulas for symbolic data

Author

Pan, Wenhao

Contributor

Billard, Lynne Advisor
Bai, Ray Committee Member
Lazar, Nicole Committee Member
Park, Cheolwoo Committee Member
Reeves, Jaxk Committee Member

College or School

Franklin College of Arts and Sciences

Department

Statistics

Date

2018

Publisher

University of Georgia

Content Type

Dissertation

Language

English

Dissertation/ Thesis Note

Doctoral

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia, Summer 2018

Year Degree Granted

2018

Keywords

Distributional Dataï¼›Model-based Clusteringï¼›Copulas

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences > Statistics
Electronic Theses and Dissertations > Doctoral Dissertation
All Resources
Doctoral

System Control Number

9949333387502959

PDF

Statistics

Download Full History