Robust estimation in mixture models and small area estimation using cross-sectional time series models

WOO, MI-JA

Robust estimation in mixture models and small area estimation using cross-sectional time series models

WOO, MI-JA

2005

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

This dissertation considers robust estimation of unknown number of components, also known as the mixture complexity, in nite mixture models and cross-sectional time series modeling of civilian unemployment rate for all the states in the U.S.. We begin with the problem of nding the mixture with fewest possible components that provides a satisfactory t of the data. Finite mixture models provide a natural way of modeling unobserved population heterogeneity, which is often encountered in data sets arising from biological, physical and social sciences. However, in many applications, it is unrealistic to expect that the component densities belong to some exact parametric family. The mixture of interest may even be contaminated, which causes the estimates such as based on KL distances to be unstable. To overcome this problem, we develop a robust estimator of mixture complexity based on the Minimum Hellinger Distance (MHD) when all other associated parameters are unknown. This estimator is considered in two cases, that is, when the random variables are continuous and discrete. For each case, an estimator of mixture complexity of mixture complexity is constructed as a by-product of minimizing a Hellinger Information Criterion, and this estimator is proved to be consistent for parametric family of mixtures. Via extensive simulations, our estimator is shown to be very competitive with several others in the literature when the model is correctly specied and to be robust under symmetric departures from postulated component normality in terms of correctly identifying the true mixture complexity robustness. Next, we consider the problem of modeling civilian unemployment rate for all the states in the U.S. Unemployment rate estimates are published by the U.S. Bureau of the Labor Statistics (BLS) every month for the whole nation, 50 states and DC as well as other areas. In recent years, the demand for small area statistics has greatly increased. At the national level, The overall sample size for the Current Population Survey (CPS) is sucient to produce reliable estimates of UE rate. However, for smaller domains, the eective sample sizes within a given domain are so small that standard design-based estimators are not precise enough. Therefore, there is a need to improve the eciency for small areas. The overlaps in CPS samples over time and the availability of other states' records provide the development of reliable model-based unemployment rate estimators for the states. To improve the eciency for small areas, we turn to explicit small area models that make specic allowance for between area variation, based on a Seasonal Autoregressive Integrated Moving Average (SARIMA) model. To carry out estimation of parameters in this random-eects version of time series model, a Bayesian inference methodology is constructed using Markov chain Monte Carlo methods. Through examining the model adequacy, and forecasting the last four observations for all the states, our model is shown to be reliable and ecient.

Details

Record ID

7636

Record Created

2024-12-05

Title

Robust estimation in mixture models and small area estimation using cross-sectional time series models

Author

WOO, MI-JA

Contributor

Sriram, Tharuvai N. Advisor
McCormick, William P. Committee Member
Reeves, Jaxk Committee Member
Rekaya, Romdhane Committee Member
Yin, Xiangrong Committee Member

College or School

Franklin College of Arts and Sciences

Department

Statistics

Date

2005

Publisher

University of Georgia

Content Type

Dissertation

Language

English

Dissertation/ Thesis Note

Doctoral

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia, Summer 2005

Year Degree Granted

2005

Keywords

Finite mixtures; Hellinger Information Criterion; Threshold; Consistency; Robustness; Adaptive Density Estimate; Symmetric Departures; Seasonal Autoregressive Moving Average Model; Bayesian Analysis; Gibbs Sampling; Metropolis-Hasting sampling; Forecasting; Model Adequacy

Record Appears in

Electronic Theses and Dissertations > Doctoral Dissertation
Franklin College of Arts and Sciences
All Resources
Doctoral

System Control Number

9949334951902959

PDF

Statistics

Download Full History