Estimation of genomic copy frequency with correlated observations

Tang, Qianqian

Estimation of genomic copy frequency with correlated observations

Tang, Qianqian

2012

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

In this thesis, we compare several methods to handle correlated data related to genome frequency copies. First, we used standard Poisson Regression to analyze the data. From the results, we find that there are several problems related to over-dispersion and under-dispersion. It is easy to handle over-dispersion using the scale-adjustment method. However, remedying problems related to dependence caused by correlated Poisson data are not so easily handled. We first created a statistic to help us test the null hypothesis that data are independent Poisson realizations vs. the alternative that they are positively associated. From this, we found that 225 base-pairs separation is the minimum cut-off distance needed to achieve approximate independence. We also used results from this analysis to devise a formula which yields the approximate correlation coefficient (r) between counts which are separated by b base-pairs. Finally, we use our method to weight observations, and find significant improvement compared to other methods.

Details

Record ID

13315

Record Created

2024-12-05

Title

Estimation of genomic copy frequency with correlated observations

Author

Tang, Qianqian

Contributor

Reeves, Jaxk Advisor
Liu, Liang Committee Member
Wang, Lily Committee Member

College or School

Franklin College of Arts and Sciences

Department

Statistics

Date

2012

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Spring 2012

Year Degree Granted

2012

Keywords

Poisson Regression; Over-dispersion; Under-dispersion; Dependent Weighting Scheme

Record Appears in

Electronic Theses and Dissertations > Graduate Thesis
Franklin College of Arts and Sciences
All Resources

System Control Number

9949334218002959

PDF

Statistics

Download Full History