DEVELOPING NEW INFORMATICS APPROACHES FOR INVESTIGATING SEQUENCE-STRUCTURE-FUNCTION AND EVOLUTIONARY RELATIONSHIPS IN GLYCOSYLTRANSFERASES

Taujale, Rahil

DEVELOPING NEW INFORMATICS APPROACHES FOR INVESTIGATING SEQUENCE-STRUCTURE-FUNCTION AND EVOLUTIONARY RELATIONSHIPS IN GLYCOSYLTRANSFERASES

Taujale, Rahil

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Glycosyltransferases (GTs) play fundamental roles in nearly all cellular processes through the biosynthesis of complex carbohydrates and glycosylation of diverse protein and small molecule substrates. Although prevalent across the tree of life, the evolutionary basis for the complex and diverse modes of GT catalytic functions remain enigmatic. This is mainly due to the extensive structural and functional diversification of GTs that presents a major challenge in mapping the relationships connecting sequence, structure, fold and function.In this dissertation, I develop and apply a combination of established and novel tools for large scale sequence based comparisons of glycosyltransferases across the tree of life. Using well curated structure-based sequence alignment profiles, I first align over half a million GT sequences adopting the GT-A fold to identify the conserved GT-A core and define the minimal active site and hydrophobic components required for GT-A function. Based on this conserved core, I build a phylogenetic framework connecting diverse GT-A families and propose a new evolutionary constraint based classification of GT-A sequences into evolutionarily related groups. Next, I use advances in deep learning to develop a GT fold classification and prediction model that extends the analysis from GT-A to other known and novel folds. I build this highly interpretable model to identify the core conserved features of all three major GT folds and predict GT families that are likely to adopt novel folds. Finally, I compile all the diverse datasets generated during these studies into an interactive data analytics platform that can be used to infer novel hypotheses about GT-A fold enzymes.

Details

Record ID

5190

Record Created

2024-12-05

Title

DEVELOPING NEW INFORMATICS APPROACHES FOR INVESTIGATING SEQUENCE-STRUCTURE-FUNCTION AND EVOLUTIONARY RELATIONSHIPS IN GLYCOSYLTRANSFERASES

Author

Taujale, Rahil

Contributor

Kannan, Natarajan Advisor
Edison, Arthur S Advisor
Moremen, Kelley W Committee Member
West, Christopher M Committee Member
Arnold, Jonathan Committee Member

College or School

Franklin College of Arts and Sciences

Department

Genetics

Subjects

Bioinformatics

Content Type

Dissertation

Pagination

185

File Format

pdf

Language

English

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia

Year Degree Granted

2021-05

Keywords

carbohydrates; deep learning; evolution; Glycosyltransferases; phylogeny; sequence alignment

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences > Genetics
Electronic Theses and Dissertations > Doctoral Dissertation
All Resources
Doctoral

System Control Number

9949375459202959

PDF

Statistics

Download Full History