Efficient graph clustering algorithms using compressive sensing

McKenzie, Daniel

McKenzie, Daniel

2019

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

Clustering graph-based data is a core problem in contemporary data science. In particular, as data sets grow in size and dimension, efficient algorithms are needed that can go beyond the O(n^2) run time of classical algorithms such as spectral clustering. In this dissertation we propose a new paradigm that rephrases the problem of finding clusters in graphs as a compressive sensing problem. This enables us to use fast algorithms originally developed for sparse recovery (in particular, greedy algorithms such as Orthogonal Matching Pursuit or Subspace Pursuit) for the clustering problem. We propose two new algorithms, and several variations thereof, in this paradigm, which we deem Cluster Pursuit algorithms. In particular, SingleClusterPursuit takes a small set of seed vertices and returns a good cluster containing them in O(dmax*n*log(n)) time, where dmax is the maximum vertex degree of the graph, while DynamicClusterPursuit efficiently updates an existing cluster in an evolving network. We further prove that SingleClusterPursuit is able to recover a large fraction of a given cluster for graphs drawn from a well-known probabilistic model of graphs with communities, namely the stochastic block model.In an additional chapter, we study the related problem of turning Euclidean data into graph data, so that graph-based clustering algorithms such as those discussed above can be used. We analyze the use of power weighted shortest path distances to measure the distance between such data points, and show that this can lead to significant improvements in classification accuracy on both real and synthetic data sets.

Details

Record ID

19566

Record Created

2024-12-05

Title

Efficient graph clustering algorithms using compressive sensing

Author

McKenzie, Daniel

Contributor

Lai, Ming-Jun Advisor
Adams, Malcolm Committee Member
Gutierrez, Juan Committee Member
Petukhov, Alexander Committee Member
Zhang, Qing Committee Member

College or School

Mathematics

Date

2019

Publisher

University of Georgia

Content Type

Dissertation

Language

English

Dissertation/ Thesis Note

Doctoral

Degree Type

Doctor of Philosophy (PHD)

Name of Granting Institution

University of Georgia, Spring 2019

Year Degree Granted

2019

Keywords

spectral graph theory; random graph theory; compressive sensing; semi-supervised learning; unsupervised learning; clustering; path distances; manifold hypothesis

Record Appears in

Electronic Theses and Dissertations > Doctoral Dissertation
All Resources
Doctoral

System Control Number

9949333452002959

PDF

Statistics

Download Full History