NSGA-ViT: an Evolutionary Approach to Vision Transformer Architecture Design

Becker, Drew

NSGA-ViT: an Evolutionary Approach to Vision Transformer Architecture Design

Becker, Drew

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

In recent years, the success of Transformers has been demonstrated in computer vision (CV)tasks, with the Vision Transformer (ViT), which competes with CNN networks on image classification tasks when using pre-trained models. Many of these deep learning models are designed by experts, which takes knowledge, time, and labor costs. Neural Architecture Search (NAS), seeks to automate the process of designing a neural network architecture. In this paper, I propose NSGA-ViT, a multi-objective evolutionary NAS for designing Transformer-based networks for computer vision tasks. NSGA-ViT utilizes a multi-objective genetic algorithm (NSGA-II) to design a ViT network with two objectives: maximizing performance, and minimizing network size. NSGA-ViT searches a search space of self-attention and convolution operations to discover a transformer architecture which outperforms ViT on CIFAR-10 and while containing half the parameters.

Record Created

2024-12-05

Title

NSGA-ViT: an Evolutionary Approach to Vision Transformer Architecture Design

Author

Becker, Drew

Contributor

Maier, Fred Advisor
Rasheed, Khaled Committee Member
Quinn, Shannon Committee Member

College or School

Franklin College of Arts and Sciences

Department

Institute for Artificial Intelligence

Content Type

Thesis

Pagination

66

File Format

pdf

Language

English

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia

Year Degree Granted

2023-12

Keywords

Computer Vision; Deep Learning; Evolutionary Computing; Neural Architecture Search; Transformer

Record Appears in

College, School, or Unit > Franklin College of Arts and Sciences
Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949618026102959

Download Full History

NSGA-ViT: an Evolutionary Approach to Vision Transformer Architecture Design

Files

Abstract

Details

PDF

Statistics