Leveraging Symmetric Structure For Improved Learning In Convolutional Neural Networks

Loading...
Thumbnail Image

Degree type

Doctor of Philosophy (PhD)

Graduate group

Computer and Information Science

Discipline

Subject

Deep Learning
Physical priors
Computer Sciences

Funder

Grant number

License

Copyright date

2022-09-09T20:20:00-07:00

Distributor

Related resources

Contributor

Abstract

The aggressive resurgence of convolutional neural network (CNN) models for prediction has led to new benchmarks in speech recognition, natural language processing and computer vision. In computer vision, the success of these models is often attributed to the combination of a highly nonlinear cascaded processing scheme and the equivariance of planar convolution to translations of the input. This thesis introduces methods that extend the equivariance capability of CNNs to linear Lie groups that describe: the motion of objects, the structure of the Euclidean world and the formation of images. The first approach introduces a framework for joint estimation of image and motion representations. The linear Lie group structure is enforced through a bilinear motion model which transforms an image representation by the linear combination of motion generators. The approach affords extrapolation of image sequences through linear extrapolation of transformation coefficients. In the second approach, 3D rotationally equivariant representations are learned by convolution of spherical functions with respect to the 3D rotation group. Methods are described for the convolution of functions on both the two- and three-spheres. The final approach enforces equivariance of representations to 2D dilated-rotations by preprocessing the input with a change of coordinates.

Date of degree

2020-01-01

Date Range for Data Collection (Start Date)

Date Range for Data Collection (End Date)

Digital Object Identifier

Series name and number

Volume number

Issue number

Publisher

Publisher DOI

Journal Issues

Comments

Recommended citation