Center for Theoretical Physics of the Universe (CTPU-PTC)

Jaeok Yi (KAIST), "Neural Networks from the Perspective of Physics"

Asia/Seoul
CTPU Seminar room (Theory Bldg, 4F)

CTPU Seminar room

Theory Bldg, 4F

Description

Despite the remarkable empirical success of deep learning, a comprehensive theoretical understanding of why and how neural networks learn remains a mystery. In this talk, we discuss physics-inspired approaches to understanding neural networks. We present synaptic field theory, a framework that reformulates the gradient descent dynamics of synaptic weights as classical field dynamics in de Sitter spacetime, constructing an action whose metric naturally matches that of a universe with a positive cosmological constant. This framework faces a challenge related to the non-locality of the cost function. To address this issue, we explore the idea of promoting neurons to dynamical degrees of freedom. Leveraging properties of stochastic gradient descent, the Lagrangian can be decomposed into a data-independent bulk part and a data-dependent boundary part. This decomposition is expected to separate the architectural structure from the stochastic properties of neural networks, enabling independent analysis of each. Through this line of research, we aim to provide physicists with a familiar language to investigate the theoretical foundations of machine learning.