Understanding and Designing Deep Neural Networks Through Theory-Guided Training
DOI:
https://doi.org/10.31224/4790Keywords:
Deep Neural Networks, Theory-Guided Training, Optimization Theory, Neural Tangent Kernel, Implicit Bias, Information Theory, Generalization, Robustness, InterpretabilityAbstract
Deep Neural Networks (DNNs) have revolutionized numerous fields with their remarkable empirical success, yet their theoretical understanding remains incomplete. The emerging paradigm of \emph{theory-trained deep neural networks} aims to bridge this gap by integrating rigorous theoretical principles into the design and training of deep models. This survey provides a comprehensive overview of the foundational theories, training methodologies, and practical applications that define this vibrant area of research. We categorize existing approaches based on the theoretical frameworks they leverage, including optimization landscapes, neural tangent kernels, implicit bias, and information-theoretic principles. Furthermore, we discuss empirical successes across diverse domains such as computer vision, natural language processing, scientific computing, and reinforcement learning. Finally, we outline key challenges and promising future directions, emphasizing the need for scalable, interpretable, and robust theory-guided learning algorithms. This survey serves as a resource for researchers and practitioners interested in the intersection of deep learning theory and practice.
Downloads
Downloads
Posted
License
Copyright (c) 2025 Karthika Nasir, Aradhana Reva, Jai Sekhar

This work is licensed under a Creative Commons Attribution 4.0 International License.