Generalization in Deep Learning

1710.05468

Authors

Yoshua Bengio,Kenji Kawaguchi,Leslie Pack Kaelbling

Abstract

This paper provides theoretical insights into why and how deep learning can generalize well, despite its large capacity, complexity, possible algorithmic instability, nonrobustness, and sharp minima, responding to an open question in the literature. We also discuss approaches to provide non-vacuous generalization guarantees for deep learning.

Based on theoretical observations, we propose new open problems and discuss the limitations of our results.

Resources

View on Hugging Face Read PDF ArXiv

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Tools

MCP
RSS
llms.txt
Sitemap

Details

Content is sourced from third-party publications.