Special Seminar in Computing & Mathematical Sciences

Series: Special Seminars in Computing + Mathematical Sciences

Optimization Challenges in Deep Learning

Professor Benjamin Recht, Dept. of Electrical Engineering and Computer Science, UC Berkeley,

When training large-scale deep neural networks for pattern recognition, hundreds of hours on clusters of GPUs are required to achieve state-of-the-art performance. Improved optimization algorithms could potentially enable faster industrial prototyping and make training contemporary models more accessible. In this talk, I will attempt to distill the key difficulties in optimizing large, deep neural networks for pattern recognition. In particular, I will emphasize that many of the popularized notions of what make these problems "hard" are not true impediments at all. I will show that it is not only easy to globally optimize neural networks, but that such global optimization remains easy when fitting completely random data. I will argue instead that the source of difficulty in deep learning is a lack of understanding of generalization. I will provide empirical evidence of high-dimensional function classes that are able to achieve state-of-the-art performance on several benchmarks without any obvious forms of regularization or capacity control. These experiments reveal that traditional learning theory fails to explain why large neural networks generalize. I will close by proposing some possible paths towards a framework of generalization that explains these experimental findings.

Event Sponsors:

Computing and Mathematical Sciences (CMS) - More events from this Sponsor

For more information, please contact Carmen Nemer-Sirois by phone at 4561 or by email at [email protected].