S&DS Seminar: Oscar Leong (Caltech)

decorative

Mason Lab 211 with remote access option, 9 Hillhouse Avenue, New Haven, CT 06520

Oscar Leong
Speaker: Oscar Leong, Caltech

In-Person seminars will be held at Mason Lab 211 with optional remote access:
(https://yale.hosted.panopto.com/Panopto/Pages/Sessions/List.aspx?folderID=f8b73c34-a27b-42a7-a073-af2d00f90ffa)

The Power and Limitations of Convexity in Data Science

Abstract: Optimization is a fundamental pillar of data science. Traditionally, the art and challenge in optimization lay primarily in problem formulation to ensure desirable properties such as convexity. In the context of contemporary data science, however, optimization is practiced differently, with scalable local search methods applied to nonconvex objectives being the dominant paradigm in high-dimensional problems. This has brought a number of foundational mathematical challenges at the interface between optimization and data science pertaining to the dichotomy between convexity and nonconvexity.

In this talk, I will discuss some of my work addressing these challenges in regularization, a technique to encourage structure in solutions to statistical estimation and inverse problems. Even setting aside computational considerations, we currently lack a systematic understanding from a modeling perspective of what types of geometries should be preferred in a regularizer for a given data source. In particular, given a data distribution, what is the optimal regularizer for such data and what are the properties that govern whether it is amenable to convex regularization? Using ideas from star geometry, Brunn-Minkowski theory, and variational analysis, I show that we can characterize the optimal regularizer for a given distribution and establish conditions under which this optimal regularizer is convex. Moreover, I describe results establishing the robustness of our approach, such as convergence of optimal regularizers with increasing sample size and statistical learning guarantees with applications to several classes of regularizers of interest.

Bio: I am a von Kármán Instructor at Caltech in the Computing + Mathematical Sciences department, hosted by Venkat Chandrasekaran. I also work with Katie Bouman and the Computational Cameras group. I completed my PhD from Rice University in Computational and Applied Mathematics under the supervision of Paul Hand and was an NSF Graduate Research Fellow. I received my undergraduate degree in Mathematics from Swarthmore College.

My research interests lie in the mathematics of data science, inverse problems, machine learning, and optimization. Much of my work concerns solving signal recovery problems with approaches inspired by deep learning and uses tools from high dimensional probability, random matrix theory, and optimization to develop provable recovery guarantees.

Wednesday, March 01, 2023

3:30pm – Pre-talk meet and greet teatime – Dana House, 24 Hillhouse Avenue

4:00pm – 5:00 pm – Talk – Mason Lab 211, 9 Hillhouse Avenue with the option of virtual participation