Smooth Games Optimization and Machine Learning Workshop: Bridging Game Theory and Deep Learning

Overview

Advances in generative modeling and adversarial learning have given rise to renewed interest in differentiable two-players games, with much of the attention falling on generative adversarial networks (GANs). Solving these games introduces distinct challenges compared to the standard minimization tasks that the machine learning (ML) community is used to. A symptom of this issue is ML and deep learning (DL) practitioners using optimization tools on game-theoretic problems. Recent work seeks to rectify this situation by bringing game theoretic tools into ML. At NeurIPS 2018 we held “Smooth games optimization in ML”, a workshop with this scope and goal in mind. Last year’s workshop addressed theoretical aspects of games in machine learning, their special dynamics, and typical challenges. Talks by Costis Daskalakis, Niao He, Jacob Abernethy and Paulina Grnarova emphasized various fundamental topics in a pure, simplified theoretical setting. A number of contributed talks and posters tackled similar questions. The workshop culminated in a panel discussion that identified a number of interesting questions. The aim of this workshop is to provide a platform for both theoretical and applied researchers from the ML, mathematical programming and game theory community to discuss the status of our understanding on the interplay between smooth games, their applications in ML, as well existing tools and methods for dealing with them. We are looking for contributions that identifies and discusses open, forward-looking problems of interest to the NeurIPS community.

Invited Speakers

Fei Fang (CMU)

Title:

Integrating Machine Learning with Game Theory for Societal Challenges

Link to video

Abstract:

Real-world problems such as protecting critical infrastructure and cyber networks and protecting wildlife, fishery, and forest often involve multiple decision-makers. While game theory is an established paradigm for such problems, its applicability in practice is often limited by computational intractability in large games, the unavailability of game parameters and the lack of rationality of human players. On the other hand, machine learning has led to huge successes in various domains and can be leveraged to overcome the limitations of the game-theoretic analysis. In this talk, I will introduce our work on integrating machine learning with computational game theory for addressing societal challenges such as security and sustainability, covering the following directions: data-based game-theoretic reasoning, learning-powered strategy computation in large scale games, and end-to-end learning of game parameters.

Short Bio:

Fei Fang is an Assistant Professor at the Institute for Software Research in the School of Computer Science at Carnegie Mellon University. Before joining CMU, she was a Postdoctoral Fellow at the Center for Research on Computation and Society (CRCS) at Harvard University. She received her Ph.D. from the Department of Computer Science at the University of Southern California in June 2016.

Eva Tardos (Cornell University)

Title:

Learning in dynamic multi-agent environments

Link to video

Abstract:

In this talk we will consider on games where players use a form of learning that helps them adapt to a changing environment. We ask if the quantitative guarantees obtained for Nash equilibria for this class of games extend to such out of equilibrium game play, when the game or the population of players is dynamically changing and where participants have to adapt to the dynamic environment.

Short Bio:

Éva Tardos received her Dipl.Math. in 1981 , and her Ph.D. 1984, from Eötvös University , Budapest, Hungary . She joined Cornell in 1989, and was Chair of the Department of Computer Science 2006-2010. She has been elected to the National Academy of Engineering, National Academy of Sciences, and the American Academy of Arts and Sciences, is an external member of the Hungarian Academy of Sciences, and is the recipient of a number of fellowships and awards including the the IEEE John von Neumann Medal, Packard Fellowship, the Gödel Prize, Dantzig Prize, and the Fulkerson Prize. She was editor editor-in-Chief of SIAM Journal of Computing 2004-2009, and is currently editor-in-Chief of the Journal of the ACM, and editor of some other journals includingthe Theory of Computing, and Combinatorica.

David Balduzzi (DeepMind)

Title:

Composition, learning, and games

Link to video

Abstract:

Gradient descent and automatic differentiation provide a powerful framework for composing function-approximators and training them to optimise an objective. However, there are many learning algorithms that do not optimise a single, fixed objective — such as self-play and its generalisations in Go and StarCraft, generative adversarial networks, and adversarial training for robustness. In this talk, I will argue for a new subfield of “differentiable mechanism design”. In support, I will describe extant work from the literature under the unifying themes of meta-games and second-order information.

Short Bio:

David Balduzzi is a researcher at Google DeepMind. He did his PhD in representation theory and algebraic geometry at the University of Chicago. After that he worked on computational neuroscience at UW-Madison and machine learning at the MPI for Intelligent Systems, ETH Zürich and Victoria University Wellington. He now works on game theory and machine learning at DeepMind.

Aryan Mokhtari (UT Austin)

Title:

Understanding the Role of Optimism in Minimax Optimization: A Proximal Point Approach

Link to video

Abstract:

In this talk, we consider solving saddle point problems, and, in particular, we discuss the concept of “optimism” or “negative momentum” - a technique which is observed to have superior empirical performance in training GANs. The goal of this talk is to provide a theoretical understanding on why optimism helps, in particular why the Optimistic Gradient Descent Ascent (OGDA) algorithm performs well in practice. To do so, we first consider the classical Proximal Point algorithm which is an implicit algorithm to solve this problem. We then show that OGDA inherently tries to approximate the proximal point method, and this is the rationale behind the ‘’negative momentum” term in the update of OGDA. This proximal point approximation viewpoint also enables us to provide a much simpler analysis of another well studied algorithm - the Extra-Gradient (EG) method.

Short Bio:

Aryan Mokhtari is an Assistant Professor in the ECE Department of the University of Texas at Austin (UT Austin). Before joining UT Austin, he was a Postdoctoral Associate in the Laboratory for Information and Decision Systems (LIDS) at MIT. Before that, he was a Research Fellow at the Simons Institute for the Theory of Computing at UC Berkeley. His research interests include the areas of optimization, machine learning, and artificial intelligence. His current research focuses on the theory and applications of convex and non-convex optimization in large-scale machine learning and data science problems.

Morning Schedule

Time	Speaker	Title
8:15	Ioannis Mitliagkas	Opening remarks
8:30	Invited talk, Eva Tardos	Learning in dynamic multi-agent environments [abstract] [video]
9:10	Poster Spotlights: David Fridovich-Keil Eric Mazumdar Olya Ohrimenko Yan Yan Guojun Zhang Shuang Li Shuang Li Kevin Lai Mingrui Liu† Lisa Lee	[video] starts at 55:33 Stable, Efficient Solutions for Differential Games with Feedback Linearizable Dynamics [PDF] Policy Gradient in Linear Quadratic Dynamic Games Has No Convergence Guarantees [PDF] Collaborative Machine Learning Markets [PDF] Sharp Analysis of Simple Restarted Stochastic Gradient for Min-Max Optimization [PDF] Convergence Behaviour of Some Gradient-Based Methods on Bilinear Zero-Sum Games [PDF] Cubic Regularization for Differentiable Games [PDF] Geometry Correspondence between Empirical and Population Games [PDF] Last-iterate convergence rates for min-max optimization [PDF] Decentralized Parallel Algorithm for Training Generative Adversarial Nets [PDF] Efficient Exploration via State Marginal Matching [PDF]
9:30	Poster session + Coffee break
11:00	Invited talk, : David Balduzzi	Composition, learning, and games [abstract] [video]
11:40	Contributed Talk, Praneeth Netrapalli	What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization? [video] starts at 41:35
12:05	Contributed talk, Tanner Fiez	Characterizing Equilibria in Stackelberg Games [video] starts at 1:06:03
12:30	Lunch break

Afternoon Schedule

Time	Speaker	Title
14:00	Invited talk, Fei Fang	Integrating Machine Learning with Game Theory for Societal Challenges [abstract] [video]
14:40	Contributed talk, Yuanhao Wang	On Solving Local Minimax Optimization: A Follow-the-Ridge Approach [video] starts at 35:05
15:05	Contributed talk, Elizabeth Bondi	Exploiting Uncertain Real-Time Information from Deep Learning in Signaling Games for Security and Sustainability [video] starts at 1:01:44
15:30	Coffee break
16:00	Invited talk, Aryan Mokhtari	Understanding the role of optimism in minimax optimization [abstract] [video]
16:40	Poster spotlights: Andrew Bennett Moksh Jain Ryan D'Orazio Benjamin Chasnov Hongkai Zheng Christos Tsirigotis Ian Gemp Konstantin Mishchenko Gabriele Farina Adam Lerer Ioannis Panageas	[video] starts at 36:16 Deep Generalized Method of Moments for Instrumental Variable Analysis [PDF] Proximal Policy Optimization for Improved Convergence in IRGAN [PDF] Bounds for Approximate Regret-Matching Algorithms [PDF] Opponent Anticipation via Conjectural Variations [PDF] Implicit competitive regularization in GANs [PDF] Objectives Towards Stable Adversarial Training Without Gradient Penalties [PDF] The Unreasonable Effectiveness of Adam on Cycles [PDF] Revisiting Stochastic Extragradient [PDF] Compositional Calculus of Regret Minimizers [PDF] Search in Cooperative Partially Observable Games [PDF] Last-Iterate Convergence: Zero-Sum Games and Constrained Min-Max Optimization [PDF]
17:00	Discussion panel	David Balduzzi, Elizabeth Bondi, Noam Brown, Praneeth Netrapalli, Eva Tardos, Jakob Foerster [video] starts at 1:0:0
17:30	Organizers	Concluding remarks -- afternoon poster session [video] starts at 1:43:20
18:30	Workshop ends

Call for Contributions

We are soliciting contributions that address one of the below questions, or secondarily, another question on the intersection of modern machine learning and games. This year we are particularly interested in accepting work that uses non-standard formulations and applications for games in ML.

How can we integrate learning with game theory? (e.g. [Schuurmans et al., 2016])
How can we inject deep learning into games and vice-versa (eg. actor-critic formulations can be cast as a game)?
What are the practical implications and applications?
How do we go beyond the standard GAN discussion and model general agents that interact with each other in a learning context?
What can we say about the existence and uniqueness results of equilibria in smooth games?
Can we approximate mixed equilibria have better properties than the exact ones? [Arora, S., Ge, R., Liang, Y., Ma, T., Zhang, 2017] [Lipton et al., 2002]
Can we define a weaker notion of solution than Nash Equilibria? [Papadimitriou, Piliouras, 2018]
Can we compare the quality/performance of Nash equilibria/cycles ? Are there points that have a better quality/outcome than Nash equilibria ? [Kleinberg et al. 2011]
How do we design efficient algorithms that are guaranteed to achieve the desired solutions?
Finally, how do we design better objectives to match a specific ML task at hand?

Submission details

A submission should take the form of an anonymous extended abstract (2-4 pages long excluding references) in PDF format using the following modified NeurIPS style. The submission process will be handled via CMT. Previously published work (or under-review) is acceptable, though it needs to be clearly indicated as published work when submitting. Please provide as a footnote in the actual pdf indicating the venue where the work has been submitted. Submissions can be accepted as contributed talks, spotlight or poster presentations (all accepted submissions can have a poster). Extended abstracts must be submitted by September 16, 2019 (11:59pm AoE). Final versions will be posted on the workshop website (and are archival but do not constitute a proceedings).

A limited number of NeurIPS registration slots will be available for accepted talks and posters to this workshop. We do not have control over the number, so it might happen that not all accepted posters get a slot. We strongly advise you to first try to register through the NeurIPS lottery to increase your chances of getting a registration slot. If you get a registration slot it means that you are guaranteed the ability to register, but you will have to pay for the registration.

Key Dates:

Abstract submission deadline: September 16, 2019 (11:59pm AoE) via CMT
Acceptance notification: October 1, 2019

Organizers

Ioannis Mitliagkas (Mila & University of Montreal)

Ioannis Mitliagkas is an assistant professor in the department of Computer Science and Operations Research (DIRO) at the University of Montréal. Before that, he was a Postdoctoral Scholar with the departments of Statistics and Computer Science at Stanford University. He obtained his Ph.D. from the department of Electrical and Computer Engineering at The University of Texas at Austin. His research includes topics in optimization, statistical learning and inference, and efficient large-scale and distributed algorithms.
He is particularly interested in the dynamics of optimization, like momentum methods, in the presence of system dynamics, adaptivity, and lately, smooth two-player games (ongoing work).

Gauthier Gidel (Mila & University of Montreal)

Gauthier Gidel is a PhD candidate at Mila lab Université de Montréal under the supervision of Simon Lacoste-Julien. Before that, he received the Diplôme de l’École Normale Supérieure in 2017 (ULM MPI2013) and the Master of Science MVA from École Normale supérieur Paris-Saclay in 2016. Gauthier’s PhD thesis topic revolves around saddle point optimization (a.k.a mini-max problems) for machine learning and more generally on understanding the practical and theoretical challenges of differentiable games optimization for multi-agent learning. He is also a recipient of a Graduate Borealis AI fellowship.

Niao He (UIUC)

Niao He is an assistant professor in the Department of Industrial and Enterprise Systems Engineering and Coordinated Science Laboratory at the University of Illinois at Urbana-Champaign. Before joining Illinois, she received her Ph.D. degree in Operations Research from Georgia Institute of Technology in 2015 and B.S. degree in Mathematics from University of Science and Technology of China in 2010. Her research interests are in large-scale optimization and machine learning, with a primary focus in bridging modern optimization theory and algorithms with core machine learning topics, like Bayesian inference, reinforcement learning, and adversarial learning. She is also a recipient of the NSF CISE Research Initiation Initiative (CRII) Award and the NCSA Faculty Fellowship.

Reyhane Askari (Mila & University of Montreal)

Reyhane Askari is a PhD student at Mila lab, Université de Montréal. She works under the supervision of Ioannis Mitliagkas (UdeM) and Nicolas Le Roux (Google Brain). Prior to her PhD, she received her Masters in Computer Science from Université de Montréal and started working as a Machine Learning engineer for two years at Mila. During that time she worked on several open-source software for deep learning such as Theano, Orion and Cortex. She also did her bachelors in Computer Engineering at Amirkabir University of Technology (Tehran Polytechnic).
Her research interests are on understanding accelerated methods in single objective and multi-objective settings using tools from dynamical systems.

Nika Haghtalab

Nika Haghtalab is an Assistant Professor in the Department of Computer Science at Cornell University. She works broadly on the theoretical aspects of machine learning and algorithmic economics. She especially cares about developing a theory for machine learning that accounts for its interactions with people and organizations, and the wide range of social and economic limitations, aspiration, and behavior they demonstrate. Prior to Cornell, she was a postdoctoral researcher at Microsoft Research, New England, in 2018-2019.
She received her Ph.D. from the Computer Science Department of Carnegie Mellon University, where she was co-advised by Avrim Blum and Ariel Procaccia. Her thesis titled Foundation of Machine Learning, by the People, for the People received the CMU School of Computer Science Dissertation Award (2018) and a SIGecom Dissertation Honorable Mention Award (2019).

Simon Lacoste-Julien (Mila & University of Montreal)

Simon Lacoste-Julien is a CIFAR fellow and an assistant professor at Mila and DIRO from Université de Montréal. His research interests are machine learning and applied math, with applications to computer vision and natural language processing. He obtained a B.Sc. in math., physics and computer science from McGill, a PhD in computer science from UC Berkeley and a post-doc from the University of Cambridge. He spent a few years as a research faculty at INRIA and École normale supérieure in Paris before coming back to his roots in Montreal in 2016.
Simon published several papers at the intersection of mathematical programming and machine learning, and in particular for solving min-max games. He is a frequent participant to the NeurIPS OPT workshop series, and co-organized the NeurIPS 2009 workshop on “The Generative & Discriminative Learning Interface” .

Acknowledgement to TPC

We would like to thank the following members of the technical program committee for participating in the review process for the workshop.

Abhishek Gupta, Aryan Mokhtari, Bert Huang, Chi Jin, Chidubem G Arachie, Damien Scieur, Daniel Hennes, David Balduzzi, Jan Balaguer, Jason Lee, Konstantin Mishchenko, Marc Lanctot, Maxim Raginsky, Nicolas Loizou, Panayotis Mertikopoulos, Pavel Dvurechensky, Sarath Pattathil, Thomas Anthony, Tianbao Yang, Volkan Cevher, Yair Carmon

Relevant References

Abernethy, J.D., Bartlett, P.L., Rakhlin, A., Tewari, A., Optimal strategies and minimax lower bounds for online convex games. In COLT 2009.

Arora, S., Ge, R., Liang, Y., Ma, T., Zhang, Y., Generalization and Equilibrium in Generative Adversarial Nets (GANs). In ICML 2017.

Balduzzi, D., Racaniere, S., Martens, J., Foerster, J., Tuyls, K. and Graepel, T., 2018. The Mechanics of n-Player Differentiable Games. In ICML 2018.

Daskalakis, C., Goldberg, P., Papadimitriou, C., The Complexity of Computing a Nash Equilibrium. SIAM J. Comput., 2009.

Daskalakis, C., Ilyas, A., Syrgkanis, V., Zeng, H., Training GANs with Optimism. In ICLR 2018.

Ewerhart, C., Ordinal Potentials in Smooth Games (SSRN Scholarly Paper No. ID 3054604). Social Science Research Network, Rochester, NY, 2017.

Fedus, W., Rosca, M., Lakshminarayaan, B., Dai, A.M., Mohamed, S., Goodfellow, I., Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step. In ICLR 2018.

Gidel, G., Jebara, T., Lacoste-Julien, S. Frank-Wolfe Algorithms for Saddle Point Problems. In AISTATS 2017.

Gidel, G., Berard,H., Vincent, P., Lacoste-Julien, S., A Variational Inequality Perspective on Generative Adversarial Networks. arXiv:1802.10551 [cs, math, stat], 2018.

Grnarova, P., Levy, K.Y., Lucchi, A., Hofmann, T., Krause, A., An Online Learning Approach to Generative Adversarial Networks. In ICLR 2018.

Harker, P.T., Pang, J.-S., Finite-dimensinal variational inequality and nonlinear complementarity problems: A survey of theory, algorithms and applications. Mathematical Programming, 1990.

Hazan, E., Singh, K., Zhang, C., Efficient Regret Minimization in Non-Convex Games, in ICML 2017.

Karlin, S., Weiss, G., The Theory of Infinite Games, Mathematical Methods and Theory in Games, Programming, and Economics, 1959.

Lipton, R.J., Young, N.E., Simple Strategies for Large Zero-sum Games with Applications to Complexity Theory, in STOC 94.

Mescheder, L., Nowozin, S., Geiger, A., The Numerics of GANs. In NeurIPS 2017.

Nisan, N., Roughgarden, T., Tardos, E., Vazirani, V., Algorithmic Game Theory. Cambridge University Press, 2007.

Pfau, D., Vinyals, O., Connecting Generative Adversarial Networks and Actor-Critic Methods. arXiv:1610.01945 [cs, stat], 2016.

Roughgarden, T., Intrinsic Robustness of the Price of Anarchy, in: Communications of The ACM - CACM, 2009.

Scutari, G., Palomar, .P., Facchinei, F., Pang, J. s., Convex Optimization, Game Theory, and Variational Inequality Theory. IEEE Signal Processing Magazine, 2010.

Syrgkanis, V., Agarwal, A., Luo, H., Schapire, R.E., Fast Convergence of Regularized Learning in Games, in NeurIPS 2015.

Von Neumann, J., Morgenstern, O., Theory of Games and Economic Behavior. Princeton University Press, 1944.

Schuurmans, Dale, and Martin A. Zinkevich. "Deep learning games." Advances in Neural Information Processing Systems. 2016.