Hot Papers 2020-08-19

1. Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning

Florian Fuchs, Yunlong Song, Elia Kaufmann, Davide Scaramuzza, Peter Duerr

retweets: 107, favorites: 349 (08/20/2020 10:24:22)
links: abs | pdf
cs.AI | cs.LG | cs.RO

Autonomous car racing raises fundamental robotics challenges such as planning minimum-time trajectories under uncertain dynamics and controlling the car at its friction limits. In this project, we consider the task of autonomous car racing in the top-selling car racing game Gran Turismo Sport. Gran Turismo Sport is known for its detailed physics simulation of various cars and tracks. Our approach makes use of maximum-entropy deep reinforcement learning and a new reward design to train a sensorimotor policy to complete a given race track as fast as possible. We evaluate our approach in three different time trial settings with different cars and tracks. Our results show that the obtained controllers not only beat the built-in non-player character of Gran Turismo Sport, but also outperform the fastest known times in a dataset of personal best lap times of over 50,000 human drivers.

Super-Human Performance in Gran Turismo Sport
Using Deep Reinforcement Learning
pdf: https://t.co/mIn59uCWdQ
abs: https://t.co/snvVpu2001
video: https://t.co/M6Sz19md3q pic.twitter.com/zEqoadZP2g
— AK (@ak92501) August 19, 2020

Researchers have presented the first autonomous racing policy that achieves super-human performance in time trial settings in the Gran Turismo Sport. #DeepLearning https://t.co/kIFliJ39tC pic.twitter.com/woHO0uZtqr
— Underfox (@Underfox3) August 19, 2020

Surpassing human players in Gran Turismo Sport first-ever! We are excited to share our paper Super-Human Performance in Gran Turismo Sport Using Deep #reinforcementlearning !
PDF: https://t.co/xMBLl5iN4Z
Video: https://t.co/HBAzXI2igN @Cprimer_ @kaufmann_elia w/ @Sony AI Zurich
— Davide Scaramuzza (@davsca1) August 19, 2020

"Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning," Fuchs et al.: https://t.co/Cuf39wPYfX

Video: https://t.co/7hhkx7omzP
— Miles Brundage (@Miles_Brundage) August 19, 2020

2. Motion Capture from Internet Videos

Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao

retweets: 49, favorites: 204 (08/20/2020 10:24:23)
links: abs | pdf
cs.CV

Recent advances in image-based human pose estimation make it possible to capture 3D human motion from a single RGB video. However, the inherent depth ambiguity and self-occlusion in a single view prohibit the recovery of as high-quality motion as multi-view reconstruction. While multi-view videos are not common, the videos of a celebrity performing a specific action are usually abundant on the Internet. Even if these videos were recorded at different time instances, they would encode the same motion characteristics of the person. Therefore, we propose to capture human motion by jointly analyzing these Internet videos instead of using single videos separately. However, this new task poses many new challenges that cannot be addressed by existing methods, as the videos are unsynchronized, the camera viewpoints are unknown, the background scenes are different, and the human motions are not exactly the same among videos. To address these challenges, we propose a novel optimization-based framework and experimentally demonstrate its ability to recover much more precise and detailed motion from multiple videos, compared against monocular motion capture methods.

Motion Capture from Internet Videos
pdf: https://t.co/yWbUJBqxeg
abs: https://t.co/EOrzAcKb2z
project page: https://t.co/cdEO4h7kVt pic.twitter.com/ulsCTugR2p
— AK (@ak92501) August 19, 2020

3. Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible

Neha S. Wadia, Daniel Duckworth, Samuel S. Schoenholz, Ethan Dyer, Jascha Sohl-Dickstein

retweets: 32, favorites: 135 (08/20/2020 10:24:24)
links: abs | pdf
cs.LG | stat.ML

Machine learning is predicated on the concept of generalization: a model achieving low error on a sufficiently large training set should also perform well on novel samples from the same distribution. We show that both data whitening and second order optimization can harm or entirely prevent generalization. In general, model training harnesses information contained in the sample-sample second moment matrix of a dataset. We prove that for models with a fully connected first layer, the information contained in this matrix is the only information which can be used to generalize. Models trained using whitened data, or with certain second order optimization schemes, have less access to this information; in the high dimensional regime they have no access at all, producing models that generalize poorly or not at all. We experimentally verify these predictions for several architectures, and further demonstrate that generalization continues to be harmed even when theoretical requirements are relaxed. However, we also show experimentally that regularized second order optimization can provide a practical tradeoff, where training is still accelerated but less information is lost, and generalization can in some circumstances even improve.

Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible: https://t.co/CuDeHxF90r We examine what information is usable for training neural networks, and how second order methods destroy exactly that information. pic.twitter.com/j1Sc09YuKT
— Jascha (@jaschasd) August 19, 2020

4. Drawing Shortest Paths in Geodetic Graphs

Sabine Cornelsen, Maximilian Pfister, Henry Förster, Martin Gronemann, Michael Hoffmann, Stephen Kobourov, Thomas Schneck

retweets: 7, favorites: 48 (08/20/2020 10:24:24)
links: abs | pdf
cs.DM | cs.CG

Motivated by the fact that in a space where shortest paths are unique, no two shortest paths meet twice, we study a question posed by Greg Bodwin: Given a geodetic graph $G$ , i.e., an unweighted graph in which the shortest path between any pair of vertices is unique, is there a philogeodetic drawing of $G$ , i.e., a drawing of $G$ in which the curves of any two shortest paths meet at most once? We answer this question in the negative by showing the existence of geodetic graphs that require some pair of shortest paths to cross at least four times. The bound on the number of crossings is tight for the class of graphs we construct. Furthermore, we exhibit geodetic graphs of diameter two that do not admit a philogeodetic drawing.

https://t.co/KolNsxgPlU These guys solved a problem I had been pitching! Given a graph (undirected, possibly weighted) with unique shortest paths, any two shortest paths only meet once. So, can you draw the graph in the plane so that any two shortest paths only meet once? 1/4
— Greg Bodwin (@GregBodwin) August 19, 2020

5. Inductive logic programming at 30: a new introduction

Andrew Cropper, Sebastijan Dumančić

retweets: 8, favorites: 47 (08/20/2020 10:24:24)
links: abs | pdf
cs.AI | cs.LG

Inductive logic programming (ILP) is a form of machine learning. The goal of ILP is to induce a logic program (a set of logical rules) that generalises training examples. As ILP approaches 30, we provide a new introduction to the field. We introduce the necessary logical notation and the main ILP learning settings. We describe the main building blocks of an ILP system. We compare several ILP systems on several dimensions. We detail four systems (Aleph, TILDE, ASPAL, and Metagol). We contrast ILP with other forms of machine learning. Finally, we summarise the current limitations and outline promising directions for future research.

Are you interested in learning programs (and specifically ILP), instead of 'just' functions? Are you a novice and a bit lost in the literature? @andrewcropper and I are working on a new intro and we would really like to hear your opinion!https://t.co/3o3F3ZXplC
— Sebastijan (@sebdumancic) August 19, 2020

6. Moment Multicalibration for Uncertainty Estimation

Christopher Jung, Changhwa Lee, Mallesh M. Pai, Aaron Roth, Rakesh Vohra

retweets: 6, favorites: 48 (08/20/2020 10:24:24)
links: abs | pdf
cs.LG | stat.ML

We show how to achieve the notion of “multicalibration” from H’ebert-Johnson et al. [2018] not just for means, but also for variances and other higher moments. Informally, it means that we can find regression functions which, given a data point, can make point predictions not just for the expectation of its label, but for higher moments of its label distribution as well-and those predictions match the true distribution quantities when averaged not just over the population as a whole, but also when averaged over an enormous number of finely defined subgroups. It yields a principled way to estimate the uncertainty of predictions on many different subgroups-and to diagnose potential sources of unfairness in the predictive power of features across subgroups. As an application, we show that our moment estimates can be used to derive marginal prediction intervals that are simultaneously valid as averaged over all of the (sufficiently large) subgroups for which moment multicalibration has been obtained.

I'm excited about our new paper "Moment Multicalibration for Uncertainty Estimation" with @crispy_jung, @ChanghwaLee3, @malleshpai, and Ricky. https://t.co/9zzHfl4oSe It gives a way to estimate the uncertainty of predictions that are simultaneously valid over many groups. 1/
— Aaron Roth (@Aaroth) August 19, 2020

7. Mesh Guided One-shot Face Reenactment using Graph Convolutional Networks

Guangming Yao, Yi Yuan, Tianjia Shao, Kun Zhou

retweets: 12, favorites: 39 (08/20/2020 10:24:24)
links: abs | pdf
cs.CV

Face reenactment aims to animate a source face image to a different pose and expression provided by a driving image. Existing approaches are either designed for a specific identity, or suffer from the identity preservation problem in the one-shot or few-shot scenarios. In this paper, we introduce a method for one-shot face reenactment, which uses the reconstructed 3D meshes (i.e., the source mesh and driving mesh) as guidance to learn the optical flow needed for the reenacted face synthesis. Technically, we explicitly exclude the driving face’s identity information in the reconstructed driving mesh. In this way, our network can focus on the motion estimation for the source face without the interference of driving face shape. We propose a motion net to learn the face motion, which is an asymmetric autoencoder. The encoder is a graph convolutional network (GCN) that learns a latent motion vector from the meshes, and the decoder serves to produce an optical flow image from the latent vector with CNNs. Compared to previous methods using sparse keypoints to guide the optical flow learning, our motion net learns the optical flow directly from 3D dense meshes, which provide the detailed shape and pose information for the optical flow, so it can achieve more accurate expression and pose on the reenacted face. Extensive experiments show that our method can generate high-quality results and outperforms state-of-the-art methods in both qualitative and quantitative comparisons.

Mesh Guided One-shot Face Reenactment Using Graph
Convolutional Networks
pdf: https://t.co/9rSmF7oB04
abs: https://t.co/DUHcyNPcMm pic.twitter.com/iHmgXfTaWk
— AK (@ak92501) August 19, 2020

Published 20 Aug 2020

ML Lead at Beatrust. (https://beatrust.com)Tatsuya Shirakawa on Twitter