Hot Papers 2020-09-14

1. Adversarial score matching and improved sampling for image generation

Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Rémi Tachet des Combes, Ioannis Mitliagkas

retweets: 72, favorites: 378 (09/15/2020 09:38:55)
links: abs | pdf
cs.LG | cs.CV | stat.ML

Denoising score matching with Annealed Langevin Sampling (DSM-ALS) is a recent approach to generative modeling. Despite the convincing visual quality of samples, this method appears to perform worse than Generative Adversarial Networks (GANs) under the Fr’echet Inception Distance, a popular metric for generative models. We show that this apparent gap vanishes when denoising the final Langevin samples using the score network. In addition, we propose two improvements to DSM-ALS: 1) Consistent Annealed Sampling as a more stable alternative to Annealed Langevin Sampling, and 2) a hybrid training formulation,composed of both denoising score matching and adversarial objectives. By combining both of these techniques and exploring different network architectures, we elevate score matching methods and obtain results competitive with state-of-the-art image generation on CIFAR-10.

New paper on adversarial😠 score matching and an alternative to Langevin Sampling for better generative models! 😸 We show how we can obtain results better than SOTA GANs. 😻

Blog: https://t.co/Lqje2m1jvr
Paper: https://t.co/dkxgDRKcCP
Code: https://t.co/ajc3TkoomS
— Alexia Jolicoeur-Martineau (@jm_alexia) September 14, 2020

Adversarial score matching and improved sampling for image generation
pdf: https://t.co/3wIanQDeZZ
abs: https://t.co/Wautk7zAYh
github: https://t.co/z5TMa2tFIW pic.twitter.com/IvBfTmhooi
— AK (@ak92501) September 14, 2020

2. Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

retweets: 37, favorites: 228 (09/15/2020 09:38:56)
links: abs | pdf
cs.AI | cs.LG

Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They understand the state of the game by looking at the physical board in front of them and modify it by manipulating pieces using touch and fine-grained motor control. Mastering complicated physical systems with abstract goals is a central challenge for artificial intelligence, but it remains out of reach for existing RL algorithms. To encourage progress towards this goal we introduce a set of physically embedded planning problems and make them publicly available. We embed challenging symbolic tasks (Sokoban, tic-tac-toe, and Go) in a physics engine to produce a set of tasks that require perception, reasoning, and motor control over long time horizons. Although existing RL algorithms can tackle the symbolic versions of these tasks, we find that they struggle to master even the simplest of their physically embedded counterparts. As a first step towards characterizing the space of solution to these tasks, we introduce a strong baseline that uses a pre-trained expert game player to provide hints in the abstract space to an RL agent’s policy while training it on the full sensorimotor control task. The resulting agent solves many of the tasks, underlining the need for methods that bridge the gap between abstract planning and embodied control.

"Physically Embedded Planning Problems: New Challenges for Reinforcement Learning"https://t.co/GllnwqsbfH

the lack of reference to robotics folks working on this for decades and reinventing the problem as your own!
gotta do better @DeepMind

How "New" is this? pic.twitter.com/H1z4R2bLgq
— Animesh Garg (@animesh_garg) September 14, 2020

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
pdf: https://t.co/hg4OXJYyQ7
abs: https://t.co/XNix3q3ABJ pic.twitter.com/ZzZgSxM2Yv
— AK (@ak92501) September 14, 2020

3. Attribute-conditioned Layout GAN for Automatic Graphic Design

Jianan Li, Jimei Yang, Jianming Zhang, Chang Liu, Christina Wang, Tingfa Xu

retweets: 19, favorites: 60 (09/15/2020 09:38:56)
links: abs | pdf
cs.CV | cs.GR

Modeling layout is an important first step for graphic design. Recently, methods for generating graphic layouts have progressed, particularly with Generative Adversarial Networks (GANs). However, the problem of specifying the locations and sizes of design elements usually involves constraints with respect to element attributes, such as area, aspect ratio and reading-order. Automating attribute conditional graphic layouts remains a complex and unsolved problem. In this paper, we introduce Attribute-conditioned Layout GAN to incorporate the attributes of design elements for graphic layout generation by forcing both the generator and the discriminator to meet attribute conditions. Due to the complexity of graphic designs, we further propose an element dropout method to make the discriminator look at partial lists of elements and learn their local patterns. In addition, we introduce various loss designs following different design principles for layout optimization. We demonstrate that the proposed method can synthesize graphic layouts conditioned on different element attributes. It can also adjust well-designed layouts to new sizes while retaining elements’ original reading-orders. The effectiveness of our method is validated through a user study.

Attribute-conditioned Layout GAN for Automatic Graphic Design
pdf: https://t.co/LQfQHo6F5O
abs: https://t.co/K5FIPeNSNu pic.twitter.com/t2BOQWLELv
— AK (@ak92501) September 14, 2020

4. Forecasting timelines of quantum computing

Jaime Sevilla, C. Jess Riedel

retweets: 11, favorites: 54 (09/15/2020 09:38:56)
links: abs | pdf
quant-ph | cs.ET

We consider how to forecast progress in the domain of quantum computing. For this purpose we collect a dataset of quantum computer systems to date, scored on their physical qubits and gate error rate, and we define an index combining both metrics, the generalized logical qubit. We study the relationship between physical qubits and gate error rate, and tentatively conclude that they are positively correlated (albeit with some room for doubt), indicating a frontier of development that trades-off between them. We also apply a log-linear regression on the metrics to provide a tentative upper bound on how much progress can be expected over time. Within the (generally optimistic) assumptions of our model, including the key assumption that exponential progress in qubit count and gate fidelity will continue, we estimate that that proof-of-concept fault-tolerant computation based onsuperconductor technology is unlikely (<5% confidence) to be exhibited before 2026, and that quantum devices capable of factoring RSA-2048 are unlikely (<5% confidence) to exist before 2039. It is of course possible that these milestones will in fact be reached earlier, but that this would require faster progress than has yet been seen.

Strong work @Jess_Riedel https://t.co/PfXem5Cj59
— Graeme Smith (@quantum_graeme) September 14, 2020

5. Object Recognition for Economic Development from Daytime Satellite Imagery

Klaus Ackermann, Alexey Chernikov, Nandini Anantharama, Miethy Zaman, Paul A Raschky

retweets: 11, favorites: 53 (09/15/2020 09:38:56)
links: abs | pdf
econ.GN | cs.CV | eess.IV

Reliable data about the stock of physical capital and infrastructure in developing countries is typically very scarce. This is particular a problem for data at the subnational level where existing data is often outdated, not consistently measured or coverage is incomplete. Traditional data collection methods are time and labor-intensive costly, which often prohibits developing countries from collecting this type of data. This paper proposes a novel method to extract infrastructure features from high-resolution satellite images. We collected high-resolution satellite images for 5 million 1km $\times$ 1km grid cells covering 21 African countries. We contribute to the growing body of literature in this area by training our machine learning algorithm on ground-truth data. We show that our approach strongly improves the predictive accuracy. Our methodology can build the foundation to then predict subnational indicators of economic development for areas where this data is either missing or unreliable.

After some long tinkering, we've managed to write up a describtion our new method for "Object Recognition for Economic Development from Daytime Satellite Imagery" (Klaus Ackermann, Alexey Chernikov, Nandini Anantharama, Miethy Zaman) 1/Nhttps://t.co/qecQqMcAFu
— Paul Raschky (@PaulRaschky) September 14, 2020

Published 15 Sep 2020

ML Lead at Beatrust. (https://beatrust.com)Tatsuya Shirakawa on Twitter