Hot Papers 2020-07-13

1. Geometric Style Transfer

Xiao-Chang Liu, Xuan-Yi Li, Ming-Ming Cheng, Peter Hall

retweets: 31, favorites: 122 (07/14/2020 09:16:53)
links: abs | pdf
cs.CV

Neural style transfer (NST), where an input image is rendered in the style of another image, has been a topic of considerable progress in recent years. Research over that time has been dominated by transferring aspects of color and texture, yet these factors are only one component of style. Other factors of style include composition, the projection system used, and the way in which artists warp and bend objects. Our contribution is to introduce a neural architecture that supports transfer of geometric style. Unlike recent work in this area, we are unique in being general in that we are not restricted by semantic content. This new architecture runs prior to a network that transfers texture style, enabling us to transfer texture to a warped image. This form of network supports a second novelty: we extend the NST input paradigm. Users can input content/style pair as is common, or they can chose to input a content/texture-style/geometry-style triple. This three image input paradigm divides style into two parts and so provides significantly greater versatility to the output we can produce. We provide user studies that show the quality of our output, and quantify the importance of geometric style transfer to style recognition by humans.

Geometric Style Transfer
pdf: https://t.co/iIxTtl094Y
abs: https://t.co/z70KyJiUOf pic.twitter.com/S09arNjg1U
— AK (@ak92501) July 13, 2020

2. Representations for Stable Off-Policy Reinforcement Learning

Dibya Ghosh, Marc G. Bellemare

retweets: 20, favorites: 106 (07/14/2020 09:16:54)
links: abs | pdf
cs.LG | cs.AI | stat.ML

Reinforcement learning with function approximation can be unstable and even divergent, especially when combined with off-policy learning and Bellman updates. In deep reinforcement learning, these issues have been dealt with empirically by adapting and regularizing the representation, in particular with auxiliary tasks. This suggests that representation learning may provide a means to guarantee stability. In this paper, we formally show that there are indeed nontrivial state representations under which the canonical TD algorithm is stable, even when learning off-policy. We analyze representation learning schemes that are based on the transition matrix of a policy, such as proto-value functions, along three axes: approximation error, stability, and ease of estimation. In the most general case, we show that a Schur basis provides convergence guarantees, but is difficult to estimate from samples. For a fixed reward function, we find that an orthogonal basis of the corresponding Krylov subspace is an even better choice. We conclude by empirically demonstrating that these stable representations can be learned using stochastic gradient descent, opening the door to improved techniques for representation learning with deep networks.

Super happy to release this new paper with @marcgbellemare! We formally study how representation learning can be used to stabilize off-policy RL.

ArXiv: https://t.co/OTbQtHmvBx
At ICML 2020: https://t.co/hmc1TniY6V

1/6 pic.twitter.com/h5cPj3pite
— Dibya Ghosh (@its_dibya) July 13, 2020

3. Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer

Xinghao Chen, Yiman Zhang, Yunhe Wang, Han Shu, Chunjing Xu, Chang Xu

retweets: 13, favorites: 51 (07/14/2020 09:16:54)
links: abs | pdf
cs.CV

Video style transfer techniques inspire many exciting applications on mobile devices. However, their efficiency and stability are still far from satisfactory. To boost the transfer stability across frames, optical flow is widely adopted, despite its high computational complexity, e.g. occupying over 97% inference time. This paper proposes to learn a lightweight video style transfer network via knowledge distillation paradigm. We adopt two teacher networks, one of which takes optical flow during inference while the other does not. The output difference between these two teacher networks highlights the improvements made by optical flow, which is then adopted to distill the target student network. Furthermore, a low-rank distillation loss is employed to stabilize the output of student network by mimicking the rank of input videos. Extensive experiments demonstrate that our student network without an optical flow module is still able to generate stable video and runs much faster than the teacher network.

Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer
pdf: https://t.co/9H68PeAoIH
abs: https://t.co/tf6V3z33aQ pic.twitter.com/QZ6Yyg2uVt
— AK (@ak92501) July 13, 2020

4. Attack of the Tails: Yes, You Really Can Backdoor Federated Learning

Hongyi Wang, Kartik Sreenivasan, Shashank Rajput, Harit Vishwakarma, Saurabh Agarwal, Jy-yong Sohn, Kangwook Lee, Dimitris Papailiopoulos

retweets: 14, favorites: 39 (07/14/2020 09:16:54)
links: abs | pdf
cs.LG | cs.CR | cs.DC | stat.ML

Due to its decentralized nature, Federated Learning (FL) lends itself to adversarial attacks in the form of backdoors during training. The goal of a backdoor is to corrupt the performance of the trained model on specific sub-tasks (e.g., by classifying green cars as frogs). A range of FL backdoor attacks have been introduced in the literature, but also methods to defend against them, and it is currently an open question whether FL systems can be tailored to be robust against backdoors. In this work, we provide evidence to the contrary. We first establish that, in the general case, robustness to backdoors implies model robustness to adversarial examples, a major open problem in itself. Furthermore, detecting the presence of a backdoor in a FL model is unlikely assuming first order oracles or polynomial time. We couple our theoretical results with a new family of backdoor attacks, which we refer to as edge-case backdoors. An edge-case backdoor forces a model to misclassify on seemingly easy inputs that are however unlikely to be part of the training, or test data, i.e., they live on the tail of the input distribution. We explain how these edge-case backdoors can lead to unsavory failures and may have serious repercussions on fairness, and exhibit that with careful tuning at the side of the adversary, one can insert them across a range of machine learning tasks (e.g., image classification, OCR, text prediction, sentiment analysis).

5. A message-passing approach to epidemic tracing and mitigation with apps

Ginestra Bianconi, Hanlin Sun, Giacomo Rapisardi, Alex Arenas

retweets: 16, favorites: 36 (07/14/2020 09:16:54)
links: abs | pdf
physics.soc-ph | cond-mat.dis-nn | cond-mat.stat-mech | cs.SI

With the hit of new pandemic threats, scientific frameworks are needed to understand the unfolding of the epidemic. At the mitigation stage of the epidemics in which several countries are now, the use of mobile apps that are able to trace contacts is of utmost importance in order to control new infected cases and contain further propagation. Here we present a theoretical approach using both percolation and message—passing techniques, to the role of contact tracing, in mitigating an epidemic wave. We show how the increase of the app adoption level raises the value of the epidemic threshold, which is eventually maximized when high-degree nodes are preferentially targeted. Analytical results are compared with extensive Monte Carlo simulations showing good agreement for both homogeneous and heterogeneous networks. These results are important to quantify the level of adoption needed for contact-tracing apps to be effective in mitigating an epidemic.

Our new work "A message-passing approach to epidemic tracing and mitigation with apps" is now on the arxiv. Excellent collaboration with Hanlin Sun, Giacomo Rapisardi and @_AlexArenas . Check it out!https://t.co/9g8Di91ISx pic.twitter.com/GAgUxMudzq
— Ginestra Bianconi (@gin_bianconi) July 13, 2020

Published 14 Jul 2020

ML Lead at Beatrust. (https://beatrust.com)Tatsuya Shirakawa on Twitter