Hot Papers 2020-07-30

1. SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation

Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao

retweets: 41, favorites: 153 (07/31/2020 08:44:02)
links: abs | pdf
cs.CV

Single-stage instance segmentation approaches have recently gained popularity due to their speed and simplicity, but are still lagging behind in accuracy, compared to two-stage methods. We propose a fast single-stage instance segmentation method, called SipMask, that preserves instance-specific spatial information by separating mask prediction of an instance to different sub-regions of a detected bounding-box. Our main contribution is a novel light-weight spatial preservation (SP) module that generates a separate set of spatial coefficients for each sub-region within a bounding-box, leading to improved mask predictions. It also enables accurate delineation of spatially adjacent instances. Further, we introduce a mask alignment weighting loss and a feature alignment scheme to better correlate mask prediction with object detection. On COCO test-dev, our SipMask outperforms the existing single-stage methods. Compared to the state-of-the-art single-stage TensorMask, SipMask obtains an absolute gain of 1.0% (mask AP), while providing a four-fold speedup. In terms of real-time capabilities, SipMask outperforms YOLACT with an absolute gain of 3.0% (mask AP) under similar settings, while operating at comparable speed on a Titan Xp. We also evaluate our SipMask for real-time video instance segmentation, achieving promising results on YouTube-VIS dataset. The source code is available at https://github.com/JialeCao001/SipMask.

SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
pdf: https://t.co/Ec1bLPKFxp
abs: https://t.co/Ry6YGHkKpi
github: https://t.co/ueoD2OuxDX pic.twitter.com/dTZ118PNMb
— AK (@ak92501) July 30, 2020

2. Biomedical and Clinical English Model Packages in the Stanza Python NLP Library

Yuhao Zhang, Yuhui Zhang, Peng Qi, Christopher D. Manning, Curtis P. Langlotz

retweets: 36, favorites: 134 (07/31/2020 08:44:03)
links: abs | pdf
cs.CL

We introduce biomedical and clinical English model packages for the Stanza Python NLP library. These packages offer accurate syntactic analysis and named entity recognition capabilities for biomedical and clinical text, by combining Stanza’s fully neural architecture with a wide variety of open datasets as well as large-scale unsupervised biomedical and clinical text data. We show via extensive experiments that our packages achieve syntactic analysis and named entity recognition performance that is on par with or surpasses state-of-the-art results. We further show that these models do not compromise speed compared to existing toolkits when GPU acceleration is available, and are made easy to download and use with Stanza’s Python interface. A demonstration of our packages is available at: http://stanza.run/bio.

🆕 We’ve extended Stanza with first domain-specific #NLProc models for biomedical and clinical medical English. They range from approaching to significantly improving state of the art results on syntactic and NER tasks. Demo https://t.co/FSduMn1hZp Paper https://t.co/VuK5ZriYEl pic.twitter.com/NmHX8Wzah6
— Stanford NLP Group (@stanfordnlp) July 30, 2020

👋 Introducing biomedical & clinical model packages for the Stanza #NLProc toolkit, including:

- 2 bio UD syntactic analysis pipelines
- 1 clinical UD syntactic pipeline
- 8 bio NER models
- 2 clinical NER models

📖 Paper: https://t.co/NeyvpLiWuW

Highlights in threads... 1/5 pic.twitter.com/n7borB3eb6
— Yuhao Zhang (@yuhaozhangx) July 30, 2020

3. Towards Ecologically Valid Research on Language User Interfaces

Harm de Vries, Dzmitry Bahdanau, Christopher Manning

retweets: 36, favorites: 133 (07/31/2020 08:44:03)
links: abs | pdf
cs.CL

Language User Interfaces (LUIs) could improve human-machine interaction for a wide variety of tasks, such as playing music, getting insights from databases, or instructing domestic robots. In contrast to traditional hand-crafted approaches, recent work attempts to build LUIs in a data-driven way using modern deep learning methods. To satisfy the data needs of such learning algorithms, researchers have constructed benchmarks that emphasize the quantity of collected data at the cost of its naturalness and relevance to real-world LUI use cases. As a consequence, research findings on such benchmarks might not be relevant for developing practical LUIs. The goal of this paper is to bootstrap the discussion around this issue, which we refer to as the benchmarks’ low ecological validity. To this end, we describe what we deem an ideal methodology for machine learning research on LUIs and categorize five common ways in which recent benchmarks deviate from it. We give concrete examples of the five kinds of deviations and their consequences. Lastly, we offer a number of recommendations as to how to increase the ecological validity of machine learning research on LUIs.

The need for open data & benchmarks in modern ML research has led to an outpouring of #NLProc data creation. But @harm_devries, @DBahdanau & I suggest the low ecological validity of most of this data undermines the resulting research. Comments welcome! https://t.co/scSc2c6Flq pic.twitter.com/9Lg8NJLxZT
— Christopher Manning (@chrmanning) July 30, 2020

4. On the Quantum versus Classical Learnability of Discrete Distributions

Ryan Sweke, Jean-Pierre Seifert, Dominik Hangleiter, Jens Eisert

retweets: 14, favorites: 97 (07/31/2020 08:44:03)
links: abs | pdf
quant-ph | cs.LG

Here we study the comparative power of classical and quantum learners for generative modelling within the Probably Approximately Correct (PAC) framework. More specifically we consider the following task: Given samples from some unknown discrete probability distribution, output with high probability an efficient algorithm for generating new samples from a good approximation of the original distribution. Our primary result is the explicit construction of a class of discrete probability distributions which, under the decisional Diffie-Hellman assumption, is provably not efficiently PAC learnable by a classical generative modelling algorithm, but for which we construct an efficient quantum learner. This class of distributions therefore provides a concrete example of a generative modelling problem for which quantum learners exhibit a provable advantage over classical learning algorithms. In addition, we discuss techniques for proving classical generative modelling hardness results, as well as the relationship between the PAC learnability of Boolean functions and the PAC learnability of discrete probability distributions.

Quantum versus classical learnability of distributions, featuring a generative modelling problem for which quantum learners exhibit a provable advantage over classical learning algorithms.https://t.co/52HkUOEMC3 pic.twitter.com/7uEdpSBBHQ
— Jens Eisert (@jenseisert) July 30, 2020

5. Translate the Facial Regions You Like Using Region-Wise Normalization

Wenshuang Liu, Wenting Chen, Linlin Shen

retweets: 12, favorites: 50 (07/31/2020 08:44:03)
links: abs | pdf
cs.CV

Though GAN (Generative Adversarial Networks) based technique has greatly advanced the performance of image synthesis and face translation, only few works available in literature provide region based style encoding and translation. We propose in this paper a region-wise normalization framework, for region level face translation. While per-region style is encoded using available approach, we build a so called RIN (region-wise normalization) block to individually inject the styles into per-region feature maps and then fuse them for following convolution and upsampling. Both shape and texture of different regions can thus be translated to various target styles. A region matching loss has also been proposed to significantly reduce the inference between regions during the translation process. Extensive experiments on three publicly available datasets, i.e. Morph, RaFD and CelebAMask-HQ, suggest that our approach demonstrate a large improvement over state-of-the-art methods like StarGAN, SEAN and FUNIT. Our approach has further advantages in precise control of the regions to be translated. As a result, region level expression changes and step by step make up can be achieved. The video demo is available at https://youtu.be/ceRqsbzXAfk.

Translate the Facial Regions You Like Using Region-Wise
Normalization
pdf: https://t.co/QZVjz9HIMQ
abs: https://t.co/78zFHn3QY4 pic.twitter.com/lWnMgjSIGO
— AK (@ak92501) July 30, 2020

6. Visual Reasoning Strategies and Satisficing: How Uncertainty Visualization Design Impacts Effect Size Judgments and Decisions

Alex Kale, Matthew Kay, Jessica Hullman

retweets: 10, favorites: 47 (07/31/2020 08:44:03)
links: abs | pdf
cs.HC

Uncertainty visualizations often emphasize point estimates to support magnitude estimates or decisions through visual comparison. However, when design choices emphasize means, users may overlook uncertainty information and misinterpret visual distance as a proxy for effect size. We present findings from a mixed design experiment on Mechanical Turk which tests eight uncertainty visualization designs: 95% containment intervals, hypothetical outcome plots, densities, and quantile dotplots, each with and without means added. We find that adding means to uncertainty visualizations has small biasing effects on both magnitude estimation and decision-making, consistent with discounting uncertainty. We also see that visualization designs that support the least biased effect size estimation do not support the best decision-making, suggesting that a chart user’s sense of effect size may not necessarily be identical when they use the same information for different tasks. In a qualitative analysis of users’ strategy descriptions, we find that many users switch strategies and do not employ an optimal strategy when one exists. Uncertainty visualizations which are optimally designed in theory may not be the most effective in practice because of the ways that users satisfice with heuristics, suggesting opportunities to better understand visualization effectiveness by modeling sets of potential strategies.

We’re excited to share our preprint, “Visual Reasoning Strategies and Satisficing: How Uncertainty Visualization Design Impacts Effect Size Judgments and Decisions”, with @JessicaHullman & @mjskay

Details in the thread! (1/n)https://t.co/T2vqM0USQ4
— Alex Kale (@AlexKale17) July 30, 2020

Published 31 Jul 2020

ML Lead at Beatrust. (https://beatrust.com)Tatsuya Shirakawa on Twitter