**GIANT: Globally Improved Approximate Newton Method for Distributed Optimization**

**KnowNER: Incremental Multilingual Knowledge in Named Entity Recognition**

**Learning Graph Topological Features via GAN**

**Anomaly Detection in Hierarchical Data Streams under Unknown Models**

**Simultaneous Causal Inference and Record Linkage**

**Joint Dictionaries for Zero-Shot Learning**

**Amplifying Inter-message Distance: On Information Divergence Measures in Big Data**

**RRA: Recurrent Residual Attention for Sequence Learning**

**Adaptive Graph Signal Processing: Algorithms and Optimal Sampling Strategies**

**OpenNMT: Open-source Toolkit for Neural Machine Translation**

**Dual Discriminator Generative Adversarial Nets**

**A Tutorial on Statistically Sound Pattern Discovery**

• Energy Harvesting Communications under Explicit and Implicit Temperature Constraints

• The sign phase transition in the problem of interfering directed paths

• Multi-Level Spherical Locality Sensitive Hashing For Approximate Near Neighbors

• Recovering Homography from Camera Captured Documents using Convolutional Neural Networks

• Rates of linear codes with low decoding error probability

• Robust period estimation using mutual information for multi-band light curves in the synoptic survey era

• Exploring Geometric Property Thresholds For Filtering Non-Text Regions In A Connected Component Based Text Detection Application

• On infinite multiplicative Sidon sets

• Extracting Traffic Primitives Directly from Naturalistically Logged Data for Self-Driving Applications

• A general class of quasi-independence tests for left-truncated right-censored data

• False arrhythmia alarm reduction in the intensive care unit

• The Importance of Being Clustered: Uncluttering the Trends of Statistics from 1970 to 2015

• Importance Sketching of Influence Dynamics in Billion-scale Networks

• A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing

• Real-Time Multiple Object Tracking – A Study on the Importance of Speed

• Art of singular vectors and universal adversarial perturbations

• On the definition of Shape Parts: a Dominant Sets Approach

• A New Perspective on the Average Mixing Matrix

• Lower Bound for Randomized First Order Convex Optimization

• Enumerating kth Roots in the Symmetric Inverse Monoid

• Efficient generation of series expansions for $\pm J$ Ising spin-glasses in a classical or a quantum (transverse) field

• Profile of a self-similar growth-fragmentation

• Holistic, Instance-Level Human Parsing

• Manifold Learning Using Kernel Density Estimation and Local Principal Components Analysis

• A Broad Learning Approach for Context-Aware Mobile Application Recommendation

• Budgeted Experiment Design for Causal Structure Learning

• What were you expecting? Using Expectancy Features to Predict Expressive Performances of Classical Piano Music

• Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields

• Multi-Agent Discrete Search with Limited Visibility

• Identifying Genetic Risk Factors via Sparse Group Lasso with Group Graph Structure

• Properties of optimal paths in first passage percolation

• Anti-Makeup: Learning A Bi-Level Adversarial Network for Makeup-Invariant Face Verification

• Learning Gating ConvNet for the two-stream based methods in action recognition

• Joint Adaptive Neighbours and Metric Learning for Multi-view Subspace Clustering

• Uniform Concentration of the Loss Estimator for Neural DUDE

• End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks

• Multi-view Graph Embedding with Hub Detection for Brain Network Analysis

• Enumerating Hassett’s wall and chamber decomposition of the moduli space of weighted stable curves

• Small-footprint Keyword Spotting Using Deep Neural Network and Connectionist Temporal Classifier

• Branch-and-bound for biobjective mixed integer programming

• Community Recovery in Hypergraphs

• Rapid Near-Neighbor Interaction of High-dimensional Data via Hierarchical Clustering

• Adversarial Discriminative Heterogeneous Face Recognition

• Maximal independent sets on a grid graph

• A Practically Competitive and Provably Consistent Algorithm for Uplift Modeling

• Optimal On The Fly Index Selection in Polynomial Time

• Generalized Permutohedra, Scattering Amplitudes, and a Cubic Three-Fold

• Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision

• Reversible Architectures for Arbitrarily Deep Residual Neural Networks

• Cross-validation improved by aggregation: Agghoo

• Limit laws for the diameter of a set of random points from a distribution supported by a smoothly bounded set

• PQk-means: Billion-scale Clustering for Product-quantized Codes

• OCCAM: a flexible, multi-purpose and extendable HPC cluster

• A low cost non-wearable gaze detection system based on infrared image processing

• The survival probability of the high-dimensional contact process with random vertex weights on the oriented lattice

• Interpreting Shared Deep Learning Models via Explicable Boundary Trees

• Using the Data Agreement Criterion to Rank Experts’ Beliefs

• Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions

• Learning Graph-Level Representation for Drug Discovery

• Dependencies: Formalising Semantic Catenae for Information Retrieval

• Hybrid High-Order methods for finite deformations of hyperelastic materials

• Deep Mean-Shift Priors for Image Restoration

• AR(1) sequence with random coefficients: Regenerative properties and its application

• Transform Invariant Auto-encoder

• Cross-lingual Word Segmentation and Morpheme Segmentation as Sequence Labelling

• Language Models of Spoken Dutch

• Reliability constrained least-cost generation expansion planning using Dynamic Programming: an isolated mini-grid in KSA

• Efficient Online Surface Correction for Real-time Large-Scale 3D Reconstruction

• Characterizations of o-polynomials by the Walsh transform

• Parallel Work Inflation, Memory Effects, and their Empirical Analysis

• Learning with Bounded Instance- and Label-dependent Label Noise

• A probabilistic proof of the Gauss-Bonnet formula for manifolds with boundary

• Recurrence region of multiuser Aloha

• Forbidden triads and Creative Success in Jazz: The Miles Davis Factor

• Sparse Representation Based Augmented Multinomial Logistic Extreme Learning Machine with Weighted Composite Features for Spectral Spatial Hyperspectral Image Classification

• Opportunistic Self Organizing Migrating Algorithm for Real-Time Dynamic Traveling Salesman Problem

• An estimator of the stable tail dependence function based on the empirical beta copula

• The asymptotic distribution of the isotonic regression estimator over a countable pre-ordered set

• Strichartz and local smoothing estimates for stochastic dispersive equations with linear multiplicative noise

• SYSTRAN Purely Neural MT Engines for WMT2017

• Emotion Recognition in the Wild using Deep Neural Networks and Bayesian Classifiers

• Bethe states of random factor graphs

• On linear ternary Intersection sequences and their properties

• Stanley-Reisner rings for quasi-arithmetic matroids

• Non-Gaussian limit of a tracer motion in an incompressible flow

• ExprGAN: Facial Expression Editing with Controllable Expression Intensity

• Observational Equivalence in System Estimation: Contractions in Complex Networks

• Spatio-temporal Learning with Arrays of Analog Nanosynapses

• A Deep Cascade Network for Unaligned Face Attribute Classification

• Imitation Learning for Vision-based Lane Keeping Assistance

• Meta-QSAR: a large-scale application of meta-learning to drug design and discovery

• Distributed Estimation Recovery under Sensor Failure

• StarSpace: Embed All The Things!

• Translations on graphs with neighborhood preservation

• Personalizing Path-Specific Effects

• S-trees

• Adaptive Modulation and Coding and Cooperative ARQ in a Cognitive Radio System

• A 1.371 Approximation Algorithm for the Steiner Tree Problem

• On Exchangeability in Network Models

• High-Dimensional Dependency Structure Learning for Physical Processes

• Sampling formulas involving differences in shift-invariant subspaces: a unified approach

• On the Benefits of Surrogate Lagrangians in Optimal Control and Planning Algorithms

• Local resilience of an almost spanning $k$-cycle in random graphs

• Weighted Message Passing and Minimum Energy Flow for Heterogeneous Stochastic Block Models with Side Information

• A new family of MRD codes in $\mathbb F_q^{2n\times2n}$ with right and middle nuclei $\mathbb F_{q^n}$

• Specious rules: an efficient and effective unifying method for removing misleading and uninformative patterns in association rule mining

• Image Matching Benchmark

• End-to-End United Video Dehazing and Detection

• Human Associations Help to Detect Conventionalized Multiword Expressions

• Certified Computation in Crowdsourcing

• The 4-girth-thickness of the complete multipartite graph

• Hash Embeddings for Efficient Word Representations

• Determining Generic Point Configurations From Unlabeled Path or Loop Lengths

• On separability of Schur rings over abelian p-groups

• Deep Reinforcement Learning with Surrogate Agent-Environment Interface

• Model-free Envelope Dimension Selection

• Multimodal Content Analysis for Effective Advertisements on YouTube

• Skyline Queries in O(1) time?

• A first-order splitting method for solving large scale composite convex optimization problem

• An Online Optimization Algorithm for Alleviating Contingencies in Meshed Networks

• Unsupervised Deep Homography: A Fast and Robust Homography Estimation Model

• Affective Neural Response Generation

• Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds

• Combinatorics of cyclic shifts in plactic, hypoplactic, sylvester, Baxter, and related monoids