**Input Fast-Forwarding for Better Deep Learning**

**Selective Classification for Deep Neural Networks**

**Interpreting Blackbox Models via Model Extraction**

**An effective algorithm for hyperparameter optimization of neural networks**

**Causal inference for social network data**

**Grounded Recurrent Neural Networks**

**Towards Interrogating Discriminative Machine Learning Models**

**MMD GAN: Towards Deeper Understanding of Moment Matching Network**

**Nonparametric Preference Completion**

**Deep Rotation Equivariant Network**

**Fast-Slow Recurrent Neural Networks**

**Learning with Average Top-k Loss**

• Sequential noise-induced escapes for oscillatory network dynamics

• Weakly-normal basis vector fields in RKHS with an application to shape Newton methods

• The Benefit of Being Flexible in Distributed Computation

• Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation

• Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit

• Bayesian Pool-based Active Learning with Abstention Feedbacks

• Second-Order Word Embeddings from Nearest Neighbor Topological Features

• Uplift Modeling with Multiple Treatments and General Response Types

• A study on exponential-size neighborhoods for the bin packing problem with conflicts

• Clinical Intervention Prediction and Understanding using Deep Networks

• Predictive Analytics for Enhancing Travel Time Estimation in Navigation Apps of Apple, Google, and Microsoft

• Discontinuous Hamiltonian Monte Carlo for sampling discrete parameters

• Designs for estimating the treatment effect in networks with interference

• Data-driven Random Fourier Features using Stein Effect

• Model-free causal inference of binary experimental data

• Convolution estimates and number of disjoint partitions

• Statistical Convergence Analysis of Gradient EM on General Gaussian Mixture Models

• Conscious and controlling elements in combinatorial group testing problems with more defectives

• Critical two-point function for long-range $O(n)$ models below the upper critical dimension

• Self-Organized Supercriticality and Oscillations in Networks of Stochastic Spiking Neurons

• Deep Multi-instance Networks with Sparse Label Assignment for Whole Mammogram Classification

• Safe Model-based Reinforcement Learning with Stability Guarantees

• Random ordering formula for sofic and Rokhlin entropy of Gibbs measures

• Hashing as Tie-Aware Learning to Rank

• Simple Pricing Schemes for the Cloud

• Joint Rate Control and Power Allocation for Non-Orthogonal Multiple Access Systems

• Flexible Cache-Aided Networks with Backhauling

• Exact Recovery of Number of Blocks in Blockmodels

• On the multiply robust estimation of the mean of the g-functional

• Sequence Summarization Using Order-constrained Kernelized Feature Subspaces

• Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models

• Sufficient conditions for the existence of a path-factor which are related to odd components

• Deep Learning Improves Template Matching by Normalized Cross Correlation

• Journalists’ information needs, seeking behavior, and its determinants on social media

• Substitution invariant Sturmian words and binary trees

• Fully reliable error control for evolutionary problems

• Which bridge estimator is optimal for variable selection?

• Multi-Task Learning for Contextual Bandits

• Dictionary-based Monitoring of Premature Ventricular Contractions: An Ultra-Low-Cost Point-of-Care Service

• Robust Data Geometric Structure Aligned Close yet Discriminative Domain Adaptation

• VANETs Meet Autonomous Vehicles: A Multimodal 3D Environment Learning Approach

• On Using Time Without Clocks via Zigzag Causality

• Self-supervised learning of visual features through embedding images into text topic spaces

• Representing the suffix tree with the CDAWG

• Higher order Cheeger inequalities for Steklov eigenvalues

• On the Success Probability of Decoding (Partial) Unit Memory Codes

• Restriction of odd degree characters of $\mathfrak{S}_n$

• Efficient Covariance Approximations for Large Sparse Precision Matrices

• Combinatorial n-fold Integer Programming and Applications

• Towards Understanding the Invertibility of Convolutional Neural Networks

• Bayesian Compression for Deep Learning

• Packing parameters in graphs: New bounds and a solution to an open problem

• Alliance formation with exclusion in the spatial public goods game

• Stochastic decomposition applied to large-scale hydro valleys management

• Daisy cubes and distance cube polynomial

• On the Möbius Function and Topology of General Pattern Posets

• On The Fixatic Number of Graphs

• Continual Learning with Deep Generative Replay

• Stochastic Sequential Neural Networks with Structured Inference

• Tree-Structured Modelling of Varying Coefficients

• The de Bruijn-Erdös-Hanani theorem

• Inclusive Flavour Tagging Algorithm

• V2X Meets NOMA: Non-Orthogonal Multiple Access for 5G Enabled Vehicular Networks

• Non-orthogonal Multiple Access for High-reliable and Low-latency V2X Communications in 5G Systems

• An experimental study of graph-based semi-supervised classification with additional node information

• Open-Category Classification by Adversarial Sample Generation

• Hajós’ cycle conjecture for small graphs

• Weighted Poisson-Delaunay Mosaics

• Non-Stationary Spectral Kernels

• Efficient algorithm for large spectral partitions

• A counterexample to Comon’s conjecture

• Train longer, generalize better: closing the generalization gap in large batch training of neural networks

• A causal approach to analysis of censored medical costs in the presence of time-varying treatment

• A.Ya. Khintchine’s Work in Probability Theory

• Speeding up Dynamic Programming on DAGs through a Fast Approximation of Path Cover

• Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning

• Small Sets with Large Difference Sets

• Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition

• Three observations on spectra of zero-nonzero patterns

• Threshold functions for small subgraphs: an analytic approach

• A Two-Level Graph Partitioning Problem Arising in Mobile Wireless Communications

• Group divisible (K_4-e)-packings with any minimum leave

• Optimization of the Jaccard index for image segmentation with the Lovász hinge

• STFT with Adaptive Window Width Based on the Chirp Rate

• Continuous testing for Poisson process intensities: A new perspective on scanning statistics

• Characterizing path-like trees from linear configurations

• Beyond Parity: Fairness Objectives for Collaborative Filtering

• A Bayesian Mallows approach to non-transitive pair comparison data: how human are sounds?

• When Will AI Exceed Human Performance? Evidence from AI Experts

• Boundary Crossing Probabilities for General Exponential Families

• Power Systems Data Fusion based on Belief Propagation

• Matrix-product structure of repeated-root constacyclic codes over finite fields

• Causal Effect Inference with Deep Latent-Variable Models

• From source to target and back: symmetric bi-directional adaptive GAN

• Deep Investigation of Cross-Language Plagiarism Detection Methods

• Transition to Shock Fluctuations in TASEP and Last Passage Percolation

• Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

• Parsing with CYK over Distributed Representations: ‘Classical’ Syntactic Parsing in the Novel Era of Neural Networks

• How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval

• Joint Distribution Optimal Transportation for Domain Adaptation

• Improved Semi-supervised Learning with GANs using Manifold Invariances

• Perturbation of Conservation Laws and Averaging on Manifolds

• Audio-replay attack detection countermeasures

• More Circulant Graphs exhibiting Pretty Good State Transfer

• Anti-spoofing Methods for Automatic SpeakerVerification System

• Transport and optics at the node in a nodal loop semimetal

• Flow-GAN: Bridging implicit and prescribed learning in generative models

• Quantum Channel Capacities Per Unit Cost

• Sharp threshold for $K_4$-percolation

• Modeling flow in porous media with double porosity/permeability: A stabilized mixed formulation, error analysis, and numerical solutions

• Linearizable Iterators for Concurrent Data Structures