**Learning and Testing Causal Models with Interventions**

**A Practical Algorithm for Distributed Clustering and Outlier Detection**

**A Generalized Active Learning Approach for Unsupervised Anomaly Detection**

**Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module**

**Entropy and mutual information in models of deep neural networks**

**Log Gaussian Cox Process Networks**

**Towards Robust Evaluations of Continual Learning**

**Hierarchical Clustering with Structural Constraints**

**Deep Reinforcement Learning For Sequence to Sequence Models**

• Probing entanglement in a many-body-localized system

• Stereo Magnification: Learning View Synthesis using Multiplane Images

• Quantum information measures of the one-dimensional Robin quantum well

• Adversarial Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation

• Implicit Autoencoders

• Thermodynamic properties of the one-dimensional Robin quantum well

• Meta-Gradient Reinforcement Learning

• Prediction of Autism Treatment Response from Baseline fMRI using Random Forests and Tree Bagging

• New Insights into Bootstrapping for Bandits

• Multi-Task Zipping via Layer-wise Neuron Sharing

• How Many Directions Determine a Shape and other Sufficiency Results for Two Topological Transforms

• Mining Procedures from Technical Support Documents

• Tie-Line Characteristics based Partitioning for Distributed Optimization of Power Systems

• Enumeration of border-strip decompositions

• The parallel texts of books translations in the quality evaluation of basic models and algorithms for the similarity of symbol strings

• Local SGD Converges Fast and Communicates Little

• Modular bootstrap agrees with path integral in the large moduli limit

• Automorphism groups of maps, hypermaps and dessins

• Geographical Hidden Markov Tree for Flood Extent Mapping (With Proof Appendix)

• Ultra-Reliable Communication over Arbitrarily Varying Channels under Block-Restricted Jamming

• Random Walks on Dynamical Random Environments with Non-Uniform Mixing

• Mobile Face Tracking: A Survey and Benchmark

• Semi-Random Graphs with Planted Sparse Vertex Cuts: Algorithms for Exact and Approximate Recovery

• Letting Emotions Flow: Success Prediction by Modeling the Flow of Emotions in Books

• A Simple Proof of the DPRZ-Theorem for 2D Cover Times

• Impact of delayed acceleration feedback on the classical car-following model

• The detection of professional fraud in automobile insurance using social network analysis

• On the spectral structure of Jordan-Kronecker products of symmetric and skew-symmetric matrices

• Estimating Population Average Causal Effects in the Presence of Non-Overlap: A Bayesian Approach

• Johnson-Mehl Cell-based Analysis of UL Cellular Network with Coupled User and BS Locations

• Rainbow fractional matchings

• Image-to-image translation for cross-domain disentanglement

• One dimensional critical Kinetic Fokker-Planck equations, Bessel and stable processes

• Phase Diagram of Quantum Hall Breakdown and Non-linear Phenomena for InGaAs/InP Quantum Wells

• Computing the resolvent of the sum of maximally monotone operators with the averaged alternating modified reflections algorithm

• Learning convex polytopes with margin

• Rare slips in fluctuating synchronized oscillator networks

• Learning Classifiers with Fenchel-Young Losses: Generalized Entropies, Margins, and Algorithms

• Minimum Information Exchange in Distributed Systems

• Autonomously and Simultaneously Refining Deep Neural Network Parameters by Generative Adversarial Networks

• Triangle-factors in pseudorandom graphs

• Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation

• No More Differentiator in PID:Development of Nonlinear Lead for Precision Mechatronics

• R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering

• Convex method for selection of fixed effects in high-dimensional linear mixed models

• Been There, Done That: Meta-Learning with Episodic Recall

• Forming IDEAS Interactive Data Exploration & Analysis System

• LF-Net: Learning Local Features from Images

• On the sum of $k$-th largest distance eigenvalues of graphs

• Multivariate Convolutional Sparse Coding for Electromagnetic Brain Signals

• Uncertainty-Aware Attention for Reliable Interpretation and Prediction

• Stochastic integration and differential equations for typical paths

• Nonlinear Acceleration of Deep Neural Networks

• Decentralized MPC based Obstacle Avoidance for Multi-Robot Target Tracking Scenarios

• Reliable Dispatch of Renewable Generation via Charging of Dynamic PEV Populations

• Eternal dominating sets on digraphs and orientations of graphs

• SOSELETO: A Unified Approach to Transfer Learning and Training with Noisy Labels

• Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

• Optimal pricing for a peer-to-peer sharing platform under network externalities

• A0C: Alpha Zero in Continuous Action Space

• Non-Preemptive Flow-Time Minimization via Rejections

• On interrelations between strongly, weakly and chord separated set-systems (a geometric approach)

• Multi-Scale DenseNet-Based Electricity Theft Detection

• Native Language Cognate Effects on Second Language Lexical Choice

• Computing the Star Chromatic Index of Every Tree in Polynomial Time

• Residual Networks as Geodesic Flows of Diffeomorphisms

• Vehicular Communication Networks in Automated Driving Era

• Model-based inference of conditional extreme value distributions with hydrological applications

• Coarse-to-fine Seam Estimation for Image Stitching

• Primal-Dual Wasserstein GAN

• Hawkes Process Kernel Structure Parametric Search with Renormalization Factors

• A Unified Probabilistic Model for Learning Latent Factors and Their Connectivities from High-Dimensional Data

• WSD-algorithm based on new method of vector-word contexts proximity calculation via epsilon-filtration

• A Hybrid Approach to Music Playlist Continuation Based on Playlist-Song Membership

• Phase Retrieval via Polytope Optimization: Geometry, Phase Transitions, and New Algorithms

• Hierarchical burst model for complex bursty dynamics

• Martin boundaries of the duals of free unitary quantum groups

• Finite Blocklength Communications in Smart Grids for Dynamic Spectrum Access and Locally Licensed Scenarios

• Interpretable and Compositional Relation Learning by Joint Training with an Autoencoder

• On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport

• An Accurate Data Cleaning Procedure for Electron Cyclotron Emission Imaging on EAST Tokamak Based on Methodology of Machine Learning

• Cameron-Liebler sets of k-spaces in PG(n,q)

• An optimal bound on the solution sets of one-variable word equations and its consequences

• Entropy Productions and Their Mathematical Representations: Clausius’ vs. Kelvin’s Views of the Second Law and Irreversibility

• Stable specification search in structural equation model with latent variables

• AVID: Adversarial Visual Irregularity Detection

• Upper Bounds for Ordered Ramsey Numbers of Graphs on Four Vertices

• Homfly polynomials for periodic knots via state model

• Stable Super-Resolution of Images

• You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery

• Estimating Carotid Pulse and Breathing Rate from Near-infrared Video of the Neck

• A small-world search for quantum speedup: How small-world interactions can lead to improved quantum annealer designs

• Kernel-estimated Nonparametric Overlap-Based Syncytial Clustering

• AutoAugment: Learning Augmentation Policies from Data

• Effective intervals and regular Dirichlet subspaces

• A network biology-based approach to evaluating the effect of environmental contaminants on human interactome and diseases

• Intelligent Trainer for Model-Based Reinforcement Learning

• A data-independent distance to infeasibility for linear conic systems

• Multi-Level Deep Cascade Trees for Conversion Rate Prediction

• Optimal Algorithms for Continuous Non-monotone Submodular and DR-Submodular Maximization

• VisualBackProp for learning using privileged information with CNNs

• Deploy Large-Scale Deep Neural Networks in Resource Constrained IoT Devices with Local Quantization Region

• Taming Convergence for Asynchronous Stochastic Gradient Descent with Unbounded Delay in Non-Convex Learning

• Local structure of multi-dimensional martingale optimal transport

• Bayesian predictive densities as an interpretation of a class of Skew–Student $t$ distributions with application to medical data

• Log-Sobolev-type inequalities for solutions to stationary Fokker-Planck-Kolmogorov equations

• Energy Efficient Delay Sensitive Optimization in SWIPT-MIMO

• Simple and practical algorithms for $\ell_p$-norm low-rank approximation

• On the SINR Distribution of SWIPT MU-MIMO with Antenna Selection

• Complex Relations in a Deep Structured Prediction Model for Fine Image Segmentation

• Evading the Adversary in Invariant Representation

• Solving Large-Scale Optimization Problems with a Convergence Rate Independent of Grid Size

• Large Data and Zero Noise Limits of Graph-Based Semi-Supervised Learning Algorithms

• Euclidean Embedding of the Poisson Weighted Infinite Tree and Application to Mobility Models

• Incomplete Nested Dissection

• Implicit Language Model in LSTM for OCR

• Modeling Interpersonal Influence of Verbal Behavior in Couples Therapy Dyadic Interactions

• A Two-Stage Subspace Trust Region Approach for Deep Neural Network Training

• Recursive functions on conditional Galton–Watson trees

• Optimal Hashing in External Memory

• Use of symmetric kernels for convolutional neural networks

• Statistical properties of lambda terms

• Diffractive electron-nucleus scattering and ancestry in branching random walks

• Adaptive Stochastic Gradient Langevin Dynamics: Taming Convergence and Saddle Point Escape Time

• Bayesian method for inferring the impact of geographical distance on intensity of communication

• Robust one-bit compressed sensing with non-Gaussian measurements

• Non-convex non-local flows for saliency detection

• Scalable Bayesian Learning for State Space Models using Variational Inference with SMC Samplers

• A Projection Approach to Equality Constrained Iterative Linear Quadratic Optimal Control

• A hybrid approach of interpolations and CNN to obtain super-resolution

• The 2d-directed spanning forest converges to the Brownian web

• Identification in Nonparametric Models for Dynamic Treatment Effects

• Douglas-Rachford splitting for a Lipschitz continuous and a strongly monotone operator

• Coloring general Kneser graphs and hypergraphs via high-discrepancy hypergraphs

• pMSE Mechanism: Differentially Private Synthetic Data with Maximal Distributional Similarity

• Classifying cooking object’s state using a tuned VGG convolutional neural network

• Embedding Syntax and Semantics of Prepositions via Tensor Decomposition

• Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

• Predictive Local Smoothness for Stochastic Gradient Methods

• Anonymizing k-Facial Attributes via Adversarial Perturbations

• Convolutional Polar Codes on Channels with Memory

• Towards Robust Training of Neural Networks by Regularizing Adversarial Gradients

• Cumulative subtraction games

• Intriguing maximally monotone operators derived from nonsunny nonexpansive retractions

• Semi-supervised classification by reaching consensus among modalities

• Learning Contextual Bandits in a Non-stationary Environment

• On a lower bound for the eccentric connectivity index of graphs

• Deep Reinforcement Learning of Marked Temporal Point Processes

• Network topology near criticality in adaptive epidemics

• Scoring Lexical Entailment with a Supervised Directional Similarity Network

• On the Skitovich-Darmois theorem for some locally compact Abelian groups

• An infinite-server queueing model MMAPkGk in semi-Markov random environment with marked MAP arrival and subject to catastrophes

• Infinite-server queueing model with MAPkGk Markov arrival streams, random volume of customers in random environment subject to catastrophe

• The Thickness of K_1,n,n and K_2,n,n

• Phocas: dimensional Byzantine-resilient stochastic gradient descent

• A New Approach for 4DVar Data Assimilation

• Duadic negacyclic codes over a finite non-chain ring and their Gray images

• GraphChallenge.org: Raising the Bar on Graph Analytic Performance

• A D-vine copula mixed model for joint meta-analysis and comparison of diagnostic tests

• First-Hitting Times Under Additive Drift

• Optimizing state change detection in functional temporal networks through dynamic community detection

• Learning compositionally through attentive guidance

• Global-Locally Self-Attentive Dialogue State Tracker

• Partial Cartesian Graph Product

• DINFRA: A One Stop Shop for Computing Multilingual Semantic Relatedness

• Corpus Conversion Service: A machine learning platform to ingest documents at scale [Poster abstract]