**Identity and Granularity of Events in Text**

**Stochastic Gradient Descent as Approximate Bayesian Inference**

**Deep API Programmer: Learning to Program with APIs**

**Cross-media Similarity Metric Learning with Unified Deep Networks**

**Graphical Models: An Extension to Random Graphs, Trees, and Other Objects**

**The Entropy of Backwards Analysis**

**Task-Oriented Query Reformulation with Reinforcement Learning**

**NEXT: A Neural Network Framework for Next POI Recommendation**

**Machine Learning and the Future of Realism**

• General three and four person two color Hat Game

• Parameterized Complexity and Approximability of Directed Odd Cycle Transversal

• Visual Recognition of Paper Analytical Device Images for Detection of Falsified Pharmaceuticals

• Odd holes in bull-free graphs

• How Much Spectrum is Too Much in Millimeter Wave Wireless Access

• Gbps User Rates Using mmWave Relayed Backhaul with High Gain Antennas

• Moment-based parameter estimation in binomial random intersection graph models

• Applying High-Resolution Visible Imagery to Satellite Melt Pond Fraction Retrieval: A Neural Network Approach

• Projection Free Rank-Drop Steps

• Passing through a stack $k$ times

• Diffusion on graphs is eventually periodic

• FastVentricle: Cardiac Segmentation with ENet

• On the local times of stationary processes with conditional local limit theorems

• Stochastic six-vertex model in a half-quadrant and half-line open ASEP

• CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data

• Information Criterion for Minimum Cross-Entropy Model Selection

• Dataset Augmentation for Pose and Lighting Invariant Face Recognition

• Point Sweep Coverage on Path

• An entity-driven recursive neural network model for chinese discourse coherence modeling

• Environment-Independent Task Specifications via GLTL

• Learning-based Robust Optimization: Procedures and Statistical Guarantees

• The de Bruijn-Erdös theorem in incidence geometry via Ph. Hall’s marriage theorem

• Exploiting Cross-Sentence Context for Neural Machine Translation

• Inferences on the acquisition of multidrug resistance in \emph{Mycobacterium tuberculosis} using molecular epidemiological data

• Skewing Methods for Variance-Stabilizing Local Linear Regression Estimation

• Camera Calibration by Global Constraints on the Motion of Silhouettes

• Fast Monte Carlo Algorithms for Tensor Operations

• Limited Feedback in Single and Multi-user MIMO Systems with Finite-Bit ADCs

• Runtime Analysis of the $(1+(λ,λ))$ Genetic Algorithm on Random Satisfiable 3-CNF Formulas

• Quantum Biometrics with Retinal Photon Counting

• Get To The Point: Summarization with Pointer-Generator Networks

• Fast Similarity Sketching

• HPTT: A High-Performance Tensor Transposition C++ Library

• Non-parametric Estimation of Stochastic Differential Equations with Sparse Gaussian Processes

• Sparse-Based Estimation Performance for Partially Known Overcomplete Large-Systems

• Records in Fractal Stochastic Processes

• Ultrafast photonic reinforcement learning based on laser chaos

• On the connectivity of the hyperbolicity region of irreducible polynomials

• Encoding Cardinality Constraints using Generalized Selection Networks

• Track selection in Multifunction Radars: Nash and correlated equilibria

• DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

• Global well-posedness of complex Ginzburg-Landau equation with a space-time white noise

• Ollivier-Ricci idleness functions of graphs

• Classical simulation of quantum circuits by dynamical localization: analytic results for Pauli-observable propagation in time-dependent disorder

• Two-time correlation and occupation time for the Brownian bridge and tied-down renewal processes

• Incremental learning of high-level concepts by imitation

• Sample size for comparing negative binomial rates in noninferiority and equivalence trials with unequal follow-up times

• Estimation in the convolution structure density model. Part I: oracle inequalities

• Estimation in the convolution structure density model. Part II: adaptation over the scale of anisotropic classes

• Bismut-Elworthy-Li formulae for Bessel processes

• A common limit in large rank for Markov chains defined from representations of classical Lie algebras

• How Robust Are Character-Based Word Embeddings in Tagging and MT Against Wrod Scramlbing or Randdm Nouse?

• Optimizing Differentiable Relaxations of Coreference Evaluation Metrics

• Bringing Structure into Summaries: Crowdsourcing a Benchmark Corpus of Concept Maps

• Cardinal Virtues: Extracting Relation Cardinalities from Text

• Liquid Splash Modeling with Neural Networks

• Optimal Power Splitting for Simultaneous Information Detection and Energy Harvesting

• On Generalized Bellman Equations and Temporal-Difference Learning

• A Low-Complexity Approach to Distributed Cooperative Caching with Geographic Constraints

• Lean From Thy Neighbor: Stochastic & Adversarial Bandits in a Network

• Maximal Unbordered Factors of Random Strings

• Additive Spanners and Distance Oracles in Quadratic Time

• Deep Structured Learning for Facial Action Unit Intensity Estimation

• TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering

• Mobility Edges in 1D Bichromatic Incommensurate Potentials

• Improving Object Detection With One Line of Code

• Configuration spaces, $\operatorname{FS^{op}}$-modules, and Kazhdan-Lusztig polynomials of braid matroids

• Recovery of damped exponentials using structured low rank matrix completion

• Interpretable 3D Human Action Analysis with Temporal Convolutional Networks

• ShapeWorld – A new test methodology for multimodal language understanding

• Neural Machine Translation Model with a Large Vocabulary Selected by Branching Entropy

• Translation of Patent Sentences with a Large Vocabulary of Technical Terms Using Neural Machine Translation

• Hierarchic Kernel Recursive Least-Squares

• Model Uncertainty, Recalibration, and the Emergence of Delta-Vega Hedging

• Neural Extractive Summarization with Side Information

• Divergence Measures Estimation and Its Asymptotic Normality Theory Using Wavelets Empirical Processes

• Incentivizing reliable demand response with customers’ uncertainties and capacity planning

• A Simple Randomized Algorithm to Compute Harmonic Numbers and Logarithms

• Cross-lingual Abstract Meaning Representation Parsing

• A New Take on Protecting Cyclists in Smart Cities

• SETH-Based Lower Bounds for Subset Sum and Bicriteria Path

• On the Gap Between Strict-Saddles and True Convexity: An Omega(log d) Lower Bound for Eigenvector Approximation

• Distributional model on a diet: One-shot word learning from text only

• A quantum walk on a line and a Weyl equation in a space

• Pseudo-Separation for Assessment of Structural Vulnerability of a Network

• User-transparent Distributed TensorFlow

• On the Existence and Continuity of Equilibria for Two-Person Zero-Sum Games with Uncertain Payoffs

• Neural Paraphrase Identification of Questions with Noisy Pretraining

• Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

• RaPro: A Novel 5G Rapid Prototyping System Architecture

• On Synchronous, Asynchronous, and Randomized Best-Response schemes for computing equilibria in Stochastic Nash games

• A Quadratic Penalty Method for Hypergraph Matching

• Performance of Energy Harvesting Receivers with Power Optimization

• Distributed demand-side contingency-service provisioning while minimizing consumer disutility through local frequency measurements and inter-load communication

• Deep Learning for Photoacoustic Tomography from Sparse Data

• Data aggregation routing protocols in wireless sensor networks: a taxonomy

• Duality in percolation via outermost boundaries III: Plus connected components

• Long Paths and Hamiltonian paths in Inhomogenous Random Graphs

• Cliques and Chromatic Number in Inhomogenous Random Graphs

• Energy-Efficient Mobile Cooperative Computing

• A novel approach for fast mining frequent itemsets use N-list structure based on MapReduce

• Randomized detection and detection capacity of multidetector networks

• MUSE: Modularizing Unsupervised Sense Embeddings

• Massive MU-MIMO-OFDM Downlink with One-Bit DACs and Linear Precoding

• Approximating Constrained Minimum Input Selection for State Space Structural Controllability

• A learning-based approach for automatic image and video colorization

• Robust Transceiver Design Based on Interference Alignment for Multi-User Multi-Cell MIMO Networks with Channel Uncertainty

• On Monte-Carlo tree search for deterministic games with alternate moves and complete information

• Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification with Convolutional Neural Networks

• Relevant change points in high dimensional time series

• FMtree: A fast locating algorithm of FM-indexes for genomic data

• Advances in Detection and Error Correction for Coherent Optical Communications: Regular, Irregular, and Spatially Coupled LDPC Code Designs

• How to desynchronize quorum-sensing networks

• A fast ILP-based Heuristic for the robust design of Body Wireless Sensor Networks

• Capacity of the Gaussian Two-Pair Two-Way Relay Channel to Within 1/2 Bit

• Liu-Nagel phase diagrams in infinite dimension

• Big Universe, Big Data: Machine Learning and Image Analysis for Astronomy

• The Reactor: A Sample-Efficient Actor-Critic Architecture

• Optimal Output Consensus of High-Order Multi-Agent Systems with Embedded Technique

• Negative Cycle Separation in Wireless Network Design

• Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping

• Temporal Action Localization by Structured Maximal Sums

• Limit Theorems for Monochromatic 2-Stars and Triangles

• Graph Convolutional Encoders for Syntax-aware Neural Machine Translation

• Automaton model of protein: dynamics of conformational and functional states

• RACE: Large-scale ReAding Comprehension Dataset From Examinations

• Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH

• Worst portfolios for dynamic monetary utility processes

• Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions

• Steiner diameter, maximum degree and size of a graph

• Adaptive Network Coding Schemes for Satellite Communications

• Rooted Graph Minors and Reducibility of Graph Polynomials