**Cold-Start Aware User and Product Attention for Sentiment Classification**

**Entity Commonsense Representation for Neural Abstractive Summarization**

**Regression with Functional Errors-in-Predictors: A Generalized Method-of-Moments Approach**

**Low-rank geometric mean metric learning**

**Configurable Markov Decision Processes**

**Hierarchical interpretations for neural network predictions**

**Understanding the Meaning of Understanding**

**Beyond Bags of Words: Inferring Systemic Nets**

**Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining**

• Constrained existence problem for weak subgame perfect equilibria with omega-regular Boolean objectives

• Status maximization as a source of fairness in a networked dictator game

• Distributed Hypothesis Testing based on Unequal-Error Protection Codes

• Correlation Tracking via Robust Region Proposals

• EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection

• Fast Decoding of Low Density Lattice Codes

• Bounds and algorithms for $k$-truss

• Improved Density-Based Spatio–Textual Clustering on Social Media

• SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment

• Translations as Additional Contexts for Sentence Classification

• The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing

• Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System

• NetScore: Towards Universal Metrics for Large-scale Performance Analysis of Deep Neural Networks for Practical Usage

• ReConvNet: Video Object Segmentation with Spatio-Temporal Features Modulation

• Dense Light Field Reconstruction From Sparse Sampling Using Residual Network

• Neural Stethoscopes: Unifying Analytic, Auxiliary and Adversarial Network Probing

• Statistical Aspects of Wasserstein Distances

• Aspect Sentiment Model for Micro Reviews

• On the ranking of Test match batsmen

• Inference in Deep Gaussian Processes using Stochastic Gradient Hamiltonian Monte Carlo

• Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

• Hamiltonian cycles in planar cubic graphs with facial 2-factors, and a new partial solution of Barnette’s Conjecture

• Morphological and Language-Agnostic Word Segmentation for NMT

• Simultaneous Sensor and Actuator Selection/Placement through Output Feedback Control

• Automatic Language Identification for Romance Languages using Stop Words and Diacritics

• Copycat CNN: Stealing Knowledge by Persuading Confession with Random Non-Labeled Data

• Efficient Active Learning for Image Classification and Segmentation using a Sample Selection and Conditional Generative Adversarial Network

• Stabilization with a Specified External Gain for Linear MIMO Systems and Its Applications to Control of Networked Systems

• The genus of the Erdős-Rényi random graph and the fragile genus property

• An Input-Delay Event-Triggered Control Design for Nonlinear Systems

• New Look at Finite Single Server Queue with Poisson Input and Semi-Markov Service Times

• Learning Cross-lingual Distributed Logical Representations for Semantic Parsing

• Semi-fractional diffusion equations

• Analysis of the Effect of Unexpected Outliers in the Classification of Spectroscopy Data

• Deep Generative Models in the Real-World: An Open Challenge from Medical Imaging

• The committee machine: Computational to statistical gaps in learning a two-layers neural network

• Asymptotic maximal order statistic for SIR in $κ-μ$ shadowed fading

• Scalable load balancing in networked systems: A survey of recent advances

• Stochastic Gradient Descent with Exponential Convergence Rates of Expected Classification Errors

• ServeNet: A Deep Neural Network for Web Service Classification

• Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

• Approximation and duality problems of refracted processes

• Urdu Word Segmentation using Conditional Random Fields (CRFs)

• Improving precipitation forecast using extreme quantile regression

• Maximum weight spectrum codes with reduced length

• Sequential Bayesian inference for spatio-temporal models of temperature and humidity data

• Ranking Recovery from Limited Comparisons using Low-Rank Matrix Completion

• Learning Dynamics of Linear Denoising Autoencoders

• 1-bit Localization Scheme for Radar using Dithered Quantized Compressed Sensing

• On the Perceptron’s Compression

• Simple model of fractal networks formed by self-organized critical dynamics

• Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks

• Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks

• Theory of Estimation-of-Distribution Algorithms

• Parameter Learning and Change Detection Using a Particle Filter With Accelerated Adaptation

• PCAS: Pruning Channels with Attention Statistics

• A bijection between permutation matrices and descending plane partitions without special parts, which respects the quadruplet of statistics considered by Behrend, Di Francesco and Zinn–Justin

• On the heavy-tail behavior of the distributionally robust newsvendor

• Single Image Reflection Separation with Perceptual Losses

• Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition

• Fire SSD: Wide Fire Modules based Single Shot Detector on Edge Device

• Financial Forecasting and Analysis for Low-Wage Workers

• View-volume Network for Semantic Scene Completion from a Single Depth Image

• A Game Theoretic Approach to Learning and Dynamics in Information Retrieval

• Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning

• Deep Multi-Output Forecasting: Learning to Accurately Predict Blood Glucose Trajectories

• Finding GEMS: Multi-Scale Dictionaries for High-Dimensional Graph Signals

• Scalable Neural Network Compression and Pruning Using Hard Clustering and L1 Regularization

• Connecting descent and peak polynomials

• Assessing the Accuracy of a Wrist Motion Tracking Method for Counting Bites across Demographic and Food Variables

• Elastically Collective Nonlinear Langevin Equation Theory of Dynamics in Glass-Forming Liquids: Transient Localization, Thermodynamic Mapping and Cooperativity

• Cut-edges and regular factors in regular graphs of odd degree

• Convex Class Model on Symmetric Positive Definite Manifolds

• From Trailers to Storylines: An Efficient Way to Learn from Movies

• Normal approximation for sums of discrete $U$-statistics – application to Kolmogorov bounds in random subgraph counting

• On 2-representation infinite algebras arising from dimer models

• Identifying the Fake Base Station: A Location Based Approach

• Base Station Cooperation in Millimeter Wave Cellular Networks: Performance Enhancement of Cell-Edge Users

• Infinite-dimensional bilinear and stochastic balanced truncation

• SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners

• On the convergence of stationary solutions in the Smoluchowski-Kramers approximation of infinite dimensional systems

• Rate-Splitting Robustness in Multi-Pair Massive MIMO Relay Systems

• Exchangeable random partitions from max-infinitely-divisible distributions

• Deep Reinforcement Learning for Dynamic Urban Transportation Problems

• Positive Grassmannian and polyhedral subdivisions

• Bounds on sizes of caps in $AG(n,q)$ via the Croot-Lev-Pach polynomial method

• A Graphical Interactive Debugger for Distributed Systems

• Shape Features Extraction Using a Partial Differential Equation

• Apuntes de Redes Neuronales Artificiales

• Pattern Dependence Detection using n-TARP Clustering

• Hessian spectrum at the global minimum of high-dimensional random landscapes

• Automatic formation of the structure of abstract machines in hierarchical reinforcement learning with state clustering

• Asymptotic distribution of least square estimators for linear models with dependent errors

• Bounds on the localization number

• A Flexible Convolutional Solver with Application to Photorealistic Style Transfer

• How Predictable is Your State Leveraging Lexical and Contextual Information for Predicting Legislative Floor Action at the State Level

• An unbiased approach to compressed sensing

• Limiting Behaviors of High Dimensional Stochastic Spin Ensemble

• Full Bayesian Modeling for fMRI Group Analysis

• Augmented Lagrangian-Based Decomposition Methods with Non-Ergodic Optimal Rates

• Kuramoto model for excitation-inhibition-based oscillations

• A theory of maximum likelihood for weighted infection graphs

• Benchmarks for Image Classification and Other High-dimensional Pattern Recognition Problems

• Large monochromatic components in multicolored bipartite graphs

• Online Self-supervised Scene Segmentation for Micro Aerial Vehicles

• Statistical Significance of CP Violation in Long Baseline Neutrino Experiments

• Analysis of Search Stratagem Utilisation

• SMHD: A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions

• Finding your Lookalike: Measuring Face Similarity Rather than Face Identity

• Cover-Encodings of Fitness Landscapes

• Reduced words for clans

• Quasi-tight Framelets with Directionality or High Vanishing Moments Derived from Arbitrary Refinable Functions

• The $e$-vector of a simplicial complex

• Manifold Mixup: Encouraging Meaningful On-Manifold Interpolation as a Regularizer

• End-to-End Parkinson Disease Diagnosis using Brain MR-Images by 3D-CNN

• A latent spatial factor approach for synthesizing opioid associated deaths and treatment admissions in Ohio counties

• Identifying Recurring Patterns with Deep Neural Networks for Natural Image Denoising

• Shape correspondences from learnt template-based parametrization

• Human Activity Recognition Based on Wearable Sensor Data: A Standardization of the State-of-the-Art

• Leading Coefficients and the Multiplicity of Known Roots

• Decentralized Ergodic Control: Distribution-Driven Sensing and Exploration for Multi-Agent Systems

• Bringing replication and reproduction together with generalisability in NLP: Three reproduction studies for Target Dependent Sentiment Analysis

• Line Search Methods for Convex-Composite Optimization

• Impostor Networks for Fast Fine-Grained Recognition

• Weak Closed-Loop Solvability of Stochastic Linear-Quadratic Optimal Control Problems

• An Evaluation of Neural Machine Translation Models on Historical Spelling Normalization

• On the regularity of join-meet ideals of modular lattices

• Automatic counting of fission tracks in apatite and muscovite using image processing

• Fully Convolutional Network for Automatic Road Extraction from Satellite Imagery

• Cactus Graphs and Graphs Complement Conjecture

• Distributed Constrained Nonconvex Optimization: the Asynchronous Method of Multipliers

• A Retrospective Analysis of the Fake News Challenge Stance Detection Task

• Extracting Parallel Sentences with Bidirectional Recurrent Neural Networks to Improve Machine Translation

• Generating Sentences Using a Dynamic Canvas

• Martingales and Super-martingales Relative to a Convex Set of Equivalent Measures

• fMRI Semantic Category Decoding using Linguistic Encoding of Word Embeddings

• Impact of atmospheric impairments on mmWave based outdoor communication

• Maintenance of Smart Buildings using Fault Trees

• A Unified Framework for Generalizable Style Transfer: Style and Content Separation

• Are My EHRs Private Enough -Event-level Privacy Protection

• Detecting Statistically Significant Communities

• Weighted Tanimoto Coefficient for 3D Molecule Structure Similarity Measurement

• A Profit Optimization Approach Based on the Use of Pumped-Hydro Energy Storage Unit and Dynamic Pricing