**How to estimate time-varying Vector Autoregressive Models? A comparison of two methods**

**Neural Network Gradient Hamiltonian Monte Carlo**

**On Optimal Generalizability in Parametric Learning**

**A Deep Learning Approach for Expert Identification in Question Answering Communities**

**Optimizing Kernel Machines using Deep Learning**

**Sliced Wasserstein Distance for Learning Gaussian Mixture Models**

**Revisiting Simple Neural Networks for Learning Representations of Knowledge Graphs**

**A Fast and Robust TSVM for Pattern Classification**

**Z-Forcing: Training Stochastic Recurrent Networks**

**DNA-GAN: Learning Disentangled Representations from Multi-Attribute Images**

**Can clone detection support quality assessments of requirements specifications?**

**Squeeze-SegNet: A new fast Deep Convolutional Neural Network for Semantic Segmentation**

**Accelerated Alternating Projections for Robust Principal Component Analysis**

**Variational Adaptive-Newton Method for Explorative Learning**

**Advances in Variational Inference**

**Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time**

**Learning to Predict with Big Data**

**Semi-Supervised Approaches to Efficient Evaluation of Model Prediction Performance**

**Markov Decision Processes with Continuous Side Information**

• A learning problem that is independent of the set theory ZFC axioms

• Joint Gaussian Processes for Biophysical Parameter Retrieval

• Unsupervised patient representations from clinical notes with interpretable classification decisions

• Characterizations and Enumerations of Patterns of Signed Shifts

• Tree Projections and Constraint Optimization Problems: Fixed-Parameter Tractability and Parallel Algorithms

• Controllable Abstractive Summarization

• LAA LTE and WiFi based Smart Grid Metering Infrastructure in 3.5 GHz Band

• Towards Dual-functional Radar-Communication Systems: Optimal Waveform Design

• Revisiting Normalized Gradient Descent: Evasion of Saddle Points

• CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning

• Goal-Driven Query Answering for Existential Rules with Equality

• A visual search engine for Bangladeshi laws

• Considering Durations and Replays to Improve Music Recommender Systems

• Weakly-supervised Semantic Parsing with Abstract Examples

• Regularization and Hierarchical Prior Distributions for Adjustment with Health Care Claims Data: Rethinking Comorbidity Scores

• Private Information Retrieval from Storage Constrained Databases — Coded Caching meets PIR

• Loss Functions for Multiset Prediction

• Linear response and moderate deviations: hierarchical approach. III

• Neural Network Dynamics Models for Control of Under-actuated Legged Millirobots

• C-WSL: Count-guided Weakly Supervised Localization

• SI-ADMM: A Stochastic Inexact ADMM Framework for Resolving Structured Stochastic Convex Programs

• Modeling Semantic Relatedness using Global Relation Vectors

• Improved quantum backtracking algorithms through effective resistance estimates

• The KPZ Limit of ASEP with Boundary

• An Accelerated Communication-Efficient Primal-Dual Optimization Framework for Structured Machine Learning

• The $(2,2)$ and $(4,3)$ properties in families of fat sets in the plane

• Making spanning graphs

• Simulating Action Dynamics with Neural Process Networks

• The Value of Communication in Synthesizing Controllers given an Information Structure

• Geometric integrators and the Hamiltonian Monte Carlo method

• Supervised and Unsupervised Transfer Learning for Question Answering

• A bilinear Bogolyubov theorem

• Quotientopes

• On the Numerical Solution of Fourth-Order Linear Two-Point Boundary Value Problems

• Automatic Conflict Detection in Police Body-Worn Video

• Rate-Compatible Punctured Polar (RCPP) Codes Based On Hierarchical Puncturing

• Linear and quadratic uniformity of the Möbius function over $\mathbb{F}_q[t]$

• The Dispersion Bias

• Kernel Conditional Exponential Family

• LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

• Velocity variations at Columbia Glacier captured by particle filtering of oblique time-lapse images

• A Novel SDASS Descriptor for Fully Encoding the Information of 3D Local Surface

• Effective Filtering on a Random Slow Manifold

• Bridging Source and Target Word Embeddings for Neural Machine Translation

• A New Perspective on Robust $M$-Estimation: Finite Sample Theory and Applications to Dependence-Adjusted Multiple Testing

• Error bounds for Approximations of Markov chains

• Normal Approximation by Stein’s Method under Sublinear Expectations

• FARM-Test: Factor-Adjusted Robust Multiple Testing with False Discovery Control

• Semiblind subgraph reconstruction in Gaussian graphical models

• On the anti-Kelulé problem of cubic graphs

• Sparse Combinatorial Group Testing for Low-Energy Massive Random Access

• Influential Sample Selection: A Graph Signal Processing Approach

• Recurrent Neural Networks as Weighted Language Recognizers

• IKBT: solving closed-form Inverse Kinematics with Behavior Tree

• On the Anti-Jamming Performance of the NR-DCSK System

• Accelerating Cross-Validation in Multinomial Logistic Regression with $\ell_1$-Regularization

• Physical Layer Security Schemes for Full-Duplex Cooperative Systems: State of the Art and Beyond

• The landscape of the spiked tensor model

• The Chromatic Number of the Disjointness Graph of the Double Chain

• Modular Resource Centric Learning for Workflow Performance Prediction

• Deep Inception-Residual Laplacian Pyramid Networks for Accurate Single Image Super-Resolution

• A Sequential Neural Encoder with Latent Structured Description for Modeling Sentences

• TorusE: Knowledge Graph Embedding on a Lie Group

• A characterization of finite abelian groups via sets of lengths in transfer Krull monoids

• On Mubayi’s Conjecture and conditionally intersecting sets

• Human and Machine Speaker Recognition Based on Short Trivial Events

• Robust Real-Time Multi-View Eye Tracking

• Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

• Hibikino-Musashi@Home 2017 Team Description Paper

• A Public Image Database for Benchmark of Plant Seedling Classification Algorithms

• A Machine Learning Approach to Modeling Human Migration

• Modeling Binary Time Series Using Gaussian Processes with Application to Predicting Sleep States

• Aicyber’s System for NLPCC 2017 Shared Task 2: Voting of Baselines

• Tracking Typological Traits of Uralic Languages in Distributed Language Representations

• Deterministic Distributed Edge-Coloring with Fewer Colors

• On the Utility of Context (or the Lack Thereof) for Object Detection

• Coloring intersection hypergraphs of pseudo-disks

• The best defense is a good offense: Countering black box attacks by predicting slightly wrong labels

• A Convex Parametrization of a New Class of Universal Kernel Functions for use in Kernel Learning

• No Reference Stereoscopic Video Quality Assessment Using Joint Motion and Depth Statistics

• Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

• Fisher information matrix of binary time series

• A Lie bracket approximation approach to distributed optimization over directed graphs

• A Descent on Simple Graphs — from Complete to Cycle — and Algebraic Properties of Their Spectra

• Sparse identification of nonlinear dynamics for model predictive control in the low-data limit

• A Generally Applicable, Highly Scalable Measurement Computation and Optimization Approach to Sequential Model-Based Diagnosis

• Note on Representing attribute reduction and concepts in concepts lattice using graphs

• Convolutional Neural Networks and Data Augmentation for Spectral-Spatial Classification of Hyperspectral Images

• Investigating Inner Properties of Multimodal Representation and Semantic Compositionality with Brain-based Componential Semantics

• MAMoC: Multisite Adaptive Offloading Framework for Mobile Cloud Applications

• A Correlation Based Feature Representation for First-Person Activity Recognition

• Two-Sample Test for Sparse High Dimensional Multinomial Distributions

• Trees of self-avoiding walks

• A balanced non-partitionable Cohen-Macaulay complex

• Dual-Path Convolutional Image-Text Embedding

• Detecting and assessing contextual change in diachronic text documents using context volatility

• Good and safe uses of AI Oracles

• Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results

• A Stochastic Resource-Sharing Network for Electric Vehicle Charging

• Fully-dynamic risk-indifference pricing and no-good-deal bounds

• Dialogue Act Recognition via CRF-Attentive Structured Network

• An Extended Sensitivity Analysis for Heterogeneous Unmeasured Confounding

• (2+1)-dimensional interface dynamics: mixing time, hydrodynamic limit and Anisotropic KPZ growth

• Mitigating Clipping Effects on Error Floors under Belief Propagation Decoding of Polar Codes

• PlinyCompute: A Platform for High-Performance, Distributed, Data-Intesive Tool Development

• People, Penguins and Petri Dishes: Adapting Object Counting Models To New Visual Domains And Object Types Without Forgetting

• New support for the value 5/2 for the spin glass lower critical dimension at zero magnetic field

• Words are Malleable: Computing Semantic Shifts in Political and Media Discourse

• A bijective proof of the enumeration of maps in higher genus

• On consistent vertex nomination schemes

• Interpreting Deep Visual Representations via Network Dissection

• Spatial Mapping with Gaussian Processes and Nonstationary Fourier Features

• P-spline smoothing for spatial data collected worldwide

• Sharp non-asymptotic Concentration Inequalities for the Approximation of the Invariant Measure of a Diffusion

• Gaussian width bounds with applications to arithmetic progressions in random settings

• Parsimonious Model-Based Clustering with Covariates

• Quantitative Benchmarks and New Directions for Noise Power Estimation Methods in ISM Radio Environment

• Relating the wave-function collapse with Euler’s formula

• Spatial Joint Species Distribution Modeling using Dirichlet Processes

• A Tractable Product Channel Model for Line-of-Sight Scenarios

• A Friendly Smoothed Analysis of the Simplex Method

• On laws of large numbers in $L^2$ for supercritical branching Markov processes beyond $λ$-positivity

• On joint distribution of range and terminal value of a Brownian motion

• Unsupervised Morphological Expansion of Small Datasets for Improving Word Embeddings

• An Unsupervised Approach for Mapping between Vector Spaces

• Hydra: a C++11 framework for data analysis in massively parallel platforms

• Novel decision-theoretic and risk-stratification metrics of predictive performance: Application to deciding who should undergo genetic testing

• Motif-based Convolutional Neural Network on Graphs

• Brain Extraction from Normal and Pathological Images: A Joint PCA/Image-Reconstruction Approach

• Bayesian optimal designs for dose-response curves with common parameters

• Contextual Object Detection with a Few Relevant Neighbors

• Classification of binary self-dual [76, 38, 14] codes with an automorphism of order 9

• CSWA: Aggregation-Free Spatial-Temporal Community Sensing

• Fighting fish and two-stack sortable permutations

• BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

• Exact Limits of Inference in Coalescent Models

• Extremes of multifractional Brownian motion

• Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations