**Activation Ensembles for Deep Neural Networks**

**On the Origin of Deep Learning**

**Adaptive Neural Networks for Fast Test-Time Prediction**

**An Unsupervised Learning Method Exploiting Sequential Output Statistics**

**Deep Voice: Real-time Neural Text-to-Speech**

**Rationalization: A Neural Machine Translation Approach to Generating Natural Language Explanations**

**Online Learning with Many Experts**

**Generative Adversarial Active Learning**

**Related Pins at Pinterest: The Evolution of a Real-World Recommender System**

**Local Short Term Electricity Load Forecasting: Automatic Approaches**

**Online Multiview Representation Learning: Dropping Convexity for Better Efficiency**

**Reinforcement Learning with Deep Energy-Based Policies**

**Neural Map: Structured Memory for Deep Reinforcement Learning**

**Learning Hierarchical Features from Generative Models**

**Statistical Anomaly Detection via Composite Hypothesis Testing for Markov Models**

• When confidence and competence collide: Effects on online decision-making discussions

• Obtaining highly excited eigenstates of the localized XX chain via DMRG-X

• Coherent Oscillations of Driven rf SQUID Metamaterials

• Decoding Generalized Reed-Solomon Codes and Its Application to RLCE Encryption Schemes

• Key Reconciliation with Low-Density Parity-Check Codes for Long-Distance Quantum Cryptography

• Crowdsourcing Cybersecurity: Cyber Attack Detection using Social Media

• A supervised approach to time scale detection in dynamic networks

• Multi-Competitive Viruses over Static and Time–Varying Networks

• Background rejection method for tens of TeV gamma-ray astronomy applicable to wide angle timing arrays

• Unifying local and non-local signal processing with graph CNNs

• Survival Trees for Interval-Censored Survival data

• Coalescence and Minimal Spanning Trees of Irregular Graphs

• Linearity in minimal resolutions of monomial ideals

• Primary gamma ray selection in a hybrid timing/imaging Cherenkov array

• Video and Accelerometer-Based Motion Analysis for Automated Surgical Skills Assessment

• A Note on Nonlocal Prior Method

• Changing Model Behavior at Test-Time Using Reinforcement Learning

• On Optimal Portfolios of Dynamic Resource Allocations

• Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying

• Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging

• Residual Convolutional CTC Networks for Automatic Speech Recognition

• A Study of the Allan Variance for Constant-Mean Non-Stationary Processes

• Parametric analysis of Cherenkov light LDF from EAS in the range 30-3000 TeV for primary gamma rays and nuclei

• Rank-to-engage: New Listwise Approaches to Maximize Engagement

• Exact Methods for Recursive Circle Packing

• Consistent structure estimation of exponential-family random graph models with additional structure

• Near Data Scheduling for Data Centers with Multi Levels of Data Locality

• Nonparanormal Information Estimation

• A Constrained Conditional Likelihood Approach for Estimating the Means of Selected Populations

• Revisiting NARX Recurrent Neural Networks for Long-Term Dependencies

• When Does Diversity of User Preferences Improve Outcomes in Selfish Routing?

• A Decomposition of Forecast Error in Prediction Markets

• Visibility graphs of random scalar fields and spatial data

• Subquadratic Algorithms for the Diameter and the Sum of Pairwise Distances in Planar Graphs

• Total positivity of Narayana matrices

• Optimizing the Coherence of Composite Networks

• A Near-Optimal Sampling Strategy for Sparse Recovery of Polynomial Chaos Expansions

• New constructions of MDS codes with complementary duals

• Constructing Adjacency Arrays from Incidence Arrays

• Efficient coordinate-wise leading eigenvector computation

• Critical Survey of the Freely Available Arabic Corpora

• Synthesizing Training Data for Object Detection in Indoor Scenes

• Transfer Learning for Domain Adaptation in MRI: Application in Brain Lesion Segmentation

• Greedy coordinate descent from the view of $\ell_1$-norm gradient descent

• Electronic conduction properties of indium tin oxide: single-particle and many-body transport

• Chi-boundedness of graph classes excluding wheel vertex-minors

• Zero sum partition into sets of the same order and its applications

• Signal Denoising Using the Minimum-Probability-of-Error Criterion

• On the Performance of Wireless Powered Communication With Non-linear Energy Harvesting

• An EM Based Probabilistic Two-Dimensional CCA with Application to Face Recognition

• Contractibility for Open Global Constraints

• Random sorting networks: local statistics via random matrix laws

• Learning Deep NBNN Representations for Robust Place Categorization

• Are there needles in a moving haystack? Adaptive sensing for detection of dynamically evolving signals

• Approval Voting with Intransitive Preferences

• Coarse Grained Exponential Variational Autoencoders

• CHAOS: A Parallelization Scheme for Training Convolutional Neural Networks on Intel Xeon Phi

• Analysis of Urban Vibrancy and Safety in Philadelphia

• Rician MIMO Channel- and Jamming-Aware Decision Fusion

• Random ultrametric trees and applications

• Sparsity constrained split feasibility for dose-volume constraints in inverse planning of intensity-modulated photon or proton therapy

• Upper-Bounding the Regularization Constant for Convex Sparse Signal Reconstruction

• The role of quantum correlations in Cop and Robber game

• Efficient Learning of Graded Membership Models

• A decentralized algorithm for control of autonomous agents coupled by feasibility constraints

• Image Stitching by Line-guided Local Warping with Global Similarity Constraint

• Complexity Classification of the Eight-Vertex Model

• Upper bounds on the smallest size of a saturating set in projective planes and spaces of even dimension

• BARCHAN: Blob Alignment for Robust CHromatographic ANalysis

• Stochastic Variance Reduction Methods for Policy Evaluation

• Global Optimality in Low-rank Matrix Optimization

• Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret

• Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

• An Efficient Multiway Mergesort for GPU Architectures

• Spatially Aware Melanoma Segmentation Using Hybrid Deep Learning Techniques

• Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs

• Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing

• Multi-scale Spectrum Sensing in Small-Cell mm-Wave Cognitive Wireless Networks

• Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition

• Ratio Utility and Cost Analysis for Privacy Preserving Subspace Projection

• BayCount: A Bayesian Decomposition Method for Inferring Tumor Heterogeneity using RNA-Seq Counts

• Maximum-Likelihood Augmented Discrete Generative Adversarial Networks

• Collaborative Optimization for Collective Decision-making in Continuous Spaces

• A multi-task convolutional neural network for mega-city analysis using very high resolution satellite imagery and geospatial data

• Strong rainbow connection numbers of toroidal meshes

• A random regularized approximate solution of the inverse problem for the Burgers’ equation

• Detecting (Un)Important Content for Single-Document News Summarization

• Kiefer Wolfowitz Algorithm is Asymptotically Optimal for a Class of Non-Stationary Bandit Problems

• Bayesian Nonparametric Feature and Policy Learning for Decision-Making

• Exact Random Coding Exponents and Universal Decoders for the Asymmetric Broadcast Channel

• Bayesian Nonparametric Unmixing of Hyperspectral Images

• Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

• Weak composition quasi-symmetric functions, Rota-Baxter algebras and Hopf algebras

• Adversarial Networks for the Detection of Aggressive Prostate Cancer

• Support vector machine and its bias correction in high-dimension, low-sample-size settings

• Friends and Enemies of Clinton and Trump: Using Context for Detecting Stance in Political Tweets

• SLE Loop Measures

• Euclidean and Hermitian LCD MDS codes

• Cutoff for Ramanujan graphs via degree inflation

• Recursions associated to trapezoid, symmetric and rotation symmetric functions over Galois fields

• Criticality and Deep Learning, Part I: Theory vs. Empirics

• Weak invariance principle in Besov spaces for stationary martingale differences

• Benefits of Cache Assignment on Degraded Broadcast Channels

• General Upper Bounds for Gate Complexity and Depth of Reversible Circuits Consisting of NOT, CNOT and 2-CNOT Gates

• Delay-Optimal Probabilistic Scheduling in Green Communications with Arbitrary Arrival and Adaptive Transmission

• Wireless Network Optimization via Stochastic Subgradient Algorithm: Convergence Rate Analysis

• Row-Centric Lossless Compression of Markov Images

• Observability and Controllability of a non-autonomous Schrödinger equation

• The Ensemble Kalman Filter: A Signal Processing Perspective

• Constructing ergodic diffusion processes on submanifolds

• Using Battery Storage for Peak Shaving and Frequency Regulation: Joint Optimization for Superlinear Gains

• PubTree: A Hierarchical Search Tool for the MEDLINE Database

• Learning Control for Air Hockey Striking using Deep Reinforcement Learning

• Topological Interference Management with Decoded Message Passing

• On Algorithmic Statistics for space-bounded algorithms

• Selection of training populations (and other subset selection problems) with an accelerated genetic algorithm (STPGA: An R-package for selection of training populations with a genetic algorithm)

• On the calculation of Fisher information for quantum parameter estimation based on the stochastic master equation

• Lattice Coding and Decoding for Multiple-Antenna Ergodic Fading Channels

• Constrained Maximum Likelihood Estimators for Densities

• 3D Scanning System for Automatic High-Resolution Plant Phenotyping

• Extended trust region problems over one or two balls: exact (semi-)Lagrangian relaxations

• Bioplausible multiscale filtering in retino-cortical processing as a mechanism in perceptual grouping

• Log-Harnack Inequalities for Markov Semigroups Generated by Non-Local Gruschin Type Operators

• A Unifying Framework for Convergence Analysis of Approximate Newton Methods

• Generating functions for permutations which avoid consecutive patterns with multiple descents

• Multiuser Precoding and Channel Estimation for Hybrid Millimeter Wave MIMO Systems

• A General Framework for Low-Resolution Receivers for MIMO Channels

• Deceiving Google’s Perspective API Built for Detecting Toxic Comments

• Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

• A mixture model approach to infer land-use influence on point referenced water quality

• Tensor Balancing on Statistical Manifold

• Synchronization Problems in Automata without Non-trivial Cycles

• A Copula-based Imputation Model for Missing Data of Mixed Type in Multilevel Data Sets

• Improvement on Brook theorem for (3 Times K1)-free Graphs

• A KZ Reduction Algorithm

• HPDedup: A Hybrid Prioritized Data Deduplication Mechanism for Primary Storage in the Cloud

• Multi-scale Image Fusion Between Pre-operative Clinical CT and X-ray Microtomography of Lung Pathology

• Conjectures related to regularity in the Kolakoski sequence

• F2F: A Library For Fast Kernel Expansions

• HashBox: Hash Hierarchical Segmentation exploiting Bounding Box Object Detection

• Linear Convergence of the Proximal Incremental Aggregated Gradient Method under Quadratic Growth Condition

• Unitarizability of weight modules over noncommutative Kleinian fiber products

• Communication-efficient Algorithms for Distributed Stochastic Principal Component Analysis

• Tverberg type theorems for matroids

• Fixed-point optimization of deep neural networks with adaptive step size retraining

• Tars: Timeliness-aware Adaptive Replica Selection for Key-Value Stores

• Another Look at the Implementation of Read/write Registers in Crash-prone Asynchronous Message-Passing Systems (Extended Version)

• The metric dimension of the circulant graph $C(n,\pm\{1,2,3,4\})$

• A model solution of the generalized Langevin equation: Emergence and Breaking of Time-Scale Invariance in Single-Particle Dynamics of Liquids

• Hausdorff dimension of the boundary of bubbles of additive Brownian motion and of the Brownian sheet

• An update on statistical boosting in biomedicine

• dotCall64: An Efficient Interface to Compiled C/C++ and Fortran Code Supporting Long Vectors

• Three-Particle Correlations in Liquid and Amorphous Aluminium

• DeepNAT: Deep Convolutional Neural Network for Segmenting Neuroanatomy

• Mutual Information based labelling and comparing clusters

• Approximation Strategies for Generalized Binary Search in Weighted Trees

• Online Nonparametric Learning, Chaining, and the Role of Partial Feedback

• Anticipating many futures: Online human motion prediction and synthesis for human-robot collaboration

• A case study on English-Malayalam Machine Translation

• Synergistic Team Composition

• On the second Feng-Rao distance of Algebraic Geometry codes related to Arf semigroups

• Low-Precision Batch-Normalized Activations

• Hajós-like theorem for signed graphs

• Variational Inference using Implicit Distributions

• Consensus Patterns parameterized by input string length is W[1]-hard

• Bayesian inference on random simple graphs with power law degree distributions

• Subspace Sum Graph of a Vector Space

• On the Expected Value of the Determinant of Random Sum of Rank-One Matrices

• Scalable and Distributed Clustering via Lightweight Coresets

• Uniform Deviation Bounds for Unbounded Loss Functions like k-Means

• Hessian corrections to Hybrid Monte Carlo

• Learning with Errors is easy with quantum samples

• Upper and Lower Bounds for the Ergodic Capacity of MIMO Jacobi Fading Channels

• Fast and Accurate Inference with Adaptive Ensemble Prediction in Image Classification with Deep Neural Networks

• Sequential Discrete Kalman Filter for Real-Time State Estimation in Power Distribution Systems: Theory and Implementation

• A Dataset for Developing and Benchmarking Active Vision

• Adaptive Learning to Speed-Up Control of Prosthetic Hands: a Few Things Everybody Should Know

• Balancing Lexicographic Fairness and a Utilitarian Objective with Application to Kidney Exchange

• Invariance principle via orthomartingale approximation

• Asynchronous Incremental Stochastic Dual Descent Algorithm for Network Resource Allocation

• Irreducible convex paving for decomposition of multi-dimensional martingale transport plans

• Independent Set Size Approximation in Graph Streams

• Hybrid method for identifying mass groups of primary cosmic rays in the joint operation of IACTs and wide angle Cherenkov timing arrays

• Identifying beneficial task relations for multi-task learning in deep neural networks

• The Fermi problem in disordered systems

• Efficient Privacy Preserving Viola-Jones Type Object Detection via Random Base Image Representation

• Visual Translation Embedding Network for Visual Relation Detection

• An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation

• Stochastic Stability Analysis of Perturbed Learning Automata with Constant Step-Size in Strategic-Form Games

• Multi-Label Segmentation via Residual-Driven Adaptive Regularization

• On Fienup Methods for Regularized Phase Retrieval

• Approximate Inference with Amortised MCMC

• Wright-Fisher diffusions for evolutionary games with death-birth updating

• Reduction and regular $t$-balanced Cayley maps on split metacyclic 2-groups

• Hopf algebra techniques to handle dynamical systems and numerical integrators

• Dynamic Word Embeddings via Skip-Gram Filtering

• Differentiable Learning of Logical Rules for Knowledge Base Completion

• The Local Limit of Random Sorting Networks

• Divisible sandpile on Sierpinski gasket graphs

• The Robot Crawler Model on Complete k-Partite and Erdős-Rényi Random Graphs

• Asymptotic enumeration of graphs by degree sequence, and the degree sequence of a random graph

• Dense blowup for parabolic SPDEs

• Optimized Secure Position Sharing with Non-trusted Servers

• Revealing Hidden Potentials of q-Space Imaging in Breast Cancer

• Scheduling Post-Disaster Repairs in Electricity Distribution Networks

• On the affine random walk on the torus

• Stance Classification of Social Media Users in Independence Movements

• Equivariance Through Parameter-Sharing

• Parametric Analysis of Cherenkov Light LDF from EAS for High Energy Gamma Rays and Nuclei: Ways of Practical Application

• Forward Event-Chain Monte Carlo: a general rejection-free and irreversible Markov chain simulation method

• McGan: Mean and Covariance Feature Matching GAN

• Dynamic principle for ensemble control tools

• Asymmetric Tri-training for Unsupervised Domain Adaptation

• Latent Correlation Gaussian Processes

• Game-Theoretic Semantics for ATL+ with Applications to Model Checking

• An SDP-Based Algorithm for Linear-Sized Spectral Sparsification

• Embarrassingly parallel inference for Gaussian processes

• Age Progression/Regression by Conditional Adversarial Autoencoder

• Boundary-Seeking Generative Adversarial Networks

• Structure of martingale transports in finite dimensions

• Skin Lesion Classification Using Hybrid Deep Neural Networks