**A description length approach to determining the number of k-means clusters**

**SARAH: A Novel Method for Machine Learning Problems Using Stochastic Recursive Gradient**

**Revisiting Unsupervised Learning for Defect Prediction**

**Easy over Hard: A Case Study on Deep Learning**

**Maximal Solutions of Sparse Analysis Regularization**

**Online Natural Gradient as a Kalman Filter**

**Systematic Generation of Algorithms for Iterative Methods**

**Gradient Boosting on Stochastic Data Streams**

**The Statistical Recurrent Unit**

**Fast k-Nearest Neighbour Search via Prioritized DCI**

**Learning to Optimize Neural Nets**

• Weighted meta-path generation, Multi-relational recommender system, Heterogeneous information network, Weighted random walk sampling

• Context-Sensitive Super-Resolution for Fast Fetal Magnetic Resonance Imaging

• Reproducible experiments on dynamic resource allocation in cloud data centers

• Tree tribes and lower bounds for switching lemmas

• Provable Optimal Algorithms for Generalized Linear Contextual Bandits

• SceneSeer: 3D Scene Design with Natural Language

• Fair prediction with disparate impact: A study of bias in recidivism prediction instruments

• Rook placements and Jordan forms of upper-triangular nilpotent matrices

• Achieving non-discrimination in prediction

• On the Power of Learning from $k$-Wise Queries

• Deep Image Harmonization

• Discrete Wavelet Transform Based Algorithm for Recognition of QRS Complexes

• Multi-Sensor Data Pattern Recognition for Multi-Target Localization: A Machine Learning Approach

• Segmentation of Lesions in Dermoscopy Images Using Saliency Map And Contour Propagation

• Combinatorial models for Schubert polynomials

• A Joint Identification Approach for Argumentative Writing Revisions

• Semi-analytical approximations to statistical moments of sigmoid and softmax mappings of normal variables

• Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling

• Learning Conversational Systems that Interleave Task and Non-Task Content

• Joint Beamforming and Antenna Selection for Sum Rate Maximization in Cognitive Radio Networks

• Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization

• Remote Sensing Image Scene Classification: Benchmark and State of the Art

• RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning

• Theory and Applications of Matrix-Weighted Consensus

• Application of SNiPER framework to BESIII physics analysis

• Explosive oscillation death in coupled Stuart-Landau oscillators

• Spatial asymptotic of the stochastic heat equation with compactly supported initial data

• The weighted poset metrics and directed graph metrics

• Codebook Design for Channel Feedback in Lens-Based Millimeter-Wave Massive MIMO Systems

• Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank

• Robust Beamforming for Secrecy Rate in Cooperative Cognitive Radio Multicast Communications

• A Computationally Efficient Algorithm to Find Time-Optimal Trajectory of Redundantly Actuated Robots Moving on a Specified Path

• Saliency Detection by Forward and Backward Cues in Deep-CNNs

• Inertial Odometry on Handheld Smartphones

• Saliency Fusion in Eigenvector Space with Multi-Channel Pulse Coupled Neural Network

• Adaptive estimation of the sparsity in the Gaussian vector model

• Modular Representation of Layered Neural Networks

• Optical Flow-based 3D Human Motion Estimation from Monocular Video

• 5G Mobile Cellular Networks: Enabling Distributed State Estimation for Smart Grid

• Complex active optical networks as a new laser concept

• A uniformness conjecture of the Kolakoski sequence, graph connectivity, and correlations

• Littlewood-Paley theory for triangle buildings

• Massively parallel lattice-Boltzmann codes on large GPU clusters

• Performance and Portability of Accelerated Lattice Boltzmann Applications with OpenACC

• Lower Bounds on Exponential Moments of the Quadratic Error in Parameter Estimation

• Incorporating Intra-Class Variance to Fine-Grained Visual Recognition

• Frequency patterns of semantic change: Corpus-based evidence of a near-critical dynamics in language change

• Wright-Fisher diffusion bridges, Coalescent processes in Wright-Fisher diffusion bridges

• Congestion-Aware Distributed Network Selection for Integrated Cellular and Wi-Fi Networks

• Quantifying the entropic cost of cellular growth control

• The ordered independent loss model for the evolution of CRISPR spacers

• Improving Object Detection with Region Similarity Learning

• Algorithms and Bounds for Very Strong Rainbow Coloring

• Reordering Method and Hierarchies for Quantum and Classical Ordered Binary Decision Diagrams

• On the total variation Wasserstein gradient flow and the TV-JKO scheme

• Learning A Physical Long-term Predictor

• The Polycluster Theory for the Structure of Glasses: Evidence from Low Temperature Physics

• Human Eye Visual Hyperacuity: A New Paradigm for Sensing?

• Improving phase II oncology trials using best observed RECIST response as an endpoint by modelling continuous tumour measurements

• Extragradient method with variance reduction for stochastic variational inequalities

• Variance-based stochastic extragradient methods with linear search for stochastic variational inequalities

• Convex optimization in Hilbert space with applications to inverse problems

• Improvements on Spectral Bisection

• Incremental constraint projection methods for monotone stochastic variational inequalities

• Smaller subgraphs of minimum degree k

• Almost periodic solution in distribution for stochastic differential equations with Stepanov almost periodic coefficients

• L$^3$-SVMs: Landmarks-based Linear Local Support Vectors Machines

• Phylogenetic Tools in Astrophysics

• A general 2-part Erd\H os-Ko-Rado theorem

• A Multi-Objective Interpretation of Optimal Transport

• Group Sparsity Residual Constraint for Image Denoising

• The coalescent structure of continuous-time Galton-Watson trees

• Second Screen User Profiling and Multi-level Smart Recommendations in the context of Social TVs

• Multi-stage Neural Networks with Single-sided Classifiers for False Positive Reduction and its Evaluation using Lung X-ray CT Images

• Perturb-and-MPM: Quantifying Segmentation Uncertainty in Dense Multi-Label CRFs

• An Arcsine Law for Markov Random Walks

• Tracing Linguistic Relations in Winning and Losing Sides of Explicit Opposing Groups

• Ergodicity analysis of stochastic biomolecular networks involving synthetic antithetic integral controllers

• Investigating the Characteristics of One-Sided Matching Mechanisms Under Various Preferences and Risk Attitudes

• On the self-convolution of generalized Fibonacci numbers

• Convergence rate of a simulated annealing algorithm with noisy observations

• Convolution Semigroups of Probability Measures on Gelfand Pairs, Revisited

• Design and Analysis of Time-Invariant SC-LDPC codes with Small Constraint Length

• Transition Densities and Traces for Invariant Feller Processes on Compact Symmetric Spaces

• Time-Inhomogeneous Branching Processes Conditioned on Non-Extinction

• Non-existence of two types of partial difference sets

• Matrix product moments in normal variables

• Graph-based Isometry Invariant Representation Learning

• Approximate Computational Approaches for Bayesian Sensor Placement in High Dimensions

• ste-GAN-ography: Generating Steganographic Images via Adversarial Training

• Distant total irregularity strength of graphs via random vertex ordering

• Personal Model Training under Privacy Constraints

• Global stability in a nonlocal reaction-diffusion equation

• A Hypercat-enabled Semantic Internet of Things Data Hub: Technical Report

• Lossy Image Compression with Compressive Autoencoders

• Preserving Differential Privacy Between Features in Distributed Estimation

• Stability and performance analysis of linear positive systems with delays using input-output methods

• A note on asymptotically optimal neighbour sum distinguishing colourings

• Sequence of purchases in credit card data reveal life styles in urban populations

• Detecting Adversarial Samples from Artifacts

• Exploiting Negative Curvature in Deterministic and Stochastic Optimization

• A Polynomial Method Approach to Zero-Sum Subsets in $\mathbb{F}_{p}^{2}$

• The projective ensemble and distribution of points in odd-dimensional spheres

• Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation

• HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving

• Repair Strategies for Storage on Mobile Clouds

• Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization

• OptNet: Differentiable Optimization as a Layer in Neural Networks