Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding

As spacecraft send back increasing amounts of telemetry data, improved anomaly detection systems are needed to lessen the monitoring burden placed on operations engineers and reduce operational risk. Current spacecraft monitoring systems only target a subset of anomaly types and often require costly expert knowledge to develop and maintain due to challenges involving scale and complexity. We demonstrate the effectiveness of Long Short-Term Memory (LSTMs) networks, a type of Recurrent Neural Network (RNN), in overcoming these issues using expert-labeled telemetry anomaly data from the Soil Moisture Active Passive (SMAP) satellite and the Mars Science Laboratory (MSL) rover, Curiosity. We also propose a complementary unsupervised and nonparametric anomaly thresholding approach developed during a pilot implementation of an anomaly detection system for SMAP, and offer false positive mitigation strategies along with other key improvements and lessons learned during development.

When and when not to use optimal model averaging

Traditionally model averaging has been viewed as an alternative to model selection with the ultimate goal to incorporate the uncertainty associated with the model selection process in standard errors and confidence intervals by using a weighted combination of candidate models. In recent years, a new class of model averaging estimators has emerged in the literature, suggesting to combine models such that the squared risk, or other risk functions, are minimized. We argue that, contrary to popular belief, these estimators do not necessarily address the challenges induced by model selection uncertainty, but should be regarded as attractive complements for the machine learning and forecasting literature, as well as tools to identify causal parameters. We illustrate our point by means of several targeted simulation studies.

Neural Relational Inference for Interacting Systems

Interacting systems are prevalent in nature, from dynamical systems in physics to complex societal dynamics. The interplay of components can give rise to complex behavior, which can often be explained using a simple model of the system’s constituent parts. In this work, we introduce the neural relational inference (NRI) model: an unsupervised model that learns to infer interactions while simultaneously learning the dynamics purely from observational data. Our model takes the form of a variational auto-encoder, in which the latent code represents the underlying interaction graph and the reconstruction is based on graph neural networks. In experiments on simulated physical systems, we show that our NRI model can accurately recover ground-truth interactions in an unsupervised manner. We further demonstrate that we can find an interpretable structure and predict complex dynamics in real motion capture and sports tracking data.

Hybrid Decision Making: When Interpretable Models Collaborate With Black-Box Models

Interpretable machine learning models have received increasing interest in recent years, especially in domains where humans are involved in the decision-making process. However, the possible loss of the task performance for gaining interpretability is often inevitable. This performance downgrade puts practitioners in a dilemma of choosing between a top-performing black-box model with no explanations and an interpretable model with unsatisfying task performance. In this work, we propose a novel framework for building a Hybrid Decision Model that integrates an interpretable model with any black-box model to introduce explanations in the decision making process while preserving or possibly improving the predictive accuracy. We propose a novel metric, explainability, to measure the percentage of data that are sent to the interpretable model for decision. We also design a principled objective function that considers predictive accuracy, model interpretability, and data explainability. Under this framework, we develop Collaborative Black-box and RUle Set Hybrid (CoBRUSH) model that combines logic rules and any black-box model into a joint decision model. An input instance is first sent to the rules for decision. If a rule is satisfied, a decision will be directly generated. Otherwise, the black-box model is activated to decide on the instance. To train a hybrid model, we design an efficient search algorithm that exploits theoretically grounded strategies to reduce computation. Experiments show that CoBRUSH models are able to achieve same or better accuracy than their black-box collaborator working alone while gaining explainability. They also have smaller model complexity than interpretable baselines.

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with applications in automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc. Competing frameworks for building these networks such as TensorFlow, Chainer, CNTK, Torch/PyTorch, Caffe1/2, MXNet and Theano, explore different tradeoffs between usability and expressiveness, research or production orientation and supported hardware. They operate on a DAG of computational operators, wrapping high-performance libraries such as CUDNN (for NVIDIA GPUs) or NNPACK (for various CPUs), and automate memory allocation, synchronization, distribution. Custom operators are needed where the computation does not fit existing high-performance library calls, usually at a high engineering cost. This is frequently required when new operators are invented by researchers: such operators suffer a severe performance penalty, which limits the pace of innovation. Furthermore, even if there is an existing runtime call these frameworks can use, it often doesn’t offer optimal performance for a user’s particular network architecture and dataset, missing optimizations between operators as well as optimizations that can be done knowing the size and shape of data. Our contributions include (1) a language close to the mathematics of deep learning called Tensor Comprehensions offering both imperative and declarative styles, (2) a polyhedral Just-In-Time compiler to convert a mathematical description of a deep learning DAG into a CUDA kernel with delegated memory management and synchronization, also providing optimizations such as operator fusion and specialization for specific sizes, (3) a compilation cache populated by an autotuner. [Abstract cutoff]

Deep Neural Networks for Bot Detection

The problem of detecting bots, automated social media accounts governed by software but disguising as human users, has strong implications. For example, bots have been used to sway political elections by distorting online discourse, to manipulate the stock market, or to push anti-vaccine conspiracy theories that caused health epidemics. Most techniques proposed to date detect bots at the account level, by processing large amount of social media posts, and leveraging information from network structure, temporal dynamics, sentiment analysis, etc. In this paper, we propose a deep neural network based on contextual long short-term memory (LSTM) architecture that exploits both content and metadata to detect bots at the tweet level: contextual features are extracted from user metadata and fed as auxiliary input to LSTM deep nets processing the tweet text. Another contribution that we make is proposing a technique based on synthetic minority oversampling to generate a large labeled dataset, suitable for deep nets training, from a minimal amount of labeled data (roughly 3,000 examples of sophisticated Twitter bots). We demonstrate that, from just one single tweet, our architecture can achieve high classification accuracy (AUC > 96%) in separating bots from humans. We apply the same architecture to account-level bot detection, achieving nearly perfect classification accuracy (AUC > 99%). Our system outperforms previous state of the art while leveraging a small and interpretable set of features yet requiring minimal training data.

Neural Program Search: Solving Programming Tasks from Description and Examples

We present a Neural Program Search, an algorithm to generate programs from natural language description and a small number of input/output examples. The algorithm combines methods from Deep Learning and Program Synthesis fields by designing rich domain-specific language (DSL) and defining efficient search algorithm guided by a Seq2Tree model on it. To evaluate the quality of the approach we also present a semi-synthetic dataset of descriptions with test examples and corresponding programs. We show that our algorithm significantly outperforms a sequence-to-sequence model with attention baseline.

Tempered Adversarial Networks

Generative adversarial networks (GANs) have been shown to produce realistic samples from high-dimensional distributions, but training them is considered hard. A possible explanation for training instabilities is the inherent imbalance between the networks: While the discriminator is trained directly on both real and fake samples, the generator only has control over the fake samples it produces since the real data distribution is fixed by the choice of a given dataset. We propose a simple modification that gives the generator control over the real samples which leads to a tempered learning process for both generator and discriminator. The real data distribution passes through a lens before being revealed to the discriminator, balancing the generator and discriminator by gradually revealing more detailed features necessary to produce high-quality results. The proposed module automatically adjusts the learning process to the current strength of the networks, yet is generic and easy to add to any GAN variant. In a number of experiments, we show that this can improve quality, stability and/or convergence speed across a range of different GAN architectures (DCGAN, LSGAN, WGAN-GP).

Classification from Pairwise Similarity and Unlabeled Data

One of the biggest bottlenecks in supervised learning is its high labeling cost. To overcome this problem, we propose a new weakly-supervised learning setting called SU classification, where only similar (S) data pairs (two examples belong to the same class) and unlabeled (U) data are needed, instead of fully-supervised data. We show that an unbiased estimator of the classification risk can be obtained only from SU data, and its empirical risk minimizer achieves the optimal parametric convergence rate. Finally, we demonstrate the effectiveness of the proposed method through experiments.

ReinforceWalk: Learning to Walk in Graph with Monte Carlo Tree Search

Learning to walk over a graph towards a target node for a given input query and a source node is an important problem in applications such as knowledge graph reasoning. It can be formulated as a reinforcement learning (RL) problem that has a known state transition model, but with partial observability and sparse reward. To overcome these challenges, we develop a graph walking agent called ReinforceWalk, which consists of a deep recurrent neural network (RNN) and a Monte Carlo Tree Search (MCTS). To address partial observability, the RNN encodes the history of observations and map it into the Q-value, the policy and the state value. In order to effectively train the agent from sparse reward, we combine MCTS with the RNN policy to generate trajectories with more positive rewards. From these trajectories, we update the network in an off-policy manner using Q-learning and improves the RNN policy. Our proposed RL algorithm repeatedly applies this policy improvement step to learn the entire model. At testing stage, the MCTS is also combined with the RNN to predict the target node with higher accuracy. Experiment results on several graph-walking benchmarks show that we are able to learn better policies from less number of rollouts compared to other baseline methods, which are mainly based on policy gradient method.

Neural Tensor Factorization

Neural collaborative filtering (NCF) and recurrent recommender systems (RRN) have been successful in modeling user-item relational data. However, they are also limited in their assumption of static or sequential modeling of relational data as they do not account for evolving users’ preference over time as well as changes in the underlying factors that drive the change in user-item relationship over time. We address these limitations by proposing a Neural Tensor Factorization (NTF) model for predictive tasks on dynamic relational data. The NTF model generalizes conventional tensor factorization from two perspectives: First, it leverages the long short-term memory architecture to characterize the multi-dimensional temporal interactions on relational data. Second, it incorporates the multi-layer perceptron structure for learning the non-linearities between different latent factors. Our extensive experiments demonstrate the significant improvement in rating prediction and link prediction on dynamic relational data by our NTF model over both neural network based factorization models and other traditional methods.

A comparative study of fairness-enhancing interventions in machine learning

Computers are increasingly used to make decisions that have significant impact in people’s lives. Often, these predictions can affect different population subgroups disproportionately. As a result, the issue of fairness has received much recent interest, and a number of fairness-enhanced classifiers and predictors have appeared in the literature. This paper seeks to study the following questions: how do these different techniques fundamentally compare to one another, and what accounts for the differences? Specifically, we seek to bring attention to many under-appreciated aspects of such fairness-enhancing interventions. Concretely, we present the results of an open benchmark we have developed that lets us compare a number of different algorithms under a variety of fairness measures, and a large number of existing datasets. We find that although different algorithms tend to prefer specific formulations of fairness preservations, many of these measures strongly correlate with one another. In addition, we find that fairness-preserving algorithms tend to be sensitive to fluctuations in dataset composition (simulated in our benchmark by varying training-test splits), indicating that fairness interventions might be more brittle than previously thought.

Blockchain and Artificial Intelligence

It is undeniable that artificial intelligence (AI) and blockchain concepts are spreading at a phenomenal rate. Both technologies have distinct degree of technological complexity and multi-dimensional business implications. However, a common misunderstanding about blockchain concept, in particular, is that blockchain is decentralized and is not controlled by anyone. But the underlying development of a blockchain system is still attributed to a cluster of core developers. Take smart contract as an example, it is essentially a collection of codes (or functions) and data (or states) that are programmed and deployed on a blockchain (say, Ethereum) by different human programmers. It is thus, unfortunately, less likely to be free of loopholes and flaws. In this article, through a brief overview about how artificial intelligence could be used to deliver bug-free smart contract so as to achieve the goal of blockchain 2.0, we to emphasize that the blockchain implementation can be assisted or enhanced via various AI techniques. The alliance of AI and blockchain is expected to create numerous possibilities.

Information Scaling Law of Deep Neural Networks

With the rapid development of Deep Neural Networks (DNNs), various network models that show strong computing power and impressive expressive power are proposed. However, there is no comprehensive informational interpretation of DNNs from the perspective of information theory. Due to the nonlinear function and the uncertain number of layers and neural units used in the DNNs, the network structure shows nonlinearity and complexity. With the typical DNNs named Convolutional Arithmetic Circuits (ConvACs), the complex DNNs can be converted into mathematical formula. Thus, we can use rigorous mathematical theory especially the information theory to analyse the complicated DNNs. In this paper, we propose a novel information scaling law scheme that can interpret the network’s inner organization by information theory. First, we show the informational interpretation of the activation function. Secondly, we prove that the information entropy increases when the information is transmitted through the ConvACs. Finally, we propose the information scaling law of ConvACs through making a reasonable assumption.

Learning Robust and Adaptive Real-World Continuous Control Using Simulation and Transfer Learning

We use model-free reinforcement learning, extensive simulation, and transfer learning to develop a continuous control algorithm that has good zero-shot performance in a real physical environment. We train a simulated agent to act optimally across a set of similar environments, each with dynamics drawn from a prior distribution. We propose that the agent is able to adjust its actions almost immediately, based on small set of observations. This robust and adaptive behavior is enabled by using a policy gradient algorithm with an Long Short Term Memory (LSTM) function approximation. Finally, we train an agent to navigate a two-dimensional environment with uncertain dynamics and noisy observations. We demonstrate that this agent has good zero-shot performance in a real physical environment. Our preliminary results indicate that the agent is able to infer the environmental dynamics after only a few timesteps, and adjust its actions accordingly.

First Order Generative Adversarial Networks

GANs excel at learning high dimensional distributions, but they can update generator parameters in directions that do not correspond to the steepest descent direction of the objective. Prominent examples of problematic update directions include those used in both Goodfellow’s original GAN and the WGAN-GP. To formally describe an optimal update direction, we introduce a theoretical framework which allows the derivation of requirements on both the divergence and corresponding method for determining an update direction. These requirements guarantee unbiased mini-batch updates in the direction of steepest descent. We propose a novel divergence which approximates the Wasserstein distance while regularizing the critic’s first order information. Together with an accompanying update direction, this divergence fulfills the requirements for unbiased steepest descent updates. We verify our method, the First Order GAN, with CelebA image generation and set a new state of the art on the One Billion Word language generation task.

Position and Distance: Recommendation beyond Matrix Factorization

For the last two decades, matrix factorization has become one of the fundamental methods for tackling recommendation problems. Recent studies demonstrate that the interaction function dot product used in MF can limit its expressiveness and lead to sub-optimal solutions. In this work, we propose modelling user-item interaction patterns in an alternative way by considering positions and distances of users and items. We assume that users and items can be positioned in a low dimensional space and their explicit closeness can be measured using Euclidean distance metric. In addition, we adopted a weighted strategy to adaptively assign different confidence levels for positive and negative samples, which introduces more flexibility for recommendation with implicit interactions. Comprehensive experiments on multiple real-world datasets demonstrate superior performances of our model over state-of-the-art competing methods including conventional matrix factorization based approaches and recent metric learning based approaches.

Barista – a Graphical Tool for Designing and Training Deep Neural Networks

In recent years, the importance of deep learning has significantly increased in pattern recognition, computer vision, and artificial intelligence research, as well as in industry. However, despite the existence of multiple deep learning frameworks, there is a lack of comprehensible and easy-to-use high-level tools for the design, training, and testing of deep neural networks (DNNs). In this paper, we introduce Barista, an open-source graphical high-level interface for the Caffe deep learning framework. While Caffe is one of the most popular frameworks for training DNNs, editing prototext files in order to specify the net architecture and hyper parameters can become a cumbersome and error-prone task. Instead, Barista offers a fully graphical user interface with a graph-based net topology editor and provides an end-to-end training facility for DNNs, which allows researchers to focus on solving their problems without having to write code, edit text files, or manually parse logged data.

Weyl semimetal to metal phase transitions driven by quasiperiodic potentials
Evaluating Compositionality in Sentence Embeddings
A Fast Proximal Point Method for Wasserstein Distance
Dimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector
Distributed Coordinated Transmission with Forward-Backward Training for 5G Radio Access
Stochastic quasi-Newton with adaptive step lengths for large-scale problems
SLE and spidernets
Detecting weak signals by combining small P-values in observational studies with multiple testing
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Regularisers in Semidefinite Programming
Multi-Armed Bandits on Unit Interval Graphs
Posets arising as 1-skeleta of simple polytopes, the nonrevisiting path conjecture, and poset topology
An infinite family of congruences for 1-shell totally symmetric plane partitions
On the Sample Complexity of Learning from a Sequence of Experiments
A Data-Driven Approach to Extract Connectivity Structures from Diffusion Tensor Imaging Data
Chance-constrained optimal location of damping control actuators under wind power variability
Analysis of fluid flow models
A Unified Implicit Dialog Framework for Conversational Search
Junction Tree Variational Autoencoder for Molecular Graph Generation
Learning a Neural-network-based Representation for Open Set Recognition
Bouncy Hybrid Sampler as a Unifying Device
Computational Optimal Transport: Complexity by Accelerated Gradient Descent Is Better Than by Sinkhorn’s Algorithm
Few-Shot Learning with Metric-Agnostic Conditional Embeddings
Randomized Empirical Processes and Confidence Bands via Virtual Resampling
Cut-Pursuit Algorithm for Regularizing Nonsmooth Functionals with Graph Total Variation
Certified Roundoff Error Bounds using Bernstein Expansions and Sparse Krivine-Stengle Representations
The Hopf monoid of Megagreedoids
Image Retargetability
Identifiability of Nonparametric Mixture Models and Bayes Optimal Clustering
Data structures for robust multifrequency imaging
Recurrent Slice Networks for 3D Segmentation on Point Clouds
TVAE: Triplet-Based Variational Autoencoder using Metric Learning
Adversarially Regularized Graph Autoencoder
REAS: Combining Numerical Optimization with SAT Solving
Dynamic Bounds on Stochastic Chemical Kinetic Systems Using Semidefinite Programming
Poincaré type and spectral gap inequalities with fractional Laplacians on Hamming cube
Efficient Exploration through Bayesian Deep Q-Networks
Towards Understanding the Generalization Bias of Two Layer Convolutional Linear Classifiers with Gradient Descent
‘How Was Your Weekend?’ A Generative Model of Phatic Conversation
Deep Leaning Models Delineates Multiple Nuclear Phenotypes in H&E Stained Histology Sections
signSGD: compressed optimisation for non-convex problems
Finite-Control-Set Model Predictive Control (FCS-MPC) for Islanded Hybrid Microgrids
Ruelle-Bowen continuous-time random walk
Texture Classification in Extreme Scale Variations using GANet
On Characterizing the Capacity of Neural Networks using Algebraic Topology
Network Summarization with Preserved Spectral Properties
A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms
Bayesian model assessment: Use of conditional vs marginal likelihoods
Predicting Adversarial Examples with High Confidence
Towards Generic Deobfuscation of Windows API Calls
An Optimized Architecture for Unpaired Image-to-Image Translation
Community detection through likelihood optimization: in search of a sound model
Deep Neural Networks Learn Non-Smooth Functions Effectively
Graph-Based Ascent Algorithms for Function Maximization
Vertex connectivity of Deza graphs with parameters of complements to Seidel graphs
A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization
Solved and unsolved problems about abelian squares
Some Information Inequalities for Statistical Inference
Unbalanced urn model with random addition
Slice as an Evolutionary Service: Genetic Optimization for Inter-Slice Resource Management in 5G Networks
A Dimension-Independent discriminant between distributions
Hardness Results and Approximation Algorithms for the Minimum Dominating Tree Problem
A Novel Scheme to Improve Lossless Image Coders by Explicit Description of Generative Model Classes
Legendre Tensor Decomposition
Learning Inverse Mappings with Adversarial Criterion
Optimal and Robust Power Allocation for Visible Light Positioning Systems under Illumination Constraints
Equations defining probability tree models
Distribution of metastable states of Ising spin glasses
An adaptive Euler-Maruyama scheme for stochastic differential equations with discontinuous drift and its convergence analysis
Computing controlled invariant sets for hybrid systems with applications to model-predictive control
Homotopy type of the neighborhood complexes of $3$-regular graphs and $4$-regular circulant graphs
Adversarial Examples on Discrete Sequences for Beating Whole-Binary Malware Detection
The algebra of balanced dessins
Helicity-protected domain-wall magnetoresistance in ferromagnetic Weyl semimetal
Tighter Variational Bounds are Not Necessarily Better
Ranking State-of-the-art Papers via Incomplete Tournaments Induced by Citations from Performance Tables
Dynamic Greedy Algorithms for the Edwards-Anderson Model
On the Relative Succinctness of Sentential Decision Diagrams
Robust Deformation Estimation in Wood-Composite Materials using Variational Optical Flow
Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model
Scalable Influence Maximization with General Marketing Strategies
Automatic localization and decoding of honeybee markers using deep convolutional neural networks
Sentence Boundary Detection for French with Subword-Level Information Vectors and Convolutional Neural Networks
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
$CI$-property for decomposable Schur rings over an elementary abelian group
Donaldson-Thomas invariants, torus knots, and lattice paths
The effect of noise intensity on stochastic parabolic equations
Pythagorean Triplets, Integral Apollonians and The Hofstadter Butterfly
Hamiltonicity in randomly perturbed hypergraphs
Rebalancing Dockless Bike Sharing Systems
Embedding spanning bounded degree graphs in randomly perturbed graphs
On the nonexistence of linear perfect Lee codes
Network Features Based Co-hyponymy Detection
First-order queries on classes of structures with bounded expansion
Ramanujan-type formulae for $1/π$: $q$-analogues
Fast Global Convergence via Landscape of Empirical Loss
Fast Rates for Online Gradient Descent Without Strong Convexity via Hoffman’s Bound
Trade-offs Between Weak-Noise Estimation Performance and Outage Exponents in Nonlinear Modulation
Numerical modelling of a peripheral arterial stenosis using dimensionally reduced models and machine learning techniques
A probabilistic framework for multi-view feature learning with many-to-many associations via neural networks
Hierarchical Overlap Graph
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring
Lattice Functions for the Analysis of Analgo-to-Digital Conversion
Modelling of Facial Aging and Kinship: A Survey
Single-Perspective Warps in Natural Image Stitching
Towards faster isomorphism tests for bounded-degree graphs
Weakly supervised collective feature learning from curated media
Optimal sequential contests
Delta-Ramp Encoder: Amplitude Sampling meets Time Encoding
Attention based Sentence Extraction from Scientific Articles using Pseudo-Labeled data
Variable Selection and Task Grouping for Multi-Task Learning
Training and Inference with Integers in Deep Neural Networks
Examining the Tip of the Iceberg: A Data Set for Idiom Translation
Unsupervised Evaluation and Weighted Aggregation of Ranked Predictions
BIRNet: Brain Image Registration Using Dual-Supervised Fully Convolutional Networks
A proof of the Bunkbed conjecture on the complete graph for $p\geqslant1/2$
Elastic Provisioning of Cloud Caches: a Cost-aware TTL Approach
Learning to Search with MCTSnets
On Double Smoothed Volatility Estimation of Potentially Nonstationary Jump-Diffusion Model
Communication Efficient, Sample Optimal, Linear Time Locally Private Discrete Distribution Estimation
Universality for bounded degree spanning trees in randomly perturbed graphs
Lengths of Words Accepted by Nondeterministic Finite Automata
Attention-based Deep Multiple Instance Learning
Online Variance Reduction for Stochastic Optimization
Broadband MIMO Couplers Characterization and Comparison
The Birthday Problem and Zero-Error List Codes
Predict and Constrain: Modeling Cardinality in Deep Structured Prediction
Joint Demosaicing and Denoising with Perceptual Optimization on a Generative Adversarial Network
Achieving the Age-Energy Tradeoff with a Finite-Battery Energy Harvesting Source
Superposition-Assisted Stochastic Optimization for Hawkes Processes
Substation Signal Matching with a Bagged Token Classifier
Semantic Scene Completion Combining Colour and Depth: preliminary experiments
Joint 3D Reconstruction of a Static Scene and Moving Objects
Deep Learning for Decoding of Linear Codes – A Syndrome-Based Approach
Quantifying Uncertainty in Discrete-Continuous and Skewed Data with Bayesian Deep Learning
A Short Survey on Sense-Annotated Corpora for Diverse Languages and Resources
Viscosity Solutions of Systems of PDEs with Interconnected Obstacles and Switching Problem without Monotonicity Condition
Exploring patterns of demand in bike sharing systems via replicated point process models
Deep Predictive Coding Network for Object Recognition
Progressive Reinforcement Learning with Distillation for Multi-Skilled Motion Control
Edge Caching in Delay-Constrained Virtualized Cellular Networks: Analysis and Market
Asset Price Volatility and Price Extrema
Power Control via Stackelberg Game for Small-Cell Networks
The Quotient of Normal Random Variables And Application to Asset Price Fat Tails
DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation
An Improved Bayesian Framework for Quadrature of Constrained Integrands
MONK — Outlier-Robust Mean Embedding Estimation by Median-of-Means
Sparse Matrix Multiplication with Bandwidth Restricted All-to-All Communication
Stochastic Variance-Reduced Hamilton Monte Carlo Methods
Stochastic Variance-Reduced Cubic Regularized Newton Method