Learning non-Gaussian Time Series using the Box-Cox Gaussian Process

Gaussian processes (GPs) are Bayesian nonparametric generative models that provide interpretability of hyperparameters, admit closed-form expressions for training and inference, and are able to accurately represent uncertainty. To model general non-Gaussian data with complex correlation structure, GPs can be paired with an expressive covariance kernel and then fed into a nonlinear transformation (or warping). However, overparametrising the kernel and the warping is known to, respectively, hinder gradient-based training and make the predictions computationally expensive. We remedy this issue by (i) training the model using derivative-free global-optimisation techniques so as to find meaningful maxima of the model likelihood, and (ii) proposing a warping function based on the celebrated Box-Cox transformation that requires minimal numerical approximations—unlike existing warped GP models. We validate the proposed approach by first showing that predictions can be computed analytically, and then on a learning, reconstruction and forecasting experiment using real-world datasets.

The Three Pillars of Machine-Based Programming

In this position paper, we describe our vision of the future of machine-based programming through a categorical examination of three pillars of research. Those pillars are: (i) intention, (ii) invention, and(iii) adaptation. Intention emphasizes advancements in the human-to-computer and computer-to-machine-learning interfaces. Invention emphasizes the creation or refinement of algorithms or core hardware and software building blocks through machine learning (ML). Adaptation emphasizes advances in the use of ML-based constructs to autonomously evolve software.

Enslaving the Algorithm: From a ‘Right to an Explanation’ to a ‘Right to Better Decisions’?

As concerns about unfairness and discrimination in ‘black box’ machine learning systems rise, a legal ‘right to an explanation’ has emerged as a compellingly attractive approach for challenge and redress. We outline recent debates on the limited provisions in European data protection law, and introduce and analyze newer explanation rights in French administrative law and the draft modernized Council of Europe Convention 108. While individual rights can be useful, in privacy law they have historically unreasonably burdened the average data subject. ‘Meaningful information’ about algorithmic logics is more technically possible than commonly thought, but this exacerbates a new ‘transparency fallacy’—an illusion of remedy rather than anything substantively helpful. While rights-based approaches deserve a firm place in the toolbox, other forms of governance, such as impact assessments, ‘soft law,’ judicial review, and model repositories deserve more attention, alongside catalyzing agencies acting for users to control algorithmic system design.

Local Binary Pattern Networks

Memory and computation efficient deep learning architec- tures are crucial to continued proliferation of machine learning capabili- ties to new platforms and systems. Binarization of operations in convo- lutional neural networks has shown promising results in reducing model size and computing efficiency. In this paper, we tackle the problem us- ing a strategy different from the existing literature by proposing local binary pattern networks or LBPNet, that is able to learn and perform binary operations in an end-to-end fashion. LBPNet1 uses local binary comparisons and random projection in place of conventional convolu- tion (or approximation of convolution) operations. These operations can be implemented efficiently on different platforms including direct hard- ware implementation. We applied LBPNet and its variants on standard benchmarks. The results are promising across benchmarks while provid- ing an important means to improve memory and speed efficiency that is particularly suited for small footprint devices and hardware accelerators.

DYAN: A Dynamical Atoms Network for Video Prediction

The ability to anticipate the future is essential when making real time critical decisions, provides valuable information to understand dynamic natural scenes, and can help unsupervised video representation learning. State-of-art video prediction is based on LSTM recursive networks and/or generative adversarial network learning. These are complex architectures that need to learn large numbers of parameters, are potentially hard to train, slow to run, and may produce blurry predictions. In this paper, we introduce DYAN, a novel network with very few parameters and easy to train, which produces accurate, high quality frame predictions, significantly faster than previous approaches. DYAN owes its good qualities to its encoder and decoder, which are designed following concepts from systems identification theory and exploit the dynamics-based invariants of the data. Extensive experiments using several standard video datasets show that DYAN is superior generating frames and that it generalizes well across domains.

Closing the AI Knowledge Gap

AI researchers employ not only the scientific method, but also methodology from mathematics and engineering. However, the use of the scientific method – specifically hypothesis testing – in AI is typically conducted in service of engineering objectives. Growing interest in topics such as fairness and algorithmic bias show that engineering-focused questions only comprise a subset of the important questions about AI systems. This results in the AI Knowledge Gap: the number of unique AI systems grows faster than the number of studies that characterize these systems’ behavior. To close this gap, we argue that the study of AI could benefit from the greater inclusion of researchers who are well positioned to formulate and test hypotheses about the behavior of AI systems. We examine the barriers preventing social and behavioral scientists from conducting such studies. Our diagnosis suggests that accelerating the scientific study of AI systems requires new incentives for academia and industry, mediated by new tools and institutions. To address these needs, we propose a two-sided marketplace called TuringBox. On one side, AI contributors upload existing and novel algorithms to be studied scientifically by others. On the other side, AI examiners develop and post machine intelligence tasks designed to evaluate and characterize algorithmic behavior. We discuss this market’s potential to democratize the scientific study of AI behavior, and thus narrow the AI Knowledge Gap.

GaAN: Gated Attention Networks for Learning on Large and Spatiotemporal Graphs

We propose a new network architecture, Gated Attention Networks (GaAN), for learning on graphs. Unlike the traditional multi-head attention mechanism, which equally consumes all attention heads, GaAN uses a convolutional sub-network to control each attention head’s importance. We demonstrate the effectiveness of GaAN on the inductive node classification problem. Moreover, with GaAN as a building block, we construct the Graph Gated Recurrent Unit (GGRU) to address the traffic speed forecasting problem. Extensive experiments on three real-world datasets show that our GaAN framework achieves state-of-the-art results on both tasks.

Natural Gradient Deep Q-learning

This paper presents findings for training a Q-learning reinforcement learning agent using natural gradient techniques. We compare the original deep Q-network (DQN) algorithm to its natural gradient counterpart (NGDQN), measuring NGDQN and DQN performance on classic controls environments without target networks. We find that NGDQN performs favorably relative to DQN, converging to significantly better policies faster and more frequently. These results indicate that natural gradient could be used for value function optimization in reinforcement learning to accelerate and stabilize training.

Data Distillery: Effective Dimension Estimation via Penalized Probabilistic PCA

The paper tackles the unsupervised estimation of the effective dimension of a sample of dependent random vectors. The proposed method uses the principal components (PC) decomposition of sample covariance to establish a low-rank approximation that helps uncover the hidden structure. The number of PCs to be included in the decomposition is determined via a Probabilistic Principal Components Analysis (PPCA) embedded in a penalized profile likelihood criterion. The choice of penalty parameter is guided by a data-driven procedure that is justified via analytical derivations and extensive finite sample simulations. Application of the proposed penalized PPCA is illustrated with three gene expression datasets in which the number of cancer subtypes is estimated from all expression measurements. The analyses point towards hidden structures in the data, e.g. additional subgroups, that could be of scientific interest.

Meta Reinforcement Learning with Latent Variable Gaussian Processes

Data efficiency, i.e., learning from small data sets, is critical in many practical applications where data collection is time consuming or expensive, e.g., robotics, animal experiments or drug design. Meta learning is one way to increase the data efficiency of learning algorithms by generalizing learned concepts from a set of training tasks to unseen, but related, tasks. Often, this relationship between tasks is hard coded or relies in some other way on human expertise. In this paper, we propose to automatically learn the relationship between tasks using a latent variable model. Our approach finds a variational posterior over tasks and averages over all plausible (according to this posterior) tasks when making predictions. We apply this framework within a model-based reinforcement learning setting for learning dynamics models and controllers of many related tasks. We apply our framework in a model-based reinforcement learning setting, and show that our model effectively generalizes to novel tasks, and that it reduces the average interaction time needed to solve tasks by up to 60% compared to strong baselines.

The Leave-one-out Approach for Matrix Completion: Primal and Dual Analysis

In this paper, we introduce a powerful technique, Leave-One-Out, to the analysis of low-rank matrix completion problems. Using this technique, we develop a general approach for obtaining fine-grained, entry-wise bounds on iterative stochastic procedures. We demonstrate the power of this approach in analyzing two of the most important algorithms for matrix completion: the non-convex approach based on Singular Value Projection (SVP), and the convex relaxation approach based on nuclear norm minimization (NNM). In particular, we prove for the first time that the original form of SVP, without re-sampling or sample splitting, converges linearly in the infinity norm. We further apply our leave-one-out approach to an iterative procedure that arises in the analysis of the dual solutions of NNM. Our results show that NNM recovers the true d -by-d rank-r matrix with \mathcal{O}(\mu^2 r^3d \log d ) observed entries, which has optimal dependence on the dimension and is independent of the condition number of the matrix. To the best of our knowledge, this is the first sample complexity result for a tractable matrix completion algorithm that satisfies these two properties simultaneously.

$\tilde{O}(n^{1/3})$-Space Algorithm for the Grid Graph Reachability Problem
VGAN-Based Image Representation Learning for Privacy-Preserving Facial Expression Recognition
Divisors on matroids and their volumes
Computational performance of a projection and rescaling algorithm
Zero-Shot Detection
Slipknotting in Random Diagrams
Continuous Time Multi-stage Stochastic Reserve and Unit Commitment
Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata
Fundamentals of Wireless Information and Power Transfer: From RF Energy Harvester Models to Signal and System Designs
Impulsive Control for G-AIMD Dynamics with Relaxed and Hard Constraints
Automated Curriculum Learning by Rewarding Temporally Rare Events
Dynamic Natural Language Processing with Recurrence Quantification Analysis
English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach
Visual Psychophysics for Making Face Recognition Algorithms More Explainable
Communication reduction in distributed optimization via estimation of the proximal operator
Supercongruences for polynomial analogs of the Apéry numbers
Exploring the predictability of range-based volatility estimators using RNNs
Lines in metric spaces: universal lines counted with multiplicity
Adversarial Generalized Method of Moments
Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation
Beyond Homophily: Incorporating Actor Variables in Actor-oriented Network Models
Solving Quadratic Programs to High Precision using Scaled Iterative Refinement
Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition
Probabilistic Occupancy Function and Sets Using Forward Stochastic Reachability for Rigid-Body Dynamic Obstacles
Partially ordering the class of invertible trees
Adaptive Smoothing V-Spline for Trajectory Reconstruction
Unveiling the invisible – mathematical methods for restoring and interpreting illuminated manuscripts
A Minimalist Approach to Type-Agnostic Detection of Quadrics in Point Clouds
Diagnostic Classification Of Lung Nodules Using 3D Neural Networks
Adaptive Polar Active Contour for Segmentation and Tracking in Ultrasound Videos
Eleven Simple Algorithms to Compute Fibonacci Numbers
Training Recurrent Neural Networks as a Constraint Satisfaction Problem
Why not be Versatile? Applications of the SGNMT Decoder for Machine Translation
Real-time Burst Photo Selection Using a Light-Head Adversarial Network
A Temporally-Aware Interpolation Network for Video Frame Inpainting
Monte Carlo Information Geometry: The dually flat case
Learning the Hierarchical Parts of Objects by Deep Non-Smooth Nonnegative Matrix Factorization
Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences
SlideNet: Fast and Accurate Slide Quality Assessment Based on Deep Neural Networks
Energy-Efficient Joint Offloading and Wireless Resource Allocation Strategy in Multi-MEC Server Systems
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Sparse Reduced Rank Regression With Nonconvex Regularization
Split graphs: combinatorial species and asymptotics
3D Point Cloud Denoising using Graph Laplacian Regularization of a Low Dimensional Manifold Model
Transferring Rich Deep Features for Facial Beauty Prediction
Learning Dynamic Memory Networks for Object Tracking
Optimal Control and Stabilization Problem for Discrete-time Markov Jump Systems with Indefinite Weight Costs
eSCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
Fair Deep Learning Prediction for Healthcare Applications with Confounder Filtering
Text Detection and Recognition in images: A survey
Offset Hypersurfaces and Persistent Homology of Algebraic Varieties
Face Recognition Techniques: A Survey
Flex-Convolution (Deep Learning Beyond Grid-Worlds)
A New State-Space Representation of Lyapunov Stability for Coupled PDEs and Scalable Stability Analysis in the SOS Framework
Unsupervised Cross-dataset Person Re-identification by Transfer Learning of Spatial-Temporal Patterns
Expressivity in TTS from Semantics and Pragmatics
Polarization and Index Modulations: a Theoretical and Practical Perspective
Risk and parameter convergence of logistic regression
Segmentation of histological images and fibrosis identification with a convolutional neural network
Cluster-based Wireless Energy Transfer for Low Complex Energy Receivers
Capacity Analysis of Index Modulations over Spatial, Polarization and Frequency Dimensions
Information content of coevolutionary game landscapes
The CTTC 5G end-to-end experimental platform: Integrating heterogeneous wireless/optical networks, distributed cloud, and IoT devices
Dual Polarized Modulation and Reception for Next Generation Mobile Satellite Communications
Rapid Prototyping of Standard-Compliant Visible Light Communications System
Link Adaptation Algorithms for Dual Polarization Mobile Satellite Systems
Advanced Signal Processing Techniques for Fixed and Mobile Satellite Communications
NOMA Assisted Joint Broadcast and Multicast Transmission in 5G Networks
Pushing for higher rates and efficiency in Satcom: the different perspectives within SatNExIV
End-to-end 5G services via an SDN/NFV-based multi-tenant network and cloud testbed
Zero-sum stochastic differential games of generalized McKean-Vlasov type *
Dual Polarized Modulation and Receivers for Mobile Communications in Urban Areas
Statistical evaluation of the azimuth and elevation angles seen at the output of the receiving antenna
Forward Link Interference Mitigation in Mobile Interactive Satellite Systems
An SDR Implementation of a Visible Light Communication System Based on the IEEE 802.15.7 Standard
Prototyping with SDR: a quick way to play with next-gen communications systems
Efficient Robust Model Predictive Control using Chordality
Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning
Frank-Wolfe with Subsampling Oracle
Progressive Structure from Motion
Discrete Potts Model for Generating Superpixels on Noisy Images
Self-Controlled Jamming Resilient Design Using Physical Layer Secret Keys
Adaptive Co-weighting Deep Convolutional Features For Object Retrieval
Optimal Symbolic Controllers Determinization for BDD storage
Fastest Rates for Stochastic Mirror Descent Methods
Sub-exponential Upper Bound for #XSAT of some CNF Classes
Effective filtering analysis for non-Gaussian dynamic systems
On Low-Resolution ADCs in Practical 5G Millimeter-Wave Massive MIMO Systems
Are you eligible? Predicting adulthood from face images via class specific mean autoencoder
Residual Codean Autoencoder for Facial Attribute Analysis
Max-Min Fairness User Scheduling and Power Allocation in Full-Duplex OFDMA Systems
Asynchronous opinion dynamics on the $k$-nearest-neighbors graph
Decomposability of graphs into subgraphs fulfilling the 1-2-3 Conjecture
Fractal analysis of the large-scale stellar mass distribution in the Sloan Digital Sky Survey
Patch-Based Image Inpainting with Generative Adversarial Networks
A Distance Oriented Kalman Filter Particle Swarm Optimizer Applied to Multi-Modality Image Registration
Ocean Eddy Identification and Tracking using Neural Networks
Ontology-Based Reasoning about the Trustworthiness of Cyber-Physical Systems
Reflected Advanced Backward Stochastic Differential Equations with Default
MLtuner: System Support for Automatic Machine Learning Tuning
Total Equitable List Coloring
Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks
On the Alon-Tarsi Number and Chromatic-choosability of Cartesian Products of Graphs
Divisibility problems for function fields
Speech-Driven Facial Reenactment Using Conditional Generative Adversarial Networks
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Equiangular tight frames from group divisible designs
MAGSAC: marginalizing sample consensus
An Improved Evaluation Framework for Generative Adversarial Networks
AC/DC: In-Database Learning Thunderstruck
Collective Schedules: Scheduling Meets Computational Social Choice
Actor and Action Video Segmentation from a Sentence
FastDeRain: A Novel Video Rain Streak Removal Method Using Directional Gradient Priors
Linearizing Visual Processes with Convolutional Variational Autoencoders
Mobile Social Services with Network Externality: From Separate Pricing to Bundled Pricing
Discrete Cubical and Path Homologies of Graphs
On a problem of Bermond and Bollobás
Non-Asymptotic Classical Data Compression with Quantum Side Information
Fusion of stereo and still monocular depth estimates in a self-supervised learning context
The Crossing Number of Seq-Shellable Drawings of Complete Graphs
Explanation Methods in Deep Learning: Users, Values, Concerns and Challenges
DeepGauge: Comprehensive and Multi-Granularity Testing Criteria for Gauging the Robustness of Deep Learning Systems
Studies on Generalized Yule Models
Broadcasting on Bounded Degree DAGs
Stacked Neural Networks for end-to-end ciliary motion analysis
An interaction index for multichoice games
C3PO: Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing
Learning Category-Specific Mesh Reconstruction from Image Collections