Geometry Score: A Method For Comparing Generative Adversarial Networks

One of the biggest challenges in the research of generative adversarial networks (GANs) is assessing the quality of generated samples and detecting various levels of mode collapse. In this work, we construct a novel measure of performance of a GAN by comparing geometrical properties of the underlying data manifold and the generated one, which provides both qualitative and quantitative means for evaluation. Our algorithm can be applied to datasets of an arbitrary nature and is not limited to visual data. We test the obtained metric on various real-life models and datasets and demonstrate that our method provides new insights into properties of GANs.


Learning Inductive Biases with Simple Neural Networks

People use rich prior knowledge about the world in order to efficiently learn new concepts. These priors – also known as ‘inductive biases’ – pertain to the space of internal models considered by a learner, and they help the learner make inferences that go beyond the observed data. A recent study found that deep neural networks optimized for object recognition develop the shape bias (Ritter et al., 2017), an inductive bias possessed by children that plays an important role in early word learning. However, these networks use unrealistically large quantities of training data, and the conditions required for these biases to develop are not well understood. Moreover, it is unclear how the learning dynamics of these networks relate to developmental processes in childhood. We investigate the development and influence of the shape bias in neural networks using controlled datasets of abstract patterns and synthetic images, allowing us to systematically vary the quantity and form of the experience provided to the learning algorithms. We find that simple neural networks develop a shape bias after seeing as few as 3 examples of 4 object categories. The development of these biases predicts the onset of vocabulary acceleration in our networks, consistent with the developmental process in children.


Transductive Adversarial Networks (TAN)

Transductive Adversarial Networks (TAN) is a novel domain-adaptation machine learning framework that is designed for learning a conditional probability distribution on unlabelled input data in a target domain, while also only having access to: (1) easily obtained labelled data from a related source domain, which may have a different conditional probability distribution than the target domain, and (2) a marginalised prior distribution on the labels for the target domain. TAN leverages a fully adversarial training procedure and a unique generator/encoder architecture which approximates the transductive combination of the available source- and target-domain data. A benefit of TAN is that it allows the distance between the source- and target-domain label-vector marginal probability distributions to be greater than 0 (i.e. different tasks across the source and target domains) whereas other domain-adaptation algorithms require this distance to equal 0 (i.e. a single task across the source and target domains). TAN can, however, still handle the latter case and is a more generalised approach to this case. Another benefit of TAN is that due to being a fully adversarial algorithm, it has the potential to accurately approximate highly complex distributions. Theoretical analysis demonstrates the viability of the TAN framework.


A Complexity Theory for Labeling Schemes

In a labeling scheme the vertices of a given graph from a particular class are assigned short labels such that adjacency can be algorithmically determined from these labels. A representation of a graph from that class is given by the set of its vertex labels. Due to the shortness constraint on the labels such schemes provide space-efficient representations for various graph classes, such as planar or interval graphs. We consider what graph classes cannot be represented by labeling schemes when the algorithm which determines adjacency is subjected to computational constraints.


Online Learning: A Comprehensive Survey

Online learning represents an important family of machine learning algorithms, in which a learner attempts to resolve an online prediction (or any type of decision-making) task by learning a model/hypothesis from a sequence of data instances one at a time. The goal of online learning is to ensure that the online learner would make a sequence of accurate predictions (or correct decisions) given the knowledge of correct answers to previous prediction or learning tasks and possibly additional information. This is in contrast to many traditional batch learning or offline machine learning algorithms that are often designed to train a model in batch from a given collection of training data instances. This survey aims to provide a comprehensive survey of the online machine learning literatures through a systematic review of basic ideas and key principles and a proper categorization of different algorithms and techniques. Generally speaking, according to the learning type and the forms of feedback information, the existing online learning works can be classified into three major categories: (i) supervised online learning where full feedback information is always available, (ii) online learning with limited feedback, and (iii) unsupervised online learning where there is no feedback available. Due to space limitation, the survey will be mainly focused on the first category, but also briefly cover some basics of the other two categories. Finally, we also discuss some open issues and attempt to shed light on potential future research directions in this field.


SQL Query Completion for Data Exploration

Within the big data tsunami, relational databases and SQL are still there and remain mandatory in most of cases for accessing data. On the one hand, SQL is easy-to-use by non specialists and allows to identify pertinent initial data at the very beginning of the data exploration process. On the other hand, it is not always so easy to formulate SQL queries: nowadays, it is more and more frequent to have several databases available for one application domain, some of them with hundreds of tables and/or attributes. Identifying the pertinent conditions to select the desired data, or even identifying relevant attributes is far from trivial. To make it easier to write SQL queries, we propose the notion of SQL query completion: given a query, it suggests additional conditions to be added to its WHERE clause. This completion is semantic, as it relies on the data from the database, unlike current completion tools that are mostly syntactic. Since the process can be repeated over and over again — until the data analyst reaches her data of interest –, SQL query completion facilitates the exploration of databases. SQL query completion has been implemented in a SQL editor on top of a database management system. For the evaluation, two questions need to be studied: first, does the completion speed up the writing of SQL queries? Second , is the completion easily adopted by users? A thorough experiment has been conducted on a group of 70 computer science students divided in two groups (one with the completion and the other one without) to answer those questions. The results are positive and very promising.


Praaline: Integrating Tools for Speech Corpus Research

This paper presents Praaline, an open-source software system for managing, annotating, analysing and visualising speech corpora. Researchers working with speech corpora are often faced with multiple tools and formats, and they need to work with ever-increasing amounts of data in a collaborative way. Praaline integrates and extends existing time-proven tools for spoken corpora analysis (Praat, Sonic Visualiser and a bridge to the R statistical package) in a modular system, facilitating automation and reuse. Users are exposed to an integrated, user-friendly interface from which to access multiple tools. Corpus metadata and annotations may be stored in a database, locally or remotely, and users can define the metadata and annotation structure. Users may run a customisable cascade of analysis steps, based on plug-ins and scripts, and update the database with the results. The corpus database may be queried, to produce aggregated data-sets. Praaline is extensible using Python or C++ plug-ins, while Praat and R scripts may be executed against the corpus data. A series of visualisations, editors and plug-ins are provided. Praaline is free software, released under the GPL license.


TSViz: Demystification of Deep Learning Models for Time-Series Analysis

This paper presents a novel framework for demystification of convolutional deep learning models for time series analysis. This is a step towards making informed/explainable decisions in the domain of time series, powered by deep learning. There have been numerous efforts to increase the interpretability of image-centric deep neural network models, where the learned features are more intuitive to visualize. Visualization in time-series is much more complicated as there is no direct interpretation of the filters and inputs as compared to image modality. In addition, little or no concentration has been devoted for the development of such tools in the domain of time-series in the past. The visualization engine of the presented framework provides possibilities to explore and analyze a network from different dimensions at four different levels of abstraction. This enables the user to uncover different aspects of the model which includes important filters, filter clusters, and input saliency maps. These representations allow to understand the network features so that the acceptability of deep networks for time-series data can be enhanced. This is extremely important in domains like finance, industry 4.0, self-driving cars, health-care, counter-terrorism etc., where reasons for reaching a particular prediction are equally important as the prediction itself. The framework \footnote{Framework download link: https://hidden.for.blind.review} can also aid in discovery of the filters which are contributing nothing to the final prediction, hence, can be pruned without any significant loss in performance.


Random matrix products: Universality and least singular values

We establish local universality of the k-point correlation functions associated with products of independent iid random matrices, as the sizes of the matrices tend to infinity, under a moment matching hypothesis. We also prove Gaussian limits for the centered linear spectral statistics of products of independent GinUE matrices, which we then extend to more general product matrices by way of another moment matching argument. In a similar fashion, we establish Gaussian limits for the centered linear spectral statistics of products of independent truncated random unitary matrices. Moreover, we are able to obtain explicit expressions for the limiting variances in each of these cases. The key technical lemma required for these results is a lower bound on the least singular value of the translated linearization matrix associated with the product of M normalized independent random matrices with independent and identically distributed subgaussian entries.


Unsupervised Typography Transfer
Generating Triples with Adversarial Networks for Scene Graph Construction
To Phrase or Not to Phrase – Impact of User versus System Term Dependence Upon Retrieval
An Unsupervised Learning Model for Deformable Medical Image Registration
Unsupervised word sense disambiguation in dynamic semantic spaces
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Deep Versus Wide Convolutional Neural Networks for Object Recognition on Neuromorphic System
Telling apart Felidae and Ursidae from the distribution of nucleotides in mitochondrial DNA
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Enhance word representation for out-of-vocabulary on Ubuntu dialogue corpus
Effective Quantization Approaches for Recurrent Neural Networks
Recognition of Acoustic Events Using Masked Conditional Neural Networks
A Diversity-based Substation Cyber Defense Strategy utilizing Coloring Games
Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management
OTFS: A New Generation of Modulation Addressing the Challenges of 5G
Interpolating Distributions for Populations in Nested Geographies using Public-use Data with Application to the American Community Survey
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Manifold Optimization Over the Set of Doubly Stochastic Matrices: A Second-Order Geometry
Spatially adaptive image compression using a tiled deep network
Tight Lower Bounds for Locally Differentially Private Selection
Minimizing Latency for Secure Coded Computing Using Secret Sharing via Staircase Codes
Gradient conjugate priors and deep neural networks
Probabilistic Non-asymptotic Analysis of Distributed Algorithms
SCK: A sparse coding based key-point detector
Correlation Estimation System Minimization Compared to Least Squares Minimization in Simple Linear Regression
Partisan: Enabling Cloud-Scale Erlang Applications
Fast methods for nonsmooth nonconvex minimization
Negative Binomial Construction of Random Discrete Distributions on the Infinite Simplex
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Minor preserving deletable edges in graphs
On Capacity of Non-Coherent Diamond Networks
Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images
PPFNet: Global Context Aware Local Features for Robust 3D Point Matching
Upper bound for a minimal quantifier depth of a Monadic Second-Order formula without asymptotic probability
Efficient collective swimming by harnessing vortices through deep reinforcement learning
Biological Mechanisms for Learning: A Computational Model of Olfactory Learning in the Manduca sexta Moth, with Applications to Neural Nets
A Semi-Supervised Two-Stage Approach to Learning from Noisy Labels
Towards A Systems Approach To Distributed Programming
A diffusion generated method for computing Dirichlet partitions
Gaussian binomial coefficients with negative arguments
Exact Semidefinite Formulations for a Class of (Random and Non-Random) Nonconvex Quadratic Programs
Driver Gaze Zone Estimation using Convolutional Neural Networks: A General Framework and Ablative Analysis
A Bayesian Approach to Multi-State Hidden Markov Models: Application to Dementia Progression
Monotone Operator Theory in Convex Optimization
Improving the Universality and Learnability of Neural Programmer-Interpreters with Combinator Abstraction
More Efficient Estimation for Logistic Regression with Optimal Subsample
Coded Caching with Heterogeneous Cache Sizes and Link Qualities: The Two-User Case
An interview based study of pioneering experiences in teaching and learning Complex Systems in Higher Education
General Strong Polarization
Deep Image Super Resolution via Natural Image Priors
Primal-dual stochastic gradient method for convex programs with many functional constraints
On the Packet Decoding Delay of Linear Network Coded Wireless Broadcast
Representation and Characterization of Non-Stationary Processes by Dilation Operators and Induced Shape Space Manifolds
Topologically Controlled Lossy Compression
The Higher-Order Prover Leo-III
From Hashing to CNNs: Training BinaryWeight Networks via Hashing
Completely Distributed Power Allocation using Deep Neural Network for Device to Device communication Underlaying LTE
Minimizing Latency in Online Ride and Delivery Services
Serve the shortest queue and Walsh Brownian motion
Monopoly pricing with buyer search
Some application of difference equations in Cryptography and Coding Theory
A local parallel communication algorithm for polydisperse rigid body dynamics
External and mutual synchronization of chimeras in a two layer network of nonlinear oscillators
Learning to score and summarize figure skating sport videos
The Multiphoton Boson Sampling Machine Doesn’t Beat Early Classical Computers for Five-boson Sampling
Saliency-Enhanced Robust Visual Tracking
Peekaboo – Where are the Objects? Structure Adjusting Superpixels
Archetypal Analysis for Sparse Representation-based Hyperspectral Sub-pixel Quantification
Data-adaptive doubly robust instrumental variable methods for treatment effect heterogeneity
Bayesian analysis of predictive Non-Homogeneous hidden Markov models using Polya-Gamma data augmentation
Comment on ‘Phase Control of Directed Diffusion in a Symmetric Optical Lattice’
Developing indicators on Open Access by combining evidence from diverse data sources
Convolutions of sets with bounded VC-dimension are uniformly continuous
Neural Network Renormalization Group
Using a reservoir computer to learn chaotic attractors, with applications to chaos synchronisation and cryptography
Detection Games Under Fully Active Adversaries
On the Algebraic and Arithmetic structure of the monoid of Product-one sequences II
mGPfusion: Predicting protein stability changes with Gaussian process kernel learning and data fusion
Multivariate Study of the Star Formation Rate in Galaxies: Bimodality Revisited
On certain combinatorial expansions of descent polynomials and the change of grammars
Relative perturbation bounds with applications to empirical covariance operators
Biomedical term normalization of EHRs with UMLS
Vanishing ideals of binary Hamming spheres
Multivariate subordination of stable processes
Online Decomposition of Compressive Streaming Data Using $n$-$\ell_1$ Cluster-Weighted Minimization
Adaptive online scheduling of tasks with anytime property on heterogeneous resources
State Compression of Markov Processes via Empirical Low-Rank Estimation
A Deep Unsupervised Learning Approach Toward MTBI Identification Using Diffusion MRI
DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech
Lower Bounds for the Fair Resource Allocation Problem
Solving Linear Programs with Complementarity Constraints using Branch-and-Cut
Parametric inference for multidimensional hypoelliptic diffusion with full observations
Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting
Incentive Mechanisms for Motivating Mobile Data Offloading in Heterogeneous Networks: A Salary-Plus-Bonus Approach
Singular values of large non-central random matrices
Learning Sparse Wavelet Representations
A New Kalman Filter Model for Nonlinear Systems Based on Ellipsoidal Bounding
Practical Issues of Action-conditioned Next Image Prediction
Erratum for Ricci-flat graphs with girth at least five
Ricci-flat cubic graphs with girth five
A Generalization Method of Partitioned Activation Function for Complex Number
Stochastic subgradient method converges at the rate $O(k^{-1/4})$ on weakly convex functions
Existence of two-step replica symmetry breaking for the spherical mixed p-spin glass at zero temperature
Texture Segmentation Based Video Compression Using Convolutional Neural Networks
The Random Fractional Matching Problem
Hadwiger numbers of self-complementary graphs
Statistical Learnability of Generalized Additive Models based on Total Variation Regularization
Learning and Querying Fast Generative Models for Reinforcement Learning
Algorithmic Bidding for Virtual Trading in Electricity Markets

Advertisements