Adversarial Network Coding

A combinatorial framework for adversarial network coding is presented. Channels are described by specifying the possible actions that one or more (possibly coordinated) adversaries may take. Upper bounds on three notions of capacity (the one-shot capacity, the zero-error capacity, and the compound zero-error capacity) are obtained for point-to-point channels, and generalized to corresponding capacity regions appropriate for multi-source networks. A key result of this paper is a general method by which bounds on these capacities in point-to-point channels may be ported to networks. This technique is illustrated in detail for Hamming-type channels with multiple adversaries operating on specific coordinates, which correspond, in the context of networks, to multiple adversaries acting on specific network edges. Capacity-achieving coding schemes are described for some of the considered adversarial models.

An Efficient Probabilistic Approach for Graph Similarity Search

Graph similarity search is a common and fundamental operation in graph databases. One of the most popular graph similarity measures is the Graph Edit Distance (GED) mainly because of its broad applicability and high interpretability. Despite its prevalence, exact GED computation is proved to be NP-hard, which could result in unsatisfactory computational efficiency on large graphs. However, exactly accurate search results are usually unnecessary for real-world applications especially when the responsiveness is far more important than the accuracy. Thus, in this paper, we propose a novel probabilistic approach to efficiently estimate GED, which is further leveraged for the graph similarity search. Specifically, we first take branches as elementary structures in graphs, and introduce a novel graph similarity measure by comparing branches between graphs, i.e., Graph Branch Distance (GBD), which can be efficiently calculated in polynomial time. Then, we formulate the relationship between GED and GBD by considering branch variations as the result ascribed to graph edit operations, and model this process by probabilistic approaches. By applying our model, the GED between any two graphs can be efficiently estimated by their GBD, and these estimations are finally utilized in the graph similarity search. Extensive experiments show that our approach has better accuracy, efficiency and scalability than other comparable methods in the graph similarity search over real and synthetic data sets.

Bayesian Conditional Generative Adverserial Networks

Traditional GANs use a deterministic generator function (typically a neural network) to transform a random noise input z to a sample \mathbf{x} that the discriminator seeks to distinguish. We propose a new GAN called Bayesian Conditional Generative Adversarial Networks (BC-GANs) that use a random generator function to transform a deterministic input y' to a sample \mathbf{x}. Our BC-GANs extend traditional GANs to a Bayesian framework, and naturally handle unsupervised learning, supervised learning, and semi-supervised learning problems. Experiments show that the proposed BC-GANs outperforms the state-of-the-arts.

Rotation Invariance Neural Network

Rotation invariance and translation invariance have great values in image recognition tasks. In this paper, we bring a new architecture in convolutional neural network (CNN) named cyclic convolutional layer to achieve rotation invariance in 2-D symbol recognition. We can also get the position and orientation of the 2-D symbol by the network to achieve detection purpose for multiple non-overlap target. Last but not least, this architecture can achieve one-shot learning in some cases using those invariance.

Rgtsvm: Support Vector Machines on a GPU in R

Rgtsvm provides a fast and flexible support vector machine (SVM) implementation for the R language. The distinguishing feature of Rgtsvm is that support vector classification and support vector regression tasks are implemented on a graphical processing unit (GPU), allowing the libraries to scale to millions of examples with >100-fold improvement in performance over existing implementations. Nevertheless, Rgtsvm retains feature parity and has an interface that is compatible with the popular e1071 SVM package in R. Altogether, Rgtsvm enables large SVM models to be created by both experienced and novice practitioners.

Neural Phrase-based Machine Translation

In this paper, we propose Neural Phrase-based Machine Translation (NPMT). Our method explicitly models the phrase structures in output sequences through Sleep-WAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To alleviate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Our experiments show that NPMT achieves state-of-the-art results on IWSLT 2014 German-English translation task without using any attention mechanisms. We also observe that our method produces meaningful phrases in the output language.

Adaptive Bayesian Power Spectrum Analysis of Multivariate Nonstationary Time Series

This article introduces a nonparametric approach to multivariate time-varying power spectrum analysis. The procedure adaptively partitions a time series into an unknown number of approximately stationary segments, where some spectral components may remain unchanged across segments, allowing components to evolve differently over time. Local spectra within segments are fit through Whittle likelihood based penalized spline models of modified Cholesky components, which provide flexible nonparametric estimates that preserve positive definite structures of spectral matrices. The approach is formulated in a Bayesian framework, in which the number and location of partitions are random, and relies on reversible jump Markov chain and Hamiltonian Monte Carlo methods that can adapt to the unknown number of segments and parameters. By averaging over the distribution of partitions, the approach can approximate both abrupt and slow-varying changes in spectral matrices. Empirical performance is evaluated in simulation studies and illustrated through analyses of electroencephalography during sleep and of the El Ni\~no-Southern Oscillation.

Knowledge Transfer for Out-of-Knowledge-Base Entities: A Graph Neural Network Approach

Knowledge base completion (KBC) aims to predict missing information in a knowledge base.In this paper, we address the out-of-knowledge-base (OOKB) entity problem in KBC:how to answer queries concerning test entities not observed at training time. Existing embedding-based KBC models assume that all test entities are available at training time, making it unclear how to obtain embeddings for new entities without costly retraining. To solve the OOKB entity problem without retraining, we use graph neural networks (Graph-NNs) to compute the embeddings of OOKB entities, exploiting the limited auxiliary knowledge provided at test time.The experimental results show the effectiveness of our proposed model in the OOKB setting.Additionally, in the standard KBC setting in which OOKB entities are not involved, our model achieves state-of-the-art performance on the WordNet dataset. The code and dataset are available at https://…/GNN-for-OOKB. This paper has been accepted by IJCAI17.

Gradient Diversity Empowers Distributed Learning

It has been experimentally observed that distributed implementations of mini-batch stochastic gradient descent (SGD) algorithms exhibit speedup saturation and decaying generalization ability beyond a particular batch-size. In this work, we present an analysis hinting that high similarity between concurrently processed gradients may be a cause of this performance degradation. We introduce the notion of gradient diversity that measures the dissimilarity between concurrent gradient updates, and show its key role in the performance of mini-batch SGD. We prove that on problems with high gradient diversity, mini-batch SGD is amenable to better speedups, while maintaining the generalization performance of serial (one sample) SGD. We further establish lower bounds on convergence where mini-batch SGD slows down beyond a particular batch-size, solely due to the lack of gradient diversity. We provide experimental evidence indicating the key role of gradient diversity in distributed learning, and discuss how heuristics like dropout, Langevin dynamics, and quantization can improve it.

Fourier-Based Testing for Families of Distributions

We study the general problem of testing whether an unknown discrete distribution belongs to a given family of distributions. More specifically, given a class of distributions \mathcal{P} and sample access to an unknown distribution \mathbf{P}, we want to distinguish (with high probability) between the case that \mathbf{P} \in \mathcal{P} and the case that \mathbf{P} is \epsilon-far, in total variation distance, from every distribution in \mathcal{P}. This is the prototypical hypothesis testing problem that has received significant attention in statistics and, more recently, in theoretical computer science. The sample complexity of this general problem depends on the underlying family \mathcal{P}. We are interested in designing sample-optimal and computationally efficient algorithms for this task. The main contribution of this work is a new and simple testing technique that is applicable to distribution families whose Fourier spectrum approximately satisfies a certain sparsity property. As the main applications of our Fourier-based testing technique, we obtain the first non-trivial testers for two fundamental families of discrete distributions: Sums of Independent Integer Random Variables (SIIRVs) and Poisson Multinomial Distributions (PMDs). Our testers for these families are nearly sample-optimal and computationally efficient. We also obtain a tester with improved sample complexity for discrete log-concave distributions. To the best of our knowledge, ours is the first use of the Fourier transform in the context of distribution testing.

Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning

This paper introduces Dex, a reinforcement learning environment toolkit specialized for training and evaluation of continual learning methods as well as general reinforcement learning problems. We also present the novel continual learning method of incremental learning, where a challenging environment is solved using optimal weight initialization learned from first solving a similar easier environment. We show that incremental learning can produce vastly superior results than standard methods by providing a strong baseline method across ten Dex environments. We finally develop a saliency method for qualitative analysis of reinforcement learning, which shows the impact incremental learning has on network attention.

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Understanding and Improvement

With the continuing empirical successes of deep networks, it becomes increasingly important to develop better methods for understanding training of models and the representations learned within. In this paper we propose Singular Vector Canonical Correlation Analysis (SVCCA), a tool for quickly comparing two representations in a way that is both invariant to affine transform (allowing comparison between different layers and networks) and fast to compute (allowing more comparisons to be calculated than with previous methods). We deploy this tool to measure the intrinsic dimensionality of layers, showing in some cases needless over-parameterization; to probe learning dynamics throughout training, finding that networks converge to final representations from the bottom up; to show where class-specific information in networks is formed; and to suggest new training regimes that simultaneously save computation and overfit less.

Modified Frank-Wolfe Algorithm for Enhanced Sparsity in Support Vector Machine Classifiers

This work proposes a new algorithm for training a re-weighted L2 Support Vector Machine (SVM), inspired on the re-weighted Lasso algorithm of Cand\`es et al. and on the equivalence between Lasso and SVM shown recently by Jaggi. In particular, the margin required for each training vector is set independently, defining a new weighted SVM model. These weights are selected to be binary, and they are automatically adapted during the training of the model, resulting in a variation of the Frank-Wolfe optimization algorithm with essentially the same computational complexity as the original algorithm. As shown experimentally, this algorithm is computationally cheaper to apply since it requires less iterations to converge, and it produces models with a sparser representation in terms of support vectors and which are more stable with respect to the selection of the regularization hyper-parameter.

A framework for Multi-A(rmed)/B(andit) testing with online FDR control
Self-dual quasiperiodic systems with power-law hopping
A Closer Look at Memorization in Deep Networks
Synaptic mechanisms of interference in working memory
Economies-of-scale in resource sharing systems: tutorial and partial review of the QED heavy-traffic regime
Warasian Economic Equilibrium Problems in Convex Regions
A Conceptual Model for Holistic Classification of Insider
Piecewise Constant Martingales and Lazy Clocks
Block-Matching Optical Flow for Dynamic Vision Sensor- Algorithm and FPGA Implementation
Epidemiology of Objectively Measured Bedtime and Chronotype in the US adolescents and adults: NHANES 2003-2006
Weighted counting of non-negative integer points in a subspace
Modeling Biological Problems in Computer Science: A Case Study in Genome Assembly
Centralized Multi-Node Repair Regenerating Codes
Improving Distributed Gradient Descent Using Reed-Solomon Codes
Control Variates for Stochastic Gradient MCMC
Improved Convergence Rates for Distributed Resource Allocation
Probabilistic Jamming/Eavesdropping Attacks to Confuse a Buffer-Aided Transmitter-Receiver Pair
Character Values of Stanley Sequences
A Stochastic Model for Solar Photo-Voltaic Power for Short-Term Probabilistic Forecast
Variational Inference Methods for Tweedie Compound Poisson Models
Advancements in Continuum Approximation Models for Logistics and Transportation Systems: 1996 – 2016
Strongly regular Cayley graphs from partitions of subdifference sets of the Singer difference sets
Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text
State observation and sensor selection for nonlinear networks
Dynamic scaling in the 2D Ising spin glass with Gaussian couplings
A Data Envelopment Analysis (DEA)-Based Model for Power Interruption Cost Estimation for Industrial Companies
A central limit theorem for the gossip process
Parametric Inference for Discretely Observed Subordinate Diffusions
Random recursive trees and preferential attachment trees are random split trees
Partial Realization Theory and System Identification Redux
On the Linear Extension Complexity of Stable Set Polytopes for Perfect Graphs
Joint Mixability of Elliptical Distributions and Related Families
Phase field approach to optimal packing problems and related Cheeger clusters
Variants of RMSProp and Adagrad with Logarithmic Regret Bounds
Statistical foundations for assessing the difference between the classical and weighted-Gini betas
Energy Efficient Scheduling for Loss Tolerant IoT Applications with Uninformed Transmitter
Evaluating the quality of tourist agendas customized to different travel styles
Adiabatic Quantum Computing for Binary Clustering
Subgeometric Rates of Convergence for Discrete Time Markov Chains under Discrete Time Subordination
Intersecting families, cross-intersecting families, and a proof of a conjecture of Feghali, Johnson and Thomas
Distributionally Robust Chance-Constrained Voltage-Concerned DC-OPF with Wasserstein Metric
On small $n$-uniform hypergraphs with positive discrepancy
A Large-Scale CNN Ensemble for Medication Safety Analysis
The fractional $k$-metric dimension of graphs
Performance Bounds for Finite Moving Average Change Detection: Application to Global Navigation Satellite Systems
Attitude and angular velocity tracking for a rigid body using geometric methods on the two-sphere
Coresets for Vector Summarization with Applications to Network Graphs
Adaptivity is exponentially powerful for testing monotonicity of halfspaces
Fatiguing STDP: Learning from Spike-Timing Codes in the Presence of Rate Codes
An invariance principle for the stochastic heat equation
The Probability of Causation
On the statistical inconsistency of Maximum Parsimony for $k$-tuple-site data
Information Structure Design in Team Decision Problems
The Z-polynomial of a matroid
Inferential results for a new measure of inequality
Resource Optimization and Power Allocation in In-band Full Duplex (IBFD)-Enabled Non-Orthogonal Multiple Access Networks
Accelerating Innovation Through Analogy Mining
Rethinking Atrous Convolution for Semantic Image Segmentation
Snarks with special spanning trees
On the Optimization Landscape of Tensor Decompositions
Sample, computation vs storage tradeoffs for classification using tensor subspace models
Rare-Event Simulation for Distribution Networks
Optimal Hölder Continuity and Dimension Properties for SLE with Minkowski Content Parametrization
Secure and Private Cloud Storage Systems with Random Linear Fountain Codes
Kernel Two-Sample Hypothesis Testing Using Kernel Set Classification
Phase transition in a random soliton cellular automaton
Buildings-to-Grid Integration Framework
An improved kernel for the cycle contraction problem
Joint resource allocation in SWIPT-based multi-antenna decode-and-forward relay networks
The impact of Entropy and Solution Density on selected SAT heuristics
Invariant Measures for Path-Dependent Random Diffusions
$H$-free subgraphs of dense graphs maximizing the number of cliques and their blow-ups
Entropy, neutro-entropy and anti-entropy for neutrosophic information
Balanced words in higher dimensions
Learning Sparse Potential Games in Polynomial Time and Sample Complexity
Lexical representation explains cortical entrainment during speech perception
Ramanujan-type congruences for 2-color partition triples
Recognizing hyperelliptic graphs in polynomial time
Rate of convergence of the Nesterov accelerated gradient method in the subcritical case $α\leq 3$
A large-scale analysis of racial disparities in police stops across the United States
Mirror descent in non-convex stochastic programming
Sparse Neural Networks Topologies
On affine variety codes from the Klein quartic
Diffusivity of a walk on fracture loops of a discrete torus
Dimensionality Reduction using Similarity-induced Embeddings
What is the $p$ for some specific underdetermined matrices such that $l_p$-minimization is equivalent to $l_0$-minimization
A new method for recognising Suzuki groups
SuperMinHash – A New Minwise Hashing Algorithm for Jaccard Similarity Estimation
Modeling credit default swap premiums with stochastic recovery rate
A Polynomial Time Algorithm for Spatio-Temporal Security Games
Limiting measure and stationarity of solutions to stochastic evolution equations with Volterra noise
Bayesian Analysis of Censored Spatial Data Based on a Non-Gaussian Model
Towards the Improvement of Automated Scientific Document Categorization by Deep Learning
Tversky loss function for image segmentation using 3D fully convolutional deep networks
Detecting Large Concept Extensions for Conceptual Analysis
Using Deep Networks for Drone Detection
On transitive designs and strongly regular graphs constructed from Mathieu group $M_{11}$
Multirate Packet Delivery In Heterogeneous Broadcast Networks
Addressing Item-Cold Start Problem in Recommendation Systems using Model Based Approach and Deep Learning
Data set operations to hide decision tree rules
Quantifying the Benefits of Infrastructure Sharing
Fixed-Rank Approximation of a Positive-Semidefinite Matrix from Streaming Data
Beyond Worst-case: A Probabilistic Analysis of Affine Policies in Dynamic Optimization
Coupled 3D Convolutional Neural Networks for Audio-Visual Recognition
The ideal of maximal flags of a poset
Learning Hierarchical Information Flow with Recurrent Neural Modules
Statistical Inference based on Bridge Divergences
Convergence to a Continuous State Branching Process with jumps and Height Process
The Effect of Interference in Vehicular Communications on Safety Factors
Families of Distributed Memory Parallel Graph Algorithms from Self-Stabilizing Kernels-An SSSP Case Study
Approximate Generalized Matching: $f$-Factors and $f$-Edge Covers
Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks
An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation
Induced subdivisions and bounded expansion
On the arithmetic of graphs
Optimal Status Update for Age of Information Minimization with an Energy Harvesting Source
Kapre: On-GPU Audio Preprocessing Layers for a Quick Implementation of Deep Neural Network Models with Keras
Exploring Content-based Artwork Recommendation with Metadata and Visual Features
Strong limit theorems for weighted sums of negatively associated random variables in nonlinear probability
An Entropy-based Pruning Method for CNN Compression
Simplex QP-based methods for minimizing a conic quadratic objective over polyhedra
AGC, t-designs and partition sets
Componentwise different tail solutions for bivariate stochastic recurrence equations — with application to GARCH(1,1) processes —
An a Priori Exponential Tail Bound for k-Folds Cross-Validation
Maximizing the Link Throughput of Spectrum Sharing IoT-based Systems through Retransmissions
Inactivation Decoding of LT and Raptor Codes: Analysis and Code Design
How Hard is it to Find (Honest) Witnesses?
Capacity Releasing Diffusion for Speed and Locality
The geometry of the generalized algebraic Riccati equation and of the singular Hamiltonian system
Code Constructions based on Reed-Solomon Codes
Smoothing technique for nonsmooth composite minimization with linear operator
On Optimal Group Claims at Voting in a Stochastic Environment
Conditional Lower Bounds for Space/Time Tradeoffs
User-driven mobile robot storyboarding: Learning image interest and saliency from pairwise image comparisons
A Non-Convex Relaxation for Fixed-Rank Approximation
Secure Broadcasting Using Independent Secret Keys
Charged particle tracking without magnetic field: optimal measurement of track momentum by a Bayesian analysis of the multiple measurements of deflections due to multiple scattering
Deep learning with spatiotemporal consistency for nerve segmentation in ultrasound images
Weighted likelihood estimation of multivariate location and scatter
Gaussian Intersymbol Interference Channels With Mismatch
Channels with Cooperation Links that May Be Absent
Computing the channel capacity of a communication system affected by uncertain transition probabilities
Signal Machine And Cellular Automaton Time-Optimal Quasi-Solutions Of The Firing Squad/Mob Synchronisation Problem On Connected Graphs
Time Complexity of Constraint Satisfaction via Universal Algebra
Pedestrian Prediction by Planning using Deep Neural Networks
Combinatorial Properties and Recognition of Unit Square Visibility Graphs
Entanglement across extended random defects in the XX spin chain
Asymptotic Expansion of Warlimont Functions on Wright Semigroups
Detection of Block-Exchangeable Structure in High-Dimensional Correlation Matrices
Kernelization of Constraint Satisfaction Problems: A Study through Universal Algebra
Bayesian Joint Modelling for Object Localisation in Weakly Labelled Images
Deep Counterfactual Networks with Propensity-Dropout
Popular differences and generalized Sidon sets
Beta-Beta Bounds: Finite-Blocklength Analog of the Golden Formula
Stochastic Heat Equations with Values in a Riemannian Manifold
Through the Looking Glass: Heckits, LATE, and Numerical Equivalence
Mixture-based Modeling of Correlated Interference in a Poisson Field of Interferers
Leveraging web resources for keyword assignment to short text documents
Numerically Stable Variants of the Communication-hiding Pipelined Conjugate Gradients Algorithm for the Parallel Solution of Large Scale Symmetric Linear Systems
Visual Decoding of Targets During Visual Search From Human Eye Fixations
Massive Semantics to empower Touristic Service Providers
Density symmetries for a class of 2-D diffusions with applications to finance
Combining Information from Multiple Forecasters: General Inefficiency of the Means
Next-order asymptotic expansion for N-marginal optimal transport with Coulomb and Riesz costs
Optimising the topological information of the $A_\infty$-persistence groups
The $\mathcal{E}$-Average Common Submatrix: Approximate Searching in a Restricted Neighborhood
The surprising secret identity of the semidefinite relaxation of K-means: manifold learning
Evaluating 35 Methods to Generate Structural Connectomes Using Pairwise Classification
A note on the Moment of Complex Wiener-Ito Integrals
An Algorithm for Network and Data-aware Placement of Multi-Tier Applications in Cloud Data Centers
Distance-regular graphs without 4-claws
Cylindrical Martingale Problems Associated with Lévy Generators
Iterative algorithms for a non-linear inverse problem in atmospheric lidar
Learning to Schedule Deadline- and Operator-Sensitive Tasks
Rigorous Dynamics and Consistent Estimation in Arbitrarily Conditioned Linear Systems
Consistent feature attribution for tree ensembles
Ergodicity and symmetry breaking in disordered spin chains with on-site non-Abelian symmetry
On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions
Endoscopic Depth Measurement and Super-Spectral-Resolution Imaging
Towards Deep Learning Models Resistant to Adversarial Attacks
Solving Integer Linear Programs with a Small Number of Global Variables and Constraints
On the accuracy of ancestral sequence reconstruction for ultrametric trees with parsimony
An exponential lower bound for cut sparsifiers in planar graphs
Bayesian multi–dipole localization and uncertainty quantification from simultaneous EEG and MEG recordings