Large Margin Few-Shot Learning

The key issue of few-shot learning is learning to generalize. In this paper, we propose a large margin principle to improve the generalization capacity of metric based methods for few-shot learning. To realize it, we develop a unified framework to learn a more discriminative metric space by augmenting the softmax classification loss function with a large margin distance loss function for training. Extensive experiments on two state-of-the-art few-shot learning models, graph neural networks and prototypical networks, show that our method can improve the performance of existing models substantially with very little computational overhead, demonstrating the effectiveness of the large margin principle and the potential of our method.


Moderated Network Models

Pairwise network models such as the Gaussian Graphical Model (GGM) are a powerful and intuitive way to analyze dependencies in multivariate data. A key assumption of the GGM is that each pairwise interaction is independent of the values of all other variables. However, in psychological research this is often implausible. In this paper, we extend the GGM by allowing each pairwise interaction between two variables to be moderated by (a subset of) all other variables in the model, and thereby introduce a Moderated Network Model (MNM). We show how to construct the MNW and propose an L1-regularized nodewise regression approach to estimate it. We provide performance results in a simulation study and show that MNMs outperform the split-sample based methods Network Comparison Test (NCT) and Fused Graphical Lasso (FGL) in detecting moderation effects. Finally, we provide a fully reproducible tutorial on how to estimate MNMs with the R-package mgm and discuss possible issues with model misspecification.


A Neural Network Lattice Decoding Algorithm

Neural network decoding algorithms are recently introduced by Nachmani et al. to decode high-density parity-check (HDPC) codes. In contrast with iterative decoding algorithms such as sum-product or min-sum algorithms in which the weight of each edge is set to 1, in the neural network decoding algorithms, the weight of every edge depends on its impact in the transmitted codeword. In this paper, we provide a novel \emph{feed-forward neural network lattice decoding algorithm} suitable to decode lattices constructed based on Construction A, whose underlying codes have HDPC matrices. We first establish the concept of feed-forward neural network for HDPC codes and improve their decoding algorithms compared to Nachmani et al. We then apply our proposed decoder for a Construction A lattice with HDPC underlying code, for which the well-known iterative decoding algorithms show poor performances. The main advantage of our proposed algorithm is that instead of assigning and training weights for all edges, which turns out to be time-consuming especially for high-density parity-check matrices, we concentrate on edges which are present in most of 4-cycles and removing them gives a girth-6 Tanner graph. This approach, by slight modifications using updated LLRs instead of initial ones, simultaneously accelerates the training process and improves the error performance of our proposed decoding algorithm.


Domain2Vec: Deep Domain Generalization

We address the problem of domain generalization where a decision function is learned from the data of several related domains, and the goal is to apply it on an unseen domain successfully. It is assumed that there is plenty of labeled data available in source domains (also called as training domain), but no labeled data is available for the unseen domain (also called a target domain or test domain). We propose a novel neural network architecture, Domain2Vec (D2V) that learns domain-specific embedding and then uses this embedding to generalize the learning across related domains. The proposed algorithm, D2V extends the idea of distribution regression and kernelized domain generalization to the neural networks setting. We propose a neural network architecture to learn domain-specific embedding and then use this embedding along with the data point specific features to label it. We show the effectiveness of the architecture by accurately estimating domain to domain similarity. We evaluate our algorithm against standard domain generalization datasets for image classification and outperform other state of the art algorithms.


Learning to Index for Nearest Neighbor Search

In this study, we present a novel ranking model based on learning the nearest neighbor relationships embedded in the index space. Given a query point, a conventional nearest neighbor search approach calculates the distances to the cluster centroids, before ranking the clusters from near to far based on the distances. The data indexed in the top-ranked clusters are retrieved and treated as the nearest neighbor candidates for the query. However, the loss of quantization between the data and cluster centroids will inevitably harm the search accuracy. To address this problem, the proposed model ranks clusters based on their nearest neighbor probabilities rather than the query-centroid distances to the query. The nearest neighbor probabilities are estimated by employing neural networks to characterize the neighborhood relationships as a nonlinear function, i.e., the density distribution of nearest neighbors with respect to the query. The proposed probability-based ranking model can replace the conventional distance-based ranking model as a coarse filter for candidate clusters, and the nearest neighbor probability can be used to determine the data quantity to be retrieved from the candidate cluster. Our experimental results demonstrated that implementation of the proposed ranking model for two state-of-the-art nearest neighbor quantization and search methods could boost the search performance effectively in billion-scale datasets.


Open Markov chains: cumulant dynamics, fluctuations and correlations
Exact tail asymptotics for fluid models driven by an $M/M/c$ queue
Social network aided plagiarism detection: Social network aided plagiarism detection
Singular Value Statistics for the Spiked Elliptic Ginibre Ensemble
Polytope volume by descent in the face lattice and applications in social choice
Limit theorems for a class of critical superprocesses with stable branching
Hierarchical Stochastic Graphlet Embedding for Graph-based Pattern Recognition
Auto-Context R-CNN
On the discrete Fuglede and Pompeiu problems
Replicated Siamese LSTM in Ticketing System for Similarity Learning and Retrieval in Asymmetric Texts
Semi-parametric Image Inpainting
Resilient Output Synchronization of Heterogeneous Multi-agent Systems under Cyber-Physical Attacks
Learning The Sequential Temporal Information with Recurrent Neural Networks
Resource Allocation Based on Deep Neural Networks for Cognitive Radio Networks
Multi-kernel unmixing and super-resolution using the Modified Matrix Pencil method
$P$-Partition Generating Function Equivalence of Naturally Labeled Posets
QDDS: A Novel Quantum Swarm Algorithm Inspired by a Double Dirac Delta Potential
Separability is not the best goal for machine learning
Bounds and Constructions for Multi-Symbol Duplication Error Correcting Codes
Machine Learning in High Energy Physics Community White Paper
On Convergence of Heuristics Based on Douglas-Rachford Splitting and ADMM to Minimize Convex Functions over Nonconvex Sets
Reasoning about exceptions in ontologies: from the lexicographic closure to the skeptical closure
A lower bound on the queueing delay in resource constrained load balancing
Stochastic Block Model for Hypergraphs: Statistical limits and a semidefinite programming approach
Exact Combinatorial Inference for Brain Images
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Automatic Classification of Defective Photovoltaic Module Cells in Electroluminescence Images
Optimising Parameters in Recurrence Quantification Analysis of Smart Energy Systems
Odderon and substructures of protons from a model-independent Levy imaging of elastic proton-proton and proton-antiproton collisions
Bounds for Different Spreads of Line and Total Graphs
A primal-dual interior-point method capable of rapidly detecting infeasibility for nonlinear programs
Quantifying model form uncertainty in Reynolds-averaged turbulence models with Bayesian deep neural networks
Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings
Generic torus orbit closures in Schubert varieties
Vulnerability Analysis of Chest X-Ray Image Classification Against Adversarial Attacks
Partial Policy-based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images
A Combined CNN and LSTM Model for Arabic Sentiment Analysis
Attention to Refine through Multi-Scales for Semantic Segmentation
Active Secure Coding Based on Eavesdropper Behavior Learning
Generating objects going well with the surroundings
Zero-shot Domain Adaptation without Domain Semantic Descriptors
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector
Computing the statistical significance of optimized communities in networks
Optimal Trajectory-Planning of UAVs via B-Splines and Disjunctive Programming
PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence Estimation
Multi-Scale Coarse-to-Fine Segmentation for Screening Pancreatic Ductal Adenocarcinoma
Human Activity Recognition in RGB-D Videos by Dynamic Images
Flow Network Tracking for Spatiotemporal and Periodic Point Matching: Applied to Cardiac Motion Analysis
Interpretable Machine Learning Study of Many-Body Localization Transition in Disordered Quantum Ising Spin Chains
Scaling-Up Reasoning and Advanced Analytics on BigData
A primal-dual interior-point relaxation method for nonlinear programs
Jointly learning relevant subgraph patterns and nonlinear models of their indicators
Universal Word Segmentation: Implementation and Interpretation
Polarimetric Convolutional Network for PolSAR Image Classification
On the Dimension of Unimodular Discrete Spaces, Part I: Definitions and Basic Properties
Time-time covariance for last passage percolation with generic initial profile
On the number of semi-magic squares of order 6
Fair Task Allocation in Crowdsourced Delivery
Spatio-temporal variations in the urban rhythm: the travelling waves of crime
Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes
Decreasing the size of the Restricted Boltzmann machine
Learning Functions in Large Networks requires Modularity and produces Multi-Agent Dynamics
Towards Enhancing Lexical Resource and Using Sense-annotations of OntoSenseNet for Sentiment Analysis
Action graphs, planar rooted forests, and self-convolutions of the Catalan numbers
A Sequence-to-Sequence Model for Semantic Role Labeling
Domain Recurrence and Probabilistic Analysis of Residence Time of Stochastic Systems and Domain Aiming Control
Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition
External Patch-Based Image Restoration Using Importance Sampling
Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes
Pioneer Networks: Progressively Growing Generative Autoencoder
Image Restoration Using Conditional Random Fields and Scale Mixtures of Gaussians
Topological Prismatoids and Small Non-Hirsch Spheres
Random band matrices
Analysis of Statistical Properties of Nonlinear Feedforward Generators Over Finite Fields
Transaction costs and institutional change of trade litigations in Bulgaria
Glow: Generative Flow with Invertible 1×1 Convolutions
Cancer Risk Messages: A Light Bulb Model
Convolutional Recurrent Neural Networks for Blood Glucose Prediction
Hopf dreams
Cancer Risk Messages: Public Health and Economic Welfare
Deep Learning for Singing Processing: Achievements, Challenges and Impact on Singers and Listeners
Simulation Modelling of Inequality in Cancer Service Access
Position-aware Self-attention with Relative Positional Encodings for Slot Filling
A deep learning approach for understanding natural language commands for mobile service robots
Deriving Neural Network Architectures using Precision Learning: Parallel-to-fan beam Conversion
ChestNet: A Deep Neural Network for Classification of Thoracic Diseases on Chest Radiography
Temporal Difference Learning with Neural Networks – Study of the Leakage Propagation Problem
On Sparse Reflexive Generalized Inverses
Video Summarisation by Classification with Deep Reinforcement Learning
A partial orthogonalization method for simulating covariance and concentration graph matrices
Computer Assisted Localization of a Heart Arrhythmia
Evolution of Cooperation on Stochastic Block Models
Deep Co-Clustering for Unsupervised Audiovisual Learning
SWIPT-based Real-Time Mobile Computing Systems: A Stochastic Geometry Perspective
Execution-Guided Neural Program Decoding
Discriminating between Indo-Aryan Languages Using SVM Ensembles
Sparse tensor recovery via N-mode FISTA with support augmentation
Sampling and Inference for Beta Neutral-to-the-Left Models of Sparse Networks
On a moment problem related to Bernstein functions
Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation
Partial smoothness and constant rank
Cut-off Theorems for the PV-model
Generalized Approximate Message Passing for Unlimited Sampling of Sparse Signals
Prediction regions through Inverse Regression
Approximate k-space models and Deep Learning for fast photoacoustic reconstruction
Delayed Bandit Online Learning with Unknown Delays
Efficient Decentralized Deep Learning by Dynamic Model Averaging
A Regularized and Smoothed Fischer-Burmeister Method for Quadratic Programming with Applications to Model Predictive Control
Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints
Design and Evaluation of a Tutor Platform for Personalized Vocabulary Learning
Effects of Load-Based Frequency Regulation on Distribution Network Operation
Probability measure-valued polynomial diffusions
Bayesian Sequential Joint Detection and Estimation
Fashion is Taking Shape: Understanding Clothing Preference Based on Body Shape From Online Sources
Automatic multi-objective based feature selection for classification
Exploring Brain-wide Development of Inhibition through Deep Learning
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Strongly Disordered Floquet Topological Systems
On the Ergodic Control of Ensembles
Data-Driven LQR Control Design
Efficient convergence through adaptive learning in sequential Monte Carlo Expectation Maximization
Lattice paths and branched continued fractions: An infinite sequence of generalizations of the Stieltjes–Rogers and Thron–Rogers polynomials, with coefficientwise Hankel-total positivity
The Hopf algebra of integer binary relations
Adversarial Symbolic Execution for Detecting Concurrency-Related Cache Timing Leaks
Ising model and the positive orthogonal Grassmannian
Pooling Pyramid Network for Object Detection
Dynamic Pricing with Finitely Many Unknown Valuations
Bias Correction For Paid Search In Media Mix Modeling
Beamforming Techniques for Non-Orthogonal Multiple Access in 5G Cellular Networks
Crystal structures for symmetric Grothendieck polynomials

Advertisements