Feature Selection Parallel Technique for Remotely Sensed Imagery Classification

Remote sensing research focusing on feature selection has long attracted the attention of the remote sensing community because feature selection is a prerequisite for image processing and various applications. Different feature selection methods have been proposed to improve the classification accuracy. They vary from basic search techniques to clonal selections, and various optimal criteria have been investigated. Recently, methods using dependence-based measures have attracted much attention due to their ability to deal with very high dimensional datasets. However, these methods are based on Cramers V test, which has performance issues with large datasets. In this paper, we propose a parallel approach to improve their performance. We evaluate our approach on hyper-spectral and high spatial resolution images and compare it to the proposed methods with a centralized version as preliminary results. The results are very promising.

Learning Detection with Diverse Proposals

To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground-truth but ignore correlation between multiple proposals and object categories. Non-Maximum Suppression (NMS) as a widely used proposal pruning scheme ignores label- and instance-level relations between object candidates resulting in multi-labeled detections. In the multi-class case, NMS selects boxes with the largest prediction scores ignoring the semantic relation between categories of potential election. In contrast, our trainable DPP layer, allowing for Learning Detection with Diverse Proposals (LDDP), considers both label-level contextual information and spatial layout relationships between proposals without increasing the number of parameters of the network, and thus improves location and category specifications of final detected bounding boxes substantially during both training and inference schemes. Furthermore, we show that LDDP keeps it superiority over Faster R-CNN even if the number of proposals generated by LDPP is only ~30% as many as those for Faster R-CNN.

Energy Propagation in Deep Convolutional Neural Networks

Many practical machine learning tasks employ very deep convolutional neural networks. Such large depths pose formidable computational challenges in training and operating the network. It is therefore important to understand how many layers are actually needed to have most of the input signal’s features be contained in the feature vector generated by the network. This question can be formalized by asking how quickly the energy contained in the feature maps decays across layers. In addition, it is desirable that none of the input signal’s features be ‘lost’ in the feature extraction network or, more formally, we want energy conservation in the sense of the energy contained in the feature vector being proportional to that of the corresponding input signal. This paper establishes conditions for energy conservation for a wide class of deep convolutional neural networks and characterizes corresponding feature map energy decay rates. Specifically, we consider general scattering networks, and find that under mild analyticity and high-pass conditions on the filters (which encompass, inter alia, various constructions of Weyl-Heisenberg filters, wavelets, ridgelets, (\alpha)-curvelets, and shearlets) the feature map energy decays at least polynomially fast. For broad families of wavelets and Weyl-Heisenberg filters, the guaranteed decay rate is shown to be exponential. Our results yield handy estimates of the number of layers needed to have at least ((1-\varepsilon)\cdot 100)\% of the input signal energy be contained in the feature vector.

Deep Extreme Multi-label Learning

Extreme multi-label learning or classification has been a practical and important problem since the boom of big data. The main challenge lies in the exponential label space which involves 2L possible label sets when the label dimension L is very large e.g. in millions for Wikipedia labels. This paper is motivated to better explore the label space by build- ing and modeling an explicit label graph. In the meanwhile, deep learning has been widely studied and used in various classification problems includ- ing multi-label classification, however it has not been sufficiently studied in this extreme but practi- cal case, where the label space can be as large as in millions. In this paper, we propose a practical deep embedding method for extreme multi-label classifi- cation. Our method harvests the ideas of non-linear embedding and modeling label space with graph priors at the same time. Extensive experiments on public datasets for XML show that our method per- form competitively against state-of-the-art result.

Beliefs in Markov Trees – From Local Computations to Local Valuation

This paper is devoted to expressiveness of hypergraphs for which uncertainty propagation by local computations via Shenoy/Shafer method applies. It is demonstrated that for this propagation method for a given joint belief distribution no valuation of hyperedges of a hypergraph may provide with simpler hypergraph structure than valuation of hyperedges by conditional distributions. This has vital implication that methods recovering belief networks from data have no better alternative for finding the simplest hypergraph structure for belief propagation. A method for recovery tree-structured belief networks has been developed and specialized for Dempster-Shafer belief functions

Determining Song Similarity via Machine Learning Techniques and Tagging Information

The task of determining item similarity is a crucial one in a recommender system. This constitutes the base upon which the recommender system will work to determine which items are more likely to be enjoyed by a user, resulting in more user engagement. In this paper we tackle the problem of determining song similarity based solely on song metadata (such as the performer, and song title) and on tags contributed by users. We evaluate our approach under a series of different machine learning algorithms. We conclude that tf-idf achieves better results than Word2Vec to model the dataset to feature vectors. We also conclude that k-NN models have better performance than SVMs and Linear Regression for this problem.

Robustly Learning a Gaussian: Getting Optimal Error, Efficiently

We study the fundamental problem of learning the parameters of a high-dimensional Gaussian in the presence of noise — where an \varepsilon-fraction of our samples were chosen by an adversary. We give robust estimators that achieve estimation error O(\varepsilon) in the total variation distance, which is optimal up to a universal constant that is independent of the dimension. In the case where just the mean is unknown, our robustness guarantee is optimal up to a factor of \sqrt{2} and the running time is polynomial in d and 1/\epsilon. When both the mean and covariance are unknown, the running time is polynomial in d and quasipolynomial in 1/\varepsilon. Moreover all of our algorithms require only a polynomial number of samples. Our work shows that the same sorts of error guarantees that were established over fifty years ago in the one-dimensional setting can also be achieved by efficient algorithms in high-dimensional settings.

Dynamic nested sampling: an improved algorithm for parameter estimation and evidence calculation

Learning Two-Branch Neural Networks for Image-Text Matching Tasks

What do Neural Machine Translation Models Learn about Morphology?

Marginal Likelihoods from Monte Carlo Markov Chains

A Neural Representation of Sketch Drawings

Tower-type bounds for unavoidable patterns in words

The rigidity of the graphs of homology spheres minus one edge

Simply Exponential Approximation of the Permanent of Positive Semidefinite Matrices

Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems

CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction

Creativity: Generating Diverse Questions using Variational Autoencoders

A Gibbsian model for message routing in highly dense multi-hop networks

UC Merced Submission to the ActivityNet Challenge 2016

Unsupervised Spatio-Temporal Embeddings for User and Location Modelling

On Codes over $\mathbb{F}_{q}+v\mathbb{F}_{q}+v^{2}\mathbb{F}_{q}$

Unsupervised Event Abstraction using Pattern Abstraction and Local Process Models

Improving Fitness Functions in Genetic Programming for Classification on Unbalanced Credit Card Datasets

Rich-clubness test: how to determine whether a complex network has or doesn’t have a rich-club?

Toward a new approach for massive LiDAR data processing

On the Pervasiveness of Difference-Convexity in Optimization and Statistics

Toward a Distributed Knowledge Discovery system for Grid systems

Orthogonal polynomials and Smith normal form

Distributed Proximal Gradient Algorithm for Partially Asynchronous Computer Clusters

Leveraging Term Banks for Answering Complex Questions: A Case for Sparse Vectors

Well-posedness of a Model for the Growth of Tree Stems and Vines

On Simultaneous Two-player Combinatorial Auctions

Attention-based Extraction of Structured Information from Street View Imagery

Clarifying Trust in Social Internet of Things

Underapproximation of Reach-Avoid Sets for Discrete-Time Stochastic Systems via Lagrangian Methods

Cutting the Error by Half: Investigation of Very Deep CNN and Advanced Training Strategies for Document Image Classification

ConceptNet at SemEval-2017 Task 2: Extending Word Embeddings with Multilingual Relational Knowledge

The Fundamental Theorem of Perfect Simulation

Quasinonexpansive Iterations on the Affine Hull of Orbits: From Mann’s Mean Value Algorithm to Inertial Methods

Active classification with comparison queries

Beyond Planar Symmetry: Modeling human perception of reflection and rotation symmetries in the wild

CASP Solutions for Planning in Hybrid Domains

Pólya Urn Latent Dirichlet Allocation: a sparse massively parallel sampler

RLE Plots: Visualising Unwanted Variation in High Dimensional Data

Bayesian Optimal Data Detector for mmWave OFDM System with Low-Resolution ADC

Semidefinite Programming and Ramsey Numbers

Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation

Deep Contextual Recurrent Residual Networks for Scene Labeling

A Characterization of Oriented Hypergraphic Laplacian and Adjacency Matrix Coefficients

NOMA based Calibration for Large-Scale Spaceborne Antenna Arrays

Instance-Level Salient Object Segmentation

Privacy-Aware Guessing Efficiency

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Hybrid Beamforming via the Kronecker Decomposition for the Millimeter-Wave Massive MIMO Systems

Finding Modes by Probabilistic Hypergraphs Shifting

Predictive-Corrective Networks for Action Detection

Representation Stability as a Regularizer for Improved Text Analytics Transfer Learning

Inter-Operator Resource Management for Millimeter Wave, Multi-Hop Backhaul Networks

Loklak – A Distributed Crawler and Data Harvester for Overcoming Rate Limits

Sampling-based speech parameter generation using moment-matching networks

Real-time On-Demand Crowd-powered Entity Extraction

Batch Data Processing and Gaussian Two-Armed Bandit

Optimizing stability of mutual synchronization between a pair of limit-cycle oscillators with weak cross coupling

Optimizing mutual synchronization of rhythmic spatiotemporal patterns in reaction-diffusion systems

Degrees of irreducible polynomials over binary field

Joint Semi-supervised RSS Dimensionality Reduction and Fingerprint Based Algorithm for Indoor Localization

Hardness of classically sampling one clean qubit model with constant total variation distance error

Dimensional reduction and its breakdown in the driven random field O(N) model near three-dimensions

A Component-Based Dual Decomposition Method for the OPF Problem

Decoupled Mild solutions for Pseudo Partial Differential Equations versus Martingale driven forward-backward SDEs

Preferential Bayesian Optimization

A new notion of majorization with applications to the comparison of extreme order statistics

On the Expansion Coefficients of KP Tau Function

Feature Tracking Cardiac Magnetic Resonance via Deep Learning and Spline Optimization

Approximating Optimization Problems using EAs on Scale-Free Networks

Stigmergy-based modeling to discover urban activity patterns from positioning data

Dilated Convolutional Neural Networks for Cardiovascular MR Segmentation in Congenital Heart Disease

Eigenvalues of symmetric tridiagonal interval matrices revisited

A Note on the Birkhoff Ergodic Theorem

Flows for Singular Stochastic Differential Equations with Unbounded Drifts

Automated Synthesis of Infinite Dimensional Stochastic Hybrid Systems

Trainable Referring Expression Generation using Overspecification Preferences

Optimal Repair Layering for Erasure-Coded Data Centers: From Theory to Practice

On topological obstructions to global stabilization of an inverted pendulum

Heat kernel of anisotropic nonlocal operators

Object proposal generation applying the distance dependent Chinese restaurant process

Investigation on the use of Hidden-Markov Models in automatic transcription of music

Tight embedding of modular lattices into partition lattices: progress and program

A Stream-Suitable Kolmogorov-Smirnov-Type Test for Big Data Analysis

Unsupervised Construction of Human Body Models Using Principles of Organic Computing

On computational complexity of Set Automata

MATS: Inference for potentially Singular and Heteroscedastic MANOVA

Learning from Demonstrations for Real World Reinforcement Learning

Growing and Destroying Catalan-Stanley Trees

Time crystals: a review

Deep-FExt: Deep Feature Extraction for Vessel Segmentation and Centerline Prediction

The gradient condition and the contribution of the dynamical part of Green-Kubo formula to the diffusion coefficient

Enabling Embedded Inference Engine with ARM Compute Library

A Proof of Orthogonal Double Machine Learning with $Z$-Estimators

Unsupervised part learning for visual recognition

On the complexity of finding and counting solution-free sets of integers

From ds-bounds for cyclic codes to true distance for abelian codes

NG2C: Pretenuring N-Generational GC for HotSpot Big Data Applications

Parallelized Kendall’s Tau Coefficient Computation via SIMD Vectorized Sorting On Many-Integrated-Core Processors

The highest root coefficients and the second smallest exponent

On large deviation probabilities for empirical distribution of branching random walks: Schr{ö}der case and B{ö}ttcher case

Optimal strategies for weighted ray search

Critical groups for Hopf algebra modules

Automatic differentiation of non-holonomic fast marching for computing most threatening trajectories under sensors surveillance

Automorphisms of the subspace sum graphs on a vector space

Attention-Set based Metric Learning for Video Face Recognition

A Neural Parametric Singing Synthesizer

Dynamic Quadratic Cheap Talk and Signaling Games

MAGAN: Margin Adaptation for Generative Adversarial Networks

Connecting Look and Feel: Associating the visual and tactile properties of physical materials

Hydrodynamic stability in the presence of a stochastic forcing:a case study in convection

Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark

Numerical solution of time-dependent problems with fractional power elliptic operator

Matrix Concentration for Expander Walks