Local Shrunk Discriminant Analysis (LSDA)

Dimensionality reduction is a crucial step for pattern recognition and data mining tasks to overcome the curse of dimensionality. Principal component analysis (PCA) is a traditional technique for unsupervised dimensionality reduction, which is often employed to seek a projection to best represent the data in a least-squares sense, but if the original data is nonlinear structure, the performance of PCA will quickly drop. An supervised dimensionality reduction algorithm called Linear discriminant analysis (LDA) seeks for an embedding transformation, which can work well with Gaussian distribution data or single-modal data, but for non-Gaussian distribution data or multimodal data, it gives undesired results. What is worse, the dimension of LDA cannot be more than the number of classes. In order to solve these issues, Local shrunk discriminant analysis (LSDA) is proposed in this work to process the non-Gaussian distribution data or multimodal data, which not only incorporate both the linear and nonlinear structures of original data, but also learn the pattern shrinking to make the data more flexible to fit the manifold structure. Further, LSDA has more strong generalization performance, whose objective function will become local LDA and traditional LDA when different extreme parameters are utilized respectively. What is more, a new efficient optimization algorithm is introduced to solve the non-convex objective function with low computational cost. Compared with other related approaches, such as PCA, LDA and local LDA, the proposed method can derive a subspace which is more suitable for non-Gaussian distribution and real data. Promising experimental results on different kinds of data sets demonstrate the effectiveness of the proposed approach.


Lifelong Metric Learning

The state-of-the-art online learning approaches is only capable of learning the metric for predefined tasks. In this paper, we consider lifelong learning problem to mimic ‘human learning’, i.e., endow a new capability to the learned metric for a new task from new online samples and incorporating previous experiences and knowledge. Therefore, we propose a new framework: lifelong metric learning (LML), which only utilizes the data of the new task to train the metric model while preserving the original capabilities. More specifically, the proposed LML maintains a common subspace for all learned metrics, named lifelong dictionary, transfers knowledge from the common subspace to each new metric task with task-specific idiosyncrasy, and redefines the common subspace over time to maximize performance across all metric tasks. We apply online Passive Aggressive optimization to solve the proposed LML framework. Finally, we evaluate our approach by analyzing several multi-task metric learning datasets. Extensive experimental results demonstrate effectiveness and efficiency of the proposed framework.


Answer Set Programming for Non-Stationary Markov Decision Processes

Non-stationary domains, where unforeseen changes happen, present a challenge for agents to find an optimal policy for a sequential decision making problem. This work investigates a solution to this problem that combines Markov Decision Processes (MDP) and Reinforcement Learning (RL) with Answer Set Programming (ASP) in a method we call ASP(RL). In this method, Answer Set Programming is used to find the possible trajectories of an MDP, from where Reinforcement Learning is applied to learn the optimal policy of the problem. Results show that ASP(RL) is capable of efficiently finding the optimal solution of an MDP representing non-stationary domains.


Gabor Convolutional Networks

Steerable properties dominate the design of traditional filters, e.g., Gabor filters, and endow features the capability of dealing with spatial transformations. However, such excellent properties have not been well explored in the popular deep convolutional neural networks (DCNNs). In this paper, we propose a new deep model, termed Gabor Convolutional Networks (GCNs or Gabor CNNs), which incorporates Gabor filters into DCNNs to enhance the resistance of deep learned features to the orientation and scale changes. By only manipulating the basic element of DCNNs based on Gabor filters, i.e., the convolution operator, GCNs can be easily implemented and are compatible with any popular deep learning architecture. Experimental results demonstrate the super capability of our algorithm in recognizing objects, where the scale and rotation changes occur frequently. The proposed GCNs have much fewer learnable network parameters, and thus is easier to train with an end-to-end pipeline. To encourage further developments, the source code is released at Github.


XES Tensorflow – Process Prediction using the Tensorflow Deep-Learning Framework

Predicting the next activity of a running process is an important aspect of process management. Recently, artificial neural networks, so called deep-learning approaches, have been proposed to address this challenge. This demo paper describes a software application that applies the Tensorflow deep-learning framework to process prediction. The software application reads industry-standard XES files for training and presents the user with an easy-to-use graphical user interface for both training and prediction. The system provides several improvements over earlier work. This demo paper focuses on the software implementation and describes the architecture and user interface.


Neural Models for Information Retrieval

Neural ranking models for information retrieval (IR) use shallow or deep neural networks to rank search results in response to a query. Traditional learning to rank models employ machine learning techniques over hand-crafted IR features. By contrast, neural models learn representations of language from raw text that can bridge the gap between query and document vocabulary. Unlike classical IR models, these new machine learning based approaches are data-hungry, requiring large scale training data before they can be deployed. This tutorial introduces basic concepts and intuitions behind neural IR models, and places them in the context of traditional retrieval models. We begin by introducing fundamental concepts of IR and different neural and non-neural approaches to learning vector representations of text. We then review shallow neural IR methods that employ pre-trained neural term embeddings without learning the IR task end-to-end. We introduce deep neural networks next, discussing popular deep architectures. Finally, we review the current DNN models for information retrieval. We conclude with a discussion on potential future directions for neural IR.


An unbiased estimator for the ellipticity from image moments

Stochastic models for fully coupled systems of nonlinear parabolic equations

Cyclically Symmetric Lozenge Tilings of a Hexagon with Four Holes

Generalized Multiplicative Indices of Polycyclic Aromatic Hydrocarbons and Benzeniod Systems

Summarized Network Behavior Prediction

Population protocols for leader election and exact majority with O(log^2 n) states and O(log^2 n) convergence time

Recovery of structure of looped jointed objects from multiframes

Out-of-focus: Learning Depth from Image Bokeh for Robotic Perception

The efficient, the intensive, and the productive: insights from the urban Kaya relation

Shading Annotations in the Wild

On the Almeida-Thouless instability in short-range Ising spin-glasses

Rational ignorance: simpler models learn more from finite data

CDDT: Fast Approximate 2D Ray Casting for Accelerated Localization

Imagining Probabilistic Belief Change as Imaging (Technical Report)

How does Docker affect energy consumption? Evaluating workloads in and out of Docker containers

Cascaded Boundary Regression for Temporal Action Detection

Towards Full Automated Drive in Urban Environments: A Demonstration in GoMentum Station, California

Dynamic Polytopic Template Approach to Robust Transient Stability Assessment

Resource Allocation for Elastic Optical Networks using Geometric Optimization

The Lovász Theta Function for Random Regular Graphs and Community Detection in the Hard Regime

Navigating Intersections with Autonomous Vehicles using Deep Reinforcement Learning

Analyzing Knowledge Transfer in Deep Q-Networks for Autonomously Handling Multiple Intersections

Four Edge-Independent Spanning Trees

Error analysis for global minima of semilinear optimal control problems

Spectral clustering in the dynamic stochastic block model

The 5G Cellular Backhaul Management Dilemma: To Cache or to Serve

A Rule-Based Computational Model of Cognitive Arithmetic

Informative and misinformative interactions in a school of fish

A Hybrid Architecture for Multi-Party Conversational Systems

Inference for three-parameter M-Wright distributions with applications

Marine Animal Classification with Correntropy Loss Based Multi-view Learning

Topological containment of the 5-clique minus an edge in 4-connected graphs

A Versatile, Sound Tool for Simplifying Definitions

Deterministic Distributed Construction of $T$-Dominating Sets in Time $T$

Non-Orthogonal Random Access (NORA) for 5G Networks

On Minkowski type question mark functions associated with even or odd continued fractions

Consistency of orthology and paralogy constraints in the presence of gene transfers

Part-based Weighting Aggregation of Deep Convolutional Features for Image

The Forgettable-Watcher Model for Video Question Answering

Failure localization in time critical market applications

Super-Resolution of Wavelet-Encoded Images

Amortized Inference and Learning in Latent Conditional Random Fields for Weakly-Supervised Semantic Image Segmentation

On the effectiveness of feature set augmentation using clusters of word embeddings

Objective Bayesian analysis for the multivariate skew-t model

On a representation of fractional Brownian motion and the limit distributions of statistics arising in cusp statistical models

Local times for spectrally negative Lévy processes

On the Laplacian spectra of some double join operations of graphs

Central Limit Theorems for empirical transportation cost in general dimension

The coordination of centralised and distributed generation

Mass Volume Curves and Anomaly Ranking

Amobee at SemEval-2017 Task 4: Deep Learning System for Sentiment Detection on Twitter

Fixed effects selection in the linear mixed-effects model using adaptive ridge procedure for L0 penalty performance

Learning Cross-Domain Disentangled Deep Representation with Supervision from A Single Domain

Formal Verification of Piece-Wise Linear Feed-Forward Neural Networks

On-The-Fly Secure Key Generation with Deterministic Models

The geometrical origins of some distributions and the complete concentration of measure phenomenon for mean-values of functionals

From collective oscillation to chimera state in a nonlocally excitable system

Algebraic characterization of regular fractions under level permutations

Linear Regression with Shuffled Labels

Going Wider: Recurrent Neural Network With Parallel Cells

Bowtie-free graphs and generic automorphisms

Optical Flow in Mostly Rigid Scenes

Construction of Four Completely Independent Spanning Trees on Augmented Cubes

FOIL it! Find One mismatch between Image and Language caption

Why Rotation Averaging is Easy

Quantified advantage of discontinuous weight selection in approximations with deep neural networks

Experimental Comparison of Probabilistic Shaping Methods for Unrepeated Fiber Transmission

Weakly-supervised Visual Grounding of Phrases with Linguistic Structures

Brownian forgery of statistical dependences

Fast Real-Time DC State Estimation in Electric Power Systems Using Belief Propagation

Internal control of systems of semilinear coupled 1-D wave equations

Learning to Estimate 3D Hand Pose from Single RGB Images

A Characterization of the Shannon Ordering of Communication Channels

Gradient Methods with Regularization for Constrained Optimization Problems and Their Complexity Estimates

Covering Small Independent Sets and Separators with Applications to Parameterized Algorithms

Data-Driven Synthesis of Smoke Flows with CNN-based Feature Descriptors

Tikhonov regularization of optimal control problems governed by semi-linear partial differential equations

Infinite-Duration Bidding Games

Polynomial expansion and sublinear separators

Comparison of Polynomial Chaos and Gaussian Process surrogates for uncertainty quantification and correlation estimation of spatially distributed open-channel steady flows

A level set-based structural optimization code using FEniCS

Algorithmic trading in a microstructural limit order book model

Robust Inference under the Beta Regression Model with Application to Health Care Studies

Chunk-Based Bi-Scale Decoder for Neural Machine Translation

Distributed Proportional-Fairness Control in MicroGrids via Blockchain Smart Contracts

The Payoff Region of a Strategic Game and Its Extreme Points

The Homogeneous Broadcast Problem in Narrow and Wide Strips

Randomness cost of symmetric twirling

An Incentive-based Online Optimization Framework for Distribution Grids

Efficient Spatio-Temporal Gaussian Regression via Kalman Filtering

Classical Discrete-Time Adaptive Control Revisited: Exponential Stabilization

Sustaining Moore’s Law Through Inexactness

Balanced Excitation and Inhibition are Required for High-Capacity, Noise-Robust Neuronal Selectivity

Introduction to finite mixtures

A Fast Causal Profiler for Task Parallel Programs

Advertisements