Network Classification and Categorization

To the best of our knowledge, this paper presents the first large-scale study that tests whether network categories (e.g., social networks vs. web graphs) are distinguishable from one another (using both categories of real-world networks and synthetic graphs). A classification accuracy of 94.2\% was achieved using a random forest classifier with both real and synthetic networks. This work makes two important findings. First, real-world networks from various domains have distinct structural properties that allow us to predict with high accuracy the category of an arbitrary network. Second, classifying synthetic networks is trivial as our models can easily distinguish between synthetic graphs and the real-world networks they are supposed to model.

MOLTE: a Modular Optimal Learning Testing Environment

We address the relative paucity of empirical testing of learning algorithms (of any type) by introducing a new public-domain, Modular, Optimal Learning Testing Environment (MOLTE) for Bayesian ranking and selection problem, stochastic bandits or sequential experimental design problems. The Matlab-based simulator allows the comparison of a number of learning policies (represented as a series of .m modules) in the context of a wide range of problems (each represented in its own .m module) which makes it easy to add new algorithms and new test problems. State-of-the-art policies and various problem classes are provided in the package. The choice of problems and policies is guided through a spreadsheet-based interface. Different graphical metrics are included. MOLTE is designed to be compatible with parallel computing to scale up from local desktop to clusters and clouds. We offer MOLTE as an easy-to-use tool for the research community that will make it possible to perform much more comprehensive testing, spanning a broader selection of algorithms and test problems. We demonstrate the capabilities of MOLTE through a series of comparisons of policies on a starter library of test problems. We also address the problem of tuning and constructing priors that have been largely overlooked in optimal learning literature. We envision MOLTE as a modest spur to provide researchers an easy environment to study interesting questions involved in optimal learning.

Learning Unknown Markov Decision Processes: A Thompson Sampling Approach

We consider the problem of learning an unknown Markov Decision Process (MDP) that is weakly communicating in the infinite horizon setting. We propose a Thompson Sampling-based reinforcement learning algorithm with dynamic episodes (TSDE). At the beginning of each episode, the algorithm generates a sample from the posterior distribution over the unknown model parameters. It then follows the optimal stationary policy for the sampled model for the rest of the episode. The duration of each episode is dynamically determined by two stopping criteria. The first stopping criterion controls the growth rate of episode length. The second stopping criterion happens when the number of visits to any state-action pair is doubled. We establish \tilde O(HS\sqrt{AT}) bounds on expected regret under a Bayesian setting, where S and A are the sizes of the state and action spaces, T is time, and H is the bound of the span. This regret bound matches the best available bound for weakly communicating MDPs. Numerical results show it to perform better than existing algorithms for infinite horizon MDPs.

DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion

In this paper, we study the task of detecting semantic parts of an object. This is very important in computer vision, as it provides the possibility to parse an object as human do, and helps us better understand object detection algorithms. Also, detecting semantic parts is very challenging especially when the parts are partially or fully occluded. In this scenario, the popular proposal-based methods like Faster-RCNN often produce unsatisfactory results, because both the proposal extraction and classification stages may be confused by the irrelevant occluders. To this end, we propose a novel detection framework, named DeepVoting, which accumulates local visual cues, called visual concepts (VC), to locate the semantic parts. Our approach involves adding two layers after the intermediate outputs of a deep neural network. The first layer is used to extract VC responses, and the second layer performs a voting mechanism to capture the spatial relationship between VC’s and semantic parts. The benefit is that each semantic part is supported by multiple VC’s. Even if some of the supporting VC’s are missing due to occlusion, we can still infer the presence of the target semantic part using the remaining ones. To avoid generating an exponentially large training set to cover all occlusion cases, we train our model without seeing occlusion and transfer the learned knowledge to deal with occlusions. This setting favors learning the models which are naturally robust and adaptive to occlusions instead of over-fitting the occlusion patterns in the training data. In experiments, DeepVoting shows significantly better performance on semantic part detection in occlusion scenarios, compared with Faster-RCNN, with one order of magnitude fewer parameters and 2.5x testing speed. In addition, DeepVoting is explainable as the detection result can be diagnosed via looking up the voted VC’s.

A Novel Algorithm for Clustering of Data on the Unit Sphere via Mixture Models

A new maximum approximate likelihood (ML) estimation algorithm for the mixture of Kent distribution is proposed. The new algorithm is constructed via the BSLM (block successive lower-bound maximization) framework and incorporates manifold optimization procedures within it. The BSLM algorithm is iterative and monotonically increases the approximate log-likelihood function in each step. Under mild regularity conditions, the BSLM algorithm is proved to be convergent and the approximate ML estimator is proved to be consistent. A Bayesian information criterion-like (BIC-like) model selection criterion is also derive, for the task of choosing the number of components in the mixture distribution. The approximate ML estimator and the BIC-like criterion are both demonstrated to be successful via simulation studies. A model-based clustering rule is proposed and also assessed favorably via simulations. Example applications of the developed methodology are provided via an image segmentation task and a neural imaging clustering problem.

Embedded-Graph Theory

In this paper, we propose a new type of graph, denoted as ’embedded-graph’, and its theory, which employs a distributed representation to describe the relations on the graph edges. Embedded-graphs can express linguistic and complicated relations, which cannot be expressed by the existing edge-graphs or weighted-graphs. We introduce the mathematical definition of embedded-graph, translation, edge distance, and graph similarity. We can transform an embedded-graph into a weighted-graph and a weighted-graph into an edge-graph by the translation method and by threshold calculation, respectively. The edge distance of an embedded-graph is a distance based on the components of a target vector, and it is calculated through cosine similarity with the target vector. The graph similarity is obtained considering the relations with linguistic complexity. In addition, we provide some examples and data structures for embedded-graphs in this paper.

Shannon entropy for imprecise and under-defined or over-defined information

Shannon entropy was defined for probability distributions and then its using was expanded to measure the uncertainty of knowledge for systems with complete information. In this article, it is proposed to extend the using of Shannon entropy to under-defined or over-defined information systems. To be able to use Shannon entropy, the information is normalized by an affine transformation. The construction of affine transformation is done in two stages: one for homothety and another for translation. Moreover, the case of information with a certain degree of imprecision was included in this approach. Besides, the article shows the using of Shannon entropy for some particular cases such as: neutrosophic information both in the trivalent and bivalent case, bifuzzy information, intuitionistic fuzzy information, imprecise fuzzy information, and fuzzy partitions.

Perspectives for Evaluating Conversational AI

Conversational AI systems are becoming famous in day to day lives. In this paper, we are trying to address the following key question: To identify whether design, as well as development efforts for search oriented conversational AI are successful or not.It is tricky to define ‘success’ in the case of conversational AI and equally tricky part is to use appropriate metrics for the evaluation of conversational AI. We propose four different perspectives namely user experience, information retrieval, linguistic and artificial intelligence for the evaluation of conversational AI systems. Additionally, background details of conversational AI systems are provided including desirable characteristics of personal assistants, differences between chatbot and an AI based personal assistant. An importance of personalization and how it can be achieved is explained in detail. Current challenges in the development of an ideal conversational AI (personal assistant) are also highlighted along with guidelines for achieving personalized experience for users.

Evaluating probabilistic forecasts with the R package scoringRules

Probabilistic forecasts in the form of probability distributions over future events have become popular in several fields including meteorology, hydrology, economics, and demography. In typical applications, many alternative statistical models and data sources can be used to produce probabilistic forecasts. Hence, evaluating and selecting among competing methods is an important task. The scoringRules package for R provides functionality for comparative evaluation of probabilistic models based on proper scoring rules, covering a wide range of situations in applied work. This paper discusses implementation and usage details, presents case studies from meteorology and economics, and points to the relevant background literature.

Motif-based Rule Discovery for Predicting Real-valued Time Series

Time series prediction is of great significance in many applications and has attracted extensive attention from the data mining community. Existing work suggests that for many problems, the shape in the current time series may correlate an upcoming shape in the same or another series. Therefore, it is a promising strategy to associate two recurring patterns as a rule’s antecedent and consequent: the occurrence of the antecedent can foretell the occurrence of the consequent, and the learned shape of consequent will give accurate predictions. Earlier work employs symbolization methods, but the symbolized representation maintains too little information of the original series to mine valid rules. The state-of-the-art work, though directly manipulating the series, fails to segment the series precisely for seeking antecedents/consequents, resulting in inaccurate rules in common scenarios. In this paper, we propose a novel motif-based rule discovery method, which utilizes motif discovery to accurately extract frequently occurring consecutive subsequences, i.e. motifs, as antecedents/consequents. It then investigates the underlying relationships between motifs by matching motifs as rule candidates and ranking them based on the similarities. Experimental results on real open datasets show that the proposed approach outperforms the baseline method by 23.9\%. Furthermore, it extends the applicability from single time series to multiple ones.

Factor Analysis of Interval Data

This paper presents a factor analysis model for symbolic data, focusing on the particular case of interval-valued variables. The proposed method describes the correlation structure among the measured interval-valued variables in terms of a few underlying, but unobservable, uncorrelated interval-valued variables, called \textit{common factors}. Uniform and Triangular distributions are considered within each observed interval. We obtain the corresponding sample mean, variance and covariance assuming a general Triangular distribution. In our proposal, factors are extracted either by Principal Component or by Principal Axis Factoring, performed on the interval-valued variables correlation matrix. To estimate the values of the common factors, usually called \textit{factor scores}, two approaches are considered, which are inspired in methods for real-valued data: the Bartlett and the Anderson-Rubin methods. In both cases, the estimated values are obtained solving an optimization problem that minimizes a function of the weighted squared Mallows distance between quantile functions. Explicit expressions for the quantile function and the squared Mallows distance are derived assuming a general Triangular distribution. The applicability of the method is illustrated using two sets of data: temperature and precipitation in cities of the United States of America between the years 1971 and 2000 and measures of car characteristics of different makes and models. Moreover, the method is evaluated on synthetic data with predefined correlation structures.

One-Shot Visual Imitation Learning via Meta-Learning

In order for a robot to be a generalist that can perform a wide range of jobs, it must be able to acquire a wide variety of skills quickly and efficiently in complex unstructured environments. High-capacity models such as deep neural networks can enable a robot to represent complex skills, but learning each skill from scratch then becomes infeasible. In this work, we present a meta-imitation learning method that enables a robot to learn how to learn more efficiently, allowing it to acquire new skills from just a single demonstration. Unlike prior methods for one-shot imitation, our method can scale to raw pixel inputs and requires data from significantly fewer prior tasks for effective learning of new skills. Our experiments on both simulated and real robot platforms demonstrate the ability to learn new tasks, end-to-end, from a single visual demonstration.

Shared Learning : Enhancing Reinforcement in $Q$-Ensembles

Deep Reinforcement Learning has been able to achieve amazing successes in a variety of domains from video games to continuous control by trying to maximize the cumulative reward. However, most of these successes rely on algorithms that require a large amount of data to train in order to obtain results on par with human-level performance. This is not feasible if we are to deploy these systems on real world tasks and hence there has been an increased thrust in exploring data efficient algorithms. To this end, we propose the Shared Learning framework aimed at making Q-ensemble algorithms data-efficient. For achieving this, we look into some principles of transfer learning which aim to study the benefits of information exchange across tasks in reinforcement learning and adapt transfer to learning our value function estimates in a novel manner. In this paper, we consider the special case of transfer between the value function estimates in the Q-ensemble architecture of BootstrappedDQN. We further empirically demonstrate how our proposed framework can help in speeding up the learning process in Q-ensembles with minimum computational overhead on a suite of Atari 2600 Games.

Gene tree reconciliation including transfers with replacement is hard and FPT
Asymptotic error distribution for the Euler scheme with locally Lipschitz coefficients
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Method for Aspect-Based Sentiment Annotation Using Rhetorical Analysis
A Rewriting System for Convex Optimization Problems
An Exploration of 2D and 3D Deep Learning Techniques for Cardiac MR Image Segmentation
Sampling from a mixture of different groups of coupons
On sets with small sumset in the circle
An Empirical Study of AI Population Dynamics with Million-agent Reinforcement Learning
Differentially Private Mixture of Generative Neural Networks
Mr. Jones — Towards a Proactive Smart Room Orchestrator
Saliency Transformation Network: Incorporating Multi-stage Visual Cues for Pancreas Segmentation
Workflow Complexity for Collaborative Interactions: Where are the Metrics? — A Challenge
State-Secrecy Codes for Networked Linear Systems
Efficient Computation of Multiple Density-Based Clustering Hierarchies
Normalized Direction-preserving Adam
Distinguishing between convergent evolution and violation of the molecular clock
Ignoring Distractors in the Absence of Labels: Optimal Linear Projection to Remove False Positives During Anomaly Detection
A Computational Model of Afterimages based on Simultaneous and Successive Contrasts
An Operational Semantic Basis for OpenMP Race Analysis
Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network
AMORPH: A statistical program for characterizing amorphous materials by X-ray diffraction
Using NLU in Context for Question Answering: Improving on Facebook’s bAbI Tasks
Structural properties of edge-chromatic critical multigraphs
REMOTEGATE: Incentive-Compatible Remote Configuration of Security Gateways
When Waiting is not an Option : Learning Options with a Deliberation Cost
Towards personalized human AI interaction – adapting the behavior of AI agents using neural signatures of subjective interest
Catalyst design using actively learned machine with non-ab initio input features towards CO2 reduction reactions
Autonomous Extracting a Hierarchical Structure of Tasks in Reinforcement Learning and Multi-task Reinforcement Learning
Pisano period codes
Accelerate Histogram-Based Contrast Enhancement by Selective Downsampling
Two-weight codes and second order recurrences
Extending inferences from randomized participants to all eligible individuals using trials nested within cohort studies
Revisiting Spectral Graph Clustering with Generative Community Models
A2-RL: Aesthetics Aware Reinforcement Learning for Automatic Image Cropping
A Framework for Generalizing Graph-based Representation Learning Methods
Simple Round Compression for Parallel Vertex Cover
Goodness-of-Fit Tests for Random Partitions via Symmetric Polynomials
Sufficient conditions on cycles that make planar graphs 4-choosable
Learning to Segment Instances in Videos with Spatial Propagation Network
Total Edge Irregularity Strength for Graphs
Heat kernels for time-dependent non-symmetric stable-like operators
Random matrix approach for primal-dual portfolio optimization problems
Agent-based Learning for Driving Policy Learning in Connected and Autonomous Vehicles
Robustness Analysis of Visual QA Models by Basic Questions
Well-posedness of supercritical SDE driven by Lévy processes with irregular drifts
Warmstarting of Model-based Algorithm Configuration
Scalability Evaluation of NSLP Algorithm for Solving Non-Stationary Linear Programming Problems on Cluster Computing Systems
Embedding simply connected 2-complexes in 3-space — I. A Kuratowski-type characterisation
Embedding simply connected 2-complexes in 3-space — II. Rotation systems
Embedding simply connected 2-complexes in 3-space — III. Constraint minors
Detection of Unauthorized IoT Devices Using Machine Learning Techniques
Embedding simply connected 2-complexes in 3-space — IV. Dual matroids
Network Slicing in Fog Radio Access Networks: Issues and Challenges
Embedding simply connected 2-complexes in 3-space — V. A refined Kuratowski-type characterisation
Learning Multi-frame Visual Representation for Joint Detection and Tracking of Small Objects
Conditions for Stability and Convergence of Set-Valued Stochastic Approximations: Applications to Approximate Value and Fixed point Iterations with Noise
KBLRN : End-to-End Learning of Knowledge Base Representations with Latent, Relational, and Numerical Features
Enumeration of labelled 4-regular planar graphs
Towards an Arabic-English Machine-Translation Based on Semantic Web
Machine-Translation History and Evolution: Survey for Arabic-English Translations
Heuristics with Performance Guarantees for the Minimum Number of Matches Problem in Heat Recovery Network Design
Uplink NOMA in Large-Scale Systems: Coverage and Physical Layer Security
The Conditional Analogy GAN: Swapping Fashion Articles on People Images
DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding
On proximal mappings with Young functions in uniformly convex Banach spaces
Codes for Erasures over Directed Graphs
Trait evolution with jumps: illusionary normality
A Randomly Expandable Method for Data Layout of RAID Storage Systems
EMOMA: Exact Match in One Memory Access
Trace and Stable Failures Semantics for CSP-Agda
The Impact of Local Geometry and Batch Size on the Convergence and Divergence of Stochastic Gradient Descent
Szemerédi’s theorem in the primes
An improvement on the maximum number of $k$-Dominating Independent Sets
Unsupervised deep object discovery for instance recognition
On Majorization in Dependence Modeling
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Subspace Clustering using Ensembles of $K$-Subspaces
Parallel Enumeration of Triangulations
T${}^2$K${}^2$: The Twitter Top-K Keywords Benchmark
Edge-Transitive Graphs
From Plants to Landmarks: Time-invariant Plant Localization that uses Deep Pose Regression in Agricultural Fields
Denoising Autoencoders for Overgeneralization in Neural Networks
Interpretable Graph-Based Semi-Supervised Learning via Flows
Uniform Edge Betweenness Centrality
A random walk with catastrophes
Homotopy type of the independence complexes of a family of regular bipartite graphs
Resource Allocation in Cloud Radio Access Networks with Device-to-Device Communications
Fast semi-supervised discriminant analysis for binary classification of large data-sets
Exploring Food Detection using CNNs
Diffusion Based Molecular Communication: Principle, Key Technologies, and Challenges
TraceTracker: Hardware/Software Co-Evaluation for Large-Scale I/O Workload Reconstruction
On Multi-Relational Link Prediction with Bilinear Models
A self-stabilizing algorithm for maximal matching in link-register model in $O(nΔ^3)$ moves
Synapse at CAp 2017 NER challenge: Fasttext CRF
MODNet: Moving Object Detection Network with Motion and Appearance for Autonomous Driving
General problem solving with category theory
Hybrid Beamforming with Spatial Modulation in Multi-user Massive MIMO mmWave Networks
p-Divisibility of the number of linear representations of an Abelian p-group
Generalized Biplots for Multidimensional Scaled Projections
Informed Non-convex Robust Principal Component Analysis with Features
Variable Selection for Highly Correlated Predictors
Multivariate reciprocal inverse Gaussian distributions from the Sabot -Tarrès -Zeng integral
Linear Precoding with Low-Resolution DACs for Massive MU-MIMO-OFDM Downlink
Hitting time and mixing time bounds of Stein’s factors
Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections
Synthesizing Optimally Resilient Controllers
Power Regulation in High Performance Multicore Processors
Random Forests of Interaction Trees for Estimating Individualized Treatment Effects in Randomized Trials
Food Recognition using Fusion of Classifiers based on CNNs
Convolution semigroups on locally compact quantum groups and noncommutative Dirichlet forms
Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting
Optimal broadcasting in networks with faulty nodes
μ- Integrable Functions and Weak Convergence of Finite Measures
Control-Oriented Learning on the Fly
Saddle-point theorems for the generalized cone-convex optimizations of set-valued functions
Convolutional Networks for Spherical Signals
Long cycles in Hamiltonian graphs
Amortization does not enhance the max-Rains information of a quantum channel
Total Variation Superiorized Conjugate Gradient Method for Image Reconstruction