ONCE and ONCE+: Counting the Frequency of Time-constrained Serial Episodes in a Streaming Sequence

As a representative sequential pattern mining problem, counting the frequency of serial episodes from a streaming sequence has drawn continuous attention in academia due to its wide application in practice, e.g., telecommunication alarms, stock market, transaction logs, bioinformatics, etc. Although a number of serial episodes mining algorithms have been developed recently, most of them are neither stream-oriented, as they require multi-pass of dataset, nor time-aware, as they fail to take into account the time constraint of serial episodes. In this paper, we propose two novel one-pass algorithms, ONCE and ONCE+, each of which can respectively compute two popular frequencies of given episodes satisfying predefined time-constraint as signals in a stream arrives one-after-another. ONCE is only used for non-overlapped frequency where the occurrences of a serial episode in sequence are not intersected. ONCE+ is designed for the distinct frequency where the occurrences of a serial episode do not share any event. Theoretical study proves that our algorithm can correctly mine the frequency of target time constraint serial episodes in a given stream. Experimental study over both real-world and synthetic datasets demonstrates that the proposed algorithm can work, with little time and space, in signal-intensive streams where millions of signals arrive within a single second. Moreover, the algorithm has been applied in a real stream processing system, where the efficacy and efficiency of this work is tested in practical applications.


Improving Word Vector with Prior Knowledge in Semantic Dictionary

Using low dimensional vector space to represent words has been very effective in many NLP tasks. However, it doesn’t work well when faced with the problem of rare and unseen words. In this paper, we propose to leverage the knowledge in semantic dictionary in combination with some morphological information to build an enhanced vector space. We get an improvement of 2.3% over the state-of-the-art Heidel Time system in temporal expression recognition, and obtain a large gain in other name entity recognition (NER) tasks. The semantic dictionary Hownet alone also shows promising results in computing lexical similarity.


A Sheaf Model of Contradictions and Disagreements. Preliminary Report and Discussion

We introduce a new formal model — based on the mathematical construct of sheaves — for representing contradictory information in textual sources. This model has the advantage of letting us (a) identify the causes of the inconsistency; (b) measure how strong it is; (c) and do something about it, e.g. suggest ways to reconcile inconsistent advice. This model naturally represents the distinction between contradictions and disagreements. It is based on the idea of representing natural language sentences as formulas with parameters sitting on lattices, creating partial orders based on predicates shared by theories, and building sheaves on these partial orders with products of lattices as stalks. Degrees of disagreement are measured by the existence of global and local sections. Limitations of the sheaf approach and connections to recent work in natural language processing, as well as the topics of contextuality in physics, data fusion, topological data analysis and epistemology are also discussed.


Combining Convolution and Recursive Neural Networks for Sentiment Analysis

This paper addresses the problem of sentence-level sentiment analysis. In recent years, Convolution and Recursive Neural Networks have been proven to be effective network architecture for sentence-level sentiment analysis. Nevertheless, each of them has their own potential drawbacks. For alleviating their weaknesses, we combined Convolution and Recursive Neural Networks into a new network architecture. In addition, we employed transfer learning from a large document-level labeled sentiment dataset to improve the word embedding in our models. The resulting models outperform all recent Convolution and Recursive Neural Networks. Beyond that, our models achieve comparable performance with state-of-the-art systems on Stanford Sentiment Treebank.


Air Markov Chain Monte Carlo

We introduce a class of Adapted Increasingly Rarely Markov Chain Monte Carlo (AirMCMC) algorithms where the underlying Markov kernel is allowed to be changed based on the whole available chain output but only at specific time points separated by an increasing number of iterations. The main motivation is the ease of analysis of such algorithms. Under the assumption of either simultaneous or (weaker) local simultaneous geometric drift condition, or simultaneous polynomial drift we prove the L_2-convergence, Weak and Strong Laws of Large Numbers (WLLN, SLLN), Central Limit Theorem (CLT), and discuss how our approach extends the existing results. We argue that many of the known Adaptive MCMC algorithms may be transformed into the corresponding Air versions, and provide an empirical evidence that performance of the Air version stays virtually the same.


Ensemble Neural Relation Extraction with Adaptive Boosting

Relation extraction has been widely studied to extract new relational facts from open corpus. Previous relation extraction methods are faced with the problem of wrong labels and noisy data, which substantially decrease the performance of the model. In this paper, we propose an ensemble neural network model – Adaptive Boosting LSTMs with Attention, to more effectively perform relation extraction. Specifically, our model first employs the recursive neural network LSTMs to embed each sentence. Then we import attention into LSTMs by considering that the words in a sentence do not contribute equally to the semantic meaning of the sentence. Next via adaptive boosting, we build strategically several such neural classifiers. By ensembling multiple such LSTM classifiers with adaptive boosting, we could build a more effective and robust joint ensemble neural networks based relation extractor. Experiment results on real dataset demonstrate the superior performance of the proposed model, improving F1-score by about 8% compared to the state-of-the-art models. The code of this work is publicly available on https://…/re.


Nonlinear Dimensionality Reduction on Graphs

In this era of data deluge, many signal processing and machine learning tasks are faced with high-dimensional datasets, including images, videos, as well as time series generated from social, commercial and brain network interactions. Their efficient processing calls for dimensionality reduction techniques capable of properly compressing the data while preserving task-related characteristics, going beyond pairwise data correlations. The present paper puts forth a nonlinear dimensionality reduction framework that accounts for data lying on known graphs. The novel framework turns out to encompass most of the existing dimensionality reduction methods as special cases, and it is capable of capturing and preserving possibly nonlinear correlations that are ignored by linear methods, as well as taking into account information from multiple graphs. An efficient algorithm admitting closed-form solution is developed and tested on synthetic datasets to corroborate its effectiveness.


A notion of stability for k-means clustering

In this paper, we define and study a new notion of stability for the k-means clustering scheme building upon the notion of quantization of a probability measure. We connect this notion of stability to a geometric feature of the underlying distribution of the data, named absolute margin condition, inspired by recent works on the subject.


The Lazy Bootstrap. A Fast Resampling Method for Evaluating Latent Class Model Fit

The latent class model is a powerful unsupervised clustering algorithm for categorical data. Many statistics exist to test the fit of the latent class model. However, traditional methods to evaluate those fit statistics are not always useful. Asymptotic distributions are not always known, and empirical reference distributions can be very time consuming to obtain. In this paper we propose a fast resampling scheme with which any type of model fit can be assessed. We illustrate it here on the latent class model, but the methodology can be applied in any situation. The principle behind the lazy bootstrap method is to specify a statistic which captures the characteristics of the data that a model should capture correctly. If those characteristics in the observed data and in model-generated data are very different we can assume that the model could not have produced the observed data. With this method we achieve the flexibility of tests from the Bayesian framework, while only needing maximum likelihood estimates. We provide a step-wise algorithm with which the fit of a model can be assessed based on the characteristics we as researcher find important. In a Monte Carlo study we show that the method has very low type I errors, for all illustrated statistics. Power to reject a model depended largely on the type of statistic that was used and on sample size. We applied the method to an empirical data set on clinical subgroups with risk of Myocardial infarction and compared the results directly to the parametric bootstrap. The results of our method were highly similar to those obtained by the parametric bootstrap, while the required computations differed three orders of magnitude in favour of our method.


Human-Machine Inference Networks For Smart Decision Making: Opportunities and Challenges

The emerging paradigm of Human-Machine Inference Networks (HuMaINs) combines complementary cognitive strengths of humans and machines in an intelligent manner to tackle various inference tasks and achieves higher performance than either humans or machines by themselves. While inference performance optimization techniques for human-only or sensor-only networks are quite mature, HuMaINs require novel signal processing and machine learning solutions. In this paper, we present an overview of the HuMaINs architecture with a focus on three main issues that include architecture design, inference algorithms including security/privacy challenges, and application areas/use cases.


Expected Precision of Europa Clipper Gravity Measurements
Out-of-time-ordered measurements as a probe of quantum dynamics
Graph-Theoretic Framework for Unified Analysis of Observability and Data Injection Attacks in the Smart Grid
The totally nonnegative part of G/P is a ball
Canonical diffusions on the pattern spaces of aperiodic Delone sets
Nonseparable Sample Selection Models with Censored Selection Rules
Oracle Separations for Quantum Statistical Zero-Knowledge
Multiplicity of eigenvalues of cographs
Zeros of random polynomials and its higher derivatives
Efficient Hierarchical Graph-Based Segmentation of RGBD Videos
Object category learning and retrieval with weak supervision
The maximum deviation of the $\text{Sine}_β$ counting process
A Formal Definition of Importance for Summarization
Median bias reduction in random-effects meta-analysis and meta-regression
A Two-point Method for PTZ Camera Calibration in Sports
Pointwise Information Decomposition Using the Specificity and Ambiguity Lattices
Intersections of $ψ$ classes on Hassett Spaces for genus $0$ with all weights $\frac{1}{2}$
Poincaré-Bendixson Theorem for Hybrid Systems
Random Access Channel Coding in the Finite Blocklength Regime
A Characterization of Guesswork on Swiftly Tilting Curves
Approximate Inference via Weighted Rademacher Complexity
Adaptive Hybrid Beamforming with Massive Phased Arrays in Macro-Cellular Networks
Exploration on Generating Traditional Chinese Medicine Prescription from Symptoms with an End-to-End method
Affine Schubert calculus and double coinvariants
More powerful post-selection inference, with application to the Lasso
Variance-Optimal Offline and Streaming Stratified Random Sampling
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
Image2GIF: Generating Cinemagraphs using Recurrent Deep Q-Networks
Parametric Modeling of Non-Stationary Signals
Greedy Algorithms for Maximizing Nash Social Welfare
Stationary distribution of the stochastic theta method for nonlinear stochastic differential equations
Covariance-based Dissimilarity Measures Applied to Clustering Wide-sense Stationary Ergodic Processes
Dimensional Reduction by Conformal Bootstrap
Ear Recognition With Score-Level Fusion Based On CMC In Long-Wave Infrared Spectrum
Solving for multi-class using orthogonal coding matrices
A Multi-Biometrics for Twins Identification Based Speech and Ear
Fine-grained Visual Categorization using PAIRS: Pose and Appearance Integration for Recognizing Subcategories
Uniqueness and Stability of Optimizers for a Membrane Problem
IRSA Transmission Optimization via Online Learning
SWRL2SPIN: A tool for transforming SWRL rule bases in OWL ontologies to object-oriented SPIN rules
Pinpointing astrophysical bursts of low-energy neutrinos embedded into the noise
Capacity Theorems for Distributed Index Coding
Bayesian inference in Y-linked two-sex branching processes with mutations: ABC approach
A Review of Multiple Try MCMC algorithms for Signal Processing
Fast Cosmic Web Simulations with Generative Adversarial Networks
Using Additional Indexes for Fast Full-Text Search of Phrases That Contains Frequently Used Words
Interactive Deep Colorization With Simultaneous Global and Local Inputs
A Generative Approach to Zero-Shot and Few-Shot Action Recognition
On Scheduling Two-Stage Jobs on Multiple Two-Stage Flowshops
InteractiveGenerativeAdversarialNetworksforFacialExpressionGeneration in Dyadic Interactions
Towards an Understanding of Neural Networks in Natural-Image Spaces
Generalized Estimating Equation for the Student-t Distributions
Understanding Deep Architectures by Interpretable Visual Summaries
Deep Neural Networks In Fully Connected CRF For Image Labeling With Social Network Metadata
Spectral and Energy Efficient Wireless Powered IoT Networks: NOMA or TDMA?
Robust Multi-subspace Analysis Using Novel Column L0-norm Constrained Matrix Factorization
Ascent with Quadratic Assistance for the Construction of Exact Experimental Designs
Scalable Mutual Information Estimation using Dependence Graphs
Graphic displays of MLB pitching mechanics and its evolutions in PITCHf/x data
Meshed Up: Learnt Error Correction in 3D Reconstructions
Gradient descent revisited via an adaptive online learning rate
Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation
Zonotopes whose cellular strings are all coherent
Identification of multiple hard X-ray sources in solar flares: A Bayesian analysis of the February 20 2002 event
Adaptive Scan Gibbs Sampler for Large Scale Inference Problems
Bayesian Nonparametric Modeling of Driver Behavior using HDP Split-Merge Sampling Algorithm
Modeling and Stabilization of a Rotating Mechanical System with Elastic Plates
A Notion of Total Dual Integrality for Convex, Semidefinite, and Extended Formulations
Faster Approximate(d) Text-to-Pattern L1 Distance
Another look into the Wong-Zakai Theorem for Stochastic Heat Equation
Optimal Energy Management Strategies in Wireless Data and Energy Cooperative Communications
Pinning by rare defects and effective mobility for elastic interfaces in high dimensions
Generalized Littlewood-Richardson coefficients for branching rules of GL(n) and extremal weight crystals
Sparse Portfolio Selection via Non-convex Fraction Function
Modified lp-norm regularization minimization for sparse signal recovery
Integrating Ultra-Fast Charging Stations within the Power Grids of Smart Cities: A Review
Mitigating Pilot Contamination in Multi-cell Hybrid Millimeter Wave Systems
A Comparison of SC-FDE and UW DFT-s-OFDM for Millimeter Wave Communications
Hindman-like theorems with uncountably many colours and finite monochromatic sets
Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection
Marketing Analytics: Methods, Practice, Implementation, and Links to Other Fields
A Gale-Berlekamp permutation-switching problem in higher dimensions
Improved Training of Generative Adversarial Networks Using Representative Features
Algorithmic Linearly Constrained Gaussian Processes
Fixed points of Sturmian morphisms and their derivated words
Nash inequality for Diffusion Processes Associated with Dirichlet Distributions
The Zarankiewicz problem in 3-partite graphs
Probability Mass Exclusions and the Directed Components of Pointwise Mutual Information
Monitoring of Wild Pseudomonas Biofilm Strain Conditions Using Statistical Characterisation of Scanning Electron Microscopy Images
Multimodal Functional and Structural Brain Connectivity Analysis in Autism: A Preliminary Integrated Approach with EEG, fMRI and DTI
Random matrix approach to plasmon resonances in the random impedance network model of disordered nanocomposites
Structure and Sensitivity in Differential Privacy: Comparing K-Norm Mechanisms
Performance Analysis of Robust Stable PID Controllers Using Dominant Pole Placement for SOPTD Process Models
Time Constrained Continuous Subgraph Search over Streaming Graphs
Joint Voxel and Coordinate Regression for Accurate 3D Facial Landmark Localization
Wavelet Analysis of the Besov Regularity of Lévy White Noises
Multi-Pointer Co-Attention Networks for Recommendation
End to End Performance Analysis of Relay Cooperative Communication Based on Parked Cars
Study on Energy Consumption and Coverage of Hierarchical Cooperation of Small Cell Base Stations in Heterogeneous Networks
Inverse Uncertainty Quantification using the Modular Bayesian Approach based on Gaussian Process, Part 2: Application to TRACE
The weakly dependent strong law of large numbers revisited
Wasserstein-Riemannian Geometry of Positive-definite Matrices
Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data
Algebraic dependencies and PSPACE algorithms in approximative complexity
Revealing the intrinsic anisotropy of superconducting Sr$_x$Bi$_2$Se$_3$
Surfactant and gravity dependent instability of two-layer channel flows: Linear theory covering all wave lengths
Application of Kriging Models for a Drug Combination Experiment on Lung Cancer
The Gaussian Double-Bubble Conjecture
Adapting The Gibbs Sampler
A model-theoretic generalization of the Elekes-Szabó theorem
HONE: Higher-Order Network Embeddings
Optimal Beam Sweeping and Communication in Mobile Millimeter-Wave Networks
A Cyber Science Based Ontology for Artificial General Intelligence Containment
Less is more: sampling chemical space with active learning
Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks
Benchmarking Clinical Decision Support Search
Strong error analysis for stochastic gradient descent optimization algorithms
Sparse and Low-rank Tensor Estimation via Cubic Sketchings
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
Uncertainty Estimation in Functional Linear Models
Liquid State Machine Learning for Resource and Cache Management in LTE-U Unmanned Aerial Vehicle (UAV) Networks
Certified Defenses against Adversarial Examples
BRAINS: Joint Bandwidth-Relay Allocation in Multi-Homing Cooperative D2D Networks
Representing the Insincere: Strategically Robust Proportional Representation
Strong Approximation of Stochastic Allen-Cahn Equation with White Noise
On the Inter-relationships among Drift rate, Forgetting rate, Bias/variance profile and Error
Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing
Comparative Study of ECO and CFNet Trackers in Noisy Environment
Approximate Vanishing Ideal via Data Knotting
Search Based Code Generation for Machine Learning Programs
Wireless Powered Asynchronous Backscatter Networks with Sporadic Short Packets: Performance Analysis and Optimization
Tournament Leave-pair-out Cross-validation for Receiver Operating Characteristic (ROC) Analysis
On the Quadratic Convergence of the Cubic Regularization Method under a Local Error Bound Condition
Safeguarding Millimeter Wave Communications Against Randomly Located Eavesdroppers
Shift-Net: Image Inpainting via Deep Feature Rearrangement
Curvature calculations for antitrees
Learning Combinations of Activation Functions
A free boundary problem with non local interaction
Join Query Optimization Techniques for Complex Event Processing Applications
CosFace: Large Margin Cosine Loss for Deep Face Recognition
Test Martingales for bounded random variables
Generalized Leapfrogging Samplesort: A Class of $O(n \log^2 n)$ Worst-Case Complexity and $O(n \log n)$ Average-Case Complexity Sorting Algorithms
Self-duality of Markov processes and intertwining functions
Local Visual Microphones: Improved Sound Extraction from Silent Video
Multiplicative ergodic theorem for a non-irreducible random dynamical system
Using Meta-heuristics and Machine Learning for Software Optimization of Parallel Computing Systems: A Systematic Literature Review
TernaryNet: Faster Deep Model Inference without GPUs for Medical 3D Segmentation using Sparse and Binary Convolutions
Road Damage Detection Using Deep Neural Networks with Images Captured Through a Smartphone
Testing normality using the summary statistics with application to meta-analysis
Using deep Q-learning to understand the tax evasion behavior of risk-averse firms
Hierarchical Spatial Transformer Network
DeepSIC: Deep Semantic Image Compression
Hyper-Hue and EMAP on Hyperspectral Images for Supervised Layer Decomposition of Old Master Drawings
Histogram of Oriented Depth Gradients for Action Recognition
Finite projective planes and the Delsarte LP-bound
Almost Optimal Scaling of Reed-Muller Codes on BEC and BSC Channels
Ultra Reliable Communication via Opportunistic ARQ Transmission in Cognitive Networks
On the Effective Energy Efficiency of Ultra-reliable Networks in the Finite Blocklength Regime
Which NP-Hard SAT and CSP Problems Admit Exponentially Improved Algorithms?
Effects of heterogeneity in power-grid network models
Improving Active Learning in Systematic Reviews
Non-Leaving-Face property for marked surfaces
Using High-Speed WANs and Network Data Caches to Enable Remote and Distributed Visualization
The exit time finite state projection scheme: bounding exit distributions and occupation measures of continuous-time Markov chains
Atomic Cross-Chain Swaps
A Pascal-like Bound for the Number of Necklaces with Fixed Density
Phase space reconstruction for non-uniformly sampled noisy time series
Learning-based Image Reconstruction via Parallel Proximal Algorithm
Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features
The Scalability of Trustless Trust
Bayesian inverse problems with non-commuting operators
A Full Bayesian Model to Handle Structural Ones and Missingness in Economic Evaluations from Individual-Level Data
Quantized Constant Envelope Precoding with PSK and QAM Signaling
The PomXYZ Proteins Self-Organize on the Bacterial Nucleoid to Stimulate Cell Division
Families of Solutions of Algebraic Riccati Equations
End-to-End Fine-Grained Action Segmentation and Recognition Using Conditional Random Field Models and Discriminative Sparse Coding
Deep Learning Approach for Very Similar Objects Recognition Application on Chihuahua and Muffin Problem
A Unifying Framework for Manipulation Problems
A note on expansion in prime fields
Basic stochastic transmission models and their inference
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Second Order Asymptotic Properties for the Tail Probability of the Number of Customers in the M/G/1 Retrial Queue
Extremal Collections of $k$-Uniform Vectors
Comparison of wait time approximations in distribution networks using (R,Q)-order policies
An Optimal Value Iteration Algorithm for Parity Games
Estimating the Cardinality of Conjunctive Queries over RDF Data Using Graph Summarisation
Learning the Reward Function for a Misspecified Model
Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation
Secure Massive IoT Using Hierarchical Fast Blind Deconvolution
Helping Crisis Responders Find the Informative Needle in the Tweet Haystack
Parameter Estimation, Sensitivity Analysis and Optimal Control of a Periodic Epidemic Model with Application to HRSV in Florida
Geospatial distributions reflect rates of evolution of features of language
Energy Scaling with Control Distance in Complex Networks
Improving Multiple Object Tracking with Optical Flow and Edge Preprocessing
Controllability, matching ratio and graph convergence
A multi-scale limit of a randomly forced rotating $3$-D compressible fluid
Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score
Stable day-to-day dynamics for departure time choice
Quantum Fractional Revival on Graphs
Matrix Completion for Structured Observations
Fast Penalized Regression and Cross Validation for Tall Data with the oem Package
Design and Analysis of 5G Scenarios with ‘simmer’: An R Package for Fast DES Prototyping
Optimal MDS codes for cooperative repair
Information Directed Sampling and Bandits with Heteroscedastic Noise

Advertisements