Classification without labels: Learning from mixed samples in high energy physics

Modern machine learning techniques can be used to construct powerful models for difficult collider physics problems. In many applications, however, these models are trained on imperfect simulations due to a lack of truth-level information in the data, which risks the model learning artifacts of the simulation. In this paper, we introduce the paradigm of classification without labels (CWoLa) in which a classifier is trained to distinguish statistical mixtures of classes, which are common in collider physics. Crucially, neither individual labels nor class proportions are required, yet we prove that the optimal classifier in the CWoLa paradigm is also the optimal classifier in the traditional fully-supervised case where all label information is available. After demonstrating the power of this method in an analytical toy example, we consider a realistic benchmark for collider physics: distinguishing quark- versus gluon-initiated jets using mixed quark/gluon training samples. More generally, CWoLa can be applied to any classification problem where labels or class proportions are unknown or simulations are unreliable, but statistical mixtures of the classes are available.


Anomaly Detection on Graph Time Series

In this paper, we use variational recurrent neural network to investigate the anomaly detection problem on graph time series. The temporal correlation is modeled by the combination of recurrent neural network (RNN) and variational inference (VI), while the spatial information is captured by the graph convolutional network. In order to incorporate external factors, we use feature extractor to augment the transition of latent variables, which can learn the influence of external factors. With the target function as accumulative ELBO, it is easy to extend this model to on-line method. The experimental study on traffic flow data shows the detection capability of the proposed method.


Tikhonov Regularization for Long Short-Term Memory Networks

It is a well-known fact that adding noise to the input data often improves network performance. While the dropout technique may be a cause of memory loss, when it is applied to recurrent connections, Tikhonov regularization, which can be regarded as the training with additive noise, avoids this issue naturally, though it implies regularizer derivation for different architectures. In case of feedforward neural networks this is straightforward, while for networks with recurrent connections and complicated layers it leads to some difficulties. In this paper, a Tikhonov regularizer is derived for Long-Short Term Memory (LSTM) networks. Although it is independent of time for simplicity, it considers interaction between weights of the LSTM unit, which in theory makes it possible to regularize the unit with complicated dependences by using only one parameter that measures the input data perturbation. The regularizer that is proposed in this paper has three parameters: one to control the regularization process, and other two to maintain computation stability while the network is being trained. The theory developed in this paper can be applied to get such regularizers for different recurrent neural networks with Hadamard products and Lipschitz continuous functions.


Using Deep Neural Networks to Automate Large Scale Statistical Analysis for Big Data Applications

Statistical analysis (SA) is a complex process to deduce population properties from analysis of data. It usually takes a well-trained analyst to successfully perform SA, and it becomes extremely challenging to apply SA to big data applications. We propose to use deep neural networks to automate the SA process. In particular, we propose to construct convolutional neural networks (CNNs) to perform automatic model selection and parameter estimation, two most important SA tasks. We refer to the resulting CNNs as the neural model selector and the neural model estimator, respectively, which can be properly trained using labeled data systematically generated from candidate models. Simulation study shows that both the selector and estimator demonstrate excellent performances. The idea and proposed framework can be further extended to automate the entire SA process and have the potential to revolutionize how SA is performed in big data analytics.


A Machine Learning Approach to Routing

Can ideas and techniques from machine learning be leveraged to automatically generate ‘good’ routing configurations? We investigate the power of data-driven routing protocols. Our results suggest that applying ideas and techniques from deep reinforcement learning to this context yields high performance, motivating further research along these lines.


From Random Walks to Random Leaps: Generalizing Classic Markov Chains for Big Data Applications

Simple random walks are a basic staple of the foundation of probability theory and form the building block of many useful and complex stochastic processes. In this paper we study a natural generalization of the random walk to a process in which the allowed step sizes take values in the set \{\pm1,\pm2,\ldots,\pm k\}, a process we call a random leap. The need to analyze such models arises naturally in modern-day data science and so-called ‘big data’ applications. We provide closed-form expressions for quantities associated with first passage times and absorption events of random leaps. These expressions are formulated in terms of the roots of the characteristic polynomial of a certain recurrence relation associated with the transition probabilities. Our analysis shows that the expressions for absorption probabilities for the classical simple random walk are a special case of a universal result that is very elegant. We also consider an important variant of a random leap: the reflecting random leap. We demonstrate that the reflecting random leap exhibits more interesting behavior in regard to the existence of a stationary distribution and properties thereof. Questions relating to recurrence/transience are also addressed, as well as an application of the random leap.


TensorFlow Enabled Genetic Programming

Genetic Programming, a kind of evolutionary computation and machine learning algorithm, is shown to benefit significantly from the application of vectorized data and the TensorFlow numerical computation library on both CPU and GPU architectures. The open source, Python Karoo GP is employed for a series of 190 tests across 6 platforms, with real-world datasets ranging from 18 to 5.5M data points. This body of tests demonstrates that datasets measured in tens and hundreds of data points see 2-15x improvement when moving from the scalar/SymPy configuration to the vector/TensorFlow configuration, with a single core performing on par or better than multiple CPU cores and GPUs. A dataset composed of 90,000 data points demonstrates a single vector/TensorFlow CPU core performing 875x better than 40 scalar/Sympy CPU cores. And a dataset containing 5.5M data points sees GPU configurations out-performing CPU configurations on average by 1.3x.


Simple Analysis of Sparse, Sign-Consistent JL
Structural Damage Identification Using Piezoelectric Impedance Measurement with Sparse Multi-Objective DIRECT
Personalized Cinemagraphs using Semantic Understanding and Collaborative Learning
Learning Policies for Adaptive Tracking with Deep Feature Cascades
Dioid Partitions of Groups
Random Binary Trees for Approximate Nearest Neighbour Search in Binary Space
Hierarchically-Attentive RNN for Album Summarization and Storytelling
ChromaTag: A Colored Marker and Fast Detection Algorithm
Scaling Deep Learning on GPU and Knights Landing clusters
Cleaning the correlation matrix with a denoising autoencoder
Identifying Reference Spans: Topic Modeling and Word Embeddings help IR
Loop-augmented forests and a variant of the Foulkes’ conjecture
Partial Information Near-Optimal Control of Forward-Backward Stochastic Differential System with Observation Noise
Dimensional and statistical foundations for accumulated damage models
Addendum to: Summary Information for Reasoning About Hierarchical Plans
Non-stationary Stochastic Optimization with Local Spatial and Temporal Changes
Left-invariant geometries on $\mathrm{SU}(2)$ are uniformly doubling
Above and Beyond the Landauer Bound: Thermodynamics of Modularity
A Unified Model for Near and Remote Sensing
‘Is there anything else I can help you with?’: Challenges in Deploying an On-Demand Crowd-Powered Conversational Agent
When Does the First Spurious Variable Get Selected by Sequential Regression Procedures?
Communication-Free Parallel Supervised Topic Models
Application Level High Speed Transfer Optimization Based on Historical Analysis and Real-time Tuning
Noise sensitivity and Voronoi percolation
Online Interactive Collaborative Filtering Using Multi-Armed Bandit with Dependent Arms
Opportunistic Scheduling of Machine Type Communications as Underlay to Cellular Networks
Heterogeneous Networks with Power-Domain NOMA: Coverage, Throughput and Power Allocation Analysis
Design and Optimization of VoD schemes with Client Caching in Wireless Multicast Networks
Two-vertex generators of Jacobians of graphs
TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References
Conformal Bootstrap Analysis for Single and Branched Polymers
A note on the vertex arboricity of signed graphs
A Simple and Realistic Pedestrian Model for Crowd Simulation and Application
Distance-preserving Subgraphs of Interval Graphs
Stabilization of quasistatic evolution of elastoplastic systems subject to periodic loading
Semantic Video CNNs through Representation Warping
On Approximate Welfare- and Revenue-Maximizing Equilibria for Size-Interchangeable Bidders
Sure profits via flash strategies and the impossibility of predictable jumps
Bounds on the Capacity of Memoryless Simplified Fiber-Optical Channel Models
Location Name Extraction from Targeted Text Streams using Gazetteer-based Statistical Language Models
Modality-bridge Transfer Learning for Medical Image Classification
Spectrum of signless 1-Laplacian on simplicial complexes
Weak universality for a class of 3d stochastic reaction-diffusion models
On sparsity and power-law properties of graphs based on exchangeable point processes
Twins and Vertex- Identification on Graphs
Hypotheses testing on infinite random graphs
Attention-Aware Face Hallucination via Deep Reinforcement Learning
The Pandora multi-algorithm approach to automated pattern recognition of cosmic-ray muon and neutrino events in the MicroBooNE detector
Probability distribution for monthly precipitation data in India
The Static and Stochastic VRPTW with both random Customers and Reveal Times: algorithms and recourse strategies
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
Cooperation promotes biodiversity and stability in a model ecosystem
Point process-based modeling of multiple debris flow landslides using INLA: an application to the 2009 Messina disaster
Pseudo-differential operators and related additive geometric stable processes
Many-body localization phase in a spin-driven chiral multiferroic chain
Utilizing Embeddings for Ad-hoc Retrieval by Document-to-document Similarity
Energy-efficient Geo-Distributed Big Data Analytics
Neural and Statistical Methods for Leveraging Meta-information in Machine Translation
The Curious Bounds of Floor Function Sums
Enhancement of large fluctuations to extinction in adaptive networks
Improving the Peña-Prieto ‘KSD’ procedure
Some comments on computational mechanics, complexity measures, and all that
Achieving an Efficient and Fair Equilibrium Through Taxation
Tosca: Operationalizing Commitments Over Information Protocols
DNN and CNN with Weighted and Multi-task Loss Functions for Audio Event Detection
Stability Analysis of Constrained Optimization Problem Using Passivity Approach
Lower bounds for several online variants of bin packing
Automatic Selection of t-SNE Perplexity
Sampling perspectives on sparse exchangeable graphs
The free-fermionic $C^{(1)}_2$ loop model, double dimers and Kashaev’s recurrence
Privacy-Preserving Economic Dispatch in Competitive Electricity Market
SESA: Supervised Explicit Semantic Analysis
The mixed degree of families of lattice polytopes
On random quadratic forms: supports of potential local maxima
Robust polynomial regression up to the information theoretic limit
Contextuality from missing and versioned data
Note on parity and the irreducible characters of the symmetric group
Activated Aging Dynamics and Effective Trap Model Description in the Random Energy Model
Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search
Fast and accurate Bayesian model criticism and conflict diagnostics using R-INLA
Analysis of Convolutional Neural Networks for Document Image Classification
Simulating a Shared Register in a System that Never Stops Changing
3D Line Segments Extraction from Semi-dense SLAM
Document Image Binarization with Fully Convolutional Neural Networks
Impact of Communication Delays on the Convergence Rate of Distributed Optimization Algorithms
Motion Feature Augmented Recurrent Neural Network for Skeleton-based Dynamic Hand Gesture Recognition
TPC: Temporal Preservation Convolutional Networks for Precise Temporal Action Localization
Perfect quantum state transfer in weighted paths with potentials (loops) using orthogonal polynomials
The sign clusters of the massless Gaussian free field percolate on $\mathbb{Z}^d$, $d \geqslant 3$ (and more)
Subset Selection with Shrinkage: Sparse Linear Modeling when the SNR is low
Learning to Synthesize a 4D RGBD Light Field from a Single Image
Multicarrier Relay Selection for Full-Duplex Relay-Assisted OFDM D2D Systems
Outage Performance of Two-Hop OFDM Systems with Spatially Random Decode-and-Forward Relays
Cell Detection with Deep Convolutional Neural Network and Compressed Sensing
Systematic Testing of Convolutional Neural Networks for Autonomous Driving
Thinking Fast, Thinking Slow! Combining Knowledge Graphs and Vector Spaces
Radical-level Ideograph Encoder for RNN-based Sentiment Analysis of Chinese and Japanese
Limit theorems for non-linear functionals of stationary Gaussian random fields
Noncommutative Catalan numbers

Advertisements