Uncertainty relations with quantum memory for the Wehrl entropy

We prove two new fundamental uncertainty relations with quantum memory for the Wehrl entropy. These are the first entropic uncertainty relations with quantum memory ever proposed for a single measurement. The first relation applies to the bipartite memory scenario and provides a lower bound to the Wehrl entropy of a quantum state conditioned on the memory quantum system in terms of the von Neumann entropy of the same quantum state conditioned on the same memory quantum system. The second relation applies to the tripartite memory scenario and provides a lower bound to the sum of the Wehrl entropy of a quantum state conditioned on the first memory quantum system with the Wehrl entropy of the same state conditioned on the second memory quantum system. The Wehrl entropy of a quantum state is the Shannon differential entropy of the outcome of a heterodyne measurement performed on the state. The heterodyne measurement is one of the main measurements in quantum optics, and lies at the basis of one of the most promising protocols for quantum key distribution. These fundamental entropic uncertainty relations will be a valuable tool in quantum information, and will e.g. find application in security proofs of quantum key distribution protocols in the asymptotic regime and in entanglement witnessing in quantum optics.

Cross-Platform Emoji Interpretation: Analysis, a Solution, and Applications

Most social media platforms are largely based on text, and users often write posts to describe where they are, what they are seeing, and how they are feeling. Because written text lacks the emotional cues of spoken and face-to-face dialogue, ambiguities are common in written language. This problem is exacerbated in the short, informal nature of many social media posts. To bypass this issue, a suite of special characters called ’emojis,’ which are small pictograms, are embedded within the text. Many emojis are small depictions of facial expressions designed to help disambiguate the emotional meaning of the text. However, a new ambiguity arises in the way that emojis are rendered. Every platform (Windows, Mac, and Android, to name a few) renders emojis according to their own style. In fact, it has been shown that some emojis can be rendered so differently that they look ‘happy’ on some platforms, and ‘sad’ on others. In this work, we use real-world data to verify the existence of this problem. We verify that the usage of the same emoji can be significantly different across platforms, with some emojis exhibiting different sentiment polarities on different platforms. We propose a solution to identify the intended emoji based on the platform-specific nature of the emoji used by the author of a social media post. We apply our solution to sentiment analysis, a task that can benefit from the emoji calibration technique we use in this work. We conduct experiments to evaluate the effectiveness of the mapping in this task.

WOAH: Preliminaries to Zero-shot Ontology Learning for Conversational Agents

The present paper presents the Weighted Ontology Approximation Heuristic (WOAH), a novel zero-shot approach to ontology estimation for conversational agents development environments. This methodology extracts verbs and nouns separately from data by distilling the dependencies obtained and applying similarity and sparsity metrics to generate an ontology estimation configurable in terms of the level of generalization.

Technical Report: Accelerating Dynamic Graph Analytics on GPUs

As graph analytics often involves compute-intensive operations, GPUs have been extensively used to accelerate the processing. However, in many applications such as social networks, cyber security, and fraud detection, their representative graphs evolve frequently and one has to perform a rebuild of the graph structure on GPUs to incorporate the updates. Hence, rebuilding the graphs becomes the bottleneck of processing high-speed graph streams. In this paper, we propose a GPU-based dynamic graph storage scheme to support existing graph algorithms easily. Furthermore, we propose parallel update algorithms to support efficient stream updates so that the maintained graph is immediately available for high-speed analytic processing on GPUs. Our extensive experiments with three streaming applications on large-scale real and synthetic datasets demonstrate the superior performance of our proposed approach.

Individualized Multi-directional Variable Selection

In this paper we propose an individualized variable selection approach to select different relevant variables for different individuals. In contrast to conventional model selection approaches, the key component of the new approach is to construct a separation penalty with multi-directional shrinkages including zero, which facilitates individualized modeling to distinguish strong signals from noisy ones. As a byproduct, the proposed model identifies subgroups among which individuals share similar effects, and thus improves estimation efficiency and personalized prediction accuracy. Another advantage of the proposed model is that it can incorporate within-subject correlation for longitudinal data.We provide a general theoretical foundation under a double-divergence modeling framework where the number of subjects and the number of repeated measurements both go to infinity, and therefore involves high-dimensional individual parameters. In addition, we present the oracle property for the proposed estimator to ensure its optimal large sample property. Simulation studies and an application to HIV longitudinal data are illustrated to compare the new approach to existing penalization methods.

Deep Reinforcement Learning for Conversational AI

Deep reinforcement learning is revolutionizing the artificial intelligence field. Currently, it serves as a good starting point for constructing intelligent autonomous systems which offer a better knowledge of the visual world. It is possible to scale deep reinforcement learning with the use of deep learning and do amazing tasks such as use of pixels in playing video games. In this paper, key concepts of deep reinforcement learning including reward function, differences between reinforcement learning and supervised learning and models for implementation of reinforcement are discussed. Key challenges related to the implementation of reinforcement learning in conversational AI domain are identified as well as discussed in detail. Various conversational models which are based on deep reinforcement learning (as well as deep learning) are also discussed. In summary, this paper discusses key aspects of deep reinforcement learning which are crucial for designing an efficient conversational AI.

Improving the Diversity of Top-N Recommendation via Determinantal Point Process

Recommender systems take the key responsibility to help users discover items that they might be interested in. Many recommendation algorithms are built upon similarity measures, which usually result in low intra-list diversity. The deficiency in capturing the whole range of user interest often leads to poor satisfaction. To solve this problem, increasing attention has been paid on improving the diversity of recommendation results in recent years. In this paper, we propose a novel method to improve the diversity of top-N recommendation results based on the determinantal point process (DPP), which is an elegant model for characterizing the repulsion phenomenon. We propose an acceleration algorithm to greatly speed up the process of the result inference, making our algorithm practical for large-scale scenarios. We also incorporate a tunable parameter into the DPP model which allows the users to smoothly control the level of diversity. More diversity metrics are introduced to better evaluate diversification algorithms. We have evaluated our algorithm on several public datasets, and compared it thoroughly with other reference algorithms. Results show that our proposed algorithm provides a much better accuracy-diversity trade-off with comparable efficiency.

Multi-scale Deep Learning Architectures for Person Re-identification

Person Re-identification (re-id) aims to match people across non-overlapping camera views in a public space. It is a challenging problem because many people captured in surveillance videos wear similar clothes. Consequently, the differences in their appearance are often subtle and only detectable at the right location and scales. Existing re-id models, particularly the recently proposed deep learning based ones match people at a single scale. In contrast, in this paper, a novel multi-scale deep learning model is proposed. Our model is able to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for matching. The importance of different spatial locations for extracting discriminative features is also learned explicitly. Experiments are carried out to demonstrate that the proposed model outperforms the state-of-the art on a number of benchmarks

Unsupervised state representation learning with robotic priors: a robustness benchmark

Our understanding of the world depends highly on our capacity to produce intuitive and simplified representations which can be easily used to solve problems. We reproduce this simplification process using a neural network to build a low dimensional state representation of the world from images acquired by a robot. As in Jonschkowski et al. 2015, we learn in an unsupervised way using prior knowledge about the world as loss functions called robotic priors and extend this approach to high dimension richer images to learn a 3D representation of the hand position of a robot from RGB images. We propose a quantitative evaluation of the learned representation using nearest neighbors in the state space that allows to assess its quality and show both the potential and limitations of robotic priors in realistic environments. We augment image size, add distractors and domain randomization, all crucial components to achieve transfer learning to real robots. Finally, we also contribute a new prior to improve the robustness of the representation. The applications of such low dimensional state representation range from easing reinforcement learning (RL) and knowledge transfer across tasks, to facilitating learning from raw data with more efficient and compact high level representations. The results show that the robotic prior approach is able to extract high level representation as the 3D position of an arm and organize it into a compact and coherent space of states in a challenging dataset.

A Generic Framework for Interesting Subspace Cluster Detection in Multi-attributed Networks

Detection of interesting (e.g., coherent or anomalous) clusters has been studied extensively on plain or univariate networks, with various applications. Recently, algorithms have been extended to networks with multiple attributes for each node in the real-world. In a multi-attributed network, often, a cluster of nodes is only interesting for a subset (subspace) of attributes, and this type of clusters is called subspace clusters. However, in the current literature, few methods are capable of detecting subspace clusters, which involves concurrent feature selection and network cluster detection. These relevant methods are mostly heuristic-driven and customized for specific application scenarios. In this work, we present a generic and theoretical framework for detection of interesting subspace clusters in large multi-attributed networks. Specifically, we propose a subspace graph-structured matching pursuit algorithm, namely, SG-Pursuit, to address a broad class of such problems for different score functions (e.g., coherence or anomalous functions) and topology constraints (e.g., connected subgraphs and dense subgraphs). We prove that our algorithm 1) runs in nearly-linear time on the network size and the total number of attributes and 2) enjoys rigorous guarantees (geometrical convergence rate and tight error bound) analogous to those of the state-of-the-art algorithms for sparse feature selection problems and subgraph detection problems. As a case study, we specialize SG-Pursuit to optimize a number of well-known score functions for two typical tasks, including detection of coherent dense and anomalous connected subspace clusters in real-world networks. Empirical evidence demonstrates that our proposed generic algorithm SG-Pursuit performs superior over state-of-the-art methods that are designed specifically for these two tasks.

Detection of Anomalies in Large Scale Accounting Data using Deep Autoencoder Networks

Learning to detect fraud in large-scale accounting data is one of the long-standing challenges in financial statement audits or forensic investigations. Nowadays, the majority of applied techniques refer to handcrafted rules derived from known fraud scenarios. While fairly successful, these rules exhibit the drawback that fraudsters gradually adapt and find ways to circumvent them. In addition, these rigid rules often fail to generalize beyond known fraud scenarios. To overcome this challenge we propose a novel method of detecting anomalous journal entries using deep autoencoder networks. We demonstrate that the trained networks’ reconstruction error regularized by the individual attribute probabilities of a journal entry can be interpreted as a highly adaptive anomaly assessment. Our empirical study, based on two datasets of real-world journal entries, demonstrates the effectiveness of the approach and outperforms several baseline anomaly detection methods. Resulting in a fraction of less than 0.15% (0.7%) of detected anomalous entries while achieving a high detection precision of 19.71% (9.26%). Initial feedback received by accountants underpinned the quality of our approach capturing highly relevant anomalies in the data. We envision this method as an important supplement to the forensic examiners’ toolbox.

Algorithms and Architecture for Real-time Recommendations at News UK

Recommendation systems are recognised as being hugely important in industry, and the area is now well understood. At News UK, there is a requirement to be able to quickly generate recommendations for users on news items as they are published. However, little has been published about systems that can generate recommendations in response to changes in recommendable items and user behaviour in a very short space of time. In this paper we describe a new algorithm for updating collaborative filtering models incrementally, and demonstrate its effectiveness on clickstream data from The Times. We also describe the architecture that allows recommendations to be generated on the fly, and how we have made each component scalable. The system is currently being used in production at News UK.

Conversational Exploratory Search via Interactive Storytelling

Conversational interfaces are likely to become more efficient, intuitive and engaging way for human-computer interaction than today’s text or touch-based interfaces. Current research efforts concerning conversational interfaces focus primarily on question answering functionality, thereby neglecting support for search activities beyond targeted information lookup. Users engage in exploratory search when they are unfamiliar with the domain of their goal, unsure about the ways to achieve their goals, or unsure about their goals in the first place. Exploratory search is often supported by approaches from information visualization. However, such approaches cannot be directly translated to the setting of conversational search. In this paper we investigate the affordances of interactive storytelling as a tool to enable exploratory search within the framework of a conversational interface. Interactive storytelling provides a way to navigate a document collection in the pace and order a user prefers. In our vision, interactive storytelling is to be coupled with a dialogue-based system that provides verbal explanations and responsive design. We discuss challenges and sketch the research agenda required to put this vision into life.

Learning Functional Causal Models with Generative Neural Networks

We introduce a new approach to functional causal modeling from observational data. The approach, called Causal Generative Neural Networks (CGNN), leverages the power of neural networks to learn a generative model of the joint distribution of the observed variables, by minimizing the Maximum Mean Discrepancy between generated and observed data. An approximate learning criterion is proposed to scale the computational cost of the approach to linear complexity in the number of observations. The performance of CGNN is studied throughout three experiments. First, we apply CGNN to the problem of cause-effect inference, where two CGNNs model P(Y|X,\textrm{noise}) and P(X|Y,\textrm{noise}) identify the best causal hypothesis out of X\rightarrow Y and Y\rightarrow X. Second, CGNN is applied to the problem of identifying v-structures and conditional independences. Third, we apply CGNN to problem of multivariate functional causal modeling: given a skeleton describing the dependences in a set of random variables \{X_1, \ldots, X_d\}, CGNN orients the edges in the skeleton to uncover the directed acyclic causal graph describing the causal structure of the random variables. On all three tasks, CGNN is extensively assessed on both artificial and real-world data, comparing favorably to the state-of-the-art. Finally, we extend CGNN to handle the case of confounders, where latent variables are involved in the overall causal model.

Granger Mediation Analysis of Multiple Time Series with an Application to fMRI

It becomes increasingly popular to perform mediation analysis for complex data from sophisticated experimental studies. In this paper, we present Granger Mediation Analysis (GMA), a new framework for causal mediation analysis of multiple time series. This framework is motivated by a functional magnetic resonance imaging (fMRI) experiment where we are interested in estimating the mediation effects between a randomized stimulus time series and brain activity time series from two brain regions. The stable unit treatment assumption for causal mediation analysis is thus unrealistic for this type of time series data. To address this challenge, our framework integrates two types of models: causal mediation analysis across the variables and vector autoregressive models across the temporal observations. We further extend this framework to handle multilevel data to address individual variability and correlated errors between the mediator and the outcome variables. These models not only provide valid causal mediation for time series data but also model the causal dynamics across time. We show that the modeling parameters in our models are identifiable, and we develop computationally efficient methods to maximize the likelihood-based optimization criteria. Simulation studies show that our method reduces the estimation bias and improve statistical power, compared to existing approaches. On a real fMRI data set, our approach not only infers the causal effects of brain pathways but accurately captures the feedback effect of the outcome region on the mediator region.

Low-temperature anomalies of a vapor deposited glass
Relative growth of the partial sums of certain random Fibonacci-like sequences
Spanning trees with few branch vertices
A note on strong-consistency of componentwise ARH(1) predictors
Pseudo-edge unfoldings of convex polyhedra
Optimal designs for enzyme inhibition kinetic models
A general asymptotic formula for distinct partitions
Minimizing the Age of the Information through Queues
A counterexample to a conjecture on facial unique-maximal colorings
Dynamic Pricing in Competitive Markets
An improved quality control chart to monitor the mean based on ranked sets
On Coordinate Minimization of Convex Piecewise-Affine Functions
Abstractions for AI-Based User Interfaces and Systems
Weighted and locally bounded list-colorings in split graphs, cographs, and partial k-trees
Ramsey equivalence of $K_n$ and $K_n+K_{n-1}$ for multiple colours
Two-sample Statistics Based on Anisotropic Kernels
ImageNet Training in 24 Minutes
Recent Advances in Fog Radio Access Networks: Performance Analysis and Radio Resource Allocation
ClickBAIT: Click-based Accelerated Incremental Training of Convolutional Neural Networks
Learning Intrinsic Sparse Structures within Long Short-term Memory
On Linear Quadratic Optimal Control of Discrete-Time Complex-Valued Linear Systems
Attention-based CNN Matching Net
Secrecy-Optimized Resource Allocation for Device-to-Device Communication Underlaying Heterogeneous Networks
Self-Guiding Multimodal LSTM – when we do not have a perfect training dataset for image captioning
Disentangled Variational Auto-Encoder for Semi-supervised Learning
A Stability-constrained Optimization Framework for Lur’e Systems and Application in Power Grids
Data-driven Job Search Engine Using Skills and Company Attribute Filters
Analytic Combinatorics in Several Variables: Effective Asymptotics and Lattice Path Enumeration
Roman domination: changing, unchanging, $γ_R$-graphs
Feature-Fused SSD: Fast Detection for Small Objects
Learning Compact Geometric Features
Asian Stamps Identification and Classification System
Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix
Shapechanger: Environments for Transfer Learning
A perturbation analysis of stochastic matrix Riccati diffusions
Joint Hierarchical Category Structure Learning and Large-Scale Image Classification
A Deep Generative Framework for Paraphrase Generation
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Robust Kernelized Multi-View Self-Representations for Clustering by Tensor Multi-Rank Minimization
Viewpoint Invariant Action Recognition using RGB-D Videos
Absolute continuity of non-homogeneous self-similar measures
Unsupervised Aspect Term Extraction with B-LSTM & CRF using Automatically Labelled Datasets
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding
Boundary optimization for rough sets
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Dependence Modeling in Ultra High Dimensions with Vine Copulas and the Graphical Lasso
Distributed Random Process for a Large-Scale Peer-to-Peer Lottery
Generalized Internal Boundaries (GIB)
On the stability of network indices defined by means of matrix functions
Random walk on a perturbation of the infinitely-fast mixing interchange process
Energy efficiency of mmWave massive MIMO precoding with low-resolution DACs
Regular Variation in a Fixpoint Problem for Single- and Multiclass Branching Processes and Queues
Open Multi-Agent Systems: Gossiping with Random Arrivals and Departures
Variable Version Lovász Local Lemma: Beyond Shearer’s Bound
The binary $q$-analogue of the Fano plane has a trivial automorphism group
Modeling correlated bursts by the bursty-get-burstier mechanism
Locating-Dominating Sets of Functigraphs
Operator Fitting for Parameter Estimation of Stochastic Differential Equations
Trend Detection based Regret Minimization for Bandit Problems
Device Activity and Embedded Information Bit Detection Using AMP in Massive MIMO
Interleaved sequences of geometric sequences binarized with Legendre symbol of two types
Crowdsourcing Paper Screening in Systematic Literature Reviews
Optimal Base Station Design with Limited Fronthaul: Massive Bandwidth or Massive MIMO?
On Achieving a Positive Rate in the Source Model Key Agreement Problem
On Finding Equilibrium Stopping Times for Time-Inconsistent Markovian Problems
The Dominating Set Problem in Geometric Intersection Graphs
Fast OLAP Query Execution in Main Memory on Large Data in a Cluster
Masquer Hunter: Adversarial Occlusion-aware Face Detection
Multi-model ensembles for ecosystem prediction
Worst-case convergence analysis of gradient and Newton methods through semidefinite programming performance estimation
MIMO with More Users than RF Chains
Modelled distributions of Triebel-Lizorkin type
A generalization of Arc-Kayles
Transcribing Against Time
A Spectral Method for Activity Shaping in Continuous-Time Information Cascades
3D Placement of an Unmanned Aerial Vehicle Base Station for Maximum Coverage of Users with Different QoS Requirements
Towards a Quantitative Averaging Principle for Stochastic Differential Equations
Differential equations driven by rough paths with jumps
Finite-time scaling at the Anderson transition for vibrations in solids
On converse bounds for classical communication over quantum channels
Dimension 1 sequences are close to randoms
The Inuence of Misspecified Covariance on False Discovery Control when Using Posterior Probabilities
Triviality of the ground-state metastate in long-range Ising spin glasses in one dimension
Cooperative Motion Planning for Non-Holonomic Agents with Value Iteration Networks
Bayesian Nonparametric Inference for Panel Count Data with an Informative Observation Process
Mixtures and products in two graphical models
On the Difference Between Closest, Furthest, and Orthogonal Pairs: Nearly-Linear vs Barely-Subquadratic Complexity in Computational Geometry
Sampling of probability measures in the convex order and approximation of Martingale Optimal Transport problems
Optimal approximation of piecewise smooth functions using deep ReLU neural networks
Atomic theory of viscoelastic response and memory effects in metallic glasses
Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions
The Orthogonal Vectors Conjecture for Branching Programs and Formulas
And That’s A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue
A Fair Individual Rate Comparison between MIMO-NOMA and MIMO-OMA
A Two-Phase Power Allocation Scheme for CRNs Employing NOMA
Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Top-Down Saliency Detection Driven by VisualClassification
Harvesting Creative Templates for Generating Stylistically Varied Restaurant Reviews
The first gap for total curvatures of planar graphs with nonnegative curvature
Video Synopsis Generation Using Spatio-Temporal Groups
String Attractors
Cystoid macular edema segmentation of Optical Coherence Tomography images using fully convolutional neural networks and fully connected CRFs