Distributed Vector Representation Of Shopping Items, The Customer And Shopping Cart To Build A Three Fold Recommendation System

The main idea of this paper is to represent shopping items through vectors because these vectors act as the base for building em- beddings for customers and shopping carts. Also, these vectors are input to the mathematical models that act as either a recommendation engine or help in targeting potential customers. We have used exponential family embeddings as the tool to construct two basic vectors – product embeddings and context vectors. Using the basic vectors, we build combined embeddings, trip embeddings and customer embeddings. Combined embeddings mix linguistic properties of product names with their shopping patterns. The customer embeddings establish an understand- ing of the buying pattern of customers in a group and help in building customer profile. For example a customer profile can represent customers frequently buying pet-food. Identifying such profiles can help us bring out offers and discounts. Similarly, trip embeddings are used to build trip profiles. People happen to buy similar set of products in a trip and hence their trip embeddings can be used to predict the next product they would like to buy. This is a novel technique and the first of its kind to make recommendation using product, trip and customer embeddings.


Maximum Margin Principal Components

Principal Component Analysis (PCA) is a very successful dimensionality reduction technique, widely used in predictive modeling. A key factor in its widespread use in this domain is the fact that the projection of a dataset onto its first K principal components minimizes the sum of squared errors between the original data and the projected data over all possible rank K projections. Thus, PCA provides optimal low-rank representations of data for least-squares linear regression under standard modeling assumptions. On the other hand, when the loss function for a prediction problem is not the least-squares error, PCA is typically a heuristic choice of dimensionality reduction — in particular for classification problems under the zero-one loss. In this paper we target classification problems by proposing a straightforward alternative to PCA that aims to minimize the difference in margin distribution between the original and the projected data. Extensive experiments show that our simple approach typically outperforms PCA on any particular dataset, in terms of classification error, though this difference is not always statistically significant, and despite being a filter method is frequently competitive with Partial Least Squares (PLS) and Lasso on a wide range of datasets.


ParlAI: A Dialog Research Software Platform

We introduce ParlAI (pronounced ‘par-lay’), an open-source software platform for dialog research implemented in Python, available at http://parl.ai. Its goal is to provide a unified framework for training and testing of dialog models, including multitask training, and integration of Amazon Mechanical Turk for data collection, human evaluation, and online/reinforcement learning. Over 20 tasks are supported in the first release, including popular datasets such as SQuAD, bAbI tasks, MCTest, WikiQA, QACNN, QADailyMail, CBT, bAbI Dialog, Ubuntu, OpenSubtitles and VQA. Included are examples of training neural models with PyTorch and Lua Torch, including both batch and hogwild training of memory networks and attentive LSTMs.


DeepXplore: Automated Whitebox Testing of Deep Learning Systems

Deep learning (DL) systems are increasingly deployed in security-critical domains including self-driving cars and malware detection, where the correctness and predictability of a system’s behavior for corner-case inputs are of great importance. However, systematic testing of large-scale DL systems with thousands of neurons and millions of parameters for all possible corner-cases is a hard problem. Existing DL testing depends heavily on manually labeled data and therefore often fails to expose different erroneous behaviors for rare inputs. We present DeepXplore, the first whitebox framework for systematically testing real-world DL systems. We address two problems: (1) generating inputs that trigger different parts of a DL system’s logic and (2) identifying incorrect behaviors of DL systems without manual effort. First, we introduce neuron coverage for estimating the parts of DL system exercised by a set of test inputs. Next, we leverage multiple DL systems with similar functionality as cross-referencing oracles and thus avoid manual checking for erroneous behaviors. We demonstrate how finding inputs triggering differential behaviors while achieving high neuron coverage for DL algorithms can be represented as a joint optimization problem and solved efficiently using gradient-based optimization techniques. DeepXplore finds thousands of incorrect corner-case behaviors in state-of-the-art DL models trained on five popular datasets. For all tested DL models, on average, DeepXplore generated one test input demonstrating incorrect behavior within one second while running on a commodity laptop. The inputs generated by DeepXplore achieved 33.2% higher neuron coverage on average than existing testing methods. We further show that the test inputs generated by DeepXplore can also be used to retrain the corresponding DL model to improve classification accuracy or identify polluted training data.


Self-Learning Monte Carlo Method: Continuous-Time Algorithm

The recently-introduced self-learning Monte Carlo method is a general-purpose numerical method that speeds up Monte Carlo simulations by training an effective model to propose uncorrelated configurations in the Markov chain. We implement this method in the framework of continuous time Monte Carlo method with auxiliary field in quantum impurity models. We introduce and train a diagram generating function (DGF) to model the probability distribution of auxiliary field configurations in continuous imaginary time, at all orders of diagrammatic expansion. By using DGF to propose global moves in configuration space, we show that the self-learning continuous-time Monte Carlo method can significantly reduce the computational complexity of the simulation.


Many body localization with long range interactions

Supervised Machine Learning for Signals Having RRC Shaped Pulses

Bayer Demosaicking Using Optimized Mean Curvature over RGB channels

Hard and soft excitation of oscillations in memristor-based oscillators with a line of equilibria

Phase transitions in integer linear problems

Deterministic, Strategyproof, and Fair Cake Cutting

Functions on Antipower Prefix Lengths of the Thue-Morse Word

Direct Ensemble Estimation of Density Functionals

Interleaved Algorithms for Constrained Submodular Function Maximization

An exact upper bound on the size of minimal clique covers

CardiacNET: Segmentation of Left Atrium and Proximal Pulmonary Veins from MRI Using Multi-View CNN

Computing minimal generating systems for some special toric ideals

Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

Wireless Information and Power Transfer over a Flat Fading AWGN channel: Nonlinearity and Asymmetric Gaussian Signaling

Political Footprints: Political Discourse Analysis using Pre-Trained Word Vectors

Toric log del Pezzo surfaces with one singularity

Structure preserving schemes for mean-field equations of collective behavior

Optimizing and Visualizing Deep Learning for Benign/Malignant Classification in Breast Tumors

Learning Gaussian Graphical Models Using Discriminated Hub Graphical Lasso

Automatic Goal Generation for Reinforcement Learning Agents

Re3 : Real-Time Recurrent Regression Networks for Object Tracking

Decoding Sentiment from Distributed Representations of Sentences

Elation KM-arcs

General auction method for real-valued optimal transport

Maximizing weighted Shannon entropy for network inference with little data

Minimax Risk Bounds for Piecewise Constant Models

Scalable Exact Parent Sets Identification in Bayesian Networks Learning with Apache Spark

Asynchronous parallel primal-dual block update methods

Fashion Forward: Forecasting Visual Style in Fashion

Ground state entanglement entropy for discrete-time two coupled harmonic oscillators

Learning a bidirectional mapping between human whole-body motion and natural language using deep recurrent neural networks

Linear Dimensionality Reduction in Linear Time: Johnson-Lindenstrauss-type Guarantees for Random Subspace

Slip-Size Distribution and Self-Organized Criticality in Block-Spring Models with Quenched Randomness

Phase Retrieval Using Structured Sparsity: A Sample Efficient Algorithmic Framework

The Weight Distribution of Quasi-quadratic Residue Codes

Shellable posets arising from even subgraphs of a graph

The maximum independent set problem on layered graphs

Regularity of powers of cover ideals of unimodular hypergraphs

On spectral properties of high-dimensional spatial-sign covariance matrices in elliptical distributions with applications

Vehicle Routing with Drones

Ballot tilings and increasing trees

Shifted tableaux and products of Schur’s symmetric functions

Discrete time pontryagin principles in banach spaces

Delving into adversarial attacks on deep policies

Elastic and Secure Energy Forecasting in Cloud Environments

Information Density as a Factor for Variation in the Embedding of Relative Clauses

Evolving Ensemble Fuzzy Classifier

Universal Dependencies Parsing for Colloquial Singaporean English

On the Achievable Spectral Efficiency of Spatial Modulation Aided Downlink Non-Orthogonal Multiple Access

Expected reliability of communication protocols

Protecting Against Untrusted Relays: An Information Self-encrypted Approach

Graph analysis and modularity of brain functional connectivity networks: searching for the optimal threshold

On permutation trinomials of type $x^{2p^s+r}+x^{p^{s}+r} +λx^r$

Symmetry breaking in two interacting populations of quadratic integrate-and-fire neurons

A Non-monotone Alternating Updating Method for A Class of Matrix Factorization Problems

Energy-efficient 3D UAV-BS Placement Versus Mobile Users’ Density and Circuit Power

Multi-Scale Factor Analysis of High-Dimensional Brain Signals

TableQA: Question Answering on Tabular Data

Accurate approximation of the distributions of the 3D Poisson-Voronoi typical cell geometrical features

Entropic selection of concepts in networks of similarity between documents

Effects of magma-induced stress within a cellular automaton model of volcanism

Probabilistic Combination of Noisy Points and Planes for RGB-D Odometry

A conjectural identity for certain parabolic Kazhdan–Lusztig polynomials

Plane Formation by Synchronous Mobile Robots without Chirality

A fully dense and globally consistent 3D map reconstruction approach for GI tract to enhance therapeutic relevance of the endoscopic capsule robot

Bayesian Inference of the Multi-Period Optimal Portfolio for an Exponential Utility

An analogue of big q-Jacobi polynomials in the algebra of symmetric functions

Exact augmented Lagrangian functions for nonlinear semidefinite programming

Robust Chance-Constrained Optimization for Power-Efficient and Secure SWIPT Systems

Exemplar or Matching: Modeling DCJ Problems with Unequal Content Genome Data

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization

Stepwise Debugging of Answer-Set Programs

Learning Texture Manifolds with the Periodic Spatial GAN

Controlling the time discretization bias for the supremum of Brownian Motion

Online learnability of Statistical Relational Learning in anomaly detection

Products of Differences over Arbitrary Finite Fields

Does a growing static length scale control the glass transition?

Adaptive Clustering through Semidefinite Programming

Sensor Array Design Through Submodular Optimization

Pricing Identical Items

Robust randomized matchings

Penalized bias reduction in extreme value estimation for censored Pareto-type data, and long-tailed insurance applications

Lower bound for the coarse Ricci curvature of continuous-time pure jump processes

Symmetric Convex Sets with Minimal Gaussian Surface Area

Exact matrix product decay modes of a boundary driven cellular automaton

Energy-Sustainable Traffic Steering for 5G Mobile Networks

Multilayer Codes for Synchronization from Deletions

Relative entropy optimization in quantum information theory via semidefinite programming approximations

MUTAN: Multimodal Tucker Fusion for Visual Question Answering

Fast Inference for Intractable Likelihood Problems using Variational Bayes

Target-Quality Image Compression with Recurrent, Convolutional Neural Networks

Limited-Memory Matrix Adaptation for Large Scale Black-box Optimization

I Probe, Therefore I Am: Designing a Virtual Journalist with Human Emotions

Holomorphic primary fields in free CFT4 and Calabi-Yau orbifolds

Learning Spatiotemporal Features for Infrared Action Recognition with 3D Convolutional Neural Networks

Model-based Catheter Segmentation in MRI-images

Continuous Implicit Authentication for Mobile Devices based on Adaptive Neuro-Fuzzy Inference System

Continuum percolation theory of epimorphic regeneration

Examining collusion and voting biases between countries during the Eurovision song contest since 1957

Algorithms for $\ell_p$ Low Rank Approximation

Advertisements