Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

We present an interactive version of an evidence-driven state-merging (EDSM) algorithm for learning variants of finite state automata. Learning these automata often amounts to recovering or reverse engineering the model generating the data despite noisy, incomplete, or imperfectly sampled data sources rather than optimizing a purely numeric target function. Domain expertise and human knowledge about the target domain can guide this process, and typically is captured in parameter settings. Often, domain expertise is subconscious and not expressed explicitly. Directly interacting with the learning algorithm makes it easier to utilize this knowledge effectively.

Bilingual Document Alignment with Latent Semantic Indexing

We apply cross-lingual Latent Semantic Indexing to the Bilingual Document Alignment Task at WMT16. Reduced-rank singular value decomposition of a bilingual term-document matrix derived from known English/French page pairs in the training data allows us to map monolingual documents into a joint semantic space. Two variants of cosine similarity between the vectors that place each document into the joint semantic space are combined with a measure of string similarity between corresponding URLs to produce 1:1 alignments of English/French web pages in a variety of domains. The system achieves a recall of ca. 88% if no in-domain data is used for building the latent semantic model, and 93% if such data is included. Analysing the system’s errors on the training data, we argue that evaluating aligner performance based on exact URL matches under-estimates their true performance and propose an alternative that is able to account for duplicates and near-duplicates in the underlying data.

Weakly-supervised learning of visual relations

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject, predicate, object) where the predicate is typically a preposition (eg. ‘under’, ‘in front of’) or a verb (‘hold’, ‘ride’) that links a pair of objects (subject, object). Learning such relations is challenging as the objects have different spatial configurations and appearances depending on the relation in which they occur. Another major challenge comes from the difficulty to get annotations, especially at box-level, for all possible triplets, which makes both learning and evaluation difficult. The contributions of this paper are threefold. First, we design strong yet flexible visual features that encode the appearance and spatial configuration for pairs of objects. Second, we propose a weakly-supervised discriminative clustering model to learn relations from image-level labels only. Third we introduce a new challenging dataset of unusual relations (UnRel) together with an exhaustive annotation, that enables accurate evaluation of visual relation retrieval. We show experimentally that our model results in state-of-the-art results on the visual relationship dataset significantly improving performance on previously unseen relations (zero-shot learning), and confirm this observation on our newly introduced UnRel dataset.

Cloud Computing – Architecture and Applications

In the era of Internet of Things and with the explosive worldwide growth of electronic data volume, and associated need of processing, analysis, and storage of such humongous volume of data, it has now become mandatory to exploit the power of massively parallel architecture for fast computation. Cloud computing provides a cheap source of such computing framework for large volume of data for real-time applications. It is, therefore, not surprising to see that cloud computing has become a buzzword in the computing fraternity over the last decade. This book presents some critical applications in cloud frameworks along with some innovation design of algorithms and architecture for deployment in cloud environment. It is a valuable source of knowledge for researchers, engineers, practitioners, and graduate and doctoral students working in the field of cloud computing. It will also be useful for faculty members of graduate schools and universities.

Orthogonal Recurrent Neural Networks with Scaled Cayley Transform

Recurrent Neural Networks (RNNs) are designed to handle sequential data but suffer from vanishing or exploding gradients. Recent work on Unitary Recurrent Neural Networks (uRNNs) have been used to address this issue and in some cases, exceed the capabilities of Long Short-Term Memory networks (LSTMs). We propose a simpler and novel update scheme to maintain orthogonal recurrent weight matrices without using complex valued matrices. This is done by parametrizing with a skew-symmetric matrix using the Cayley transform. Such a parametrization is unable to represent matrices with negative one eigenvalues, but this limitation is overcome by scaling the recurrent weight matrix by a diagonal matrix consisting of ones and negative ones. The proposed training scheme involves a straightforward gradient calculation and update step. In several experiments, the proposed scaled Cayley orthogonal recurrent neural network (scoRNN) achieves superior results with fewer trainable parameters than other unitary RNNs.

Recurrent Scale Approximation for Object Detection in CNN

Since convolutional neural network (CNN) lacks an inherent mechanism to handle large scale variations, we always need to compute feature maps multiple times for multi-scale object detection, which has the bottleneck of computational cost in practice. To address this, we devise a recurrent scale approximation (RSA) to compute feature map once only, and only through this map can we approximate the rest maps on other levels. At the core of RSA is the recursive rolling out mechanism: given an initial map on a particular scale, it generates the prediction on a smaller scale that is half the size of input. To further increase efficiency and accuracy, we (a): design a scale-forecast network to globally predict potential scales in the image since there is no need to compute maps on all levels of the pyramid. (b): propose a landmark retracing network (LRN) to retrace back locations of the regressed landmarks and generate a confidence score for each landmark; LRN can effectively alleviate false positives due to the accumulated error in RSA. The whole system could be trained end-to-end in a unified CNN framework. Experiments demonstrate that our proposed algorithm is superior against state-of-the-arts on face detection benchmarks and achieves comparable results for generic proposal generation. The source code of RSA is available at

Benchmarking Multimodal Sentiment Analysis

We propose a framework for multimodal sentiment analysis and emotion recognition using convolutional neural network-based feature extraction from text and visual modalities. We obtain a performance improvement of 10% over the state of the art by combining visual, text and audio features. We also discuss some major issues frequently ignored in multimodal sentiment analysis research: the role of speaker-independent models, importance of the modalities and generalizability. The paper thus serve as a new benchmark for further research in multimodal sentiment analysis and also demonstrates the different facets of analysis to be considered while performing such tasks.

How Good Are Machine Learning Clouds for Binary Classification with Good Features?

We conduct an empirical study of machine learning functionalities provided by major cloud service providers, which we call em machine learning clouds. Machine learning clouds hold the promise of hiding all the sophistication of running large-scale machine learning: Instead of specifying how to run a machine learning task, users only specify what machine learning task to run and the cloud figures out the rest. Raising the level of abstraction, however, rarely comes free — a performance penalty is possible. How good, then, are current machine learning clouds on real-world machine learning workloads? We study this question by presenting mlbench, a novel benchmark dataset constructed with the top winning code for all available competitions on Kaggle, as well as the results we obtained by running mlbench on machine learning clouds from both Azure and Amazon. We analyze the strength and weakness of existing machine learning clouds and discuss potential future directions.

Owl: A General-Purpose Numerical Library in OCaml

Owl is a new numerical library developed in the OCaml language. It focuses on providing a comprehensive set of high-level numerical functions so that developers can quickly build up data analytical applications. In this abstract, we will present Owl’s design, core components, and its key functionality.

Visual Explanations for Convolutional Neural Networks via Input Resampling

The predictive power of neural networks often costs model interpretability. Several techniques have been developed for explaining model outputs in terms of input features; however, it is difficult to translate such interpretations into actionable insight. Here, we propose a framework to analyze predictions in terms of the model’s internal features by inspecting information flow through the network. Given a trained network and a test image, we select neurons by two metrics, both measured over a set of images created by perturbations to the input image: (1) magnitude of the correlation between the neuron activation and the network output and (2) precision of the neuron activation. We show that the former metric selects neurons that exert large influence over the network output while the latter metric selects neurons that activate on generalizable features. By comparing the sets of neurons selected by these two metrics, our framework suggests a way to investigate the internal attention mechanisms of convolutional neural networks.

Deep Multi-View Learning with Stochastic Decorrelation Loss

Multi-view learning aims to learn an embedding space where multiple views are either maximally correlated for cross-view recognition, or decorrelated for latent factor disentanglement. A key challenge for deep multi-view representation learning is scalability. To correlate or decorrelate multi-view signals, the covariance of the whole training set should be computed which does not fit well with the mini-batch based training strategy, and moreover (de)correlation should be done in a way that is free of SVD-based computation in order to scale to contemporary layer sizes. In this work, a unified approach is proposed for efficient and scalable deep multi-view learning. Specifically, a mini-batch based Stochastic Decorrelation Loss (SDL) is proposed which can be applied to any network layer to provide soft decorrelation of the layer’s activations. This reveals the connection between deep multi-view learning models such as Deep Canonical Correlation Analysis (DCCA) and Factorisation Autoencoder (FAE), and allows them to be easily implemented. We further show that SDL is superior to other decorrelation losses in terms of efficacy and scalability.

Model-Free Renewable Scenario Generation Using Generative Adversarial Networks

Scenario generation is an important step in the operation and planning of power systems with high renewable penetrations. In this work, we proposed a data-driven approach for scenario generation using generative adversarial networks, which is based on two interconnected deep neural networks. Compared with existing methods based on probabilistic models that are often hard to scale or sample from, our method is data-driven, and captures renewable energy production patterns in both temporal and spatial dimensions for a large number of correlated resources. For validation, we use wind and solar times-series data from NREL integration data sets. We demonstrate that the proposed method is able to generate realistic wind and photovoltaic power profiles with full diversity of behaviors. We also illustrate how to generate scenarios based on different conditions of interest by using labeled data during training. For example, scenarios can be conditioned on weather events~(e.g. high wind day) or time of the year~(e,g. solar generation for a day in July). Because of the feedforward nature of the neural networks, scenarios can be generated extremely efficiently without sophisticated sampling techniques.

Learning to Match

Outsourcing tasks to previously unknown parties is becoming more common. One specific such problem involves matching a set of workers to a set of tasks. Even if the latter have precise requirements, the quality of individual workers is usually unknown. The problem is thus a version of matching under uncertainty. We believe that this type of problem is going to be increasingly important. When the problem involves only a single skill or type of job, it is essentially a type of bandit problem, and can be solved with standard algorithms. However, we develop an algorithm that can perform matching for workers with multiple skills hired for multiple jobs with multiple requirements. We perform an experimental evaluation in both single-task and multi-task problems, comparing with the bounded \epsilon-first algorithm, as well as an oracle that knows the true skills of workers. One of the algorithms we developed gives results approaching 85\% of oracle’s performance. We invite the community to take a closer look at this problem and develop real-world benchmarks.

Cost and Actual Causation

I propose the purpose our concept of actual causation serves is minimizing various cost in intervention practice. Actual causation has three features: nonredundant sufficiency, continuity and abnormality; these features correspond to the minimization of exploitative cost, exploratory cost and risk cost in intervention practice. Incorporating these three features, a definition of actual causation is given. I test the definition in 66 causal cases from actual causation literature and show that this definition’s application fit intuition better than some other causal modelling based definitions.

Mini-batch Tempered MCMC

In this paper we propose a general framework of performing MCMC with only a mini-batch of data. We show by estimating the Metropolis-Hasting ratio with only a mini-batch of data, one is essentially sampling from the true posterior raised to a known temperature. We show by experiments that our method, Mini-batch Tempered MCMC (MINT-MCMC), can efficiently explore multiple modes of a posterior distribution. As an application, we demonstrate one application of MINT-MCMC as an inference tool for Bayesian neural networks. We also show an cyclic version of our algorithm can be applied to build an ensemble of neural networks with little additional training cost.

Transfer Learning with Label Noise

Transfer learning aims to improve learning in the target domain with limited training data by borrowing knowledge from a related but different source domain with sufficient labeled data. To reduce the distribution shift between source and target domains, recent methods have focused on exploring invariant representations that have similar distributions across domains. However, existing methods assume that the labels in the source domain are uncontaminated, while in reality, we often only have access to a source domain with noisy labels. In this paper, we first analyze the effects of label noise in various transfer learning scenarios in which the data distribution is assumed to change in different ways. We find that although label noise has no effect on the invariant representation learning in the covariate shift scenario, it has adverse effects on the learning process in the more general target/conditional shift scenarios. To solve this problem, we propose a new transfer learning method to learn invariant representations in the presence of label noise, which also simultaneously estimates the label distributions in the target domain. Experimental results on both synthetic and real-world data verify the effectiveness of the proposed method.

Analysis and Optimization of Convolutional Neural Network Architectures

Convolutional Neural Networks (CNNs) dominate various computer vision tasks since Alex Krizhevsky showed that they can be trained effectively and reduced the top-5 error from 26.2 % to 15.3 % on the ImageNet large scale visual recognition challenge. Many aspects of CNNs are examined in various publications, but literature about the analysis and construction of neural network architectures is rare. This work is one step to close this gap. A comprehensive overview over existing techniques for CNN analysis and topology construction is provided. A novel way to visualize classification errors with confusion matrices was developed. Based on this method, hierarchical classifiers are described and evaluated. Additionally, some results are confirmed and quantified for CIFAR-100. For example, the positive impact of smaller batch sizes, averaging ensembles, data augmentation and test-time transformations on the accuracy. Other results, such as the positive impact of learned color transformation on the test accuracy could not be confirmed. A model which has only one million learned parameters for an input size of 32x32x3 and 100 classes and which beats the state of the art on the benchmark dataset Asirra, GTSRB, HASYv2 and STL-10 was developed.

Taming Non-stationary Bandits: A Bayesian Approach

We consider the multi armed bandit problem in non-stationary environments. Based on the Bayesian method, we propose a variant of Thompson Sampling which can be used in both rested and restless bandit scenarios. Applying discounting to the parameters of prior distribution, we describe a way to systematically reduce the effect of past observations. Further, we derive the exact expression for the probability of picking sub-optimal arms. By increasing the exploitative value of Bayes’ samples, we also provide an optimistic version of the algorithm. Extensive empirical analysis is conducted under various scenarios to validate the utility of proposed algorithms. A comparison study with various state-of-the-arm algorithms is also included.

Skill2vec: Machine Learning Approaches for Determining the Relevant Skill from Job Description

Un-supervise learned word embeddings have seen tremendous success in numerous Natural Language Processing (NLP) tasks in recent years. The main contribution of this paper is to develop a technique called Skill2vec, which applies machine learning techniques in recruitment to enhance the search strategy to find the candidates who possess the right skills. Skill2vec is a neural network architecture which inspired by Word2vec, developed by Mikolov et al. in 2013, to transform a skill to a new vector space. This vector space has the characteristics of calculation and present their relationship. We conducted an experiment using AB testing in a recruitment company to demonstrate the effectiveness of our approach.

Anomaly Detection by Robust Statistics

Real data often contain anomalous cases, also known as outliers. These may spoil the resulting analysis but they may also contain valuable information. In either case, the ability to detect such anomalies is essential. A useful tool for this purpose is robust statistics, which aims to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. We present an overview of several robust methods and the resulting graphical outlier detection tools. We discuss robust procedures for univariate, low-dimensional, and high-dimensional data, such as estimating location and scatter, linear regression, principal component analysis, classification, clustering, and functional data analysis. Also the challenging new topic of cellwise outliers is introduced.

Unsupervised Visual Attribute Transfer with Reconfigurable Generative Adversarial Networks

Learning to transfer visual attributes requires supervision dataset. Corresponding images with varying attribute values with the same identity are required for learning the transfer function. This largely limits their applications, because capturing them is often a difficult task. To address the issue, we propose an unsupervised method to learn to transfer visual attribute. The proposed method can learn the transfer function without any corresponding images. Inspecting visualization results from various unsupervised attribute transfer tasks, we verify the effectiveness of the proposed method.

Familia: An Open-Source Toolkit for Industrial Topic Modeling

Familia is an open-source toolkit for pragmatic topic modeling in industry. Familia abstracts the utilities of topic modeling in industry as two paradigms: semantic representation and semantic matching. Efficient implementations of the two paradigms are made publicly available for the first time. Furthermore, we provide off-the-shelf topic models trained on large-scale industrial corpora, including Latent Dirichlet Allocation (LDA), SentenceLDA and Topical Word Embedding (TWE). We further describe typical applications which are successfully powered by topic modeling, in order to ease the confusions and difficulties of software engineers during topic model selection and utilization.

Dimensionality reduction of SDPs through sketching

We show how to sketch semidefinite programs (SDPs) using positive maps in order to reduce their dimension. More precisely, we use Johnson-Lindenstrauss transforms to produce a smaller SDP whose solution preserves feasibility or approximates the value of the original problem with high probability. These techniques allow to improve both complexity and storage space requirements. They apply to problems in which the Schatten 1-norm of the matrices specifying the SDP and of a solution to the problem is constant in the problem size. Furthermore, we provide some no-go results which clarify the limitations of positive, linear sketches in this setting. Finally, we discuss numerical examples to benchmark our methods.

Temporal Hierarchical Clustering

We study hierarchical clusterings of metric spaces that change over time. This is a natural geometric primitive for the analysis of dynamic data sets. Specifically, we introduce and study the problem of finding a temporally coherent sequence of hierarchical clusterings from a sequence of unlabeled point sets. We encode the clustering objective by embedding each point set into an ultrametric space, which naturally induces a hierarchical clustering of the set of points. We enforce temporal coherence among the embeddings by finding correspondences between successive pairs of ultrametric spaces which exhibit small distortion in the Gromov-Hausdorff sense. We present both upper and lower bounds on the approximability of the resulting optimization problems.

Deep Discrete Supervised Hashing

Hashing has been widely used for large-scale search due to its low storage cost and fast query speed. By using supervised information, supervised hashing can significantly outperform unsupervised hashing. Recently, discrete supervised hashing and deep hashing are two representative progresses in supervised hashing. On one hand, hashing is essentially a discrete optimization problem. Hence, utilizing supervised information to directly guide discrete (binary) coding procedure can avoid sub-optimal solution and improve the accuracy. On the other hand, deep hashing, which integrates deep feature learning and hash-code learning into an end-to-end architecture, can enhance the feedback between feature learning and hash-code learning. The key in discrete supervised hashing is to adopt supervised information to directly guide the discrete coding procedure in hashing. The key in deep hashing is to adopt the supervised information to directly guide the deep feature learning procedure. However, there have not existed works which can use the supervised information to directly guide both discrete coding procedure and deep feature learning procedure in the same framework. In this paper, we propose a novel deep hashing method, called deep discrete supervised hashing (DDSH), to address this problem. DDSH is the first deep hashing method which can utilize supervised information to directly guide both discrete coding procedure and deep feature learning procedure, and thus enhance the feedback between these two important procedures. Experiments on three real datasets show that DDSH can outperform other state-of-the-art baselines, including both discrete hashing and deep hashing baselines, for image retrieval.

Regularization techniques for fine-tuning in neural machine translation

We investigate techniques for supervised domain adaptation for neural machine translation where an existing model trained on a large out-of-domain dataset is adapted to a small in-domain dataset. In this scenario, overfitting is a major challenge. We investigate a number of techniques to reduce overfitting and improve transfer learning, including regularization techniques such as dropout and L2-regularization towards an out-of-domain prior. In addition, we introduce tuneout, a novel regularization technique inspired by dropout. We apply these techniques, alone and in combination, to neural machine translation, obtaining improvements on IWSLT datasets for English->German and English->Russian. We also investigate the amounts of in-domain training data needed for domain adaptation in NMT, and find a logarithmic relationship between the amount of training data and gain in BLEU score.

Learning Neural Network Classifiers with Low Model Complexity

Modern neural network architectures for large-scale learning tasks have substantially higher model complexities, which makes understanding, visualizing and training these architectures difficult. Recent contributions to deep learning techniques have focused on architectural modifications to improve parameter efficiency and performance. In this paper, we derive a continuous and differentiable error functional for a neural network that minimizes its empirical error as well as a measure of the model complexity. The latter measure is obtained by deriving a differentiable upper bound on the Vapnik-Chervonenkis (VC) dimension of the classifier layer of a class of deep networks. Using standard backpropagation, we realize a training rule that tries to minimize the error on training samples, while improving generalization by keeping the model complexity low. We demonstrate the effectiveness of our formulation (the Low Complexity Neural Network – LCNN) across several deep learning algorithms, and a variety of large benchmark datasets. We show that hidden layer neurons in the resultant networks learn features that are crisp, and in the case of image datasets, quantitatively sharper. Our proposed approach yields benefits across a wide range of architectures, in comparison to and in conjunction with methods such as Dropout and Batch Normalization, and our results strongly suggest that deep learning techniques can benefit from model complexity control methods such as the LCNN learning rule.

Time Quasicrystals in Dissipative Dynamical Systems
Finitely dependent cycle coloring
Face Deidentification with Generative Deep Neural Networks
Dynamics of homelessness in urban America
Planar Tropical Cubic Curves of Any Genus, and Higher Dimensional Generalisations
Remarks on the $S$-topology
Independent Feedback Vertex Sets for Graphs of Bounded Diameter
Large-Scale Inverse Reinforcement Learning via Function Approximation for Clinical Motion Analysis
Bipartite spanning sub(di)graphs induced by 2-partitions
Independent Feedback Vertex Set for $P_5$-free Graphs
Photographic Image Synthesis with Cascaded Refinement Networks
Online Deception Detection Refueled by Real World Data Collection
A Weakly Supervised Approach to Train Temporal Relation Classifiers and Acquire Regular Event Pairs Simultaneously
Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL?
FontCode: Embedding Information in Text Documents using Glyph Perturbation
Hyperprofile-based Computation Offloading for Mobile Edge Networks
Visual Relationship Detection with Internal and External Linguistic Knowledge Distillation
Proceedings of the IJCAI 2017 Workshop on Learning in the Presence of Class Imbalance and Concept Drift (LPCICD’17)
The Split Feasibility Problem with Polynomials
Testing theories of old-age mortality using model selection techniques
Tetravalent Vertex- and Edge-Transitive Graphs Over Doubled Cycles
Refuting Feder, Kinne and Rafiey
A compressive channel estimation technique robust to synchronization impairments
Joint CFO and Channel Estimation in Millimeter Wave Systems with One-Bit ADCs
Sentiment Analysis on Financial News Headlines using Training Dataset Augmentation
Data Transfer Optimization Based on Offline Knowledge Discovery and Adaptive Real-time Sampling
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes
Zero-Shot Activity Recognition with Verb Attribute Induction
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras
Global Sensitivity Analysis and Quantification of Model Error in Scramjet Computations
Deep Feature Consistent Deep Image Transformations: Downscaling, Decolorization and HDR Tone Mapping
Classes of Nonnegative Quadratic Optimization Problems with Exact Copositive Relaxation
Method and apparatus for automatic text input insertion in digital devices with a restricted number of keys
On modeling weakly stationary processes
Topology Analysis of International Networks Based on Debates in the United Nations
Essential sets for random operators constructed from Arratia flow
On the free boundary of an annuity purchase
Counterfactual Reasoning with Disjunctive Knowledge in a Linear Structural Equation Model
Empirical Bayes approaches to PageRank type algorithms for rating scientific journals
Comparing Different Models for Investigating Cascading Failures in Power Systems
Curriculum Learning and Minibatch Bucketing in Neural Machine Translation
Synthetic Database for Evaluation of General, Fundamental Biometric Principles
Dual Ramsey theorems for relational structures
Balanced Stable Marriage: How Close is Close Enough?
A generalized multivariate Student-t mixture model for Bayesian classification and clustering of radar waveforms
Sensitivity Analysis for matched pair analysis of binary data: From worst case to average case analysis
Extreme Quantum Advantage for Rare-Event Sampling
The diameter of KPKVB random graphs
New bounds on the Ramsey number $r(I_m, L_n)$
Improved Adversarial Systems for 3D Object Generation and Reconstruction
Fine-Gray competing risks model with high-dimensional covariates: estimation and Inference
Identification of Treatment Effects under Conditional Partial Independence
A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks
A Skew-Normal Copula-Driven GLMM
Successive Refinement of Abstract Sources
Learning Language Representations for Typology Prediction
Deep Potential Molecular Dynamics: a scalable model with the accuracy of quantum mechanics
Finitary random interlacements and the Gaboriau-Lyons problem
The SNPR neighbourhood of tree-child networks
Virtual PET Images from CT Data Using Deep Convolutional Networks: Initial Results
Lambda number of the power graph of a finite group
Network modelling of topological domains using Hi-C data
Secure Detection: Efficiency Versus Security
Discover and Learn New Objects from Documentaries
ScanNet: A Fast and Dense Scanning Framework for Metastatic Breast Cancer Detection from Whole-Slide Images
Robust stability conditions for feedback interconnections of distributed-parameter negative imaginary systems
Occlusion Handling using Semantic Segmentation and Visibility-Based Rendering for Mixed Reality
Elicitability and its Application in Risk Management
CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting
Explicit expressions for European option pricing under a generalized skew normal distribution
Soft communities in similarity space
Joint Named Entity Recognition and Stance Detection in Tweets
Sparse Vector Recovery: Bernoulli-Gaussian Message Passing
Life efficiency does not always increase with the dissipation rate
Learning to Infer Graphics Programs from Hand-Drawn Images
A shape theorem for the scaling limit of the IPDSAW at criticality
Some asymptotic results of survival tree and forest models
Tree based weighted learning for estimating individualized treatment rules with censored data
Learned Experts’ Assessment-based Reconstruction Network (‘LEARN’) for Sparse-data CT
Rate of convergence for Hilbert space valued processes
Virtual crystals and Nakajima monomials
Finding a best approximation pair of points for two polyhedra
A Novel Approach for Image Segmentation based on Histograms computed from Hue-data
Ratios of Ordered Points of Point Processes with Regularly Varying Intensity Measures
Invertibility via distance for non-centered random matrices with continuous distributions
A Vision For Continuous Automated Game Design
Adaptive Delivery in Caching Networks
Droplet localization in the random XXZ model and its manifestations
Handling Nested Parallelism and Extreme Load Imbalance in an Orbital Analysis Code
Graph persistence
Ergodicity of Lévy-driven SDEs arising from multiclass many-server queues
On Designing of a Low Leakage Patient-Centric Provider Network
Stable Throughput of Cooperative Cognitive Networks with Energy Harvesting: Finite Relay Buffer and Finite Battery Capacity
Consistent Nonparametric Different-Feature Selection via the Sparsest $k$-Subgraph Problem
Recurrent 3D Pose Sequence Machines
Bitwise Retransmission Schemes for Resources Constrained Uplink Sensor Networks
Asymptotics and Optimal Bandwidth Selection for Nonparametric Estimation of Density Level Sets
Scene Graph Generation from Objects, Phrases and Caption Regions
Developing Knowledge-enhanced Chronic Disease Risk Prediction Models from Regional EHR Repositories
A Geometric Variational Approach to Bayesian Inference
Automatic Crack Detection in Built Infrastructure Using Unmanned Aerial Vehicles
Energy-Efficient Resource Allocation for Ultra-reliable and Low-latency Communications
Solving the Bose-Hubbard model with machine learning
Spectral Compressed Sensing via Projected Gradient Descent
Bilevel Optimization Based Transmission Expansion Planning Considering Phase Shifting Transformer
Camera Relocalization by Predicting Pairwise Relative Poses Using Convolutional Neural Network
Approximate Random Matrix Models for Generalized Fading MIMO Channels
Joint Sum Rate And Error Probability Optimization: Finite Blocklengh Analysis
Statistical Modeling of Propagation Channels for Terahertz Band
The Undirected Optical Indices of Complete $m$-ary Trees
Synthesis of Positron Emission Tomography (PET) Images via Multi-channel Generative Adversarial Networks (GANs)
Polar Code Construction for List Decoding
Delay Analysis of Multichannel Parallel Contention Tree Algorithms (MP-CTA)
Coded Load Balancing in Cache Networks
Sweeping processes with prescribed behaviour on jumps
Low-Resource Neural Headline Generation
GPS Multipath Detection in the Frequency Domain
Variance of the volume of random real algebraic submanifolds II
Capacity limitations of visual search in deep convolutional neural network
Arc-transitive Cayley graphs on non-ableian simple groups with soluble vertex stabilizers and valency seven
Evaluating Music Recommender Systems for Groups
Spectrum Access In Cognitive Radio Using A Two Stage Reinforcement Learning Approach
Performance analysis of FSO communications under LOS blockage
Field Theory of Disordered Elastic Interfaces at 3-Loop Order
2D-3D Fully Convolutional Neural Networks for Cardiac MR Segmentation
Combining Thesaurus Knowledge and Probabilistic Topic Models
Recognizing Graphs Close to Bipartite Graphs with an Application to Colouring Reconfiguration
On Approximation for Fractional Stochastic Partial Differential Equations on the Sphere
Bias Estimation for Decentralized Sensor Fusion — Multi-Agent Based Bias Estimation Method
Random gluing of metric spaces
Meta-SGD: Learning to Learn Quickly for Few Shot Learning
CODE-RADE – Community Infrastructure for the Delivery of Physics Applications
Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging
Linguistically Motivated Vocabulary Reduction for Neural Machine Translation from Turkish to English
Learning Audio – Sheet Music Correspondences for Score Identification and Offline Alignment
Penalty Alternating Direction Methods for Mixed-Integer Optimization: A New View on Feasibility Pumps
Quantum Privacy-Preserving Perceptron
Fashioning with Networks: Neural Style Transfer to Design Clothes
A Relational Event Approach to Modeling Behavioral Dynamics
On the Shapley value of unrooted phylogenetic trees
A matrix Bougerol identity and the Hua-Pickrell measures
Robust Private Information Retrieval on Coded Data
Bounce statistics for rational lattice paths
Random integral operators related to the point processes
A Framework for Super-Resolution of Scalable Video via Sparse Reconstruction of Residual Frames
Wavelet Residual Network for Low-Dose CT via Deep Convolutional Framelets
A Generalized Matched Filter Framework for Massive MIMO Systems
Multiscale Co-Design Analysis of Energy, Latency, Area, and Accuracy of a ReRAM Analog Neural Training Accelerator
A limiting characteristic polynomial of some random matrix ensembles
Tuning MPI Collectives by Verifying Performance Guidelines
Bounds on the burning numbers of spiders and path-forests
Spectral Method and Regularized MLE Are Both Optimal for Top-$K$ Ranking
Some variations of EM algorithms for Marshall-Olkin bivariate Pareto distribution with location and scale