Community Identity and User Engagement in a Multi-Community Landscape

Fisher GAN

Multiple Source Domain Adaptation with Adversarial Training of Neural Networks

Multiplicative component models for replicated point processes

Semi-Supervised Model Training for Unbounded Conversational Speech Recognition

Probabilistic and Geometrical Applications to Graph Theory

Direct Estimation of Regional Wall Thicknesses via Residual Recurrent Neural Network

Multiplex model of mental lexicon reveals explosive learning in humans

Optimal Transport Theory for Cell Association in UAV-Enabled Cellular Networks

Evolution of Social Power in Social Networks with Dynamic Topology

CASENet: Deep Category-Aware Semantic Edge Detection

Stochastic Feedback Control of Systems with Unknown Nonlinear Dynamics

A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Deep Matching and Validation Network — An End-to-End Solution to Constrained Image Splicing Localization and Detection

Image splicing is a very common image manipulation technique that is sometimes used for malicious purposes. A splicing detection and localization algorithm usually takes an input image and produces a binary decision indicating whether the input image has been manipulated, and also a segmentation mask that corresponds to the spliced region. Most existing splicing detection and localization pipelines suffer from two main shortcomings: 1) they use handcrafted features that are not robust against subsequent processing (e.g., compression), and 2) each stage of the pipeline is usually optimized independently. In this paper we extend the formulation of the underlying splicing problem to consider two input images, a query image and a potential donor image. Here the task is to estimate the probability that the donor image has been used to splice the query image, and obtain the splicing masks for both the query and donor images. We introduce a novel deep convolutional neural network architecture, called Deep Matching and Validation Network (DMVN), which simultaneously localizes and detects image splicing. The proposed approach does not depend on handcrafted features and uses raw input images to create deep learned representations. Furthermore, the DMVN is end-to-end op- timized to produce the probability estimates and the segmentation masks. Our extensive experiments demonstrate that this approach outperforms state-of-the-art splicing detection methods by a large margin in terms of both AUC score and speed.

Efficient 3D Placement of a UAV Using Particle Swarm Optimization

Providing Wireless Coverage to High-rise Buildings Using UAVs

The Indoor Mobile Coverage Problem Using UAVs

Maximizing Indoor Wireless Coverage Using UAVs Equipped with Directional Antennas

Maximum nullity and zero forcing number on cubic graphs

Heteroscedastic Concomitant Lasso for sparse multimodal electromagnetic brain imaging

Linear-size CDAWG: new repetition-aware indexing and grammar compression

Nearest Neighbour Radial Basis Function Solvers for Deep Neural Networks

We present a radial basis function solver for convolutional neural networks that can be directly applied to both image classification and distance metric learning problems. Our method treats all training features from a deep neural network as radial basis function centres and computes loss by summing the influence of a feature’s nearby centres in the embedding space. Having a radial basis function centred on each training feature is made scalable by treating it as an approximate nearest neighbour search problem. End-to-end learning of the network and solver is carried out, mapping high dimensional features into clusters of the same class. This results in a well formed embedding space, where semantically related instances are likely to be located near one another, regardless of whether or not the network was trained on those classes. The same loss function is used for both the metric learning and classification problems. We show that our radial basis function solver sets state-of-the-art embedding results on the Stanford Cars196 and CUB-200-2011 datasets. Additionally, we show that when used as a classifier, our method outperforms a conventional softmax classifier on the Caltech-256 object recognition dataset and the fine-grained recognition dataset CUB-200-2011.

Good Semi-supervised Learning that Requires a Bad GAN

LiDAR-Camera Calibration using 3D-3D Point correspondences

AMPNet: Asynchronous Model-Parallel Training for Dynamic Neural Networks

Half-quadratic transportation problems

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

KlusTree: Clustering Answer Trees from Keyword Search on Graphs

Graph structured data on the web is now massive as well as diverse, ranging from social networks, web graphs to knowledge-bases. Effectively querying this graph structured data is non-trivial and has led to research in a variety of directions — structured queries, keyword and natural language queries, automatic translation of these queries to structured queries, etc. We are concerned with a class of queries called relationship queries, which are usually expressed as a set of keywords (each keyword denoting a named entity). The results returned are a set of ranked trees, each of which denotes relationships among the various keywords. The result list could consist of hundreds of answers. The problem of keyword search on graphs has been explored for over a decade now, but an important aspect that is not as extensively studied is that of user experience. We propose KlusTree, which presents clustered results to the users instead of a list of all the results. In our approach, the result trees are represented using language models and are clustered using JS divergence as a distance measure. We compare KlusTree with the well-known approaches based on isomorphism and tree-edit distance based clustering. The user evaluations show that KlusTree outperforms the other two in providing better clustering, thereby enriching user experience, revealing interesting patterns and improving result interpretation by the user.

Mirror version of similar triangles method for constrained optimization problems

Multi-shot ASP solving with clingo

Global hard thresholding algorithms for joint sparse image representation and denoising

Mini-Flash Crashes, Model Risk, and Optimal Execution

A Split-Sample Approach for Estimating the Stability Index of a Stable Distribution

Quadratic Unconstrained Binary Optimization Problem Preprocessing: Theory and Empirical Analysis

Phase Function Density Deconvolution with Heteroscedastic Measurement Error of Unknown Type

Lifelong Generative Modeling

Lifelong learning is the problem of learning multiple consecutive tasks in an online manner and is essential towards the development of intelligent machines that can adapt to their surroundings. In this work we focus on learning a lifelong approach to generative modeling whereby we continuously incorporate newly observed distributions into our model representation. We utilize two models, aptly named the student and the teacher, in order to aggregate information about all past distributions without the preservation of any of the past data or previous models. The teacher is utilized as a form of compressed memory in order to allow for the student model to learn over the past as well as present data. We demonstrate why a naive approach to lifelong generative modeling fails and introduce a regularizer with which we demonstrate learning across a long range of distributions.

Deep Learning for Spatio-Temporal Modeling: Dynamic Traffic Flows and High Frequency Trading

Probabilistic Global Scale Estimation for MonoSLAM Based on Generic Object Detection

Efficient Modeling of Latent Information in Supervised Learning using Gaussian Processes

Often in machine learning, data are collected as a combination of multiple conditions, e.g., the voice recordings of multiple persons, each labeled with an ID. How could we build a model that captures the latent information related to these conditions and generalize to a new one with few data? We present a new model called Latent Variable Multiple Output Gaussian Processes (LVMOGP) and that allows to jointly model multiple conditions for regression and generalize to a new condition with a few data points at test time. LVMOGP infers the posteriors of Gaussian processes together with a latent space representing the information about different conditions. We derive an efficient variational inference method for LVMOGP, of which the computational complexity is as low as sparse Gaussian processes. We show that LVMOGP significantly outperforms related Gaussian process methods on various tasks with both synthetic and real data.

Machine learning for graph-based representations of three-dimensional discrete fracture networks

Dimensionality reduction for acoustic vehicle classification with spectral clustering

Targeted Learning with Daily EHR Data

Stopping time convergence for processes associated with Dirichlet forms

Inexpensive Cost-Optimized Measurement Proposal for Sequential Model-Based Diagnosis

Person Depth ReID: Robust Person Re-identification with Commodity Depth Sensors

Vocabulary-informed Extreme Value Learning

The novel unseen classes can be formulated as the extreme values of known classes. This inspired the recent works on open-set recognition \cite{Scheirer_2013_TPAMI,Scheirer_2014_TPAMIb,EVM}, which however can have no way of naming the novel unseen classes. To solve this problem, we propose the Extreme Value Learning (EVL) formulation to learn the mapping from visual feature to semantic space. To model the margin and coverage distributions of each class, the Vocabulary-informed Learning (ViL) is adopted by using vast open vocabulary in the semantic space. Essentially, by incorporating the EVL and ViL, we for the first time propose a novel semantic embedding paradigm — Vocabulary-informed Extreme Value Learning (ViEVL), which embeds the visual features into semantic space in a probabilistic way. The learned embedding can be directly used to solve supervised learning, zero-shot and open set recognition simultaneously. Experiments on two benchmark datasets demonstrate the effectiveness of proposed frameworks.

Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval

Care about you: towards large-scale human-centric visual relationship detection

Understanding Abuse: A Typology of Abusive Language Detection Subtasks

A Unified Optimization Approach for Sparse Tensor Operations on GPUs

Listen, Interact and Talk: Learning to Speak via Interaction

One of the long-term goals of artificial intelligence is to build an agent that can communicate intelligently with human in natural language. Most existing work on natural language learning relies heavily on training over a pre-collected dataset with annotated labels, leading to an agent that essentially captures the statistics of the fixed external training data. As the training data is essentially a static snapshot representation of the knowledge from the annotator, the agent trained this way is limited in adaptiveness and generalization of its behavior. Moreover, this is very different from the language learning process of humans, where language is acquired during communication by taking speaking action and learning from the consequences of speaking action in an interactive manner. This paper presents an interactive setting for grounded natural language learning, where an agent learns natural language by interacting with a teacher and learning from feedback, thus learning and improving language skills while taking part in the conversation. To achieve this goal, we propose a model which incorporates both imitation and reinforcement by leveraging jointly sentence and reward feedbacks from the teacher. Experiments are conducted to validate the effectiveness of the proposed approach.

Multi-channel Weighted Nuclear Norm Minimization for Real Color Image Denoising

Dilated Residual Networks

Isolated Loops in Quantum Feedback Networks

Direct Mapping Hidden Excited State Interaction Patterns from ab initio Dynamics and Its Implications on Force Field Development

Bayesian Unification of Gradient and Bandit-based Learning for Accelerated Global Optimisation

Learning Data Manifolds with a Cutting Plane Method

Measurement uncertainty relations for position and momentum: Relative entropy formulation

An approximate ensemble averaged solution of the stochastic Helmholtz equation at wavelengths comparable with the size of heterogeneity domains

Optimal dynamic treatment allocation

L1-norm Error Function Robustness and Outlier Regularization

Proof of a local antimagic conjecture

Near-optimal matrix recovery from random linear measurements

Conditional CycleGAN for Attribute Guided Face Image Generation

A Deep Multi-View Learning Framework for City Event Extraction from Twitter Data Streams

Cities have been a thriving place for citizens over the centuries due to their complex infrastructure. The emergence of the Cyber-Physical-Social Systems (CPSS) and context-aware technologies boost a growing interest in analysing, extracting and eventually understanding city events which subsequently can be utilised to leverage the citizen observations of their cities. In this paper, we investigate the feasibility of using Twitter textual streams for extracting city events. We propose a hierarchical multi-view deep learning approach to contextualise citizen observations of various city systems and services. Our goal has been to build a flexible architecture that can learn representations useful for tasks, thus avoiding excessive task-specific feature engineering. We apply our approach on a real-world dataset consisting of event reports and tweets of over four months from San Francisco Bay Area dataset and additional datasets collected from London. The results of our evaluations show that our proposed solution outperforms the existing models and can be used for extracting city related events with an averaged accuracy of 81% over all classes. To further evaluate the impact of our Twitter event extraction model, we have used two sources of authorised reports through collecting road traffic disruptions data from Transport for London API, and parsing the Time Out London website for sociocultural events. The analysis showed that 49.5% of the Twitter traffic comments are reported approximately five hours prior to the authorities official records. Moreover, we discovered that amongst the scheduled sociocultural event topics; tweets reporting transportation, cultural and social events are 31.75% more likely to influence the distribution of the Twitter comments than sport, weather and crime topics.

Two-Armed Bandit Problem, Data Processing, and Parallel Version of the Mirror Descent Algorithm

User Selection and Widely Linear Multiuser Precoding for One-dimensional Signalling

Cores with distinct parts and bigraded Fibonacci numbers

LAP: a Linearize and Project Method for Solving Inverse Problems with Coupled Variables

Bayesian Bootstraps for Massive Data

Recently, two scalable adaptations of the bootstrap have been proposed: the bag of little bootstraps (BLB; Kleiner et al., 2014) and the subsampled double bootstrap (SDB; Sengupta et al., 2016). In this paper, we introduce Bayesian bootstrap analogues to the BLB and SDB that have similar theoretical and computational properties, a strategy to perform lossless inference for a class of functionals of the Bayesian bootstrap, and briefly discuss extensions for Dirichlet Processes.

Robust Online Matrix Factorization for Dynamic Background Subtraction

Small sphere distributions for directional data with application to medical imaging

Data Driven Coded Aperture Design for Depth Recovery

$(q,t)$-characters of Kirillov-Reshetikhin modules of type $A_r$ as quantum cluster variables

Improving the Expected Improvement Algorithm

The expected improvement (EI) algorithm is a popular strategy for information collection in optimization under uncertainty. The algorithm is widely known to be too greedy, but nevertheless enjoys wide use due to its simplicity and ability to handle uncertainty and noise in a coherent decision theoretic framework. To provide rigorous insight into EI, we study its properties in a simple setting of Bayesian optimization where the domain consists of a finite grid of points. This is the so-called best-arm identification problem, where the goal is to allocate measurement effort wisely to confidently identify the best arm using a small number of measurements. In this framework, one can show formally that EI is far from optimal. To overcome this shortcoming, we introduce a simple modification of the expected improvement algorithm. Surprisingly, this simple change results in an algorithm that is asymptotically optimal for Gaussian best-arm identification problems, and provably outperforms standard EI by an order of magnitude.

Ensemble of Part Detectors for Simultaneous Classification and Localization

Towards Metamerism via Foveated Style Transfer

Cross validation for locally stationary processes

On the Power Spectral Density Applied to the Analysis of Old Canvases

Global sensitivity analysis in the context of imprecise probabilities (p-boxes) using sparse polynomial chaos expansions

Small Area Quantile Estimation

On a generalized crank for $k$-colored partitions

Temporal anomaly detection: calibrating the surprise

We propose a hybrid approach to temporal anomaly detection in user-database access data — or more generally, any kind of subject-object co-occurrence data. Our methodology allows identifying anomalies based on a single stationary model, instead of requiring a full temporal one, which would be prohibitive in our setting. We learn our low-rank stationary model from the high-dimensional training data, and then fit a regression model for predicting the expected likelihood score of normal access patterns in the future. The disparity between the predicted and the observed likelihood scores is used to assess the ‘surprise’. This approach enables calibration of the anomaly score so that time-varying normal behavior patterns are not considered anomalous. We provide a detailed description of the algorithm, including a convergence analysis, and report encouraging empirical results. One of the datasets we tested is new for the public domain. It consists of two months’ worth of database access records from a live system. This dataset will be made publicly available, and is provided in the supplementary material.

Distributed Convolutional Sparse Coding

Role Playing Learning for Socially Concomitant Mobile Robot Navigation

Deterministic Partially Dynamic Single Source Shortest Paths in Weighted Graphs

Coreset Construction via Randomized Matrix Multiplication

Fractional Hedonic Games

Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks – Counting, Detection, and Tracking

Implicit Variational Inference with Kernel Density Ratio Fitting

Pose-Aware Person Recognition

Reciprocity-driven Sparse Network Formation

An Automatic Contextual Analysis and Clustering Classifiers Ensemble approach to Sentiment Analysis

Products reviews are one of the major resources to determine the public sentiment. The existing literature on reviews sentiment analysis mainly utilizes supervised paradigm, which needs labeled data to be trained on and suffers from domain-dependency. This article addresses these issues by describes a completely automatic approach for sentiment analysis based on unsupervised ensemble learning. The method consists of two phases. The first phase is contextual analysis, which has five processes, namely (1) data preparation; (2) spelling correction; (3) intensifier handling; (4) negation handling and (5) contrast handling. The second phase comprises the unsupervised learning approach, which is an ensemble of clustering classifiers using a majority voting mechanism with different weight schemes. The base classifier of the ensemble method is a modified k-means algorithm. The base classifier is modified by extracting initial centroids from the feature set via using SentWordNet (SWN). We also introduce new sentiment analysis problems of Australian airlines and home builders which offer potential benchmark problems in the sentiment analysis field. Our experiments on datasets from different domains show that contextual analysis and the ensemble phases improve the clustering performance in term of accuracy, stability and generalization ability.

On Residual CNN in text-dependent speaker verification task

Kronecker Recurrent Units

Our work addresses two important issues with recurrent neural networks: (1) they are over-parameterized, and (2) the recurrence matrix is ill-conditioned. The former increases the sample complexity of learning and the training time. The latter causes the vanishing and exploding gradient problem. We present a flexible recurrent neural network model called Kronecker Recurrent Units (KRU). KRU achieves parameter efficiency in RNNs through a Kronecker factored recurrent matrix. It overcomes the ill-conditioning of the recurrent matrix by enforcing soft unitary constraints on the factors. Thanks to the small dimensionality of the factors, maintaining these constraints is computationally efficient. Our experimental results on five standard data-sets reveal that KRU can reduce the number of parameters by three orders of magnitude in the recurrent weight matrix compared to the existing recurrent models, without trading the statistical performance. These results in particular show that while there are advantages in having a high dimensional recurrent space, the capacity of the recurrent part of the model can be dramatically reduced.

Graph coarse-graining reveals differences in the module-level structure of functional brain networks

Heuristic Rectangle Splitting: Leveraging Single-Objective Heuristics to Efficiently Solve Multi-Objective Problems

Fast learning rate of deep learning via a kernel perspective

We develop a new theoretical framework to analyze the generalization error of deep learning, and derive a new fast learning rate for two representative algorithms: empirical risk minimization and Bayesian deep learning. The series of theoretical analyses of deep learning has revealed its high expressive power and universal approximation capability. Although these analyses are highly nonparametric, existing generalization error analyses have been developed mainly in a fixed dimensional parametric model. To compensate this gap, we develop an infinite dimensional model that is based on an integral form as performed in the analysis of the universal approximation capability. This allows us to define a reproducing kernel Hilbert space corresponding to each layer. Our point of view is to deal with the ordinary finite dimensional deep neural network as a finite approximation of the infinite dimensional one. The approximation error is evaluated by the degree of freedom of the reproducing kernel Hilbert space in each layer. To estimate a good finite dimensional model, we consider both of empirical risk minimization and Bayesian deep learning. We derive its generalization error bound and it is shown that there appears bias-variance trade-off in terms of the number of parameters of the finite dimensional approximation. We show that the optimal width of the internal layers can be determined through the degree of freedom and the convergence rate can be faster than O(1/\sqrt{n}) rate which has been shown in the existing studies.

Strong solvability of regularized stochastic Landau-Lifshitz-Gilbert equation

Maximum Number of Common Zeros of Homogeneous Polynomials over Finite Fields

Deterministic subgraph detection in broadcast CONGEST

Dependency-Aware Rollback and Checkpoint-Restart for Distributed Task-Based Runtimes

On Multilingual Training of Neural Dependency Parsers

Black-box Testing of First-Order Logic Ontologies Using WordNet

Automatic White-Box Testing of First-Order Logic Ontologies

Permutation-based Causal Inference Algorithms with Interventions

On the regularity of edge ideal of graphs

Latent Intention Dialogue Models

Deep Learning for Patient-Specific Kidney Graft Survival Analysis

Fast Single-Class Classification and the Principle of Logit Separation

General Bounds for Incremental Maximization

Boltzmann Exploration Done Right

Distributed Communication-aware Motion Planning for Multi-agent Systems from STL and SpaTeL Specifications

A New Lower Bound for van der Waerden Numbers

Balanced vertices in labeled rooted trees

Online Auctions and Multi-scale Online Learning

Some remarks on the asymmetric sum–product phenomenon

A Block-Sensitivity Lower Bound for Quantum Testing Hamming Distance

On sampling graphical Markov models

We consider sampling and enumeration problems for Markov equivalence classes. We create and analyze a Markov chain for uniform random sampling on the DAGs inside a Markov equivalence class. Though the worst case is exponentially slow mixing, we find a condition on the Markov equivalence class for polynomial time mixing. We also investigate the ratio of Markov equivalence classes to DAGs and a Markov chain of He, Jia, and Yu for random sampling of sparse Markov equivalence classes.

Characterization of tilt stability via subgradient graphical derivative with applications to nonlinear programming

simmer: Discrete-Event Simulation for R

The simmer package brings discrete-event simulation to R. It is designed as a generic yet powerful process-oriented framework. The architecture encloses a robust and fast simulation core written in C++ with automatic monitoring capabilities. It provides a rich and flexible R API that revolves around the concept of trajectory, a common path in the simulation model for entities of the same type.

Convergence of the Population Dynamics algorithm in the Wasserstein metric

Free monoids and generalized metric spaces

On the Capacity of Fractal Wireless Networks With Direct Social Interactions

word2vec Skip-Gram with Negative Sampling is a Weighted Logistic PCA

We show that the skip-gram formulation of word2vec trained with negative sampling is equivalent to a weighted logistic PCA. This connection allows us to better understand the objective, compare it to other word embedding methods, and extend it to higher dimensional models.

On The Continuous Coverage Problem for a Swarm of UAVs

Maximum nullity of Cayley graph

Deep Complex Networks

At present, the vast majority of building blocks, techniques, and architectures for deep learning are based on real-valued operations and representations. However, recent work on recurrent neural networks and older fundamental theoretical analysis suggests that complex numbers could have a richer representational capacity and could also facilitate noise-robust memory retrieval mechanisms. Despite their attractive properties and potential for opening up entirely new neural architectures, complex-valued deep neural networks have been marginalized due to the absence of the building blocks required to design such models. In this work, we provide the key atomic components for complex-valued deep neural networks and apply them to convolutional feed-forward networks. More precisely, we rely on complex convolutions and present algorithms for complex batch-normalization, complex weight initialization strategies for complex-valued neural nets and we use them in experiments with end-to-end training schemes. We demonstrate that such complex-valued models are able to achieve comparable or better performance than their real-valued counterparts. We test deep complex models on several computer vision tasks and on music transcription using the MusicNet dataset where we achieve state of the art performance.

On the validity of parametric block correlation matrices with constant within and between group correlations

Slimness of graphs

Spreading a Confirmed Rumor: A Case for Oscillatory Dynamics

Growth-Optimal Portfolio Selection under CVaR Constraints

A Viral Timeline Branching Process to study a Social Network

On the relation between dependency distance, crossing dependencies, and parsing. Comment on ‘Dependency distance: a new perspective on syntactic patterns in natural languages’ by Haitao Liu et al

Applying Artificial Intelligence and Internet Techniques in Rural Tourism Domain

Abnormality Detection and Localization in Chest X-Rays using Deep Convolutional Neural Networks

Quadratic BSDEs with mean reflection

On shortened and punctured cyclic codes

BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet

Convergence Analysis of Two-layer Neural Networks with ReLU Activation

Continuous Video to Simple Signals for Swimming Stroke Detection with Convolutional Neural Networks

Projection Theorems of Divergences and Likelihood Maximization Methods

A bootstrap approximation to Lp_statistic of kernel density estimator in length-biased model

Fully distributed PageRank computation with exponential convergence

The placement of the head that maximizes predictability. An information theoretic approach

The finite-time ruin probability of the nonhomogeneous Poisson risk model with conditionally independent subexponential claims

Intrinsic Reduced Attitude Formation with Ring Inter-Agent Graph

Local Large Deviations, McMillian Theorem for multitype Galton-Watson Processes

Probabilistic Program Abstractions

Attitude Quaternion Estimation Using a Spectral Perturbation Approach

Insert ‘Price’ to Coxian Phase-Type Models: An Application to Hospital Charge and Length of Stay Data

A proof of a conjecture by Erdős, Faudree, Rousseau and Schelp about subgraphs of minimum degree $k$

Neural Semantic Parsing by Character-based Translation: Experiments with Abstract Meaning Representations

Symmetry Group of Ordered Hamming Block Space

On The Robustness of Epsilon Skew Extension for Burr III Distribution on Real Line

Should Robots be Obedient?

Deep Learning for User Comment Moderation

Experimenting with a new dataset of 1.6M user comments from a Greek news portal and existing datasets of English Wikipedia comments, we show that an RNN outperforms the previous state of the art in moderation. A deep, classification-specific attention mechanism improves further the overall performance of the RNN. We also compare against a CNN and a word-list baseline, considering both fully automatic and semi-automatic moderation.

Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream

Micro-blogging service Twitter is a lucrative source for data mining applications on global sentiment. But due to the omnifariousness of the subjects mentioned in each data item; it is inefficient to run a data mining algorithm on the raw data. This paper discusses an algorithm to accurately classify the entire stream in to a given number of mutually exclusive collectively exhaustive streams upon each of which the data mining algorithm can be run separately yielding more relevant results with a high efficiency.

Learning the Sparse and Low Rank PARAFAC Decomposition via the Elastic Net

The Criticality of a Randomly-Driven Front

Several extreme coefficients of the Tutte polynomial of graphs

On Ryser’s conjecture: $t$-intersecting and degree-bounded hypergraphs, covering by heterogeneous sets

Supervised Complementary Entity Recognition with Augmented Key-value Pairs of Knowledge

Abstract Argumentation / Persuasion / Dynamics

Learning Network Structures from Contagion

Time-Optimal Trajectories of Generic Control-Affine Systems Have at Worst Iterated Fuller Singularities

Counting Subwords Occurrences in Base-b Expansions

Affine maps between graph isomorphism polytopes and Boolean quadratic polytopes

Improving the local scoring algorithm using gradient sampling

Rate $(n-1)/n$ Systematic MDS Convolutional Codes over $GF(2^m)$

Multiple solutions of nonlinear equations involving the square root of the Laplacian

Nonlinear problems on the Sierpiński gasket

Dynamics of core of language vocabulary

A Matched Pairs Analysis of International Protection Outcomes in Ireland

Non-parametric estimation of time varying AR(1)–processes with local stationarity and periodicity

New radiographic image processing tested on the simple and double-flux platform at OMEGA

Tangent Cones to TT Varieties

Robust Fusion Methods for Big Data

We address one of the important problems in Big Data, namely how to combine estimators from different subsamples by robust fusion procedures, when we are unable to deal with the whole sample.

Subdifferential characterization of continuous probability functions under Gaussian distribution

Control and Energy Management System in Microgrids

Machine Learned Learning Machines

There are two common approaches for optimizing the performance of a machine: genetic algorithms and machine learning. A genetic algorithm is applied over many generations whereas machine learning works by applying feedback until the system meets a performance threshold. Though these are methods that typically operate separately, we combine evolutionary adaptation and machine learning into one approach. Our focus is on machines that can learn during their lifetime, but instead of equipping them with a machine learning algorithm we aim to let them evolve their ability to learn by themselves. We use evolvable networks of probabilistic and deterministic logic gates, known as Markov Brains, as our computational model organism. The ability of Markov Brains to learn is augmented by a novel adaptive component that can change its computational behavior based on feedback. We show that Markov Brains can indeed evolve to incorporate these feedback gates to improve their adaptability to variable environments. By combining these two methods, we now also implemented a computational model that can be used to study the evolution of learning.

Increasing the Efficiency of Sparse Matrix-Matrix Multiplication with a 2.5D Algorithm and One-Sided MPI

Bayesian stochastic blockmodeling

Optimal control for the stochastic FitzHugh-Nagumo model with recovery variable

More on the total dominator chromatic number of a graph

$L^p$-estimates and regularity for SPDEs with monotone semilinearity

Some Ageing Properties of Dynamic Additive Mean Residual Life Model

Fair Division of a Graph

A note on weak convergence of the $n$-point motions of Harris flows

Dynamic scaling analysis of the long-range RKKY Ising spin glass Dy$_{x}$Y$_{1-x}$Ru$_{2}$Si$_{2}$

Directed random walks on polytopes with few facets

SuperBrownian motion and the spatial Lambda-Fleming-Viot process

An Erdős-Gallai-type theorem for keyrings

A lower bound for the size of Kakeya sets with respect to hyperplanes in $\mathbb{F}_q^n$

Sparse Maximum-Entropy Random Graphs with a Given Power-Law Degree Distribution

Complex Hadamard matrices with noncommutative entries

A Generalized Accelerated Composite Gradient Method: Uniting Nesterov’s Fast Gradient Method and FISTA

Who’s to say what’s funny? A computer using Language Models and Deep Learning, That’s Who!