• Well-posedness of the non-local conservation law by stochastic perturbation
• Hyperspectral Unmixing with Endmember Variability using Semi-supervised Partial Membership Latent Dirichlet Allocation
• An MCMC free approach to post-selective inference
• The Wetting/Layering transition for the low temperature Solid-On-Solid model
• qGaussian: Tools to Explore Applications of Tsallis Statistics
• Sampling from a pseudo selective posterior using a primal-dual approach
• Optimal stopping of one-dimensional diffusions with integral criteria
• Effective Evaluation using Logged Bandit Feedback from Multiple Loggers
• Deep Decentralized Multi-task Multi-Agent RL under Partial Observability
• TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
• Sequential Monte Carlo Methods in the nimble R Package
• Cutoff for random to random card shuffle
• Preserving Data-Privacy with Added Noises: Optimal Estimation and Privacy Analysis
• A Unified Treatment of Multiple Testing with Prior Knowledge
• Identifying the Support of Rectangular Signals in Gaussian Noise
• Discriminative Distance-Based Network Indices and the Tiny-World Property
• Curriculum Dropout
• Recurrent Models for Situation Recognition
• Multi-fidelity Bayesian Optimisation with Continuous Approximations
• RoomNet: End-to-End Room Layout Estimation
• Towards Context-aware Interaction Recognition
• A Fast HOG Descriptor Using Lookup Table and Integral Image
• Single image super-resolution using self-optimizing mask via fractional-order gradient interpolation and reconstruction
• An Adaptive Framework to Tune the Coordinate Systems in Evolutionary Algorithms
• Non-Associative Learning Representation in the Nervous System of the Nematode Caenorhabditis elegans
• SIM-CE: An Advanced Simulink Platform for Studying the Brain of Caenorhabditis elegans
• An Automated Auto-encoder Correlation-based Health-Monitoring and Prognostic Method for Machine Bearings
• Distributed Stochastic Model Predictive Control Synthesis for Large-Scale Uncertain Linear System
• Evolving Game Skill-Depth using General Video Game AI Agents
• Unsupervised Learning of Mixture Regression Models for Longitudinal Data
• First- and Second-Order Hypothesis Testing for Mixed Memoryless Sources with General Mixture
• Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters
• Multi-talker Speech Separation and Tracing with Permutation Invariant Training of Deep Recurrent Neural Networks
• Symmetric powers of permutation representations of finite groups and primitive colorings on polyhedrons
• A wake-sleep algorithm for recurrent, spiking neural networks
• Hydrodynamic limit for the Ginzburg-Landau $\nablaφ$ interface model with non-convex potential
• A Fast Algorithm for a Weighted Low Rank Approximation
• Constrained Spacecraft Relative Motion Planning Exploiting Natural Motion Trajectories and Invariance
• Representability of Lyndon-Maddux relation algebras
• Hardware-Efficient Schemes of Quaternion Multiplying Units for 2D Discrete Quaternion Fourier Transform Processors
• Solving the Goddard problem by an influence diagram
• Spectrum Estimation from a Few Entries
• Functional Central Limit Theorem For Susceptible-Infected Process On Configuration Model Graphs
• Analysis of error control in large scale two-stage multiple hypothesis testing
• On Piercing Numbers of Families Satisfying the $(p,q)_r$ Property
• Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks
• Triangle-free induced subgraphs of polarity graphs
• Stability for hyperplane complements of type B/C and statistics on squarefree polynomials over finite fields
• Goal Conflict in Designing an Autonomous Artificial System
• Fully symmetric kernel quadrature
• Optimal Learning from Multiple Information Sources
• Weakly-supervised DCNN for RGB-D Object Recognition in Real-World Applications Which Lack Large-scale Annotated Training Data
• Minimal forcing sets for 1D origami
• An Initial Study on Load Forecasting Considering Economic Factors
• Cayley graphs on groups with commutator subgroup of order 2p are hamiltonian
• Probabilistic Models for Daily Peak Loads at Distribution Feeders
• Penalized pairwise pseudo likelihood for variable selection with nonignorable missing data
• Direct Monocular Odometry Using Points and Lines
• Schur positivity and log-concavity related to longest increasing subsequences
• Zero-Shot Learning by Generating Pseudo Feature Representations
• Multirole Logic (Extended Abstract)
• Discrete Invariants of Generically Inconsistent Systems of Laurent Polynomials
• Single Molecule Studies Under Constant Force Using Model Based Robust Control Design
• Power-spectrum of long eigenlevel sequences in quantum chaology
• Multilevel Context Representation for Improving Object Recognition
• TAC-GAN – Text Conditioned Auxiliary Classifier Generative Adversarial Network
• A Passivity-Based Distributed Reference Governor for Constrained Robotic Networks
• Spectral analysis of stationary random bivariate signals
• A Fully-Automated Pipeline for Detection and Segmentation of Liver Lesions and Pathological Lymph Nodes
• The Hardness of Embedding Grids and Walls
• Specht modules for quiver Hecke algebras of type $C$
• Semi-Supervised Learning with Competitive Infection Models
• Geometric tracking control of thrust vectoring UAVs
• Modeling and optimal control of HIV/AIDS prevention through PrEP
• Persistence exponents in Markov chains
• Deep Neural Networks for Semantic Segmentation of Multispectral Remote Sensing Imagery
• Almost Buchsbaumness of some rings arising from complexes with isolated singularities
• Regress-Later Monte Carlo for Optimal Inventory Control with applications in energy
• On $H^2$-gradient Flows for the Willmore Energy
• Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options
• Practical Coreset Constructions for Machine Learning
• Locating a robber with multiple probes
• Enhancing Physical Layer Security in Dual-Hop Multiuser Transmission
• Characterization theorems for $Q$-independent random variables with values in a locally compact Abelian group
• Near Optimal Hamiltonian-Control and Learning via Chattering
• Generating Multi-label Discrete Electronic Health Records using Generative Adversarial Networks
• A thermodynamic analysis of the spider silk and the importance of complexity
• Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases
• Bernoulli Rank-$1$ Bandits for Click Feedback
• An overview of the quantization for mixed distributions
• Vision-based Real-Time Aerial Object Localization and Tracking for UAV Sensing System
• The Structure of Extreme Level Sets in Branching Brownian Motion
• Optimally solving the joint order batching and picker routing problem
• The Relationship Between Agnostic Selective Classification Active Learning and the Disagreement Coefficient
• A Controlled Set-Up Experiment to Establish Personalized Baselines for Real-Life Emotion Recognition
• Scalable Content Delivery with Coded Caching in Multi-Antenna Fading Channels
• Nonexistence of Efficient Dominating Sets in the Cayley Graphs Generated by Transposition Trees of Diameter 3
• Truth-Telling Mechanism for Secure Two-Way Relay Communications with Energy-Harvesting Revenue
• On Helmholtz free energy for finite abstract simplicial complexes
• Object category understanding via eye fixations on freehand sketches
• Connected Dominating Sets in Graphs With Stability Number Three
• Using maximum entry-wise deviation to test the goodness-of-fit for stochastic block models
• Adaptive p-values after cross-validation
• Evidence Updating for Stream-Processing in Big-Data: Robust Conditioning in Soft and Hard Fusion Environments
• The Unheralded Value of the Multiway Rendezvous: Illustration with the Production Cell Benchmark
• Disaggregated Benders decomposition and lazy constraints for solving the budget-constrained dynamic uncapacitated facility location and network design problem
• Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
• Paper2vec: Citation-Context Based Document Distributed Representation for Scholar Recommendation
• Proceedings International Workshop on Formal Engineering approaches to Software Components and Architectures
• Fast Sequential Decoding of Polar Codes
• On the normalized Laplacian spectra of some subdivision joins of two graphs
• Full-Duplex Cooperative Cognitive Radio Networks with Wireless Energy Harvesting
• Correction to the paper ‘Some remarks on Davie’s uniqueness theorem’
• A New Class of Discrete-time Stochastic Volatility Model with Correlated Errors
• Algorithm for Optimization and Interpolation based on Hyponormality
• Near-optimal bounds for phase synchronization
• Asymptotic Performance of PCA for High-Dimensional Heteroscedastic Data
• Power Beacon-Assisted Millimeter Wave Ad Hoc Networks
• Cyclohedron and Kantorovich-Rubinstein polytopes
• Random Walk Among Mobile/Immobile Traps: A Short Review
• Shift-Coupling of Random Rooted Networks
• Anyonic self-induced disorder in a stabilizer code: quasi-many body localization in a translational invariant model
• Variational inference for probabilistic Poisson PCA
• Towards a Quantum World Wide Web
• Reoptimization of the Closest Substring Problem under Pattern Length Modification
• A Preferential Attachment Paradox: How does Preferential Attachment Combine with Growth to Produce Networks with Log-normal In-degree Distributions?
• Inference-Based Distributed Channel Allocation in Wireless Sensor Networks
• Automated positive part extraction for lattice path generating functions in the octant
• Moments of random multiplicative functions, I: Low moments, better than squareroot cancellation, and critical multiplicative chaos
• A Flexible Privacy-preserving Framework for Singular Value Decomposition under Internet of Things Environment
• Skill and reliability of seasonal forecasts for the Chinese energy sector
• On the discretization of the Onsager-Machlup functional
• Empirical Analysis of the Necessary and Sufficient Conditions of the Echo State Property
• The Same Analysis Approach: Practical protection against the pitfalls of novel neuroimaging analysis methods
• Parallel Sort-Based Matching for Data Distribution Management on Shared-Memory Multiprocessors
• A Systematic Study of Online Class Imbalance Learning with Concept Drift
• Copula Index for Detecting Dependence and Monotonicity between Stochastic Signals
• BFGS convergence to nonsmooth minimizers of convex functions
• QMDP-Net: Deep Learning for Planning under Partial Observability
• Independence clustering (without a matrix)
• A note on Hindman-type theorems for uncountable cardinals
• Nonlinear Perturbation of a Noisy Hamiltonian Lattice Field Model: Universality Persistence
• Generalized Compute-Compress-and-Forward
• Analysing the sensitivity of pollen based land-cover maps to different auxiliary variables
• Chance Constrained Optimal Power Flow with Primary Frequency Response
• Towards an orbifold generalization of Zvonkine’s $r$-ELSV formula
• On the effect of pooling on the geometry of representations
• On the conversion of multivalued gene regulatory networks to Boolean dynamics
• Flare forecasting at the Met Office Space Weather Operations Centre
• A strongly convergent numerical scheme from EnKF continuum analysis
• Strongly convex stochastic online optimization on a unit simplex with application to the mixing least square regression
• On the Use of Default Parameter Settings in the Empirical Evaluation of Classification Algorithms
• A robust convex optimization framework for autonomous network planning under load uncertainty
• A continuous spatio-temporal approach to estimate climate change
• Variance Reduced Stochastic Gradient Descent with Sufficient Decrease
• Worth Weighting? How to Think About and Use Sample Weights in Survey Experiments
• The geometry of hypothesis testing over convex cones: Generalized likelihood tests and minimax radii
• Geometries arising from trilinear forms on low-dimensional vector spaces
• The Role of Network Analysis in Industrial and Applied Mathematics
• Point-hyperplane frameworks, slider joints, and rigidity preserving transformations
• Boosting Dilated Convolutional Networks with Mixed Tensor Decompositions
• Modulus consensus in discrete-time signed networks and properties of special recurrent inequalities
• Analog Transmit Signal Optimization for Undersampled Delay-Doppler Estimation
• Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
• Mask R-CNN
• The SK model is Full-step Replica Symmetry Breaking at zero temperature
Graph-based semi-supervised learning is one of the most popular methods in machine learning. Some of its theoretical properties such as bounds for the generalization error and the convergence of the graph Laplacian regularizer have been studied in computer science and statistics literatures. However, a fundamental statistical property, the consistency of the estimator from this method has not been proved. In this article, we study the consistency problem under a non-parametric framework. We prove the consistency of graph-based learning in the case that the estimated scores are enforced to be equal to the observed responses for the labeled data. The sample sizes of both labeled and unlabeled data are allowed to grow in this result. When the estimated scores are not required to be equal to the observed responses, a tuning parameter is used to balance the loss function and the graph Laplacian regularizer. We give a counterexample demonstrating that the estimator for this case can be inconsistent. The theoretical findings are supported by numerical studies.
Since Alan Turing envisioned Artificial Intelligence (AI) , a major driving force behind technical progress has been competition with human cognition. Historical milestones have been frequently associated with computers matching or outperforming humans in difficult cognitive tasks (e.g. face recognition , personality classification , driving cars , or playing video games ), or defeating humans in strategic zero-sum encounters (e.g. Chess , Checkers , Jeopardy! , Poker , or Go ). In contrast, less attention has been given to developing autonomous machines that establish mutually cooperative relationships with people who may not share the machine’s preferences. A main challenge has been that human cooperation does not require sheer computational power, but rather relies on intuition , cultural norms , emotions and signals [13, 14, 15, 16], and pre-evolved dispositions toward cooperation , common-sense mechanisms that are difficult to encode in machines for arbitrary contexts. Here, we combine a state-of-the-art machine-learning algorithm with novel mechanisms for generating and acting on signals to produce a new learning algorithm that cooperates with people and other machines at levels that rival human cooperation in a variety of two-player repeated stochastic games. This is the first general-purpose algorithm that is capable, given a description of a previously unseen game environment, of learning to cooperate with people within short timescales in scenarios previously unanticipated by algorithm designers. This is achieved without complex opponent modeling or higher-order theories of mind, thus showing that flexible, fast, and general human-machine cooperation is computationally achievable using a non-trivial, but ultimately simple, set of algorithmic mechanisms.
Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in its building modules. In this work, we introduce two new modules to enhance the transformation modeling capacity of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from target tasks, without additional supervision. The new modules can readily replace their plain counterparts in existing CNNs and can be easily trained end-to-end by standard back-propagation, giving rise to deformable convolutional networks. Extensive experiments validate the effectiveness of our approach on sophisticated vision tasks of object detection and semantic segmentation. The code would be released.
We propose and systematically evaluate three strategies for training dynamically-routed artificial neural networks: graphs of learned transformations through which different input signals may take different paths. Though some approaches have advantages over others, the resulting networks are often qualitatively similar. We find that, in dynamically-routed networks trained to classify images, layers and branches become specialized to process distinct categories of images. Additionally, given a fixed computational budget, dynamically-routed networks tend to perform better than comparable statically-routed networks.
Learning an encoding of feature vectors in terms of an over-complete dictionary or a probabilistic information geometric (Fisher vectors) construct is wide-spread in statistical signal processing and computer vision. In content based information retrieval using deep-learning classifiers, such encodings are learnt on the flattened last layer, without adherence to the multi-linear structure of the underlying feature tensor. We illustrate a variety of feature encodings incl. sparse dictionary coding and Fisher vectors along with proposing that a structured tensor factorization scheme enables us to perform retrieval that is at par, in terms of average precision, with Fisher vector encoded image signatures. In short, we illustrate how structural constraints increase retrieval fidelity.
Visual patterns represent the discernible regularity in the visual world. They capture the essential nature of visual objects or scenes. Understanding and modeling visual patterns is a fundamental problem in visual recognition that has wide ranging applications. In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative. The proposed PatternNet leverages the filters in the last convolution layer of a convolutional neural network to find locally consistent visual patches, and by combining these filters we can effectively discover unique visual patterns. In addition, PatternNet can discover visual patterns efficiently without performing expensive image patch sampling, and this advantage provides an order of magnitude speedup compared to most other approaches. We evaluate the proposed PatternNet subjectively by showing randomly selected visual patterns which are discovered by our method and quantitatively by performing image classification with the identified visual patterns and comparing our performance with the current state-of-the-art. We also directly evaluate the quality of the discovered visual patterns by leveraging the identified patterns as proposed objects in an image and compare with other relevant methods. Our proposed network and procedure, PatterNet, is able to outperform competing methods for the tasks described.
Computer vision is one of the most active research fields in information technology today. Giving machines and robots the ability to see and comprehend the surrounding world at the speed of sight creates endless potential applications and opportunities. Feature detection and description algorithms can be indeed considered as the retina of the eyes of such machines and robots. However, these algorithms are typically computationally intensive, which prevents them from achieving the speed of sight real-time performance. In addition, they differ in their capabilities and some may favor and work better given a specific type of input compared to others. As such, it is essential to compactly report their pros and cons as well as their performances and recent advances. This paper is dedicated to provide a comprehensive overview on the state-of-the-art and recent advances in feature detection and description algorithms. Specifically, it starts by overviewing fundamental concepts. It then compares, reports and discusses their performance and capabilities. The Maximally Stable Extremal Regions algorithm and the Scale Invariant Feature Transform algorithms, being two of the best of their type, are selected to report their recent algorithmic derivatives.
As technology proceeds and the number of smart devices continues to grow substantially, need for ubiquitous context-aware platforms that support interconnected, heterogeneous, and distributed network of devices has given rise to what is referred today as Internet-of-Things. However, paving the path for achieving aforementioned objectives and making the IoT paradigm more tangible requires integration and convergence of different knowledge and research domains, covering aspects from identification and communication to resource discovery and service integration. Through this chapter, we aim to highlight researches in topics including proposed architectures, security and privacy, network communication means and protocols, and eventually conclude by providing future directions and open challenges facing the IoT development.
This article proposes a new graphical tool, the magnitude-shape (MS) plot, for visualizing both the magnitude and shape outlyingness of multivariate functional data. The proposed tool builds on the recent notion of functional directional outlyingness, which measures the centrality of functional data by simultaneously considering the level and the direction of their deviation from the central region. The MS-plot intuitively presents not only levels but also directions of magnitude outlyingness on the horizontal axis or plane, and demonstrates shape outlyingness on the vertical axis. A dividing curve or surface is provided to separate non-outlying data from the outliers. Both the simulated data and the practical examples confirm that the MS-plot is superior to existing tools for visualizing centrality and detecting outliers for functional data.
Taking image and question as the input of our method, it can output the text-based answer of the query question about the given image, so called Visual Question Answering (VQA). There are two main modules in our algorithm. Given a natural language question about an image, the first module takes the question as input and then outputs the basic questions of the main question, given question. The second module takes the main question, image and these basic questions as input and then outputs the text-based answer of the main question. We formulate the basic questions generation problem as a LASSO optimization problem, and also propose a criterion about how to exploit these basic questions to help answer main question. Our method is evaluated on the challenging VQA dataset, and yields the competitive performance compared to state-of-the-art.
We propose a new method for training iterative collective classifiers for labeling nodes in network data. The iterative classification algorithm (ICA) is a canonical method for incorporating relational information into classification. Yet, existing methods for training ICA models rely on the assumption that relational features reflect the true labels of the nodes. This unrealistic assumption introduces a bias that is inconsistent with the actual prediction algorithm. In this paper, we introduce recurrent collective classification (RCC), a variant of ICA analogous to recurrent neural network prediction. RCC accommodates any differentiable local classifier and relational feature functions. We provide gradient-based strategies for optimizing over model parameters to more directly minimize the loss function. In our experiments, this direct loss minimization translates to improved accuracy and robustness on real network data. We demonstrate the robustness of RCC in settings where local classification is very noisy, settings that are particularly challenging for ICA.
Most state-of-the-art text detection methods are specific to horizontal text in Latin scripts and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented bounding box that covers a part of a word or text line; A link connects two adjacent segments, indicating that they belong to the same word or line. Both elements are detected densely at multiple scales by an end-to-end trained, fully-convolutional neural network. Final detections are the combinations of segments that are connected by links. Compared with previous methods, our method improves along the dimensions of accuracy, speed and ease of training. It achieves an f-measure of 75.0% on the standard ICDAR 2015 Incidental (Challenge 4) benchmark, outperforming the previous best by a large margin. It runs at over 20 FPS on 512×512 input images. In addition, our method is able to detect non-Latin text in long lines.
The massive amount of available data potentially used to discover patters in machine learning is a challenge for kernel based algorithms with respect to runtime and storage capacities. Local approaches might help to relieve these issues. From a statistical point of view local approaches allow additionally to deal with different structures in the data in different ways. This paper analyses properties of localized kernel based, non-parametric statistical machine learning methods, in particular of support vector machines (SVMs) and methods close to them. We will show there that locally learnt kernel methods are universal consistent. Furthermore, we give an upper bound for the maxbias in order to show statistical robustness of the proposed method.
Ensemble methods using multiple classifiers have proven to be the most successful approach for the task of Native Language Identification (NLI), achieving the current state of the art. However, a systematic examination of ensemble methods for NLI has yet to be conducted. Additionally, deeper ensemble architectures such as classifier stacking have not been closely evaluated. We present a set of experiments using three ensemble-based models, testing each with multiple configurations and algorithms. This includes a rigorous application of meta-classification models for NLI, achieving state-of-the-art results on three datasets from different languages. We also present the first use of statistical significance testing for comparing NLI systems, showing that our results are significantly better than the previous state of the art. We make available a collection of test set predictions to facilitate future statistical tests.
The advent of artificial intelligence has changed many disciplines such as engineering, social science and economics. Artificial intelligence is a computational technique which is inspired by natural intelligence such as the swarming of birds, the working of the brain and the pathfinding of the ants. These techniques have impact on economic theories. This book studies the impact of artificial intelligence on economic theories, a subject that has not been extensively studied. The theories that are considered are: demand and supply, asymmetrical information, pricing, rational choice, rational expectation, game theory, efficient market hypotheses, mechanism design, prospect, bounded rationality, portfolio theory, rational counterfactual and causality. The benefit of this book is that it evaluates existing theories of economics and update them based on the developments in artificial intelligence field.
We consider the problem of model selection and estimation in sparse high dimensional linear regression models with strongly correlated variables. First, we study the theoretical properties of the dual Lasso solution, and we show that joint consideration of the Lasso primal and its dual solutions are useful for selecting correlated active variables. Second, we argue that correlations among active predictors are not problematic, and we derive a new weaker condition on the design matrix, called Pseudo Irrepresentable Condition (PIC). Third, we present a new variable selection procedure, Dual Lasso Selector, and we prove that the PIC is a necessary and sufficient condition for consistent variable selection for the proposed method. Finally, by combining the dual Lasso selector further with the Ridge estimation even better prediction performance is achieved. We call the combination (DLSelect+Ridge), it can be viewed as a new combined approach for inference in high-dimensional regression models with correlated variables. We illustrate DLSelect+Ridge method and compare it with popular existing methods in terms of variable selection, prediction accuracy, estimation accuracy and computation speed by considering various simulated and real data examples.
This paper contributes a new large-scale dataset for weakly supervised cross-media retrieval, named Twitter100k. Current datasets, such as Wikipedia, NUS Wide and Flickr30k, have two major limitations. First, these datasets are lacking in content diversity, i.e., only some pre-defined classes are covered. Second, texts in these datasets are written in well-organized language, leading to inconsistency with realistic applications. To overcome these drawbacks, the proposed Twitter100k dataset is characterized by two aspects: 1) it has 100,000 image-text pairs randomly crawled from Twitter and thus has no constraint in the image categories; 2) text in Twitter100k is written in informal language by the users. Since strongly supervised methods leverage the class labels that may be missing in practice, this paper focuses on weakly supervised learning for cross-media retrieval, in which only text-image pairs are exploited during training. We extensively benchmark the performance of four subspace learning methods and three variants of the Correspondence AutoEncoder, along with various text features on Wikipedia, Flickr30k and Twitter100k. Novel insights are provided. As a minor contribution, inspired by the characteristic of Twitter100k, we propose an OCR-based cross-media retrieval method. In experiment, we show that the proposed OCR-based method improves the baseline performance.
The number of documents available into Internet moves each day up. For this reason, processing this amount of information effectively and expressibly becomes a major concern for companies and scientists. Methods that represent a textual document by a topic representation are widely used in Information Retrieval (IR) to process big data such as Wikipedia articles. One of the main difficulty in using topic model on huge data collection is related to the material resources (CPU time and memory) required for model estimate. To deal with this issue, we propose to build topic spaces from summarized documents. In this paper, we present a study of topic space representation in the context of big data. The topic space representation behavior is analyzed on different languages. Experiments show that topic spaces estimated from text summaries are as relevant as those estimated from the complete documents. The real advantage of such an approach is the processing time gain: we showed that the processing time can be drastically reduced using summarized documents (more than 60\% in general). This study finally points out the differences between thematic representations of documents depending on the targeted languages such as English or latin languages.
Translating information between text and image is a fundamental problem in artificial intelligence that connects natural language processing and computer vision. In the past few years, performance in image caption generation has seen significant improvement through the adoption of recurrent neural networks (RNN). Meanwhile, text-to-image generation begun to generate plausible images using datasets of specific categories like birds and flowers. We’ve even seen image generation from multi-category datasets such as the Microsoft Common Objects in Context (MSCOCO) through the use of generative adversarial networks (GANs). Synthesizing objects with a complex shape, however, is still challenging. For example, animals and humans have many degrees of freedom, which means that they can take on many complex shapes. We propose a new training method called Image-Text-Image (I2T2I) which integrates text-to-image and image-to-text (image captioning) synthesis to improve the performance of text-to-image synthesis. We demonstrate that %the capability of our method to understand the sentence descriptions, so as to I2T2I can generate better multi-categories images using MSCOCO than the state-of-the-art. We also demonstrate that I2T2I can achieve transfer learning by using a pre-trained image captioning module to generate human images on the MPII Human Pose
In this work we perform outlier detection using ensembles of neural networks obtained by variational approximation of the posterior in a Bayesian neural network setting. The variational parameters are obtained by sampling from the true posterior by gradient descent. We show our outlier detection results are better than those obtained using other efficient ensembling methods.
We present PEC, an Event Calculus (EC) style action language for reasoning about probabilistic causal and narrative information. It has an action language style syntax similar to that of the EC variant Modular-E. Its semantics is given in terms of possible worlds which constitute possible evolutions of the domain, and builds on that of EFEC, an epistemic extension of EC. We also describe an ASP implementation of PEC and show the sense in which this is sound and complete.
Convolutional Neural Networks (CNNs) have been successfully applied to many computer vision tasks, such as image classification. By performing linear combinations and element-wise nonlinear operations, these networks can be thought of as extracting solely first-order information from an input image. In the past, however, second-order statistics computed from handcrafted features, e.g., covariances, have proven highly effective in diverse recognition tasks. In this paper, we introduce a novel class of CNNs that exploit second-order statistics. To this end, we design a series of new layers that (i) extract a covariance matrix from convolutional activations, (ii) compute a parametric, second-order transformation of a matrix, and (iii) perform a parametric vectorization of a matrix. These operations can be assembled to form a Covariance Descriptor Unit (CDU), which replaces the fully-connected layers of standard CNNs. Our experiments demonstrate the benefits of our new architecture, which outperform the first-order CNNs, while relying on up to 90% fewer parameters.
Machine learning has matured to the point to where it is now being considered to automate decisions in loan lending, employee hiring, and predictive policing. In many of these scenarios however, previous decisions have been made that are unfairly biased against certain subpopulations (e.g., those of a particular race, gender, or sexual orientation). Because this past data is often biased, machine learning predictors must account for this to avoid perpetuating discriminatory practices (or incidentally making new ones). In this paper, we develop a framework for modeling fairness in any dataset using tools from counterfactual inference. We propose a definition called counterfactual fairness that captures the intuition that a decision is fair towards an individual if it gives the same predictions in (a) the observed world and (b) a world where the individual had always belonged to a different demographic group, other background causes of the outcome being equal. We demonstrate our framework on two real-world problems: fair prediction of law school success, and fair modeling of an individual’s criminality in policing data.
Deep Neural Networks (DNNs) have achieved remarkable performance on a variety of pattern-recognition tasks, particularly visual classification problems, where new algorithms reported to achieve or even surpass the human performance. In this paper, we test the state-of-the-art DNNs with negative images and show that the accuracy drops to the level of random classification. This leads us to the conjecture that the DNNs, which are merely trained on raw data, do not recognize the semantics of the objects, but rather memorize the inputs. We suggest that negative images can be thought as ‘semantic adversarial examples’, which we define as transformed inputs that semantically represent the same objects, but the model does not classify them correctly.