Sentiment Analysis of Citations Using Word2vec

Citation sentiment analysis is an important task in scientific paper analysis. Existing machine learning techniques for citation sentiment analysis are focusing on labor-intensive feature engineering, which requires large annotated corpus. As an automatic feature extraction tool, word2vec has been successfully applied to sentiment analysis of short texts. In this work, I conducted empirical research with the question: how well does word2vec work on the sentiment analysis of citations? The proposed method constructed sentence vectors (sent2vec) by averaging the word embeddings, which were learned from Anthology Collections (ACL-Embeddings). I also investigated polarity-specific word embeddings (PS-Embeddings) for classifying positive and negative citations. The sentence vectors formed a feature space, to which the examined citation sentence was mapped to. Those features were input into classifiers (support vector machines) for supervised classification. Using 10-cross-validation scheme, evaluation was conducted on a set of annotated citations. The results showed that word embeddings are effective on classifying positive and negative citations. However, hand-crafted features performed better for the overall classification.

Understanding Deep Representations through Random Weights

We systematically study the deep representation of random weight CNN (convolutional neural network) using the DeCNN (deconvolutional neural network) architecture. We first fix the weights of an untrained CNN, and for each layer of its feature representation, we train a corresponding DeCNN to reconstruct the input image. As compared with the pre-trained CNN, the DeCNN trained on a random weight CNN can reconstruct images more quickly and accurately, no matter which type of random distribution for the CNN’s weights. It reveals that every layer of the random CNN can retain photographically accurate information about the image. We then let the DeCNN be untrained, i.e. the overall CNN-DeCNN architecture uses only random weights. Strikingly, we can reconstruct all position information of the image for low layer representations but the colors change. For high layer representations, we can still capture the rough contours of the image. We also change the number of feature maps and the shape of the feature maps and gain more insight on the random function of the CNN-DeCNN structure. Our work reveals that the purely random CNN-DeCNN architecture substantially contributes to the geometric and photometric invariance due to the intrinsic symmetry and invertible structure, but it discards the colormetric information due to the random projection.

Detection and Resolution of Rumours in Social Media: A Survey

Despite the increasing use of social media platforms for information and news gathering, its unmoderated nature often leads to the emergence and spread of rumours, i.e. pieces of information that are unverified at the time of posting. At the same time, the openness of social media platforms provides opportunities to study how users share and discuss rumours, and to explore how natural language processing and data mining techniques may be used to find ways of determining their veracity. In this survey we introduce and discuss two types of rumours that circulate on social media; long-standing rumours that circulate for long periods of time, and newly-emerging rumours spawned during fast-paced events such as breaking news, where reports are released piecemeal and often with an unverified status in their early stages. We provide an overview of research into social media rumours with the ultimate goal of developing a rumour classification system that consists of four components: rumour detection, rumour tracking, rumour stance classification and rumour veracity classification. We delve into the approaches presented in the scientific literature for the development of each of these four components. We summarise the efforts and achievements so far towards the development of rumour classification systems and conclude with suggestions for avenues for future research in social media mining for detection and resolution of rumours.

Big Holes in Big Data: A Monte Carlo Algorithm for Detecting Large Hyper-rectangles in High Dimensional Data

We present the first algorithm for finding holes in high dimensional data that runs in polynomial time with respect to the number of dimensions. Previous algorithms are exponential. Finding large empty rectangles or boxes in a set of points in 2D and 3D space has been well studied. Efficient algorithms exist to identify the empty regions in these low-dimensional spaces. Unfortunately such efficiency is lacking in higher dimensions where the problem has been shown to be NP-complete when the dimensions are included in the input. Applications for algorithms that find large empty spaces include big data analysis, recommender systems, automated knowledge discovery, and query optimization. Our Monte Carlo-based algorithm discovers interesting maximal empty hyper-rectangles in cases where dimensionality and input size would otherwise make analysis impractical. The run-time is polynomial in the size of the input and the number of dimensions. We apply the algorithm on a 39-dimensional data set for protein structures and discover interesting properties that we think could not be inferred otherwise.

It Takes Two to Tango: Towards Theory of AI’s Mind

Theory of Mind is the ability to attribute mental states (beliefs, intents, knowledge, perspectives, etc.) to others and recognize that these mental states may differ from one’s own. Theory of Mind is critical to effective communication and to teams demonstrating higher collective performance. To effectively leverage the progress in Artificial Intelligence (AI) to make our lives more productive, it is important for humans and AI to work well together in a team. Traditionally, there has been much emphasis on research to make AI more accurate, and (to a lesser extent) on having it better understand human intentions, tendencies, beliefs, and contexts. The latter involves making AI more human-like and having it develop a theory of our minds. In this work, we argue that for human-AI teams to be effective, humans must also develop a theory of AI’s mind – get to know its strengths, weaknesses, beliefs, and quirks. We instantiate these ideas within the domain of Visual Question Answering (VQA). We find that using just a few examples(50), lay people can be trained to better predict responses and oncoming failures of a complex VQA model. Surprisingly, we find that having access to the model’s internal states – its confidence in its top-k predictions, explicit or implicit attention maps which highlight regions in the image (and words in the question) the model is looking at (and listening to) while answering a question about an image – do not help people better predict its behavior

Opinion Mining on Non-English Short Text

On the Reliable Detection of Concept Drift from Streaming Unlabeled Data

Upper Bounds on the Runtime of the Univariate Marginal Distribution Algorithm on OneMax

Improved Training of Wasserstein GANs

Transfer of View-manifold Learning to Similarity Perception of Novel Objects

Efficient Registration of Pathological Images: A Joint PCA/Image-Reconstruction Approach

Robust Student’s t based Stochastic Cubature Filter for Nonlinear Systems with Heavy-tailed Process and Measurement Noises

Polychromatic Colorings on the Integers

Gromov-Hausdorff-Prokhorov convergence of vertex cut-trees of n-leaf Galton-Watson trees

Many edge-disjoint rainbow spanning trees in general graphs

Reading Wikipedia to Answer Open-Domain Questions

One-Shot Neural Cross-Lingual Transfer for Paradigm Completion

Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems

Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature

Full and maximal squashed flat antichains of minimum weight

A multivariate variable selection approach for analyzing LC-MS metabolomics data

Geodesic Distance Histogram Feature for Video Segmentation

Algorithms for Routing of Unmanned Aerial Vehicles with Mobile Recharging Stations and for Package Delivery

Moderately Complex Paxos Made Simple: High-Level Specification of Distributed Algorithm

Efficient Asymmetric Co-Tracking using Uncertainty Sampling

Optimal Reconstruction with a Small Number of Views

Noether currents for higher-order variational problems of Herglotz type with time delay

Learning to Predict Indoor Illumination from a Single Image

Speed Trajectory Planning at Signalized Intersections with Sequential Convex Optimization

Customizing First Person Image Through Desired Actions

Gradient Flows in Uncertainty Propagation and Filtering of Linear Gaussian Systems

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

Online Geographical Load Balancing for Energy-Harvesting Mobile Edge Computing

Assortment Optimization under Unknown MultiNomial Logit Choice Models

Snapshot Ensembles: Train 1, get M for free

The spin-Brauer diagram algebra

Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes

Ontological Multidimensional Data Models and Contextual Data Qality

Stochastic L-BFGS Revisited: Improved Convergence Rates and Practical Acceleration Strategies

A Multi-Index Markov Chain Monte Carlo Method

Psychological and Personality Profiles of Political Extremists

Thin graph classes and polynomial-time approximation schemes

Conic Relaxations for Power System State Estimation with Line Measurements

Topic modeling of public repositories at scale using names in source code

Multiple Instance Detection Network with Online Instance Classifier Refinement

Inverse Fractional Knapsack Problem with Profits and Costs Modification

Binomial edge ideals of bipartite graphs

Computations of volumes and Ehrhart series in four candidates elections

(t,q) Q-systems, DAHA and quantum toroidal algebras via generalized Macdonald operators

Real-World Recommender Systems for Academia: The Pain and Gain in Building, Operating, and Researching them [Long Version]

Clustering-based Source-aware Assessment of True Robustness for Learning Models

Compositional Human Pose Regression

Latency Optimization for Resource Allocation in Mobile-Edge Computation Offloading

Decisive length scale for field dependence of hopping charge transport in organic semiconductors

Iterated stochastic processes : simulation and relationship with high order partial differential equations

Optimal Scheduling of Downlink Communication for a Multi-Agent System with a Central Observation Post

Complexity-Aware Assignment of Latent Values in Discriminative Models for Accurate Gesture Recognition

The random spanning tree on ladder-like graphs

Model selection and model averaging in MACML-estimated MNP models

Stochastic and Chance-Constrained Conic Distribution System Expansion Planning Using Bilinear Benders Decomposition

Modeling trait-dependent evolution on a random species tree

Robust Regulation of MIMO systems: A Reformulation of the Internal Model Principle

Faster Subgradient Methods for Functions with Hölderian Growth

iWinrNFL: A Simple and Well-Calibrated In-Game NFL Win Probability Model

Multimodal Dialogs (MMD): A large-scale dataset for studying multimodal domain-aware conversations

A Brownian Motion Model and Extreme Belief Machine for Modeling Sensor Data Measurements

A categorical approach to the maximum theorem

Nonparametric causal effects based on incremental propensity score interventions

Three-dimensional Catalan numbers and product-coproduct prographs

Adversarial Connective-exploiting Networks for Implicit Discourse Relation Classification

The universal triangle-free graph has finite big Ramsey degrees

Fair Allocation of Indivisible Goods: Improvement and Generalization

PSPO: Parallel Simultaneous Perturbation Optimization

A Time-Frequency Domain Approach of Heart Rate Estimation From Photoplethysmographic (PPG) Signal

Sequential Learning of Analysis Operators

Dense point sets with many halving lines

Clustering in Hilbert space of a quantum optimization problem

Crime Prediction by Data-Driven Green’s Function method

Lossy Asymptotic Equipartition property for Wireless Sensor Networks

Toughness and spanning trees in $K_4$-minor-free graphs

Compressed Covariance Estimation With Automated Dimension Learning

A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment

Complexity of short Presburger arithmetic

Private Multi-File Retrieval From Distributed Databases

Building a Neural Machine Translation System Using Only Synthetic Parallel Data

Non-Analytic Solution to the Fokker-Planck Equation of Fractional Brownian Motion via Laplace Transforms

A simple characterization of tightness for convex solid sets of positive random variables

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Potential Functions based Sampling Heuristic For Optimal Path Planning

Bi-universality characterizes a realistic spatial network model

SAR image despeckling through convolutional neural networks

The Stixel world: A medium-level representation of traffic scenes

Optimal Average Satisfaction and Extended Justified Representation in Polynomial Time

Tropical Limits of Probability Spaces, Part I: The Intrinsic Kolmogorov-Sinai Distance and the Asymptotic Equipartition Property for Configurations

Efficient Version-Space Reduction for Visual Tracking

The Optimal Error Bound for the Method of Simultaneous Projections

The level-crossing intensity for the density of the image of the Lebesgue measure under the action of a Brownian stochastic flow

Variationa characterization of the regularity of Monge-Brenier maps

Inference for the cross-covariance operator of stationary functional time series

Survey of Game Theory and Future Trends with Application to Emerging Wireless Data Communication Networks

Structured Parallel Programming for Monte Carlo Tree Search

People Counting in Crowded and Outdoor Scenes using an Hybrid Multi-Camera Approach

A Geometric Approach to Rotor Failure Tolerant Trajectory Tracking Control Design for a Quadrotor

Branching diffusion representation of quasi-linear elliptic PDEs and estimation using Monte Carlo method

Restoration of Images with Wavefront Aberrations

Dense Multi-view 3D-reconstruction Without Dense Correspondences

Risk-averse model predictive control

Tomaszewski’s Problem on Randomly Signed Sums: Breaking the 3/8 Barrier

Two-exponential models of gene expression patterns for noisy experimental data

Simple Measures of Individual Cluster-Membership Certainty for Hard Partitional Clustering

Local Guarantees in Graph Cuts and Clustering

Committees providing EJR can be computed efficiently

Understanding Concept Drift

Provable Inductive Robust PCA via Iterative Hard Thresholding

Cash-settled options for wholesale electricity markets

Spectral approximation of fractional PDEs in image processing and phase field modeling

Learning in anonymous nonatomic games with applications to first-order mean field games

Word-Alignment-Based Segment-Level Machine Translation Evaluation using Word Embeddings

Identifying networks with common organizational principles

Hidden Two-Stream Convolutional Networks for Action Recognition

Geometric loss functions for camera pose regression with deep learning

A two-stage working model strategy for network analysis under Hierarchical Exponential Random Graph Models

Exploring Choice Overload in Related-Article Recommendations in Digital Libraries

A growing length-scale in supercooled liquids: Cluster formation induced by local densification

A Message-Passing Algorithm for Graph Isomorphism

A New Capacity Scaling Law in Ultra-Dense Networks

A Class of Temporal Hierarchical Exponential Random Graph Models for Longitudinal Network Data

Quantum multiplication operators for Lagrangian and orthogonal Grassmannians

Syntax Aware LSTM Model for Chinese Semantic Role Labeling

Sparse Autoencoder for Unsupervised Nucleus Detection and Representation in Histopathology Images

Kolmogorov bounds for the normal approximation of the number of triangles in the Erdos-Renyi random graph

A Survey of Distributed Message Broker Queues

Error bounds for monomial convexification in polynomial optimization

Galerkin approximations of nonlinear optimal control problems in Hilbert spaces

Scaling limit of a river delta to a continuum random tree : a Brownian web approach

Multi-frequency sparse Bayesian learning with uncertainty models

A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion

Combining Lexical and Syntactic Features for Detecting Content-dense Texts in News

On Kernelized Multi-armed Bandits

Learning a Variational Network for Reconstruction of Accelerated MRI Data

CRLB Calculations for Joint AoA, AoD and Multipath Gain Estimation in Millimeter Wave Wireless Networks

Clustering in Hilbert simplex geometry

Joint Design of Digital and Analog Processing for Downlink C-RAN with Large-Scale Antenna Arrays

Phase transition in inhomogenous Erdős-Rényi random graphs via tree counting

Convergence in First Passage Percolation with nonidentical passage times

Duality in percolation via outermost boundaries I: Bond Percolation

Finding and using expanders in locally sparse graphs

Approximately certifying the restricted isometry property is hard

Stop That Join! Discarding Dimension Tables when Learning High Capacity Classifiers

On the independence number of graphs related to a polarity

The relationship between the size of camphor driven rotor and its angular velocity

A Comparison of Directional Distances for Hand Pose Estimation

Matching Connectivity: On the Structure of Graphs with Perfect Matchings

UPGMA and the normalized equidistant minimum evolution problem

Convolutional neural networks for segmentation and object detection of human semen

Truncating Wide Networks using Binary Tree Architectures

Optimizing Communication by Compression for Multi-GPU Scalable Breadth-First Searches

Multi-Task Learning of Keyphrase Boundary Classification

Capturing Hand Motion with an RGB-D Sensor, Fusing a Generative Model with Salient Points

Diffusive systems and weighted Hankel operators

Efficient acquisition rules for model-based approximate Bayesian computation

Power Control in Massive MIMO with Dynamic User Population

Clustered multi-state models with observation-level random effects, mover-stayer effects and dynamic covariates: Modelling transition intensities and sojourn times in a study of psoriatic arthritis

Block-Matching Convolutional Neural Network for Image Denoising

Truthfulness in Repeated Predictions

3D Object Reconstruction from Hand-Object Interactions

Admissibility of invariant tests for means with covariates

On the Aubin property of a class of parameterized variational systems

Dictionary-based Tensor Canonical Polyadic Decomposition

Mixture Hidden Markov Models for Sequence Data: The seqHMM Package in R

On Chordal Graph and Line Graph Squares

AutoSVD++: An Efficient Hybrid Collaborative Filtering Model via Contractive Auto-encoders

A Transition-Based Directed Acyclic Graph Parser for UCCA

Neural Lattice-to-Sequence Models for Uncertain Inputs

Dynamic Planar Embeddings of Dynamic Graphs

A new scope of penalized empirical likelihood with high-dimensional estimating equations

A parametric level-set method for partially discrete tomography

Spatiotemporal Networks for Video Emotion Recognition

Sparse mean localization by information theory

Analysis, detection and correction of misspecified discrete time state space models

Causality and surrogate variable analysis

Large deviation principle for random matrix products

Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection

Massive MIMO Performance – TDD Versus FDD: What Do Measurements Say?

Uncertainty and sensitivity analysis of functional risk curves based on Gaussian processes

Doubly Reflected BSDEs and ${\cal E}^{f}$-Dynkin games: beyond the right-continuous case

Optimal lower bounds for universal relation, and for samplers and finding duplicates in streams

Semi-Supervised Generation with Cluster-aware Generative Models

Symmetric motifs in random geometric graphs

Local nearest neighbour classification with applications to semi-supervised learning

A correlation game for unsupervised learning yields computational interpretations of Hebbian excitation, anti-Hebbian inhibition, and synapse elimination

Distributed FD-MIMO: Cellular Evolution for 5G and Beyond

Soft-to-Hard Vector Quantization for End-to-End Learned Compression of Images and Neural Networks

A Central Limit Theorem for Vincular Permutation Patterns

Fast Encoding and Decoding of Flexible-Rate and Flexible-Length Polar Codes

Transfer-Matrix Methods meet Ehrhart Theory

On quadratic variation in the Skorokhod space

Channel Feedback Based on AoD-Adaptive Subspace Codebook in FDD Massive MIMO Systems

Polar Codes over Fading Channels with Power and Delay Constraints

Investigating consumers’ store-choice behavior via hierarchical variable selection

Causal inference with observational studies trimmed by the estimated propensity scores

The 2017 DAVIS Challenge on Video Object Segmentation

A Consistent Bayesian Formulation for Stochastic Inverse Problems Based on Push-forward Measures

Quasifree stochastic cocycles and quantum random walks

Index Coding: Rank-Invariant Extensions

Quantum advantage with shallow circuits

Loop Tiling in Large-Scale Stencil Codes at Run-time with OPS

Limiting shape of the Depth First Search tree in an Erdős-Rényi graph

Multi-rendezvous Spacecraft Trajectory Optimization with Beam P-ACO

Graph Partitioning with Acyclicity Constraints

Critical classes, Kronecker products of spin characters, and the Saxl conjecture

No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis

Hierarchical Surface Prediction for 3D Object Reconstruction

A DG-extention of symmetric functions arising from higher representation theory

Convolutional Polar Codes

An example of a deterministic cellular automaton exhibiting linear-exponential convergence to the steady state