data mining Archives | NEC Labs America

Data Mining is the process of discovering patterns, relationships, and valuable insights from large sets of data. It involves the application of various techniques, such as statistical analysis, machine learning, and pattern recognition, to identify meaningful patterns or knowledge within the data.

Posts

Heterogeneous Graph Matching Networks for Unknown Malware Detection

August 16, 2019/in Publications/by NEC Labs America

Information systems have widely been the target of malware attacks. Traditional signature-based malicious program detection algorithms can only detect known malware and are prone to evasion techniques such as binary obfuscation, while behavior-based approaches highly rely on the malware training samples and incur prohibitively high training cost. To address the limitations of existing techniques, we propose MatchGNet, a heterogeneous Graph Matching Network model to learn the graph representation and similarity metric simultaneously based on the invariant graph modeling of the program’s execution behaviors. We conduct a systematic evaluation of our model and show that it is accurate in detecting malicious program behavior and can help detect malware attacks with less false positives. MatchGNet outperforms the state-of-the-art algorithms in malware detection by generating 50% less false positives while keeping zero false negatives.

Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs

August 16, 2019/in Publications/by NEC Labs America

Node classification in graph-structured data aims to classify the nodes where labels are only available for a subset of nodes. This problem has attracted considerable research efforts in recent years. In real-world applications, both graph topology and node attributes evolve over time. Existing techniques, however, mainly focus on static graphs and lack the capability to simultaneously learn both temporal and spatial/structural features. Node classification in temporal attributed graphs is challenging for two major aspects. First, effectively modeling the spatio-temporal contextual information is hard. Second, as temporal and spatial dimensions are entangled, to learn the feature representation of one target node, its desirable and challenging to differentiate the relative importance of different factors, such as different neighbors and time periods. In this paper, we propose STAR, a spatio-temporal attentive recurrent network model, to deal with the above challenges. STAR extracts the vector representation of neighborhood by sampling and aggregating local neighbor nodes. It further feeds both the neighborhood representation and node attributes into a gated recurrent unit network to jointly learn the spatio-temporal contextual information. On top of that, we take advantage of the dual attention mechanism to perform a thorough analysis on the model interpretability. Extensive experiments on real datasets demonstrate the effectiveness of the STAR model.

illiad: InteLLigent Invariant and Anomaly Detection in Cyber-Physical Systems

January 16, 2018/in Publications/by NEC Labs America

Cyber-physical systems (CPSs) are today ubiquitous in urban environments. Such systems now serve as the backbone to numerous critical infrastructure applications, from smart grids to IoT installations. Scalable and seamless operation of such CPSs requires sophisticated tools for monitoring the time series progression of the system, dynamically tracking relationships, and issuing alerts about anomalies to operators. We present an online monitoring system (illiad) that models the state of the CPS as a function of its relationships between constituent components, using a combination of model-based and data-driven strategies. In addition to accurate inference for state estimation and anomaly tracking, illiad also exploits the underlying network structure of the CPS (wired or wireless) for state estimation purposes. We demonstrate the application of illiad to two diverse settings: a wireless sensor motes application and an IEEE 33-bus microgrid.

Posts

Heterogeneous Graph Matching Networks for Unknown Malware Detection

Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs

illiad: InteLLigent Invariant and Anomaly Detection in Cyber-Physical Systems

Contact Us

About Us

Our Pages

Read Our Blog Posts

Tag Archive for: data mining

Posts

Contact Us

About Us

Our Pages

Read Our Blog Posts