Data Science and System Security | Haifeng Chen

HOME

PROJECTS

PEOPLE

PUBLICATIONS

PATENTS

Haifeng Chen

Haifeng Chen

Department Head

Data Science and System Security

Projects

AI for Space

Overview: From planetary image analysis to spacecraft monitoring, AI is becoming an increasingly important tool in space exploration and development. Our AI-powered monitoring solution will perform intricate checks to ensure the spacecraft and satellites operate correctly during the production and operation phases.

AIOPs: Evoking Intelligence in Operations

Overview: IT operation is one of the technological foundations of the increasingly digitalized world. It is responsible for ensuring that digitalized businesses and societies run reliably, efficiently and safely. With the rapid advances in networking, computers, and hardware, we face an explosive growth of complexity in networked applications and information services. These large-scale, often distributed, information systems usually consist of a great variety of components that work together in a highly complex, coordinated, and evolving manner.

Complex System Modeling and Optimization

Overview: With ubiquitous sensing and networking capability, traditional complex physical systems have been undergoing revolutionary changes in their ICT capabilities. They are now equipped with a large number of sensors distributed across different parts of the system, which collect a tremendous amount of data from system operation.

DDA: Deep Document Analysis

Overview: Unstructured data is growing at an unprecedented rate, valuable knowledge, including findings, observations, business demand, opportunities, is widely recorded as texts in documents. We are developing advanced analysis engines for mining text data in documents, aiming to discover valuable knowledge from large-scale documents and provide informed decision-making for users.

Dynamic Graph Analysis

Overview: In many big data applications, data with complex structures are connected for their explicit/implicit interactions and are naturally represented as graphs/networks. The world is full of complex and dynamic interactions between diverse objects. The flood of dynamic graph data poses great computational challenges and entails interdisciplinary collaborations.

Multimodal Data Analysis

Overview: Multimodal data are prevalent in industrial monitoring, finance and healthcare. In particular, time series are often tagged with text comments from experts that provide layman users with the domain knowledge to understand the charts. Texts give the patterns qualitative meaning, while time series makes the words quantitative. Analyzing the relationship between different data types is the key to unraveling the hidden structure of such data.

Safe and Trustworthy AI

Overview: By leveraging big data and deep learning, in recent years, AI technologies have made significant progress. They have been adopted in many applications, including malware detection, image classification, and stock market prediction. As our society becomes more automated, more and more systems will rely on AI techniques. And instead of augmenting human decisions, some AI systems will make their own decisions and execute autonomously.

Skill Acquisition Learning (SAiL)

Overview: This project aims to learn skills by mimicking experts’ behaviors in given tasks. The proposed SAiL engine is trained to perform action prediction tasks from demonstrations by learning a mapping function between observed states and actions. The main challenges in real applications, medical and health care, for example, are that the collection of such experts’ demonstrations is very expensive and It takes a large amount of time and money for expert training.

Time Series Language Model for Explainable AI

Overview: We are developing an advanced multi-modal forecasting system that utilizes both time series data and textual data, such as news articles, to predict future trends and events. This innovative system integrates advanced time series backbone models with large language models (LLMs), combining the strengths of statistical analysis and machine learning techniques.

Time Series Sensor Data Analysis

Overview: With ubiquitous sensing and networking capability, traditional complex physical systems have been undergoing revolutionary changes in their ICT capabilities. They are now equipped with a large number of sensors distributed across different parts of the system, which collect a tremendous amount of data from system operation.

Publications

Evidence-Based Out-of-Distribution Detection on Multi-Label Graphs

May 3, 2025/SIAM International Conference on Data Mining (SDM 2025), Alexandria, VA

The Out-of-Distribution (OOD) problem in graph-structured data is becoming increasingly important in various areas of research and applications, including social network recommendation [36], protein function detection [9, 21], etc. Furthermore, owing to the inherent multi-label properties of nodes, multi-label

Position Really Matters: Towards a Holistic Approach for Prompt Tuning

April 30, 2025/2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

Prompt tuning is highly effective in efficiently extracting knowledge from foundation models, encompassing both language, vision, and vision-language models. However, the efficacy of employing fixed soft prompts with a predetermined position for concatenation with inputs for all instances, irrespective

MixLLM: Dynamic Routing in Mixed Large Language Models

April 29, 2025/2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

Large Language Models (LLMs) exhibit potential artificial generic intelligence recently, however, their usage is costly with high response latency. Given mixed LLMs with their own strengths and weaknesses, LLM routing aims to identify the most suitable model for each query in the stream to maximize response

DISC: Dynamic Decomposition Improves LLM Inference Scaling (SSI-FM)

April 28, 2025/ICLR Workshop on Scaling Self-Improving Foundation Models without Human Supervision (SSI-FM) at ICLR 2025

Inference scaling methods often rely on decomposing problems into steps, followed by sampling and selecting the best next steps. However, these steps and their sizes are typically fixed or depend on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically breaks

DISC: Dynamic Decomposition Improves LLM Inference Scaling (DL4C)

April 28, 2025/Third Workshop on Deep Learning for Code (DL4C) at ICLR 2025

Inference scaling methods often rely on decomposing problems into steps, followed by sampling and selecting the best next steps. However, these steps and their sizes are typically fixed or depend on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically breaks

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

April 28, 2025/The Thirteenth International Conference on Learning Representations (ICLR 2025)

The advent of large language models (LLMs) has revolutionized the field of text generation, producing outputs that closely mimic human-like writing. Although academic and industrial institutions have developed detectors to prevent the malicious usage of LLM-generated texts, other research has doubt about

SFS: Smarter Code Space Search improves LLM Inference Scaling

April 28, 2025/The Thirteenth International Conference on Learning Representations (ICLR 2025)

We frame code generation as a black-box optimization problem within the code space and demonstrate how optimization-inspired techniques can enhance inference scaling. Based on this perspective, we propose SCATTERED FOREST SEARCH (SFS), a novel approach that improves solution diversity and better exploits

Chain-of-region: Visual Language Models Need Details for Diagram Analysis

April 25, 2025/The Thirteenth International Conference on Learning Representations

Visual Language Models (VLMs) like GPT-4V have broadened the scope of LLM applications, yet they face significant challenges in accurately processing visual details, particularly in scientific diagrams. This paper explores the necessity of meticulous visual detail collection and region decomposition

TSLA: Unified Time Series and Language Model

April 10, 2025/2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)

Real-world time series data often require analysis or interpretation from domain experts. Some tasks, like time series question answering, involve both time series and natural language questions, posing challenges for single-modality language models to understand their interaction. To this end, we present

Graph Neural Networks, Explained: Our Role in the Future of AI

April 9, 2025

NEC Laboratories America (NECLA) is advancing the frontier of Graph Neural Networks (GNNs), a transformative AI technology that processes complex, interconnected data. Through innovations like PTDNet for robust learning, novel frameworks for explainability, StrGNN for anomaly detection in dynamic graphs,

TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents

March 4, 2025/The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

Time series data is essential in various applications, including climate modeling, healthcare monitoring, and financial analytics. Understanding the contextual information associated with real-world time series data is often essential for accurate and reliable event predictions. In this paper, we introduce

Incident Diagnosing and Reporting System based on Retrieval Augmented Large Language Model

March 3, 2025/The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

The Internet-of-Things (IoT) is widely used in many applications such as smart city, transportation, healthcare, and environment monitoring. A key task of IoT maintenance is to analyze the abnormal sensor records and generate incident report. Traditionally, domain experts engage in such labor intensive

Improving Logits-based Detector without Logits from Black-box LLMs

December 9, 2024/The Thirty-eighth Annual Conference on Neural Information Processing Systems

The advent of Large Language Models (LLMs) has revolutionized text generation, producing outputs that closely mimic human writing. This blurring of lines between machine- and human-written text presents new challenges in distinguishing one from the other a task further complicated by the frequent

A Survey on Detection of LLMs-Generated Content

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

The burgeoning capabilities of advanced large language models (LLMs) such as ChatGPT have led to an increase in synthetic content generation with implications across a variety of sectors, including media, cybersecurity, public discourse, and education. As such, the ability to detect LLMs-generated content

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration (EMNLP 2024)

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integrating knowledge have been developed, which augment LLMs with domain-specific

Large Language Models Can Be Contextual Privacy Protection Learners

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

The proliferation of Large Language Models (LLMs) has driven considerable interest in fine-tuning them with domain-specific data to create specialized language models. Nevertheless, such domain-specific fine-tuning data often contains contextually sensitive personally identifiable information (PII).

The WizARd and Apprentice: An Augmented Reality Expert Capture System

October 25, 2024/The 23rd IEEE International Symposium on Mixed and Augmented Reality (ISMAR 2024), Bellevue, WA

Learning to perform physical tasks is ubiquitous yet challenging without expert guidance. While Augmented Reality (AR) has been adopted to overlay instructions directly onto the physical context, the natural authoring of such content remains unexplored. To address this, we developed WizARd and Apprentice,

PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization

August 29, 2024/30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

Achieving carbon neutrality within industrial operations has become increasingly imperative for sustainable development. It is both a significant challenge and a key opportunity for operational optimization in industry 4.0. In recent years, Deep Reinforcement Learning (DRL) based methods offer promising

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration (VLDB 2024)

August 28, 2024/International Workshop on LLM+KG: Data Management Opportunities in Unifying Large Language Models+Knowledge Graphs in conjunction with VLDB 2024, Guangzhou, China

Though Large Language Models (LLMs) have shown remarkable open-generation capabilities across diverse domains, they struggle with knowledge-intensive tasks. To alleviate this issue, knowledge integration methods have been proposed to enhance LLMs with domain-specific knowledge graphs using external modules.

POND: Multi-Source Time Series Domain Adaptation with Information-Aware Prompt Tuning

August 27, 2024/30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

Time series domain adaptation stands as a pivotal and intricate challenge with diverse applications, including but not limited to human activity recognition, sleep stage classification, and machine fault diagnosis. Despite the numerous domain adaptation techniques proposed to tackle this complex problem,

Distantly-Supervised Joint Extraction with Noise-Robust Learning

August 16, 2024/The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand

Joint entity and relation extraction is a process that identifies entity pairs and their relations using a single model. We focus on the problem of joint extraction in distantly-labeled data,whose labels are generated by aligning entity mentions with the corresponding entity and relation tags using a

Towards Counterfactual Fairness-aware Domain Generalization in Changing Environments

August 9, 2024/IJCAI 2024 - The 33rd International Joint Conference on Artificial Intelligence, Jeju, South Korea

Recognizing domain generalization as a commonplace challenge in machine learning, data distribution might progressively evolve across a continuum of sequential domains in practical scenarios. While current methodologies primarily concentrate on bolstering model effectiveness within these new domains,

DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton

July 27, 2024/The Forty-first International Conference on Machine Learning (ICML 2024), Vienna, Austria

This paper introduces the retrieval-augmented large language model with Definite Finite Automaton (DFA-RAG), a novel framework designed to enhance the capabilities of conversational agents using large language models (LLMs). Traditional LLMs face challenges in generating regulated and compliant responses

RIO-CPD: A Riemannian Geometric Method for Correlation-aware Online Change Point Detection

July 25, 2024/Geometry-grounded Representation Learning and Generative Modeling Workshop (ICML 2024)

The objective of change point detection is to identify abrupt changes at potentially multiple points within a data sequence. This task is particularly challenging in the online setting where various types of changes can occur, including shifts in both the marginal and joint distributions of the data.

Knowledge-enhanced Prompt Learning for Open-domain Commonsense Reasoning

July 3, 2024/NEC Technical Journal, Special Issue on Revolutionizing Business Practices with Generative AI

Neural language models for commonsense reasoning often formulate the problem as a QA task and make predictions based on learned representations of language after fine-tuning. However, without providing any fine-tuning data and pre-defined answer candidates, can neural language models still answer commonsense

Pruning as a Domain-specific LLM Extractor

June 20, 2024/2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Mexico City, Mexico

Large Language Models (LLMs) have exhibited remarkable proficiency across a wide array of NLP tasks. However, the escalation in model size also engenders substantial deployment costs. While few efforts have explored model pruning techniques to reduce the size of LLMs, they mainly center on general or

Uncertainty Quantification for In-Context Learning of Large Language Models

June 20, 2024/2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Mexico City, Mexico

In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLMs response, such as hallucination, have also been actively discussed. Existing

Advancing Sustainability in Global Supply Chains through Agent-based Simulation

May 30, 2024/The Eighteenth International Conference on Digital Society (ICDS 2024)

In today’s world, with its complex global supply chains, the difficulties and uncertainties we face offer both challenges and opportunities for making things better, especially in terms of efficiency and sustainability. These challenges grow due to unpredictable events, such as natural disasters, unexpected

MULAN: Multi-modal Causal Structure Learning and Root Cause Analysis for Microservice Systems

May 17, 2024/The Web Conference 2024 (WWW 2024)

Effective root cause analysis (RCA) is vital for swiftly restoring services, minimizing losses, and ensuring the smooth operation and management of complex systems. Previous data-driven RCA methods, particularly those employing causal discovery techniques, have primarily focused on constructing dependency

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Large language models (LLMs) have notably enhanced the fluency and diversity of machine-generated text. However, this progress also presents a significant challenge in detecting the origin of a given text, and current research on detection methods lags behind the rapid evolution of LLMs. Conventional

Improving Open Information Extraction with Large Language Models: A Study on Demonstration Uncertainty

May 11, 2024/ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Open Information Extraction (OIE) task aims at extracting structured facts from unstructured text, typically in the form of (subject, relation, object) triples. Despite the potential of large language models (LLMs) like ChatGPT as a general task solver, they lag behind state-of-the-art (supervised) methods

Parametric Augmentation for Time Series Contrastive Learning

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Graph Neural Networks (GNNs) are neural models that leverage the dependency structure in graphical data via message passing among the graph nodes. GNNs have emerged as pivotal architectures in analyzing graph-structured data, and their expansive application in sensitive domains requires a comprehensive

Multi-Agent Simulator for Carbon Neutrality: The Technology the World Has Been Waiting For

April 24, 2024

Today, each country, government, and enterprise are urged to take effective action to fight against climate change; however, an efficient method has not been found. Even a way to accurately calculate Scope 3 carbon emissions has yet to be developed. The technology of a multi-agent simulator could be

Dynamic Causal Discovery in Imitation Learning

March 4, 2024/The 17th ACM International Conference on Web Search and Data Mining (WSDM 2024), Merida, Yucatan, Mexico

Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret control policies learned by the agent. Difficulties mainly come

Prompt-based Domain Discrimination for Multi-source Time Series Domain Adaptation

December 21, 2023/arxiv.org

Time series domain adaptation stands as a pivotal and intricate challenge with diverse applications, including but not limited to human activity recognition, sleep stage classification, and machine fault diagnosis. Despite the numerous domain adaptation techniques proposed to tackle this complex problem,

Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

December 16, 2023/Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA

Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking

Open-Ended Commonsense Reasoning with Unrestricted Answer Scope

December 10, 2023/Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore

Open-ended Commonsense Reasoning is defined as solving a commonsense question without providing 1) a short list of answer candidates and 2) a pre-defined answer scope. Conventional ways of formulating the commonsense question into a question-answering form or utilizing external knowledge to learn retrieval-based

NEC Labs America Team Heading to NeurIPS23 in New Orleans

December 7, 2023

NEC Labs America is proud to be a Silver Sponsor for NeurIPS 2023 in New Orleans from December 10-16. Visit our booth to meet our team and learn about our intern opportunities in machine learning, data science, media analytics and integrated systems. Also, our Vijay Kumar.B.G, Samuel Schulter & Manmohan

GLAD: Content-Aware Dynamic Graphs for Log Anomaly Detection

December 2, 2023/IEEE International Conference On Knowledge Graph (ICKG-2023), Shanghai, China

Logs play a crucial role in system monitoring and debugging by recording valuable system information, including events and status. Although various methods have been proposed to detect anomalies in log sequences, they often overlook the significance of considering relationships among system components,

Adaptation Speed Analysis for Fairness-Aware Causal Models

October 25, 2023/32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

For example, in machine translation tasks, to achieve bidirectional translation between two languages, the source corpus is often used as the target corpus, which involves the training of two models with opposite directions. The question of which one can adapt most quickly to a domain shift is of significant

Calibrate Graph Neural Networks under Out-of-Distribution Nodes via Deep Q-learning

October 25, 2023/32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

Graph neural networks (GNNs) have achieved great success in dealing with graph-structured data that are prevalent in the real world. The core of graph neural networks is the message passing mechanism that aims to generate the embeddings of nodes by aggregating the neighboring node information. However,

Temporal Graph-Based Incident Analysis System for Internet of Things (ECML)

September 22, 2023/ECML PKDD 2023 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Internet-of-things (IoTs) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on sensor data is an important task for IoT maintenance and operation. In real applications, the occurrence of a system-level incident usually involves hundreds of abnormal sensors, making

Temporal Graph based Incident Analysis System for Internet of Things

September 17, 2023/European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2023)

Internet-of-things (IoTs) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on sensor data is an important task for IoT maintenance and operation. In real applications, the occurrence of a system-level incident usually involves hundreds of abnormal sensors, making

AutoTCL: Automated Time Series Contrastive Learning with Adaptive Augmentations

August 20, 2023/The 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

Read AutoTCL: Automated Time Series Contrastive Learning with Adaptive Augmentations publication. Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist

FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

Read FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation publication. Imitation learning that replicates experts’ skills via their demonstrations has shown significant success in various decision-making tasks. However, two critical challenges still hinder the deployment of imitation

Incremental Causal Graph Learning for Online Root Cause Localization

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

The task of root cause analysis (RCA) is to identify the root causes of system faults/failures by analyzing system monitoring data. Efficient RCA can greatly accelerate system failure recovery and mitigate system damages or financial losses. However, previous research has mostly focused on developing

Interdependent Causal Networks for Root Cause Localization

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The goal of root cause analysis is to identify the underlying causes of system problems by discovering and analyzing the causal structure from system monitoring data. It is indispensable for maintaining the stability and robustness of large-scale complex systems. Existing methods mainly focus on the

Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

Imitation learning has achieved great success in many sequential decision-making tasks, in which a neural agent is learned by imitating collected human demonstrations. However, existing algorithms typically require a large number of high-quality demonstrations that are difficult and expensive to collect.

State-Aware Anomaly Detection for Massive Sensor Data in Internet of Things

August 7, 2023/The 3rd Workshop on Artificial Intelligence-Enabled Cybersecurity Analytics

With the escalating prevalence of Internet of Things (IoTs) in critical infrastructure, the requirement for efficient and effective anomaly detection solution becomes increasingly important. Unfortunately, most prior research works have largely overlooked to adapt detection criteria for different operational

Personalized Federated Learning under Mixture Distributions

July 29, 2023/The 40th International Conference on Machine Learning (ICML 2023)

The recent trend towards Personalized Federated Learning (PFL) has garnered significant attention as it allows for the training of models that are tailored to each client while maintaining data privacy. However, current PFL techniques primarily focus on modeling the conditional distribution heterogeneity

Unsupervised Anomaly Detection Under A Multiple Modeling Strategy Via Model Set Optimization Through Transfer Learning

June 30, 2023/The 26th International Conference on Information Fusion, Charleston, SC

Unsupervised anomaly detection approaches have been widely accepted in applications for industrial systems. Industrial systems often operate with multiple modes since they work for multiple purposes or under different conditions. In order to deal with the difficulty of anomaly detection due to multiple

Multi-Label Temporal Evidential Neural Networks for Early Event Detection

June 9, 2023/2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

Early event detection aims to detect events even before the event is complete. However, most of the existing methods focus on an event with a single label but fail to be applied to cases with multiple labels. Another non-negligible issue for early event detection is a prediction with overconfidence due

Beyond One Model Fits All: A Survey of Domain Specialization for Large Language Models

June 9, 2023/arXiv

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task agnostic foundation for a wide range of applications. The great promise of LLMs as general task solvers motivated people to extend their functionality largely beyond

Interpretable Skill Learning for Dynamic Treatment Regimes through Imitation

March 24, 2023/57th Conference on Information Sciences and Systems (CISS 2023)

Imitation learning that mimics experts’ skills from their demonstrations has shown great success in discovering dynamic treatment regimes, i.e., the optimal decision rules to treat an individual patient based on related evolving treatment and covariate history. Existing imitation learning methods,

Dynamic Prompting: A Unified Framework for Prompt Tuning

March 6, 2023/arXiv

It has been demonstrated that prompt tuning is highly effective in efficiently eliciting knowledge from language models (LMs). However, the prompt tuning still lags behind fine tuning, especially when the LMs are small. P tuning v2 (Liu et al., 2021b) makes it comparable with finetuning by adding continuous

Exploring the limits of ChatGPT for Query or Aspect based Text Summarization

February 16, 2023/arXiv

Text summarization has been a crucial problem in natural language processing (NLP) for several decades. It aims to condense lengthy documents into shorter versions while retaining the most critical information. Various methods have been proposed for text summarization, including extractive and abstractive

Time Series Contrastive Learning with Information-Aware Augmentations

February 14, 2023/Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23)

Various contrastive learning approaches have been proposed in recent years and have achieved significant empirical success. While effective and prevalent, contrastive learning has been less explored for time series data. A key component of contrastive learning is to select appropriate augmentations,

Deep Federated Anomaly Detection for Multivariate Time Series Data

December 20, 2022/IEEE BigData 2022 - Special Session 2: Machine Learning on Big Data (MLBD 2022), Osaka, Japan

Although many anomaly detection approaches have been developed for multivariate time series data, limited effort has been made in federated settings in which multivariate time series data are heterogeneously distributed among different edge devices while data sharing is prohibited. In this paper, we

Towards Robust Graph Neural Networks via Adversarial Contrastive Learning

December 20, 2022/2022 IEEE International Conference on Big Data (IEEE BigData 2022), Osaka, Japan

Graph Neural Network (GNN), as a powerful representation learning model on graph data, attracts much attention across various disciplines. However, recent studies show that GNN is vulnerable to adversarial attacks. How to make GNN more robust? What are the key vulnerabilities in GNN? How to address the

DeepGAR: Deep Graph Learning for Analogical Reasoning

December 3, 2022/IEEE ICDM 2022 - 22nd IEEE International Conference on Data Mining, Orlando, FL

Analogical reasoning is the process of discovering and mapping correspondences from a target subject to a base subject. As the most well-known computational method of analogical reasoning, Structure-Mapping Theory (SMT) abstracts both target and base subjects into relational graphs and forms the cognitive

Personalized Federated Learning via Heterogeneous Modular Networks

December 3, 2022/IEEE ICDM 2022 - 22nd IEEE International Conference on Data Mining, Orlando, FL

Personalized Federated Learning (PFL) which collaboratively trains a federated model while considering local clients under privacy constraints has attracted much attention. Despite its popularity, it has been observed that existing PFL approaches result in sub-optimal solutions when the joint distribution

Using AI To Safely Put The First Woman On The Moon

October 7, 2022

We are helping to safely bring the first woman astronaut to the moon as part of NASA – National Aeronautics and Space Administration’s Artemis Project with our System Invariant Analysis Technology (SIAT). With Lockheed Martin Space’s T-Tauri AI platform, our SIAT analytics engine takes the data from

Multi-Faceted Knowledge-Driven Pre-training for Product Representation Learning

September 28, 2022/IEEE Transactions on Knowledge and Data Engineering

As a key component of e-commerce computing, product representation learning (PRL) provides benefits for a variety of applications, including product matching, search, and categorization. The existing PRL approaches have poor language understanding ability due to their inability to capture contextualized

Explainable Anomaly Detection System for Categorical Sensor Data in Internet of Things

September 23, 2022/ECML-PKDD 2022: The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Grenoble, France

Internet of things (IoT) applications deploy massive number of sensors to monitor the system and environment. Anomaly detection on streaming sensor data is an important task for IoT maintenance and operation. However, there are two major challenges for anomaly detection in real IoT applications: (1)

3D Histogram-Based Anomaly Detection for Categorical Sensor Data in Internet of Things

September 9, 2022/VLIoT 2022 - Very Large Internet of Things 2022 (virtual conference)

The applications of Internet-of-things (IoT) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on streaming sensor data is an important task for IoT maintenance and operation. In real IoT applications, many sensors report categorical values rather than numerical

CAT: Beyond Efficient Transformer for Content-Aware Anomaly Detection in Event Sequences

August 18, 2022/28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

It is critical and important to detect anomalies in event sequences, which becomes widely available in many application domains. Indeed, various efforts have been made to capture abnormal patterns from event sequences through sequential pattern analysis or event representation learning. However, existing

Towards Learning Disentangled Representations for Time Series

August 18, 2022/28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)

Promising progress has been made toward learning efficient time series representations in recent years, but the learned representations often lack interpretability and do not encode semantic meanings by the complex interactions of many latent factors. Learning representations that disentangle these latent

SEED: Sound Event Early Detection via Evidential Uncertainty

May 27, 2022/2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore (virtual paper presentations)

Sound Event Early Detection (SEED) is an essential task in recognizing the acoustic environments and soundscapes. However, most of the existing methods focus on the offline sound event detection, which suffers from the over-confidence issue of early-stage event detection and usually yield unreliable

Superclass-Conditional Gaussian Mixture Model for Coarse-To-Fine Few-Shot Learning

April 29, 2022/10th International Conference on Learning Representations (ICLR 2022)

Learning fine-grained embeddings is essential for extending the generalizability of models pre-trained on “coarse” labels (e.g., animals). It is crucial to fields for which fine-grained labeling (e.g., breeds of animals) is expensive, but fine-grained prediction is desirable, such as medicine. The dilemma

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph

March 1, 2022/Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-2022)

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided

Ordinal Quadruplet: Retrieval of Missing Labels in Ordinal Time Series

January 24, 2022/arXiv

In this paper, we propose an ordered time series classification framework that is robust against missing classes in the training data, i.e., during testing we can prescribe classes that are missing during training. This framework relies on two main components: (1) our newly proposed ordinal quadruplet

Dynamic Causal Discovery in Imitation Learning

December 14, 2021/Causal Inference Challenges in Sequential Decision Making: Bridging Theory and Practice - A NeurIPS 2021 Workshop

Using deep reinforcement learning (DRL) to recover expert policies via imitation has been found to be promising in a wide range of applications. However, it remains a difficult task to interpret the control policy learned by the agent. Difficulties mainly come from two aspects: 1) agents in DRL are usually

InfoGCL: Information-Aware Graph Contrastive Learning

December 14, 2021/Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Virtual-only Conference

InfoGCL: Information-Aware Graph Contrastive Learning Various graph contrastive learning models have been proposed to improve the performance of tasks on graph datasets in recent years. While effective and prevalent, these models are usually carefully customized. In particular, despite all recent work

You Are What and Where You Are: Graph Enhanced Attention Network for Explainable POI Recommendation

November 15, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Point-of-interest (POI) recommendation is an emerging area of research on location-based social networks to analyze user behaviors and contextual check-in information. For this problem, existing approaches, with shallow or deep architectures, have two major drawbacks. First, for these approaches, the

Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation

November 11, 2021/The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Recent multilingual pre-trained language models have achieved remarkable zero-shot performance, where the model is only finetuned on one source language and directly evaluated on target languages. In this work, we propose a self-learning framework that further utilizes unlabeled data of target languages,

Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction

November 11, 2021/The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Compliments and concerns in reviews are valuable for understanding users’ shopping interests and their opinions with respect to specific aspects of certain items. Existing review-based recommenders favor large and complex language encoders that can only learn latent and uninterpretable text representations.

Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

November 5, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

In many high-stakes applications of machine learning models, outputting only predictions or providing statistical confidence is usually insufficient to gain trust from end users, who often prefer a transparent reasoning paradigm. Despite the recent encouraging developments on deep networks for sequential

Structural Temporal Graph Neural Networks for Anomaly Detection in Dynamic Graphs

November 5, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Detecting anomalies in dynamic graphs is a vital task, with numerous practical applications in areas such as security, finance, and social media. Existing network embedding based methods have mostly focused on learning good node representations, whereas largely ignoring the subgraph structural changes

Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection

October 24, 2021/29th ACM International Conference on Multimedia (ACM Multimedia 2021)

Detecting abnormal activities in real-world surveillance videos is an important yet challenging task as the prior knowledge about video anomalies is usually limited or unavailable. Despite that many approaches have been developed to resolve this problem, few of them can capture the normal spatio-temporal

Domain oriented Language Modeling with Adaptive Hybrid Masking and Optimal Transport Alignment

August 18, 2021/KDD 2021: ACM SIGKDD Conference on Knowledge Discovery and Data Mining, SIGKDD 2021

Motivated by the success of pre-trained language models such as BERT in a broad range of natural language processing (NLP) tasks, recent research efforts have been made for adapting these models for different application domains. Along this line, existing domain-oriented models have primarily followed

Multi-Scale One-Class Recurrent Neural Networks for Discrete Event Sequence Anomaly Detection

August 18, 2021/ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

Discrete event sequences are ubiquitous, such as an ordered event series of process interactions in Information and Communication Technology systems. Recent years have witnessed increasing efforts in detecting anomalies with discrete event sequences. However, it remains an extremely difficult task due

SIGL: Securing Software Installations Through Deep Graph Learning

August 13, 2021/USENIX Security 2021 - The 30th USENIX Security Symposium

Many users implicitly assume that software can only be exploited after it is installed. However, recent supply-chain attacks demonstrate that application integrity must be ensured during installation itself. We introduce SIGL, a new tool for detecting malicious behavior during software installation.

Hierarchical Imitation Learning with Contextual Bandits for Dynamic Treatment Regimes

July 24, 2021/The Thirty-eighth International Conference on Machine Learning (ICML 2021)

Imitation learning has been proved to be effective in mimicking experts’ behaviors from their demonstrations without access to explicit reward signals. Meanwhile, complex tasks, e.g., dynamic treatment regimes for patients with comorbidities, often suggest significant variability in expert demonstrations

FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems

June 25, 2021/CVPR 2021 - IEEE/CVF Conference on Computer Vision and Pattern Recognition

We present FACESEC, a framework for fine-grained robustness evaluation of face recognition systems. FACESEC evaluation is performed along four dimensions of adversarial modeling: the nature of perturbation (e.g., pixel-level or face accessories), the attacker’s system knowledge (about training data

Automated Anomaly Detection via Curiosity-Guided Search and Self-Imitation Learning

June 15, 2021/The IEEE Transactions on Neural Networks and Learning Systems

Anomaly detection is an important data mining task with numerous applications, such as intrusion detection, credit card fraud detection, and video surveillance. However, given a specific complicated task with complicated data, the process of building an effective deep learning-based system for anomaly

Unsupervised Concept Representation Learning for Length-Varying Text Similarity

June 11, 2021/NAACL 2021 – 2021 Annual Conference of the North American Chapter

Measuring document similarity plays an important role in natural language processing tasks. Most existing document similarity approaches suffer from the information gap caused by context and vocabulary mismatches when comparing varying-length texts. In this paper, we propose an unsupervised concept representation

Deep Multi-Instance Contrastive Learning with Dual Attention for Anomaly Precursor Detection

May 1, 2021/SIAM International Conference on Data Mining, Virtual Conference (SDM21)

Prognostics or early detection of incipient faults by leveraging the monitoring time series data in complex systems is valuable to automatic system management and predictive maintenance. However, this task is challenging. First, learning the multi-dimensional heterogeneous time series data with various

AutoOD: Neural Architecture Search for Outlier Detection

April 23, 2021/ICDE 2021 - The 37th IEEE International Conference on Data Engineering

Outlier detection is an important data mining task with numerous applications such as intrusion detection, credit card fraud detection, and video surveillance. However, given a specific task with complex data, the process of building an effective deep learning based system for outlier detection still

Learning to Drop: Robust Graph Neural Network via Topological Denoising

March 12, 2021/WSDM 2021 - The 14th ACM International WSDM Conference on Web Seach and Data Mining

Graph Neural Networks (GNNs) have shown to be powerful tools for graph analytics. The key idea is to recursively propagate and aggregate information along the edges of the given graph. Despite their success, however, the existing GNNs are usually sensitive to the quality of the input graph. Real-world

Multi-Task Recurrent Modular Networks

March 9, 2021/AAAI 2021 - 35th AAAI Conference on Artificial Intelligence

We consider the models of deep multi-task learning with recurrent architectures that exploit regularities across tasks to improve the performance of multiple sequence processing tasks jointly. Most existing architectures are painstakingly customized to learn task relationships for different problems,

Dynamic Gaussian Mixture based Deep Generative Model For Robust Forecasting on Sparse Multivariate Time Series

February 22, 2021/AAAI 2021 - 35th AAAI Conference on Artificial Intelligence

Forecasting on Sparse Multivariate Time Series Forecasting on sparse multivariate time series (MTS) aims to model the predictors of future values of time series given their incomplete past, which is important for many emerging applications. However, most existing methods process MTS’s individually,

Parameterized Explainer for Graph Neural Network

December 12, 2020/Thirty-Fourth Annual Conference on Neural Information Processing Systems (NeurIPS 2020)

Despite recent progress in Graph Neural Networks (GNNs), explaining predictions made by GNNs remains a challenging open problem. The leading method independently addresses the local explanations (i.e., important subgraph structure and node features) to interpret why a GNN model makes the prediction for

T2-Net: A Semi-supervised Deep Model for Turbulence Forecasting

November 20, 2020/ICDM 2020 - The 20th IEEE International Conference on Data Mining

Accurate air turbulence forecasting can help airlines avoid hazardous turbulence, guide the routes that keep passengers safe, maximize efficiency, and reduce costs. Traditional turbulence forecasting approaches heavily rely on painstakingly customized turbulence indexes, which are less effective in dynamic

Anomaly Detection on Web-User Behaviors through Deep Learning

October 23, 2020/16th EAI International Conference on Security and Privacy in Communication Networks (SecureComm 2020)

The modern Internet has witnessed the proliferation of web applications that play a crucial role in the branding process among enterprises. Web applications provide a communication channel between potential customers and business products. However, web applications are also targeted by attackers due

VESSELS: Efficient and Scalable Deep Learning Prediction on Trusted Processors

October 21, 2020/ACM Symposium on Cloud Computing 2020 (SoCC 2020)

Deep learning systems on the cloud are increasingly targeted by attacks that attempt to steal sensitive data. Intel SGX has been proven effective to protect the confidentiality and integrity of such data during computation. However, state-of-the-art SGX systems still suffer from substantial performance

Anomalous Event Sequence Detection

September 24, 2020/IEEE Intelligent Systems

Anomaly detection has been widely applied in modern data-driven security applications to detect abnormal events/entities that deviate from the majority. However, less work has been done in terms of detecting suspicious event sequences/paths, which are better discriminators than single events/entities

Node Classification in Temporal Graphs through Stochastic Sparsification and Temporal Structural Convolution

September 18, 2020/ECML-PKDD 2020 - The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Node classification in temporal graphs aims to predict node labels based on historical observations. In real-world applications, temporal graphs are complex with both graph topology and node attributes evolving rapidly, which poses a high overfitting risk to existing graph learning approaches. In this

Robust Graph Representation Learning via Neural Sparsification

July 18, 2020/The 37th International Conference on Machine Learning (ICML 2020)

Graph representation learning serves as the core of important prediction tasks, ranging from product recommendation to fraud detection. Reallife graphs usually have complex information in the local neighborhood, where each node is described by a rich set of features and connects to dozens or even hundreds

At the Speed of Sound: Efficient Audio Scene Classification

June 11, 2020/The Annual ACM International Conference on Multimedia Retrieval (ICMR 2020)

Efficient audio scene classification is essential for smart sensing platforms such as robots, medical monitoring, surveillance, or autonomous vehicles. We propose a retrieval-based scene classification architecture that combines recurrent neural networks and attention to compute embeddings for short

RULENet: End-to-end Learning with the Dual-estimator for Remaining Useful Life Estimation

June 10, 2020/2020 IEEE International Conference on Prognostics and Health Management, Detroit, MI

Remaining Useful Life (RUL) estimation is a key element in Predictive maintenance. System agnostic approaches which just utilize sensor and operational time series have gained popularity due to its ease of implementation. Due to the nature of measurement or degradation mechanisms, its accurate estimation

Inductive and Unsupervised Representation Learning on Graph Structured Objects

April 30, 2020/8th International Conference on Learning Representations (ICLR 2020)

Inductive and unsupervised graph learning is a critical technique for predictive or information retrieval tasks where label information is difficult to obtain. It is also challenging to make graph learning inductive and unsupervised at the same time, as learning processes guided by reconstruction error

A Generic Edge-Empowered Graph Convolutional Network via Node-Edge Mutual Enhancement

April 24, 2020/The Web Conference 2020 (WWW 2020)

Graph Convolutional Networks (GCNs) have shown to be a powerful tool for analyzing graph-structured data. Most of previous GCN methods focus on learning a good node representation by aggregating the representations of neighboring nodes, whereas largely ignoring the edge information. Although few recent

Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes

April 24, 2020/The Web Conference 2020 (WWW 2020)

Recent developments in discovering dynamic treatment regimes (DTRs) have heightened the importance of deep reinforcement learning (DRL) which are used to recover the doctor’s treatment policies. However, existing DRL-based methods expose the following limitations: 1) supervised methods based on behavior

APTrace: A Responsive System for Agile Enterprise Level Causality Analysis

April 24, 2020/36th IEEE International Conference on Data Engineering (ICDE 2020)

While backtracking analysis has been successful in assisting the investigation of complex security attacks, it faces a critical dependency explosion problem. To address this problem, security analysts currently need to tune backtracking analysis manually with different case-specific heuristics. However,

You Are What You Do: Hunting Stealthy Malware via Data Provenance Analysis

March 9, 2020/NDSS Symposium 2020

To subvert recent advances in perimeter and host security, the attacker community has developed and employed various attack vectors to make malware much more stealthy than before to penetrate the target system and prolong its presence. The advanced malware, or stealthy malware, impersonates or abuses

Asymmetrically Hierarchical Networks with Attentive Interactions for Interpretable Review-based Recommendation

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

Recently, recommender systems have been able to emit substantially improved recommendations by leveraging user-provided reviews. Existing methods typically merge all reviews of a given user (item) into a long document, and then process user and item documents in the same manner. In practice, however,

Deep Unsupervised Binary Coding Networks for Multivariate Time Series Retrieval

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

Multivariate time series data are becoming increasingly ubiquitous in varies real-world applications such as smart city, power plant monitoring, wearable devices, etc. Given the current time series segment, how to retrieve similar segments within the historical data in an efficient and effective manner

Tensorized LSTM with Adaptive Shared Memory for Learning Trends in Multivariate Time Series

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

The problem of learning and forecasting underlying trends in time series data arises in a variety of applications, such as traffic management, energy optimization, etc. In literature, a trend in time series is characterized by the slope and duration, and its prediction is then to forecast the two values

Interpretable Click-Through Rate Prediction through Hierarchical Attention

February 7, 2020/The 13th ACM International Conference on Web Search and Data Mining (WSDM 2020)

Click-through rate (CTR) prediction is a critical task in online advertising and marketing. For this problem, existing approaches, with shallow or deep architectures, have three major drawbacks. First, they typically lack persuasive rationales to explain the outcomes of the models. Unexplainable predictions

Temporal Context-aware Representation Learning for Question Routing

February 7, 2020/The 13th ACM International Conference on Web Search and Data Mining (WSDM 2020)

Question routing (QR) aims at recommending newly posted questions to the potential answerers who are most likely to answer the questions. The existing approaches that learn users’ expertise from their past question-answering activities usually suffer from challenges in two aspects: 1) multi-faceted expertise

Progressive Processing of System-Behavioral Query

December 13, 2019/The 35th Annual Computer Security Applications Conference (ACSAC 2019)

System monitoring has recently emerged as an effective way to analyze and counter advanced cyber attacks. The monitoring data records a series of system events and provides a global view of system behaviors in an organization. Querying such data to identify potential system risks and malicious behaviors

Adaptive Neural Network for Node Classification in Dynamic Networks

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Given a network with the labels for a subset of nodes, transductive node classification targets to predict the labels for the remaining nodes in the network. This technique has been used in a variety of applications such as voxel functionality detection in brain network and group label prediction in

Learning Robust Representations with Graph Denoising Policy Network

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Existing representation learning methods based on graph neural networks and their variants rely on the aggregation of neighborhood information, which makes it sensitive to noises in the graph, e.g. erroneous links between nodes, incorrect/missing node features. In this paper, we propose Graph Denoising

Self-Attentive Attributed Network Embedding Through Adversarial Learning

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Network embedding aims to learn the low-dimensional representations/embeddings of vertices which preserve the structure and inherent properties of the networks. The resultant embeddings are beneficial to downstream tasks such as vertex classification and link prediction. A vast majority of real-world

Heterogeneous Graph Matching Networks for Unknown Malware Detection

August 16, 2019/The 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

Information systems have widely been the target of malware attacks. Traditional signature-based malicious program detection algorithms can only detect known malware and are prone to evasion techniques such as binary obfuscation, while behavior-based approaches highly rely on the malware training samples

Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs

August 16, 2019/The 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

Node classification in graph-structured data aims to classify the nodes where labels are only available for a subset of nodes. This problem has attracted considerable research efforts in recent years. In real-world applications, both graph topology and node attributes evolve over time. Existing techniques,

Clairvoyant Networks

June 21, 2019/Network Traffic Measurement and Analysis Conference (TMA Conference 2019)

We use the term clairvoyant to refer to networks that provide on-demand visibility for any flow at any time. Traditionally, network visibility is achieved by instrumenting and passively monitoring all flows in a network. SDN networks, by design endowed with full visibility, offer another alternative

Attentional Heterogeneous Graph Neural Network: Application to Program Reidentification

May 4, 2019/SIAM International Conference on Data Mining (SDM 2019)

Program or process is an integral part of almost every IT/OT system. Can we trust the identity/ID (e.g., executable name) of the program? To avoid detection, malware may disguise itself using the ID of a legitimate program, and a system tool (e.g., PowerShell) used by the attackers may have the fake

Deep Co-Clustering

May 4, 2019/SIAM International Conference on Data Mining (SDM 2019)

Co-clustering partitions instances and features simultaneously by leveraging the duality between them, and it often yields impressive performance improvement over traditional clustering algorithms. The recent development in learning deep representations has demonstrated the advantage in extracting effective

A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data

February 1, 2019/The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019)

Nowadays, multivariate time series data are increasingly collected in various real-world systems, e.g., power plants, wearable devices, etc. Anomaly detection and diagnosis in multivariate time series refer to identifying abnormal status in certain time steps and pinpointing the root causes. Building

Deep Learning IP Network Representations

August 24, 2018/Big-DAMA 2018 - ACM SIGCOMM 2018 Workshop on Big Data Analytics and Machine Learning for Data Communication Networks

We present DIP, a deep learning-based framework to learn structural properties of the Internet, such as node clustering or distance between nodes. Existing embedding-based approaches use linear algorithms on a single source of data, such as latency or hop count information, to approximate the position

Deep r-th Root Rank Supervised Joint Binary Embedding for Multivariate Time Series Retrieval

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Multivariate time series data are becoming increasingly common in numerous real-world applications, e.g., power plant monitoring, health care, wearable devices, automobiles, etc. As a result, multivariate time series retrieval, i.e., given the current multivariate time series segment, how to obtain its

Learning Deep Network Representations with Adversarially Regularized Autoencoders

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The problem of network representation learning, also known as network embedding, arises in many machine learning tasks assuming that there exist a small number of variabilities in the vertex representations which can capture the “semantics” of the original network structure. Most existing network embedding

NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks

August 23, 2018/KDD 2018 – 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Massive and dynamic networks arise in many practical applications such as social media, security and public health. Given an evolutionary network, it is crucial to detect structural anomalies, such as vertices and edges whose “behaviors” deviate from underlying majority of the network, in a real-time

TINET: Transferring Knowledge between Invariant Networks

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The latent behavior of an information system that can exhibit extreme events, such as system faults or cyber-attacks, is complex. Recently, the invariant network has shown to be a powerful way of characterizing complex system behaviors. Structures and evolutions of the invariance network, in particular,

Exploiting Graph Regularized Multi-dimensional Hawkes Processes for Modeling Events with Spatio-temporal Characteristics

July 19, 2018/The 27th International Joint Conference on Artificial Intelligence (IJCAI-18)

Multi-dimensional Hawkes processes (MHP) has been widely used for modeling temporal events. However, when MHP was used for modeling events with spatio-temporal characteristics, the spatial information was often ignored despite its importance. In this paper, we introduce a framework to exploit MHP for

Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection

May 3, 2018/Proceedings of the 6th International Conference on Learning Representations, Vancouver Convention Center (ICLR 2018)

Unsupervised anomaly detection on multi- or high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation

Co-Regularized Deep Multi-Network Embedding

April 27, 2018/Proceedings of the 2018 World Wide Web Conference (WWW 2018)

Network embedding aims to learn a low-dimensional vector representation for each node in the social and information networks, with the constraint to preserve network structures. Most existing methods focus on single network embedding, ignoring the relationship between multiple networks. In many real-world