Data Science & System Security | Publications

DATA SCIENCE & SYSTEM SECURITY

PROJECTS

PEOPLE

PATENTS

Publications

NEC Labs America Attends ACL 2026 San Diego July 2-7, 2026

June 23, 2026

NEC Laboratories America heads to ACL 2026 in San Diego, California, July 2–7, to present accepted papers spanning knowledge updating and memory control in large language models, task-aware cultural alignment, uncertainty-aware reasoning, and adaptive chain-of-thought optimization, representing some

How AI Can Transform the Way Companies Buy What They Need

June 22, 2026

Procurement teams lose time and money to inaccurate demand forecasts and manual supplier negotiations. A new framework from NEC Corporation and NEC Laboratories America combines automated negotiation with multimodal AI forecasting to optimize both sides of the procurement process.

Automated Negotiation and Multimodal Time-Series Forecasting for Efficient Procurement

May 29, 2026/The 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)

Procurement is a key function in supply chain management that involves acquiring goods and services to meet organizational needs. Efficient procurement is crucial for minimizing costs, ensuring timely delivery, and maintaining quality standards. This paper explores the integration of automated negotiation

How Rule-Driven Routing Makes Retrieval-Augmented Generation Smarter

May 13, 2026

Most retrieval-augmented generation systems stop at documents, ignoring the relational databases that power finance, healthcare, and research. Our researchers built a rule-driven framework that learns which source to query for each question, delivering better answers at lower computational cost.

Learning to Route: A Rule-Driven Agent Framework for Hybrid-Source Retrieval-Augmented Generation

April 12, 2026/The 2026 ACM Web Conference (WWW 2026)

Large Language Models (LLMs) have shown remarkable performance on general Question Answering (QA), yet they often struggle in domain-specific scenarios where accurate and up-to-date information is required. Retrieval-Augmented Generation (RAG) addresses this limitation by enriching LLMs with external

Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation

March 29, 2026/The 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)

Time series data is ubiquitous across various domains, including manufacturing, finance, and healthcare. High-quality annotations are essential for effectively understanding time series and facilitating downstream tasks. However, obtaining such annotations is challenging, particularly in mission-critical

DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router

March 29, 2026/19th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)

Large Language Models (LLMs) excel at many reasoning tasks but struggle with knowledge-intensive queries due to their inability to dynamically access up-to-date or domain-specific information. Retrieval-Augmented Generation (RAG) has emerged as a promising solution, enabling LLMs to ground their responses

Multi-Agent Procedural Graph Extraction with Structural and Logical Refinement

March 29, 2026/The 19th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)

Automatically extracting workflows as procedural graphs from natural language is promising yet underexplored, demanding both structural validity and logical alignment. While recent large language models (LLMs) show potential for procedural graph extraction, they often produce ill-formed structures or

MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

January 27, 2026/40th AAAI Conference on Artificial Intelligence (AAAI-26)

Uncovering causal structures from observational data is crucial for understanding complex systems and making informed decisions. While reinforcement learning (RL) has shown promise in identifying these structures in the form of a directed acyclic graph (DAG), existing methods often lack efficiency, making

Brownian Bridge Augmented Surrogate Simulation and Injection Planning for Geological CO2 Storage

January 22, 2026/The 40th Annual AAAI Conference on Artificial Intelligence (AAAI-26)

Geological CO2 storage (GCS) involves injecting captured CO2 into deep sub-surface formations to support climate goals. The effective management of GCS relies on adaptive injection planning to dynamically control injection rates and well pressures to balance both storage safety and efficiency. Prior

Online Multi-modal Root Cause Identification in Microservice Systems

December 11, 2025/2025 IEEE International Conference on Big Data

Root Cause Analysis (RCA) is essential for pinpointing the root causes of failures in microservice systems. Traditional data-driven RCA methods are typically limited to offline applications due to high computational demands, and existing online RCA methods handle only single-modal data, overlooking complex

Human Texts Are Outliers: Detecting LLM-generated Texts via Out-of-distribution Detection

December 7, 2025/The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

The rapid advancement of large language models (LLMs) such as ChatGPT, DeepSeek, and Claude has significantly increased the presence of AI-generated text in digital communication. This trend has heightened the need for reliable detection methods to distinguish between human-authored and machine-generated

Multi-Modal View Enhanced Large Vision Models for Long-Term Time Series Forecasting

December 7, 2025/The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

Time series, typically represented as numerical sequences, can also be transformed into images and texts, offering multi-modal views (MMVs) of the same underlying signal. These MMVs can reveal complementary patterns and enable the use of powerful pre-trained large models, such as large vision models

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

December 7, 2025/The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

Large Language Models (LLMs) offer promising capabilities for tackling complex reasoning tasks, including optimization problems. However, existing methods either rely on prompt engineering, which leads to poor generalization across problem types, or require costly supervised training. We introduce SolverLLM,

TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

December 7, 2025/The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

Time series analysis provides essential insights for real-world system dynamics and informs downstream decision-making, yet most existing methods often overlook the rich contextual signals present in auxiliary modalities. To bridge this gap, we introduce TimeXL, a multi-modal prediction framework that

DISC: Dynamic Decomposition Improves LLM Inference Scaling

December 1, 2025/The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

Inference scaling methods for LLMs often rely on decomposing problems into steps (or groups of tokens), followed by sampling and selecting the best next steps. However, these steps and their sizes are often predetermined or manually designed based on domain knowledge. We propose dynamic decomposition,

xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion

November 15, 2025/25th IEEE International Conference on Data Mining (IEEE ICDM 2025)

Extreme events frequently occur in real-world time series and often carry significant practical implications. In domains such as climate and healthcare, these events, such as floods, heatwaves, or acute medical episodes, can lead to serious consequences. Accurate forecasting of such events is therefore

Correlation-aware Online Change Point Detection

November 14, 2025/The 34th ACM International Conference on Information and Knowledge Management (CIKM 2025)

Change point detection aims to identify abrupt shifts occurring at multiple points within a data sequence. This task becomes particularly challenging in the online setting, where different types of change can occur, including shifts in both the marginal and joint distributions of the data. In this paper,

Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

September 3, 2025/ACM Computing Surveys

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused

Harnessing Vision Models for Time Series Analysis: A Survey

August 18, 2025/The 34th International Joint Conference on Artificial Intelligence (IJCAI 2025 Survey Track)

Time series analysis has witnessed the inspiring development from traditional autoregressive models, deep learning models, to recent Transformers and Large Language Models (LLMs). Efforts in leveraging vision models for time series analysis have also been made along the way but are less visible to the

Multi-modal Time Series Analysis: A Tutorial and Survey

August 7, 2025/31st ACM SIGKDD Conference on Knowledge Discover and Data Mining (ACM KDD 2025)

Multi-modal time series analysis has recently emerged as a prominent research area, driven by the increasing availability of diverse data modalities, such as text, images, and structured tabular data from real-world sources. However, effective analysis of multi-modal time series is hindered by data heterogeneity,

ICeTEA: Mixture of Detectors for Metric-Log Anomaly Detection

August 4, 2025/The 11th Mining and Learning from Time Series Workshop: From Classical Methods to LLMs (KDD MILETS Workshop 2025)

Anomaly detection is essential for identifying unusual system behaviors and has wide-ranging applications, from fraud detection to system monitoring. In web servers, anomalies are typically detected using two types of data: metrics (numerical indicators of performance) and logs (records of system events).

Uncertainty Propagation on LLM Agent

July 29, 2025/The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Large language models (LLMs) integrated into multi-step agent systems enable complex decision-making processes across various applications. However, their outputs often lack reliability, making uncertainty estimation crucial. Existing uncertainty estimation methods primarily focus on final-step outputs,

Exploring Multi-Modal Data with Tool-Augmented LLM Agents for Precise Causal Discovery

July 28, 2025/The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)

Causal discovery is an imperative foundation for decision-making across domains, such as smart health, AI for drug discovery and AIOps. Traditional statistical causal discovery methods, while well-established, predominantly rely on observational data and often overlook the semantic cues inherent in cause-and-effect

Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion

July 13, 2025/Forty-Second International Conference on Machine Learning (ICML 2025)

Symmetry in the parameter space of deep neural networks (DNNs) has proven beneficial for various deep learning applications. A well-known example is the permutation symmetry in Multi-Layer Perceptrons (MLPs), where permuting the rows of weight matrices in one layer and applying the inverse permutation

Where’s the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content

June 14, 2025/The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 (CVPR 2025), Nashville, TN

The recent proliferation of photorealistic images created by generative models has sparked both excitement and concern, as these images are increasingly indistinguishable from real ones to the human eye. While offering new creative and commercial possibilities, the potential for misuse, such as in misinformation

Evidence-Based Out-of-Distribution Detection on Multi-Label Graphs

May 3, 2025/SIAM International Conference on Data Mining (SDM 2025), Alexandria, VA

The Out-of-Distribution (OOD) problem in graph-structured data is becoming increasingly important in various areas of research and applications, including social network recommendation [36], protein function detection [9, 21], etc. Furthermore, owing to the inherent multi-label properties of nodes, multi-label

Position Really Matters: Towards a Holistic Approach for Prompt Tuning

April 30, 2025/2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

Prompt tuning is highly effective in efficiently extracting knowledge from foundation models, encompassing both language, vision, and vision-language models. However, the efficacy of employing fixed soft prompts with a predetermined position for concatenation with inputs for all instances, irrespective

MixLLM: Dynamic Routing in Mixed Large Language Models

April 29, 2025/2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL 2025)

Large Language Models (LLMs) exhibit potential artificial generic intelligence recently, however, their usage is costly with high response latency. Given mixed LLMs with their own strengths and weaknesses, LLM routing aims to identify the most suitable model for each query in the stream to maximize response

DISC: Dynamic Decomposition Improves LLM Inference Scaling (SSI-FM)

April 28, 2025/ICLR Workshop on Scaling Self-Improving Foundation Models without Human Supervision (SSI-FM) at ICLR 2025

Inference scaling methods often rely on decomposing problems into steps, followed by sampling and selecting the best next steps. However, these steps and their sizes are typically fixed or depend on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically breaks

DISC: Dynamic Decomposition Improves LLM Inference Scaling (DL4C)

April 28, 2025/Third Workshop on Deep Learning for Code (DL4C) at ICLR 2025

Inference scaling methods often rely on decomposing problems into steps, followed by sampling and selecting the best next steps. However, these steps and their sizes are typically fixed or depend on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically breaks

F-Fidelity: A Robust Framework for Faithful-NESS Evaluation in Explainable AI

April 28, 2025/The Thirteenth International Conference on Learning Representations

Recent research has developed a number of eXplainable AI (XAI) techniques, such as gradient-based approaches, input perturbation-base methods, and black-box explanation methods. While these XAI techniques can extract meaningful insights from deep learning models, how to properly evaluate them remains

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

April 28, 2025/The Thirteenth International Conference on Learning Representations (ICLR 2025)

The advent of large language models (LLMs) has revolutionized the field of text generation, producing outputs that closely mimic human-like writing. Although academic and industrial institutions have developed detectors to prevent the malicious usage of LLM-generated texts, other research has doubt about

SFS: Smarter Code Space Search improves LLM Inference Scaling

April 28, 2025/The Thirteenth International Conference on Learning Representations (ICLR 2025)

We frame code generation as a black-box optimization problem within the code space and demonstrate how optimization-inspired techniques can enhance inference scaling. Based on this perspective, we propose SCATTERED FOREST SEARCH (SFS), a novel approach that improves solution diversity and better exploits

Chain-of-region: Visual Language Models Need Details for Diagram Analysis

April 25, 2025/The Thirteenth International Conference on Learning Representations

Visual Language Models (VLMs) like GPT-4V have broadened the scope of LLM applications, yet they face significant challenges in accurately processing visual details, particularly in scientific diagrams. This paper explores the necessity of meticulous visual detail collection and region decomposition

TSLA: Unified Time Series and Language Model

April 10, 2025/2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)

Real-world time series data often require analysis or interpretation from domain experts. Some tasks, like time series question answering, involve both time series and natural language questions, posing challenges for single-modality language models to understand their interaction. To this end, we present

TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents

March 4, 2025/The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

Time series data is essential in various applications, including climate modeling, healthcare monitoring, and financial analytics. Understanding the contextual information associated with real-world time series data is often essential for accurate and reliable event predictions. In this paper, we introduce

Incident Diagnosing and Reporting System based on Retrieval Augmented Large Language Model

March 3, 2025/The 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)

The Internet-of-Things (IoT) is widely used in many applications such as smart city, transportation, healthcare, and environment monitoring. A key task of IoT maintenance is to analyze the abnormal sensor records and generate incident report. Traditionally, domain experts engage in such labor intensive

Improving Logits-based Detector without Logits from Black-box LLMs

December 9, 2024/The Thirty-eighth Annual Conference on Neural Information Processing Systems

The advent of Large Language Models (LLMs) has revolutionized text generation, producing outputs that closely mimic human writing. This blurring of lines between machine- and human-written text presents new challenges in distinguishing one from the other a task further complicated by the frequent

Protecting Your LLMs with Information Bottleneck

December 9, 2024/The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

The advent of large language models (LLMs) has revolutionized the field of natural language processing, yet they might be attacked to produce harmful content. Despite efforts to ethically align LLMs, these are often fragile and can be circumvented by jailbreaking attacks through optimized or manual adversarial

A Survey on Detection of LLMs-Generated Content

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

The burgeoning capabilities of advanced large language models (LLMs) such as ChatGPT have led to an increase in synthetic content generation with implications across a variety of sectors, including media, cybersecurity, public discourse, and education. As such, the ability to detect LLMs-generated content

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration (EMNLP 2024)

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integrating knowledge have been developed, which augment LLMs with domain-specific

Large Language Models Can Be Contextual Privacy Protection Learners

November 13, 2024/The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

The proliferation of Large Language Models (LLMs) has driven considerable interest in fine-tuning them with domain-specific data to create specialized language models. Nevertheless, such domain-specific fine-tuning data often contains contextually sensitive personally identifiable information (PII).

The WizARd and Apprentice: An Augmented Reality Expert Capture System

October 25, 2024/The 23rd IEEE International Symposium on Mixed and Augmented Reality (ISMAR 2024), Bellevue, WA

Learning to perform physical tasks is ubiquitous yet challenging without expert guidance. While Augmented Reality (AR) has been adopted to overlay instructions directly onto the physical context, the natural authoring of such content remains unexplored. To address this, we developed WizARd and Apprentice,

PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization

August 29, 2024/30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

Achieving carbon neutrality within industrial operations has become increasingly imperative for sustainable development. It is both a significant challenge and a key opportunity for operational optimization in industry 4.0. In recent years, Deep Reinforcement Learning (DRL) based methods offer promising

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration (VLDB 2024)

August 28, 2024/International Workshop on LLM+KG: Data Management Opportunities in Unifying Large Language Models+Knowledge Graphs in conjunction with VLDB 2024, Guangzhou, China

Though Large Language Models (LLMs) have shown remarkable open-generation capabilities across diverse domains, they struggle with knowledge-intensive tasks. To alleviate this issue, knowledge integration methods have been proposed to enhance LLMs with domain-specific knowledge graphs using external modules.

Mastering Long-Tail Complexity on Graphs: Characterization, Learning, and Generalization

August 28, 2024/30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

In the context of long-tail classification on graphs, the vast majority of existing work primarily revolves around the development of model debiasing strategies, intending to mitigate class imbalances and enhance the overall performance. Despite the notable success, there is very limited literature that

POND: Multi-Source Time Series Domain Adaptation with Information-Aware Prompt Tuning

August 27, 2024/30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

Time series domain adaptation stands as a pivotal and intricate challenge with diverse applications, including but not limited to human activity recognition, sleep stage classification, and machine fault diagnosis. Despite the numerous domain adaptation techniques proposed to tackle this complex problem,

Distantly-Supervised Joint Extraction with Noise-Robust Learning

August 16, 2024/The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand

Joint entity and relation extraction is a process that identifies entity pairs and their relations using a single model. We focus on the problem of joint extraction in distantly-labeled data,whose labels are generated by aligning entity mentions with the corresponding entity and relation tags using a

Towards Counterfactual Fairness-aware Domain Generalization in Changing Environments

August 9, 2024/IJCAI 2024 - The 33rd International Joint Conference on Artificial Intelligence, Jeju, South Korea

Recognizing domain generalization as a commonplace challenge in machine learning, data distribution might progressively evolve across a continuum of sequential domains in practical scenarios. While current methodologies primarily concentrate on bolstering model effectiveness within these new domains,

DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton

July 27, 2024/The Forty-first International Conference on Machine Learning (ICML 2024), Vienna, Austria

This paper introduces the retrieval-augmented large language model with Definite Finite Automaton (DFA-RAG), a novel framework designed to enhance the capabilities of conversational agents using large language models (LLMs). Traditional LLMs face challenges in generating regulated and compliant responses

RIO-CPD: A Riemannian Geometric Method for Correlation-aware Online Change Point Detection

July 25, 2024/Geometry-grounded Representation Learning and Generative Modeling Workshop (ICML 2024)

The objective of change point detection is to identify abrupt changes at potentially multiple points within a data sequence. This task is particularly challenging in the online setting where various types of changes can occur, including shifts in both the marginal and joint distributions of the data.

Pruning as a Domain-specific LLM Extractor

June 20, 2024/2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Mexico City, Mexico

Large Language Models (LLMs) have exhibited remarkable proficiency across a wide array of NLP tasks. However, the escalation in model size also engenders substantial deployment costs. While few efforts have explored model pruning techniques to reduce the size of LLMs, they mainly center on general or

Uncertainty Quantification for In-Context Learning of Large Language Models

June 20, 2024/2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Mexico City, Mexico

In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLMs response, such as hallucination, have also been actively discussed. Existing

Advancing Sustainability in Global Supply Chains through Agent-based Simulation

May 30, 2024/The Eighteenth International Conference on Digital Society (ICDS 2024)

In today’s world, with its complex global supply chains, the difficulties and uncertainties we face offer both challenges and opportunities for making things better, especially in terms of efficiency and sustainability. These challenges grow due to unpredictable events, such as natural disasters, unexpected

MULAN: Multi-modal Causal Structure Learning and Root Cause Analysis for Microservice Systems

May 17, 2024/The Web Conference 2024 (WWW 2024)

Effective root cause analysis (RCA) is vital for swiftly restoring services, minimizing losses, and ensuring the smooth operation and management of complex systems. Previous data-driven RCA methods, particularly those employing causal discovery techniques, have primarily focused on constructing dependency

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Large language models (LLMs) have notably enhanced the fluency and diversity of machine-generated text. However, this progress also presents a significant challenge in detecting the origin of a given text, and current research on detection methods lags behind the rapid evolution of LLMs. Conventional

Improving Open Information Extraction with Large Language Models: A Study on Demonstration Uncertainty

May 11, 2024/ICLR 2024 Workshop on Reliable and Responsible Foundation Models

Open Information Extraction (OIE) task aims at extracting structured facts from unstructured text, typically in the form of (subject, relation, object) triples. Despite the potential of large language models (LLMs) like ChatGPT as a general task solver, they lag behind state-of-the-art (supervised) methods

Parametric Augmentation for Time Series Contrastive Learning

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

May 11, 2024/12th International Conference on Learning Representations (ICLR 2024)

Graph Neural Networks (GNNs) are neural models that leverage the dependency structure in graphical data via message passing among the graph nodes. GNNs have emerged as pivotal architectures in analyzing graph-structured data, and their expansive application in sensitive domains requires a comprehensive

Dynamic Causal Discovery in Imitation Learning

March 4, 2024/The 17th ACM International Conference on Web Search and Data Mining (WSDM 2024), Merida, Yucatan, Mexico

Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret control policies learned by the agent. Difficulties mainly come

Prompt-based Domain Discrimination for Multi-source Time Series Domain Adaptation

December 21, 2023/arxiv.org

Time series domain adaptation stands as a pivotal and intricate challenge with diverse applications, including but not limited to human activity recognition, sleep stage classification, and machine fault diagnosis. Despite the numerous domain adaptation techniques proposed to tackle this complex problem,

Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning

December 16, 2023/Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA

Meta-learning enables quick adaptation of machine learning models to new tasks with limited data. While tasks could come from varying distributions in reality, most of the existing meta-learning methods consider both training and testing tasks as from the same uni-component distribution, overlooking

Open-Ended Commonsense Reasoning with Unrestricted Answer Scope

December 10, 2023/Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore

Open-ended Commonsense Reasoning is defined as solving a commonsense question without providing 1) a short list of answer candidates and 2) a pre-defined answer scope. Conventional ways of formulating the commonsense question into a question-answering form or utilizing external knowledge to learn retrieval-based

GLAD: Content-Aware Dynamic Graphs for Log Anomaly Detection

December 2, 2023/IEEE International Conference On Knowledge Graph (ICKG-2023), Shanghai, China

Logs play a crucial role in system monitoring and debugging by recording valuable system information, including events and status. Although various methods have been proposed to detect anomalies in log sequences, they often overlook the significance of considering relationships among system components,

Adaptation Speed Analysis for Fairness-Aware Causal Models

October 25, 2023/32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

For example, in machine translation tasks, to achieve bidirectional translation between two languages, the source corpus is often used as the target corpus, which involves the training of two models with opposite directions. The question of which one can adapt most quickly to a domain shift is of significant

Calibrate Graph Neural Networks under Out-of-Distribution Nodes via Deep Q-learning

October 25, 2023/32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

Graph neural networks (GNNs) have achieved great success in dealing with graph-structured data that are prevalent in the real world. The core of graph neural networks is the message passing mechanism that aims to generate the embeddings of nodes by aggregating the neighboring node information. However,

Temporal Graph-Based Incident Analysis System for Internet of Things (ECML)

September 22, 2023/ECML PKDD 2023 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Internet-of-things (IoTs) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on sensor data is an important task for IoT maintenance and operation. In real applications, the occurrence of a system-level incident usually involves hundreds of abnormal sensors, making

Temporal Graph based Incident Analysis System for Internet of Things

September 17, 2023/European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2023)

Internet-of-things (IoTs) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on sensor data is an important task for IoT maintenance and operation. In real applications, the occurrence of a system-level incident usually involves hundreds of abnormal sensors, making

AutoTCL: Automated Time Series Contrastive Learning with Adaptive Augmentations

August 20, 2023/The 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

Read AutoTCL: Automated Time Series Contrastive Learning with Adaptive Augmentations publication. Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist

FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

Read FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation publication. Imitation learning that replicates experts’ skills via their demonstrations has shown significant success in various decision-making tasks. However, two critical challenges still hinder the deployment of imitation

Incremental Causal Graph Learning for Online Root Cause Localization

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

The task of root cause analysis (RCA) is to identify the root causes of system faults/failures by analyzing system monitoring data. Efficient RCA can greatly accelerate system failure recovery and mitigate system damages or financial losses. However, previous research has mostly focused on developing

Interdependent Causal Networks for Root Cause Localization

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The goal of root cause analysis is to identify the underlying causes of system problems by discovering and analyzing the causal structure from system monitoring data. It is indispensable for maintaining the stability and robustness of large-scale complex systems. Existing methods mainly focus on the

Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations

August 10, 2023/29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

Imitation learning has achieved great success in many sequential decision-making tasks, in which a neural agent is learned by imitating collected human demonstrations. However, existing algorithms typically require a large number of high-quality demonstrations that are difficult and expensive to collect.

State-Aware Anomaly Detection for Massive Sensor Data in Internet of Things

August 7, 2023/The 3rd Workshop on Artificial Intelligence-Enabled Cybersecurity Analytics

With the escalating prevalence of Internet of Things (IoTs) in critical infrastructure, the requirement for efficient and effective anomaly detection solution becomes increasingly important. Unfortunately, most prior research works have largely overlooked to adapt detection criteria for different operational

Personalized Federated Learning under Mixture Distributions

July 29, 2023/The 40th International Conference on Machine Learning (ICML 2023)

The recent trend towards Personalized Federated Learning (PFL) has garnered significant attention as it allows for the training of models that are tailored to each client while maintaining data privacy. However, current PFL techniques primarily focus on modeling the conditional distribution heterogeneity

Unsupervised Anomaly Detection Under A Multiple Modeling Strategy Via Model Set Optimization Through Transfer Learning

June 30, 2023/The 26th International Conference on Information Fusion, Charleston, SC

Unsupervised anomaly detection approaches have been widely accepted in applications for industrial systems. Industrial systems often operate with multiple modes since they work for multiple purposes or under different conditions. In order to deal with the difficulty of anomaly detection due to multiple

Multi-Label Temporal Evidential Neural Networks for Early Event Detection

June 9, 2023/2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

Early event detection aims to detect events even before the event is complete. However, most of the existing methods focus on an event with a single label but fail to be applied to cases with multiple labels. Another non-negligible issue for early event detection is a prediction with overconfidence due

Beyond One Model Fits All: A Survey of Domain Specialization for Large Language Models

June 9, 2023/arXiv

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task agnostic foundation for a wide range of applications. The great promise of LLMs as general task solvers motivated people to extend their functionality largely beyond

Interpretable Skill Learning for Dynamic Treatment Regimes through Imitation

March 24, 2023/57th Conference on Information Sciences and Systems (CISS 2023)

Imitation learning that mimics experts’ skills from their demonstrations has shown great success in discovering dynamic treatment regimes, i.e., the optimal decision rules to treat an individual patient based on related evolving treatment and covariate history. Existing imitation learning methods,

Dynamic Prompting: A Unified Framework for Prompt Tuning

March 6, 2023/arXiv

It has been demonstrated that prompt tuning is highly effective in efficiently eliciting knowledge from language models (LMs). However, the prompt tuning still lags behind fine tuning, especially when the LMs are small. P tuning v2 (Liu et al., 2021b) makes it comparable with finetuning by adding continuous

Exploring the limits of ChatGPT for Query or Aspect based Text Summarization

February 16, 2023/arXiv

Text summarization has been a crucial problem in natural language processing (NLP) for several decades. It aims to condense lengthy documents into shorter versions while retaining the most critical information. Various methods have been proposed for text summarization, including extractive and abstractive

Time Series Contrastive Learning with Information-Aware Augmentations

February 14, 2023/Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23)

Various contrastive learning approaches have been proposed in recent years and have achieved significant empirical success. While effective and prevalent, contrastive learning has been less explored for time series data. A key component of contrastive learning is to select appropriate augmentations,

Deep Federated Anomaly Detection for Multivariate Time Series Data

December 20, 2022/IEEE BigData 2022 - Special Session 2: Machine Learning on Big Data (MLBD 2022), Osaka, Japan

Although many anomaly detection approaches have been developed for multivariate time series data, limited effort has been made in federated settings in which multivariate time series data are heterogeneously distributed among different edge devices while data sharing is prohibited. In this paper, we

Towards Robust Graph Neural Networks via Adversarial Contrastive Learning

December 20, 2022/2022 IEEE International Conference on Big Data (IEEE BigData 2022), Osaka, Japan

Graph Neural Network (GNN), as a powerful representation learning model on graph data, attracts much attention across various disciplines. However, recent studies show that GNN is vulnerable to adversarial attacks. How to make GNN more robust? What are the key vulnerabilities in GNN? How to address the

DeepGAR: Deep Graph Learning for Analogical Reasoning

December 3, 2022/IEEE ICDM 2022 - 22nd IEEE International Conference on Data Mining, Orlando, FL

Analogical reasoning is the process of discovering and mapping correspondences from a target subject to a base subject. As the most well-known computational method of analogical reasoning, Structure-Mapping Theory (SMT) abstracts both target and base subjects into relational graphs and forms the cognitive

Personalized Federated Learning via Heterogeneous Modular Networks

December 3, 2022/IEEE ICDM 2022 - 22nd IEEE International Conference on Data Mining, Orlando, FL

Personalized Federated Learning (PFL) which collaboratively trains a federated model while considering local clients under privacy constraints has attracted much attention. Despite its popularity, it has been observed that existing PFL approaches result in sub-optimal solutions when the joint distribution

Multi-Faceted Knowledge-Driven Pre-training for Product Representation Learning

September 28, 2022/IEEE Transactions on Knowledge and Data Engineering

As a key component of e-commerce computing, product representation learning (PRL) provides benefits for a variety of applications, including product matching, search, and categorization. The existing PRL approaches have poor language understanding ability due to their inability to capture contextualized

Explainable Anomaly Detection System for Categorical Sensor Data in Internet of Things

September 23, 2022/ECML-PKDD 2022: The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Grenoble, France

Internet of things (IoT) applications deploy massive number of sensors to monitor the system and environment. Anomaly detection on streaming sensor data is an important task for IoT maintenance and operation. However, there are two major challenges for anomaly detection in real IoT applications: (1)

Multi-source Inductive Knowledge Graph Transfer

September 23, 2022/ECML PKDD 2022 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Grenoble, France

Multi-source Inductive Knowledge Graph Transfer Large-scale information systems, such as knowledge graphs (KGs), enterprise system networks, often exhibit dynamic and complex activities. Recent research has shown that formalizing these information systems as graphs can effectively characterize the entities

3D Histogram-Based Anomaly Detection for Categorical Sensor Data in Internet of Things

September 9, 2022/VLIoT 2022 - Very Large Internet of Things 2022 (virtual conference)

The applications of Internet-of-things (IoT) deploy a massive number of sensors to monitor the system and environment. Anomaly detection on streaming sensor data is an important task for IoT maintenance and operation. In real IoT applications, many sensors report categorical values rather than numerical

CAT: Beyond Efficient Transformer for Content-Aware Anomaly Detection in Event Sequences

August 18, 2022/28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

It is critical and important to detect anomalies in event sequences, which becomes widely available in many application domains. Indeed, various efforts have been made to capture abnormal patterns from event sequences through sequential pattern analysis or event representation learning. However, existing

Towards Learning Disentangled Representations for Time Series

August 18, 2022/28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2022)

Promising progress has been made toward learning efficient time series representations in recent years, but the learned representations often lack interpretability and do not encode semantic meanings by the complex interactions of many latent factors. Learning representations that disentangle these latent

SEED: Sound Event Early Detection via Evidential Uncertainty

May 27, 2022/2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore (virtual paper presentations)

Sound Event Early Detection (SEED) is an essential task in recognizing the acoustic environments and soundscapes. However, most of the existing methods focus on the offline sound event detection, which suffers from the over-confidence issue of early-stage event detection and usually yield unreliable

Superclass-Conditional Gaussian Mixture Model for Coarse-To-Fine Few-Shot Learning

April 29, 2022/10th International Conference on Learning Representations (ICLR 2022)

Learning fine-grained embeddings is essential for extending the generalizability of models pre-trained on “coarse” labels (e.g., animals). It is crucial to fields for which fine-grained labeling (e.g., breeds of animals) is expensive, but fine-grained prediction is desirable, such as medicine. The dilemma

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-Sentence Dependency Graph

March 1, 2022/Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-2022)

We target the task of cross-lingual Machine Reading Comprehension (MRC) in the direct zero-shot setting, by incorporating syntactic features from Universal Dependencies (UD), and the key features we use are the syntactic relations within each sentence. While previous work has demonstrated effective syntax-guided

Ordinal Quadruplet: Retrieval of Missing Labels in Ordinal Time Series

January 24, 2022/arXiv

In this paper, we propose an ordered time series classification framework that is robust against missing classes in the training data, i.e., during testing we can prescribe classes that are missing during training. This framework relies on two main components: (1) our newly proposed ordinal quadruplet

Dynamic Causal Discovery in Imitation Learning

December 14, 2021/Causal Inference Challenges in Sequential Decision Making: Bridging Theory and Practice - A NeurIPS 2021 Workshop

Using deep reinforcement learning (DRL) to recover expert policies via imitation has been found to be promising in a wide range of applications. However, it remains a difficult task to interpret the control policy learned by the agent. Difficulties mainly come from two aspects: 1) agents in DRL are usually

InfoGCL: Information-Aware Graph Contrastive Learning

December 14, 2021/Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Virtual-only Conference

InfoGCL: Information-Aware Graph Contrastive Learning Various graph contrastive learning models have been proposed to improve the performance of tasks on graph datasets in recent years. While effective and prevalent, these models are usually carefully customized. In particular, despite all recent work

You Are What and Where You Are: Graph Enhanced Attention Network for Explainable POI Recommendation

November 15, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Point-of-interest (POI) recommendation is an emerging area of research on location-based social networks to analyze user behaviors and contextual check-in information. For this problem, existing approaches, with shallow or deep architectures, have two major drawbacks. First, for these approaches, the

Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation

November 11, 2021/The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Recent multilingual pre-trained language models have achieved remarkable zero-shot performance, where the model is only finetuned on one source language and directly evaluated on target languages. In this work, we propose a self-learning framework that further utilizes unlabeled data of target languages,

Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction

November 11, 2021/The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

Compliments and concerns in reviews are valuable for understanding users’ shopping interests and their opinions with respect to specific aspects of certain items. Existing review-based recommenders favor large and complex language encoders that can only learn latent and uninterpretable text representations.

Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

November 5, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

In many high-stakes applications of machine learning models, outputting only predictions or providing statistical confidence is usually insufficient to gain trust from end users, who often prefer a transparent reasoning paradigm. Despite the recent encouraging developments on deep networks for sequential

Structural Temporal Graph Neural Networks for Anomaly Detection in Dynamic Graphs

November 5, 2021/30th ACM International Conference on Information and Knowledge Management (CIKM 2021)

Detecting anomalies in dynamic graphs is a vital task, with numerous practical applications in areas such as security, finance, and social media. Existing network embedding based methods have mostly focused on learning good node representations, whereas largely ignoring the subgraph structural changes

Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection

October 24, 2021/29th ACM International Conference on Multimedia (ACM Multimedia 2021)

Detecting abnormal activities in real-world surveillance videos is an important yet challenging task as the prior knowledge about video anomalies is usually limited or unavailable. Despite that many approaches have been developed to resolve this problem, few of them can capture the normal spatio-temporal

Domain oriented Language Modeling with Adaptive Hybrid Masking and Optimal Transport Alignment

August 18, 2021/KDD 2021: ACM SIGKDD Conference on Knowledge Discovery and Data Mining, SIGKDD 2021

Motivated by the success of pre-trained language models such as BERT in a broad range of natural language processing (NLP) tasks, recent research efforts have been made for adapting these models for different application domains. Along this line, existing domain-oriented models have primarily followed

Multi-Scale One-Class Recurrent Neural Networks for Discrete Event Sequence Anomaly Detection

August 18, 2021/ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

Discrete event sequences are ubiquitous, such as an ordered event series of process interactions in Information and Communication Technology systems. Recent years have witnessed increasing efforts in detecting anomalies with discrete event sequences. However, it remains an extremely difficult task due

SIGL: Securing Software Installations Through Deep Graph Learning

August 13, 2021/USENIX Security 2021 - The 30th USENIX Security Symposium

Many users implicitly assume that software can only be exploited after it is installed. However, recent supply-chain attacks demonstrate that application integrity must be ensured during installation itself. We introduce SIGL, a new tool for detecting malicious behavior during software installation.

Hierarchical Imitation Learning with Contextual Bandits for Dynamic Treatment Regimes

July 24, 2021/The Thirty-eighth International Conference on Machine Learning (ICML 2021)

Imitation learning has been proved to be effective in mimicking experts’ behaviors from their demonstrations without access to explicit reward signals. Meanwhile, complex tasks, e.g., dynamic treatment regimes for patients with comorbidities, often suggest significant variability in expert demonstrations

FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems

June 25, 2021/CVPR 2021 - IEEE/CVF Conference on Computer Vision and Pattern Recognition

We present FACESEC, a framework for fine-grained robustness evaluation of face recognition systems. FACESEC evaluation is performed along four dimensions of adversarial modeling: the nature of perturbation (e.g., pixel-level or face accessories), the attacker’s system knowledge (about training data

Automated Anomaly Detection via Curiosity-Guided Search and Self-Imitation Learning

June 15, 2021/The IEEE Transactions on Neural Networks and Learning Systems

Anomaly detection is an important data mining task with numerous applications, such as intrusion detection, credit card fraud detection, and video surveillance. However, given a specific complicated task with complicated data, the process of building an effective deep learning-based system for anomaly

Unsupervised Concept Representation Learning for Length-Varying Text Similarity

June 11, 2021/NAACL 2021 – 2021 Annual Conference of the North American Chapter

Measuring document similarity plays an important role in natural language processing tasks. Most existing document similarity approaches suffer from the information gap caused by context and vocabulary mismatches when comparing varying-length texts. In this paper, we propose an unsupervised concept representation

Deep Multi-Instance Contrastive Learning with Dual Attention for Anomaly Precursor Detection

May 1, 2021/SIAM International Conference on Data Mining, Virtual Conference (SDM21)

Prognostics or early detection of incipient faults by leveraging the monitoring time series data in complex systems is valuable to automatic system management and predictive maintenance. However, this task is challenging. First, learning the multi-dimensional heterogeneous time series data with various

AutoOD: Neural Architecture Search for Outlier Detection

April 23, 2021/ICDE 2021 - The 37th IEEE International Conference on Data Engineering

Outlier detection is an important data mining task with numerous applications such as intrusion detection, credit card fraud detection, and video surveillance. However, given a specific task with complex data, the process of building an effective deep learning based system for outlier detection still

Learning to Drop: Robust Graph Neural Network via Topological Denoising

March 12, 2021/WSDM 2021 - The 14th ACM International WSDM Conference on Web Seach and Data Mining

Graph Neural Networks (GNNs) have shown to be powerful tools for graph analytics. The key idea is to recursively propagate and aggregate information along the edges of the given graph. Despite their success, however, the existing GNNs are usually sensitive to the quality of the input graph. Real-world

Multi-Task Recurrent Modular Networks

March 9, 2021/AAAI 2021 - 35th AAAI Conference on Artificial Intelligence

We consider the models of deep multi-task learning with recurrent architectures that exploit regularities across tasks to improve the performance of multiple sequence processing tasks jointly. Most existing architectures are painstakingly customized to learn task relationships for different problems,

Dynamic Gaussian Mixture based Deep Generative Model For Robust Forecasting on Sparse Multivariate Time Series

February 22, 2021/AAAI 2021 - 35th AAAI Conference on Artificial Intelligence

Forecasting on Sparse Multivariate Time Series Forecasting on sparse multivariate time series (MTS) aims to model the predictors of future values of time series given their incomplete past, which is important for many emerging applications. However, most existing methods process MTS’s individually,

Parameterized Explainer for Graph Neural Network

December 12, 2020/Thirty-Fourth Annual Conference on Neural Information Processing Systems (NeurIPS 2020)

Despite recent progress in Graph Neural Networks (GNNs), explaining predictions made by GNNs remains a challenging open problem. The leading method independently addresses the local explanations (i.e., important subgraph structure and node features) to interpret why a GNN model makes the prediction for

This is Why We Can’t Cache Nice Things: Lightning-Fast Threat Hunting using Suspicion-Based Hierarchical Storage

December 11, 2020/2020 Annual Computer Security Applications Conference

Recent advances in causal analysis can accelerate incident response time, but only after a causal graph of the attack has been constructed. Unfortunately, existing causal graph generation techniques are mainly offline and may take hours or days to respond to investigator queries, creating greater opportunity

T2-Net: A Semi-supervised Deep Model for Turbulence Forecasting

November 20, 2020/ICDM 2020 - The 20th IEEE International Conference on Data Mining

Accurate air turbulence forecasting can help airlines avoid hazardous turbulence, guide the routes that keep passengers safe, maximize efficiency, and reduce costs. Traditional turbulence forecasting approaches heavily rely on painstakingly customized turbulence indexes, which are less effective in dynamic

Anomaly Detection on Web-User Behaviors through Deep Learning

October 23, 2020/16th EAI International Conference on Security and Privacy in Communication Networks (SecureComm 2020)

The modern Internet has witnessed the proliferation of web applications that play a crucial role in the branding process among enterprises. Web applications provide a communication channel between potential customers and business products. However, web applications are also targeted by attackers due

VESSELS: Efficient and Scalable Deep Learning Prediction on Trusted Processors

October 21, 2020/ACM Symposium on Cloud Computing 2020 (SoCC 2020)

Deep learning systems on the cloud are increasingly targeted by attacks that attempt to steal sensitive data. Intel SGX has been proven effective to protect the confidentiality and integrity of such data during computation. However, state-of-the-art SGX systems still suffer from substantial performance

Anomalous Event Sequence Detection

September 24, 2020/IEEE Intelligent Systems

Anomaly detection has been widely applied in modern data-driven security applications to detect abnormal events/entities that deviate from the majority. However, less work has been done in terms of detecting suspicious event sequences/paths, which are better discriminators than single events/entities

Node Classification in Temporal Graphs through Stochastic Sparsification and Temporal Structural Convolution

September 18, 2020/ECML-PKDD 2020 - The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

Node classification in temporal graphs aims to predict node labels based on historical observations. In real-world applications, temporal graphs are complex with both graph topology and node attributes evolving rapidly, which poses a high overfitting risk to existing graph learning approaches. In this

Robust Graph Representation Learning via Neural Sparsification

July 18, 2020/The 37th International Conference on Machine Learning (ICML 2020)

Graph representation learning serves as the core of important prediction tasks, ranging from product recommendation to fraud detection. Reallife graphs usually have complex information in the local neighborhood, where each node is described by a rich set of features and connects to dozens or even hundreds

At the Speed of Sound: Efficient Audio Scene Classification

June 11, 2020/The Annual ACM International Conference on Multimedia Retrieval (ICMR 2020)

Efficient audio scene classification is essential for smart sensing platforms such as robots, medical monitoring, surveillance, or autonomous vehicles. We propose a retrieval-based scene classification architecture that combines recurrent neural networks and attention to compute embeddings for short

RULENet: End-to-end Learning with the Dual-estimator for Remaining Useful Life Estimation

June 10, 2020/2020 IEEE International Conference on Prognostics and Health Management, Detroit, MI

Remaining Useful Life (RUL) estimation is a key element in Predictive maintenance. System agnostic approaches which just utilize sensor and operational time series have gained popularity due to its ease of implementation. Due to the nature of measurement or degradation mechanisms, its accurate estimation

Inductive and Unsupervised Representation Learning on Graph Structured Objects

April 30, 2020/8th International Conference on Learning Representations (ICLR 2020)

Inductive and unsupervised graph learning is a critical technique for predictive or information retrieval tasks where label information is difficult to obtain. It is also challenging to make graph learning inductive and unsupervised at the same time, as learning processes guided by reconstruction error

A Generic Edge-Empowered Graph Convolutional Network via Node-Edge Mutual Enhancement

April 24, 2020/The Web Conference 2020 (WWW 2020)

Graph Convolutional Networks (GCNs) have shown to be a powerful tool for analyzing graph-structured data. Most of previous GCN methods focus on learning a good node representation by aggregating the representations of neighboring nodes, whereas largely ignoring the edge information. Although few recent

Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes

April 24, 2020/The Web Conference 2020 (WWW 2020)

Recent developments in discovering dynamic treatment regimes (DTRs) have heightened the importance of deep reinforcement learning (DRL) which are used to recover the doctor’s treatment policies. However, existing DRL-based methods expose the following limitations: 1) supervised methods based on behavior

APTrace: A Responsive System for Agile Enterprise Level Causality Analysis

April 24, 2020/36th IEEE International Conference on Data Engineering (ICDE 2020)

While backtracking analysis has been successful in assisting the investigation of complex security attacks, it faces a critical dependency explosion problem. To address this problem, security analysts currently need to tune backtracking analysis manually with different case-specific heuristics. However,

You Are What You Do: Hunting Stealthy Malware via Data Provenance Analysis

March 9, 2020/NDSS Symposium 2020

To subvert recent advances in perimeter and host security, the attacker community has developed and employed various attack vectors to make malware much more stealthy than before to penetrate the target system and prolong its presence. The advanced malware, or stealthy malware, impersonates or abuses

Asymmetrically Hierarchical Networks with Attentive Interactions for Interpretable Review-based Recommendation

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

Recently, recommender systems have been able to emit substantially improved recommendations by leveraging user-provided reviews. Existing methods typically merge all reviews of a given user (item) into a long document, and then process user and item documents in the same manner. In practice, however,

Deep Unsupervised Binary Coding Networks for Multivariate Time Series Retrieval

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

Multivariate time series data are becoming increasingly ubiquitous in varies real-world applications such as smart city, power plant monitoring, wearable devices, etc. Given the current time series segment, how to retrieve similar segments within the historical data in an efficient and effective manner

Tensorized LSTM with Adaptive Shared Memory for Learning Trends in Multivariate Time Series

February 12, 2020/The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

The problem of learning and forecasting underlying trends in time series data arises in a variety of applications, such as traffic management, energy optimization, etc. In literature, a trend in time series is characterized by the slope and duration, and its prediction is then to forecast the two values

Interpretable Click-Through Rate Prediction through Hierarchical Attention

February 7, 2020/The 13th ACM International Conference on Web Search and Data Mining (WSDM 2020)

Click-through rate (CTR) prediction is a critical task in online advertising and marketing. For this problem, existing approaches, with shallow or deep architectures, have three major drawbacks. First, they typically lack persuasive rationales to explain the outcomes of the models. Unexplainable predictions

Temporal Context-aware Representation Learning for Question Routing

February 7, 2020/The 13th ACM International Conference on Web Search and Data Mining (WSDM 2020)

Question routing (QR) aims at recommending newly posted questions to the potential answerers who are most likely to answer the questions. The existing approaches that learn users’ expertise from their past question-answering activities usually suffer from challenges in two aspects: 1) multi-faceted expertise

Progressive Processing of System-Behavioral Query

December 13, 2019/The 35th Annual Computer Security Applications Conference (ACSAC 2019)

System monitoring has recently emerged as an effective way to analyze and counter advanced cyber attacks. The monitoring data records a series of system events and provides a global view of system behaviors in an organization. Querying such data to identify potential system risks and malicious behaviors

Adaptive Neural Network for Node Classification in Dynamic Networks

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Given a network with the labels for a subset of nodes, transductive node classification targets to predict the labels for the remaining nodes in the network. This technique has been used in a variety of applications such as voxel functionality detection in brain network and group label prediction in

Learning Robust Representations with Graph Denoising Policy Network

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Existing representation learning methods based on graph neural networks and their variants rely on the aggregation of neighborhood information, which makes it sensitive to noises in the graph, e.g. erroneous links between nodes, incorrect/missing node features. In this paper, we propose Graph Denoising

Self-Attentive Attributed Network Embedding Through Adversarial Learning

November 11, 2019/The 19th IEEE International Conference on Data Mining (ICDM 2019)

Network embedding aims to learn the low-dimensional representations/embeddings of vertices which preserve the structure and inherent properties of the networks. The resultant embeddings are beneficial to downstream tasks such as vertex classification and link prediction. A vast majority of real-world

A Query System for Efficiently Investigating Complex Attack Behaviors for Enterprise Security

August 30, 2019/45th International Conference on Very Large Data Bases (VLDB 2019)

The need for countering Advanced Persistent Threat (APT) attacks has led to the solutions that ubiquitously monitor system activities in each enterprise host, and perform timely attack investigation over the monitoring data for uncovering the attack sequence. However, existing general-purpose query systems

Heterogeneous Graph Matching Networks for Unknown Malware Detection

August 16, 2019/The 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

Information systems have widely been the target of malware attacks. Traditional signature-based malicious program detection algorithms can only detect known malware and are prone to evasion techniques such as binary obfuscation, while behavior-based approaches highly rely on the malware training samples

Spatio-Temporal Attentive RNN for Node Classification in Temporal Attributed Graphs

August 16, 2019/The 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

Node classification in graph-structured data aims to classify the nodes where labels are only available for a subset of nodes. This problem has attracted considerable research efforts in recent years. In real-world applications, both graph topology and node attributes evolve over time. Existing techniques,

Clairvoyant Networks

June 21, 2019/Network Traffic Measurement and Analysis Conference (TMA Conference 2019)

We use the term clairvoyant to refer to networks that provide on-demand visibility for any flow at any time. Traditionally, network visibility is achieved by instrumenting and passively monitoring all flows in a network. SDN networks, by design endowed with full visibility, offer another alternative

Attentional Heterogeneous Graph Neural Network: Application to Program Reidentification

May 4, 2019/SIAM International Conference on Data Mining (SDM 2019)

Program or process is an integral part of almost every IT/OT system. Can we trust the identity/ID (e.g., executable name) of the program? To avoid detection, malware may disguise itself using the ID of a legitimate program, and a system tool (e.g., PowerShell) used by the attackers may have the fake

Deep Co-Clustering

May 4, 2019/SIAM International Conference on Data Mining (SDM 2019)

Co-clustering partitions instances and features simultaneously by leveraging the duality between them, and it often yields impressive performance improvement over traditional clustering algorithms. The recent development in learning deep representations has demonstrated the advantage in extracting effective

PoLPer: Process-Aware Restriction of Over-Privileged Setuid Calls in Legacy Applications

March 27, 2019/9th ACM Conference on Data and Application Security and Privacy (CODASPY 2019)

Setuid system calls enable critical functions such as user authentications and modular privileged components. Such operations must only be executed after careful validation. However, current systems do not perform rigorous checks, allowing exploitation of privileges through memory corruption vulnerabilities

Countering Malicious Processes with Process-DNS Association

February 27, 2019/The 26th Annual Network and Distributed System Security Symposium (NDSS 2019)

Modern malware and cyber attacks depend heavily on DNS services to make their campaigns reliable and difficult to track. Monitoring network DNS activities and blocking suspicious domains have been proven an effective technique in countering such attacks. However, recent successful campaigns reveal that

NODOZE: Combatting Threat Alert Fatigue with Automated Provenance Triage

February 27, 2019/The 26th Annual Network and Distributed System Security Symposium (NDSS 2019)

Large enterprises are increasingly relying on threat detection softwares (e.g., Intrusion Detection Systems) to allow them to spot suspicious activities. These softwares generate alerts which must be investigated by cyber analysts to figure out if they are true attacks. Unfortunately, in practice, there

A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data

February 1, 2019/The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019)

Nowadays, multivariate time series data are increasingly collected in various real-world systems, e.g., power plants, wearable devices, etc. Anomaly detection and diagnosis in multivariate time series refer to identifying abnormal status in certain time steps and pinpointing the root causes. Building

Behavior-based Community Detection: Application to Host Assessment in Enterprise Information Networks

October 26, 2018/Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM 2018)

Behavior-based Community Detection: Application to Host Assessment in Enterprise Information Networks Community detection in complex networks is a fundamental problem that attracts much attention across various disciplines. Previous studies have been mostly focusing on external connections between nodes

Collaborative Alert Ranking for Anomaly Detection

October 26, 2018/Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM 2018)

Given a large number of low-quality heterogeneous categorical alerts collected from an anomaly detection system, how to characterize the complex relationships between different alerts and deliver trustworthy rankings to end users? While existing techniques focus on either mining alert patterns or filtering

TGNet: Learning to Rank Nodes in Temporal Graphs

October 26, 2018/Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM 2018)

Node ranking in temporal networks are often impacted by heterogeneous context from node content, temporal, and structural dimensions. This paper introduces TGNet , a deep-learning framework for node ranking in heterogeneous temporal graphs. TGNet utilizes a variant of Recurrent Neural Network to adapt

NodeMerge: Template Based Efficient Data Reduction For Big-Data Causality Analysis

October 19, 2018/Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security (ACM CCS 2018)

Today’s enterprises are exposed to sophisticated attacks, such as Advanced Persistent Threats~(APT) attacks, which usually consist of stealthy multiple steps. To counter these attacks, enterprises often rely on causality analysis on the system activity data collected from a ubiquitous system monitoring

Deep Learning IP Network Representations

August 24, 2018/Big-DAMA 2018 - ACM SIGCOMM 2018 Workshop on Big Data Analytics and Machine Learning for Data Communication Networks

We present DIP, a deep learning-based framework to learn structural properties of the Internet, such as node clustering or distance between nodes. Existing embedding-based approaches use linear algorithms on a single source of data, such as latency or hop count information, to approximate the position

Deep r-th Root Rank Supervised Joint Binary Embedding for Multivariate Time Series Retrieval

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Multivariate time series data are becoming increasingly common in numerous real-world applications, e.g., power plant monitoring, health care, wearable devices, automobiles, etc. As a result, multivariate time series retrieval, i.e., given the current multivariate time series segment, how to obtain its

Learning Deep Network Representations with Adversarially Regularized Autoencoders

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The problem of network representation learning, also known as network embedding, arises in many machine learning tasks assuming that there exist a small number of variabilities in the vertex representations which can capture the “semantics” of the original network structure. Most existing network embedding

NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks

August 23, 2018/KDD 2018 – 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Massive and dynamic networks arise in many practical applications such as social media, security and public health. Given an evolutionary network, it is crucial to detect structural anomalies, such as vertices and edges whose “behaviors” deviate from underlying majority of the network, in a real-time

TINET: Transferring Knowledge between Invariant Networks

August 23, 2018/KDD 2018 - 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

The latent behavior of an information system that can exhibit extreme events, such as system faults or cyber-attacks, is complex. Recently, the invariant network has shown to be a powerful way of characterizing complex system behaviors. Structures and evolutions of the invariance network, in particular,

SAQL: A Stream-based Query System for Real-Time Abnormal System Behavior Detection

August 17, 2018/The 27th USENIX Security Symposium (USENIX Security 2018)

Recently, advanced cyber attacks, which consist of a sequence of steps that involve many vulnerabilities and hosts, compromise the security of many well-protected businesses. This has led to solutions that ubiquitously monitor system activities in each host (big data) as a series of events and search

Exploiting Graph Regularized Multi-dimensional Hawkes Processes for Modeling Events with Spatio-temporal Characteristics

July 19, 2018/The 27th International Joint Conference on Artificial Intelligence (IJCAI-18)

Multi-dimensional Hawkes processes (MHP) has been widely used for modeling temporal events. However, when MHP was used for modeling events with spatio-temporal characteristics, the spatial information was often ignored despite its importance. In this paper, we introduce a framework to exploit MHP for

AIQL: Enabling Efficient Attack Investigation from System Monitoring Data

July 13, 2018/Proceedings of the 2018 USENIX Annual Technical Conference (ATC 18)

The need for countering Advanced Persistent Threat (APT) attacks has led to solutions that ubiquitously monitor system activities in each host and perform timely attack investigation over the monitoring data for analyzing attack provenance. However, existing query systems based on relational databases

LogLens: A Real-time Log Analysis System

July 2, 2018/38th IEEE International Conference on Distributed Computing Systems (ICDCS 2018)

Administrators of most user-facing systems depend on periodic log data to get an idea of the health and status of production applications. Logs report information, which is crucial to diagnose the root cause of complex problems. In this paper, we present a real-time log analysis system called LogLens

Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection

May 3, 2018/Proceedings of the 6th International Conference on Learning Representations, Vancouver Convention Center (ICLR 2018)

Unsupervised anomaly detection on multi- or high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation

Co-Regularized Deep Multi-Network Embedding

April 27, 2018/Proceedings of the 2018 World Wide Web Conference (WWW 2018)

Network embedding aims to learn a low-dimensional vector representation for each node in the social and information networks, with the constraint to preserve network structures. Most existing methods focus on single network embedding, ignoring the relationship between multiple networks. In many real-world

Towards a Timely Causality Analysis for Enterprise Security

February 21, 2018/Proceedings of Network and Distributed Systems Security (NDSS) Symposium 2018

The increasingly sophisticated Advanced Persistent Threat (APT) attacks have become a serious challenge for enterprise IT security. Attack causality analysis, which tracks multi-hop causal relationships between files and processes to diagnose attack provenances and consequences, is the first step towards