
NEC Laboratories America, Inc.
Yun Chi
Emails:
________________________________________________________
I am a research staff member at NEC Laboratories America, Inc. My current research interests include:
- link, content, and trend analysis on Web, Blogosphere, and other social networks
- data mining, including mining structured data, mining streaming data, and mining probabilistic data
- machine learning
Here are my publication lists collected by DBLP and by ACM.
Here are some of my publications, patents, and source codes.
Journal papers
-
iOLAP: A Framework for Analyzing the Internet, Social Networks, and Other Networked Data.
Yun Chi, Shenghuo Zhu, Koji Hino, Yihong Gong, Yi Zhang.
IEEE Transactions on Multimedia, 2009. (accepted)
-
On Evolutionary Spectral Clustering
Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, Belle L. Tseng.
ACM Transactions on Knowledge Discovery from Data, 2009. (accepted)
-
Analyzing Communities and Their Evolutions in Dynamic Social Networks
Yu-Ru Lin, Yun Chi, Shenghuo Zhu, Hari Sundaram, Belle L. Tseng.
ACM Transactions on Knowledge Discovery from Data, 2009. (accepted)
-
Detecting Splogs via Temporal Dynamics using Self-similarity Analysis.
Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura, Belle L. Tseng.
ACM Transactions on the Web, Vol. 2, No. 1, pp. 4:1-4:35, 2008.
-
Catch the Moment: Maintaining Closed Frequent Itemsets in a Data Stream Sliding Window.
Yun Chi, Haixun Wang, Philip S. Yu, Richard R. Muntz.
Knowledge and Information Systems, vol. 10, no. 3, pp. 265-294, 2006.
-
Frequent Subtree Mining--An Overview.
Yun Chi, Siegfried Nijssen, Richard R. Muntz, Joost N. Kok.
Fundamenta Informaticae, vol. 66, No. 1-2, pp. 161-198, 2005.
-
Mining Closed and Maximal Frequent Subtrees from Databases of Labeled Rooted Trees.
Yun Chi, Yi Xia, Yirong Yang, Richard R. Muntz.
IEEE Transactions on Knowledge and Data Engineering, Vol. 17, No. 2, pp. 190-202, 2005.
-
Canonical Forms for Labeled Trees and Their Applications in Frequent Subtree Mining.
Yun Chi, Yirong Yang, Richard R. Muntz.
Knowledge and Information Systems, vol. 8, no. 2, pp. 203-234, 2005.
Conference papers
-
Probabilistic polyadic factorization and its application to personalized recommendation.
Yun Chi, Shenghuo Zhu, Yihong Gong, Yi Zhang.
ACM 17th Conference on Information and Knowledge Management (CIKM), 2008.
-
Integrating clustering and multi-document summarization to improve document understanding.
Dingding Wang, Shenghuo Zhu, Tao Li, Yun Chi, Yihong Gong.
ACM 17th Conference on Information and Knowledge Management (CIKM), 2008.
-
Efficient Computation of Personal Aggregate Queries on Blogs.
Ka Cheung Sia, Junghoo Cho, Yun Chi, Belle L. Tseng.
The 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2008.
-
FacetNet: A Framework for Analyzing Communities and Their Evolutions in Dynamic Networks.
Yu-Ru Lin, Yun Chi, Shenghuo Zhu, Hari Sundaram, Belle L. Tseng.
The 17th International World Wide Web Conference (WWW), 2008.
-
Evolutionary Spectral Clustering by Incorporating Temporal Smoothness.
Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, Belle L. Tseng.
The 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2007. (Runner-up for the Best Research Paper Award)
-
Structural and Temporal Analysis of the Blogosphere Through Community Factorization.
Yun Chi, Shenghuo Zhu, Xiaodan Song, Junichi Tatemura, Belle L. Tseng.
The 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2007.
-
Information Flow Modeling based on Diffusion Rate for Prediction and Ranking.
Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng.
16th International World Wide Web Conference (WWW), 2007.
-
Combining Content and Link for Classification using Matrix Factorization.
Shenghuo Zhu, Kai Yu, Yun Chi, Yihong Gong.
The 30th Annual International ACM SIGIR Conference (SIGIR), 2007.
-
Incremental Spectral Clustering With Application to Monitoring of Evolving Blog Communities.
Huazhong Ning, Wei Xu, Yun Chi, Yihong Gong, Thomas Huang.
SIAM International Conference on Data Mining (SDM), 2007.
-
Blog Community Discovery and Evolution Based on Mutual Awareness Expansion.
Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura, Belle L. Tseng.
The IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2007.
-
Splog Detection Using Content, Time and Link Structures.
Yu-Ru Lin, Hari Sundaram, Yun Chi, Jun Tatemura, Belle L. Tseng.
IEEE International Conference on Multimedia & Expo (ICME), 2007.
-
Identifying Opinion Leaders in the Blogosphere.
Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng.
ACM Sixteenth Conference on Information and Knowledge Management (CIKM), 2007.
-
Capturing User Interests by Both Exploitation and Exploration.
Ka Cheung Sia, Shenghuo Zhu, Yun Chi, Koji Hino, Belle L. Tseng.
11th International Conference on User Modeling (UM), 2007.
-
RSS Feeds Monitoring Driven by User Browsing Pattern.
Ka Cheung Sia, Junghoo Cho, Koji Hino, Yun Chi, Shenghuo Zhu, Belle L. Tseng.
International Conference on User Modeling (ICWSM), 2007.
-
Eigen-Trend: Trend Analysis in the Blogosphere based on Singular Value Decompositions.
Yun Chi, Belle L. Tseng, Junichi Tatemura.
ACM Fifteenth Conference on Information and Knowledge Management (CIKM), 2006.
-
Loadstar: A Load Shedding Scheme for Classifying Data Streams.
Yun Chi, Philip S. Yu, Haixun Wang, Richard R. Muntz.
The SIAM International Conference on Data Mining (SDM), 2005.
-
Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding Window.
Yun Chi, Haixun Wang, Philip S. Yu, Richard R. Muntz.
The Fourth IEEE International Conference on Data Mining (ICDM), 2004.
-
HybridTreeMiner: An Efficient Algorithm for Mining Frequent Rooted Trees and Free Trees Using Canonical Forms.
Yun Chi, Yirong Yang, Richard R. Muntz.
The Sixteenth International Conference on Scientific and Statistical Database Management (SSDBM), 2004.
-
CMTreeMiner: Mining Both Closed and Maximal Frequent Subtrees.
Yun Chi, Yirong Yang, Yi Xia, Richard R. Muntz.
The Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2004.
-
Indexing and Mining Free Trees.
Yun Chi, Yirong Yang, Richard R. Muntz.
The Third IEEE International Conference on Data Mining (ICDM), 2003.
Workshop and other papers
-
Splog Detection Using Self-similarity Analysis on Blog Temporal Dynamics.
Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura, Belle L. Tseng.
Third International Workshop on Adversarial Information Retrieval on the Web (AIRWeb), 2007.
-
Summarization System by Identifying Influential Blogs.
Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng.
International Conference on Weblog and Social Media (ICWSM), 2007. (demo paper)
-
The Splog Detection Task and A Solution Based on Temporal and Link Properties.
Yu-Ru Lin, Wen-Yen Chen, Xiaolin Shi, Richard Sia, Xiaodan Song, Yun Chi, Koji Hino, Hari Sundaram, Junichi Tatemura, Belle L. Tseng.
Proceedings of 2006 Text REtrieval Conference (TREC), 2006.
-
Discovery of Blog Communities based on Mutual Awareness.
Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura, Belle L. Tseng.
Third Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, 2006.
-
Loadstar: Load Shedding in Data Stream Mining.
Yun Chi, Haixun Wang, Philip S. Yu.
31st International Conference on Very Large Data Bases (VLDB), 2005. (demo paper)
-
Frequent Subtree Mining--An Overview.
Yun Chi, Siegfried Nijsseny, Richard R. Muntz, Joost N. Kok.
Lecture Notes for the First International Workshop on Mining Graphs, Trees and Sequences (MGTS'03), 2004.
-
Mining Association Rules with Non-uniform Privacy Concerns.
Yi Xia, Yirong Yang, Yun Chi, Richard R. Muntz.
The 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD), 2004.
Patents
-
Tianbao Yang, Yun Chi, Shenghuo Zhu, Yihong Gong
Systems and Methods for Finding Communities and Their Evolutions in Dynamic Social Network
(pending)
-
Yun Chi, Shenghuo Zhu, Yihong Gong
Systems and Methods for Processing High-dimensional Data
(pending)
-
Shenghuo Zhu, Dingding Wang, Yun Chi, Yihong Gong
Multi-Document Summarization Utilizing Document Clustering
(pending)
-
Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, Belle L. Tseng
System and Method for Evolutionary Spectral Clustering
(pending)
-
Shenghuo Zhu, Kai Yu, Yun Chi, Yihong Gong
Combining Content and Link for Classification using Matrix Factorization
(pending)
-
Yun Chi, Belle L. Tseng, Junichi Tatemura
System and Method for Trend Extraction and Analysis of Dynamic Data
(Disclosure No. NECLAB-PAUS0008, pending)
-
Yun Chi, Philip S. Yu, Haixun Wang
System and Method for Load Shedding in Classifying Data Streams
(Disclosure No. YOR820040594, pending)
-
Yun Chi, Philip S. Yu, Haixun Wang
System and Method for Maintaining Closed Frequent Itemsets over a Data Stream Sliding Window
(Disclosure No. YOR820040811, pending)
Services
PC member for ICDM'06, CIKM'08, SIAM SDM'09, WWW'09.
Software
In my student life, I have developed some software to support my research. All codes were implemented in MS VC++ 6.0 and successfuly compiled under Redhat Linux 6.0 using g++ 2.96. Please feel free to use the software for academic purposes.
-
MomentFP:
a system for mining and maintaining closed frequent itemsets over data streams [MomentFP.tar.gz]
-
FreeTreeMiner:
an a priori algorithm for mining frequent free trees [FreeTreeMiner.tgz]
-
RootedTreeMiner:
an algorithm for mining frequent rooted unordered trees [RootedTreeMiner.tgz]
-
HybridTreeMiner Free:
an algorithm for mining frequent free trees [HybridTreeMiner_Free.tgz]
-
HybridTreeMiner Rooted:
an algorithm for mining frequent rooted unordered trees [HybridTreeMiner_Rooted.tgz]
-
CMTreeMiner Unordered:
an algorithm for mining closed and maximal frequent rooted unordered trees [CMUnorderedTreeMiner.tgz]
-
CMTreeMiner Ordered:
an algorithm for mining closed and maximal frequent rooted ordered trees [CMOrderedTreeMiner.tgz]
NEC Laboratories America Home
©2007 NEC Laboratories America, Inc. All rights reserved.