ICDM'07: Regular Papers
DM206 "Data Discretization Unification"
Ruoming Jin, Yuri Breitbart, and Chibuike Muoh
DM207 "Clustering Needles in a Haystack: An Information Theoretic Analysis of Minority and Outlier Detection"
DM222 "Social Network Extraction of Academic Researchers"
Jie Tang, Duo Zhang, and Limin Yao
DM271 "Community Learning by Graph Approximation"
Bo Long, Xiaoyun Xu, Zhongfei Zhang, and Philip S. Yu
DM284 "Temporal analysis of semantic graphs using ASALSAN"
Brett Bader, Richard Harshman, and Tamara Kolda
DM288 "A Generalization of Proximity Functions for K-means"
Junjie Wu, Hui Xiong, and Jian Chen
DM327 "Disk Aware Discord Discovery: Finding Unusual Time Series in Terabyte Sized Datasets"
Dragomir Yankov, Eamonn Keogh, and Umaa Rebbapragada
DM338 "Non-redundant Multi-view Clustering Via Orthogonalization"
Ying Cui, Xiaoli Fern, and Jennifer Dy
DM358 "ORIGAMI: Mining Representative Orthogonal Graph Patterns"
Mohammad Hasan, Vineet Chaoji, Saeed Salem, jeremy Besson, and Mohammed Zaki
DM362 "Dynamic Micro Targeting: Fitness-Based Approach to Predicting Individual Preferences"
Tianyi Jiang and Alexander Tuzhilin
DM364 "Incorporating User provided Constraints into Document Clustering"
YANHUA CHEN, MANJEET REGE, and MING DONG
DM388 "Mining statistical information of frequent fault-tolerant patterns in transactional databases"
Ardian Kristanto Poernomo and Vivekanand Gopalkrishnan
DM406 "GMDA: General Averaged Divergences Analysis"
Dacheng Tao, Xuelong Li, Xindong Wu, and Stephen Maybank
DM408 "Parallel Mining of Frequent Closed Patterns:Harnessing Modern Computer Architectures"
Claudio Lucchese, Salvatore Orlando, and Raffaele Perego
DM417 "Multilevel Belief Propagation for Fast Inference on Markov Random Fields"
Liang Xiong, Fei Wang, and Changshui Zhang
DM442 "A Pairwise Covariance-preserving Projection Method for Dimension Reduction"
Xiaoming Liu, Jianwei Yin, Zhilin Feng, and Jinxiang Dong
DM463 "Succinct Matrix Approximation and Efficient k-NN Classification"
Rong Liu and Yong Shi
DM466 "Improving Text Classification by Using Encyclopedia Knowledge"
Pu Wang and Lijun Chen
DM470 "Maximum Entropy Based Significance of Itemsets"
DM482 "Sample Selection for Maximal Diversity"
Feng Pan, Adam Roberts, Leonard McMillan, Fernando Pardo Manuel de Villena, David Threadgill, and Wei Wang
DM485 "Lightweight Distributed Trust Propagation"
Daniele Quercia, Stephen Hailes, and Licia Capra
DM494 "A Cascaded Approach to Biomedical Named Entity Recognition Using a Unified Model"
Shing-Kit CHAN, Wai LAM, and Xiaofeng YU
DM506 "Improving Knowledge Discovery by Combining Text Mining and Link Analysis Techniques"
Wei Jin, hung-hay ho, and Xin Wu
DM571 "Efficient Data Sampling in Heterogenous Peer-to-Peer Networks"
Benjamin Arai, Song Lin, and Dimitrios Gunopulos
DM576 "Depth-Based Novelty Detection and its Application to Taxonomic Research"
Yixin Chen, Henry Bart, Xin Dang, and Hanxiang Peng
DM581 "On Appropriate Assumptions to Mine Data Streams"
Jing Gao, Wei Fan, and Jiawei Han
DM582 "Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights"
Robert Bell and Yehuda Koren
DM584 "Finding cohesive clusters for analyzing knowledge communities"
Vasileios Kandylas, S. Phineas Upham, and Lyle H. Ungar
DM638 "How Much Noise is too Much: A Study in Automatic Text Classification"
Sumeet Agarwal, Shantanu Godbole, Diwakar Punjani, and Shourya Roy
DM681 "Rule Cubes for Causal Investigations"
Axel Blumenstock, Franz Schweiggert, and Markus Müller
DM689 "A Unified Approach for Sparse Subspace Learning"
DM692 "Efficient Algorithms for Mining Significant Substructures in Graphs with Quality Guarantees"
Huahai He and Ambuj Singh
DM695 "Language-Independent Set Expansion of Named Entities using the Web"
Richard C. Wang and William Cohen
DM703 "Detecting Fractures in Classifier Performance"
David Cieslak and Nitesh Chawla
DM718 "Binary Matrix Factorization with Applications"
Zhongyuan Zhang, Tao Li, Chris Ding, and Xiang-Sun Zhang
DM728 "Structure-based Statistical Features and Multivariate Time Series Clustering"
Xiaozhe Wang, Anthony Wirth, and Liang Wang
DM736 "Supervised Learning by Training on Aggregate Outputs"
David Musicant, Janara Christensen, and Jamie Olson
DM741 "Mining Frequent Itemsets in a Stream"
Toon Calders, Nele Dexters, and Bart Goethals
DM764 "The Chosen Few: On Identifying Valuable Patterns"
Albrecht Zimmermann and Björn Bringmann
DM774 "Local Probabilistic Models for Link Prediction"
Chao Wang, Venu Satuluri, and Srinivasan Parthasarathy
ICDM'07: Short Papers
DM208 "Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks"
Ruoming Jin, Scott Mccalle, and Eivind Almaas
DM211 "Co-Ranking Authors and Documents in a Heterogeneous Network"
Ding Zhou, Sergey Orshanskiy, Hongyuan Zha, and C. Lee Giles
DM243 "A Novel Criterion for Onset Detection: Differential Information Redundancy with Application to Human Movement Initiation"
Gert Van Dijck, Marc Van Hulle, and Jo Van Vaerenbergh
DM245 "Using Burstiness to Improve Clustering of Topics in News Streams"
QI HE, Kuiyu Chang, and Ee-Peng Lim
DM249 "estMax: Finding maximal frequent itemsets over online data streams"
Ho Jin Woo and Won Suk Lee
DM278 "Failure Prediction in IBM BlueGene/L Event Logs"
Yinglung Liang, Yanyong Zhang, Hui Xiong, Ramendra Sahoo, and Anand Sivasubramaniam
DM281 "Efficient Discovery of Frequent Approximate Sequential Patterns"
Feida Zhu, Xifeng Yan, Jiawei Han, and Philip S. Yu
DM299 "Sampling for Sequential Pattern Mining : From Static Databases to Data Streams"
Chedy Raïssi and Pascal Poncelet
DM324 "Incremental Subspace Clustering over Multiple Data Streams"
Qi Zhang, Jinze Liu, and Wei Wang
DM328 "Locally Constrained Support Vector Clustering"
Dragomir Yankov, Eamonn Keogh, and Kin Kan
DM331 "Optimal Subsequence Bijection"
Longin Jan Latecki, Qiang Wang, Suzan Koknar-Tezel, and Vasileios Megalooikonomou
DM335 "High-Speed Function Approximation"
Biswanath Panda, Mirek Riedewald, Johannes Gehrke, and Stephen Pope
DM360 "PRISM: A Primal Approach for Frequent Sequence Mining"
Karam Gouda, Mosab Hassaan, and Mohammed Zaki
DM374 "A Support Vector Approach to Censored Targets"
Pannagadatta Shivaswamy, Wei Chu, and Martin Jansche
DM383 "Active Learning from Data Streams"
Xingquan Zhu, Peng Zhang, Xiaodong Lin, and Yong Shi
DM391 "Analyzing and Detecting Review Spam"
Nitin Jindal and Bing Liu
DM402 "Document Transformation for Multi-label Feature Selection in Text Categorization"
DM407 "Optimizing Frequency Queries for Data Mining Applications"
Hassan Malik and John Kender
DM430 "A Semantic Kernel for Semi-Structured Documents"
Sujeevan Aseervatham, Emmanuel Viennet, and Younès Bennani
DM433 "Web Site Recommendation Using HTTP Traffic"
Ming Jia, Shaozhi Ye, Xing Li, and Julie Dickerson
DM434 "gApprox: Mining Frequent Approximate Patterns from a Massive Network"
CHEN CHEN, Xifeng Yan, Feida Zhu, and Jiawei Han
DM443 "Statistical Learning Algorithm for Tree Similarity"
Atsuhiro Takasu, Daiji Fukagawa, and Tatsuya Akutsu
DM446 "Finding Predictive Runs with LAPS"
Suhrid Balakrishnan and David Madigan
DM452 "Discovering Temporal Communities from Social Network Documents"
Ding Zhou, Isaac Councill, Hongyuan Zha, and C. Lee Giles
DM461 "Can the Content of Public Information be used to Forecast Abnormal Stock Market Behaviour?"
Calum Robertson, Shlomo Geva, and Rodney Wolff
DM476 "Mining Interpretable Human Strategies: A Case Study"
Xiaoli Fern, Chaitanya Komireddy, and Margaret Burnett
DM477 "Efficient Kernel Discriminant Analysis via Spectral Regression"
DM480 "Confident Identification of Relevant Objects Based on Nonlinear Rescaling Method and Transductive Inference"
Shen-Shyang Ho and Roman Polyak
DM502 "Using Significant, Positively and Relatively Class Correlated Rules For Associative Classification of Imbalanced Datasets"
Florian Verhein and Sanjay Chawla
DM513 "Local Word Bag Model for Text Categorization"
Wen Pu and Ning Liu
DM515 "Bandit-Based Algorithms for Budgeted Learning"
Kun Deng, Chris Bourke, Stephen Scott, and Julie Sunderman
DM524 "Noise Modelling with Associative Corruption Rules"
Yan Zhang and Xindong Wu
DM525 "A Computational Approach to Style in American Poetry"
David Kaplan and David Blei
DM533 "Cocktail Ensemble for Regression"
Yang Yu, Zhi-Hua Zhou, and Kai Ming Ting
DM548 "Mechanism Design for Clustering Aggregation by Selfish Systems"
Pinata Winoto, Yiu-ming Cheung, and Jiming Liu
DM567 "Topical N-grams: Phrase and Topic Discovery, with an Application to Information Retrieval"
Xuerui Wang, Andrew McCallum, and Xing Wei
DM572 "Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery"
David Minnen, Thad Starner, Charles Isbell, and Irfan Essa
DM586 "A Text Classification Framework with a Local Feature Ranking for Learning Social Networks"
Masoud Makrehchi and Mohamed Kamel
DM628 "Extracting Product Comparisons from Discussion Boards"
Ronen Feldman, Moshe Fresko, Jacob Goldenberg, Oded Netzer, Lyle Ungar
DM630 "Connections between mining frequent itemsets and learning generative models"
Srivatsan Laxman, Prasad Naldurg, Raja Sripada, and Ramarathnam Venkatesan
DM631 "Preserving Privacy through Data Generation"
Jilles Vreeken, Matthijs van Leeuwen, and Arno Siebes
DM650 "Cross-mining binary and numerical attributes"
Gemma Garriga, Hannes Heikinheimo, and Jouni Seppänen
DM659 "On Meta-Learning Rule Learning Heuristics"
Frederik Janssen and Johannes Fürnkranz
DM674 "Temporal And Social Network Based Blogging Behavior Prediction In BlogSpace"
BI CHEN, Qiankun Zhao, Bingjun Sun, and Prasenjit Mitra
DM691 "DUSC: Dimensionality Unbiased Subspace Clustering"
Ira Assent, Ralph Krieger, Emmanuel Müller, and Thomas Seidl
DM697 "Weighted Additive Criterion for Linear Dimension Reduction"
Jing Peng and Stefan Robila
DM699 "Bayesian Folding-In with Dirichlet Kernels for PLSI"
Alexander Hinneburg, Andre Gohr, and Hans-Henning Gabriel
DM702 "An Efficient Spectral Algorithm for Network Community Discovery and Its Applications to Biological and Social Networks"
Jianhua Ruan and Weixiong Zhang
DM713 "Recommendation via Query Centered Random Walk on K-partite Graph"
Haibin Cheng, Pang-Ning Tan, Sticklen Jon, and William F. Punch
DM714 "Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization"
Tao Li, Chris Ding, and Michael Jordan
DM721 "Latent Dirichlet Conditional Naive-Bayes Models"
Arindam Banerjee and Hanhuai Shan
DM726 "Consensus Clustering"
Nam Nguyen and Rich Caruana
DM735 "Transitional Patterns and Their Significant Milestones"
Qian Wan and Aijun An
DM738 "Exploration of Link Structure and Community-based Node Roles in Network"
Jerry Scripps, Pang-Ning Tan, and Abdol-Hossein Esfahanian
DM739 "Understanding Discrete Classifiers - with a case study in gene prediction"
Muhammad Subianto and Arno Siebes
DM750 "Lazy Bagging for Classifying Imbalanced Data"
DM766 "Computing Correlation Anomaly Scores using Stochastic Nearest Neighbors"
Tsuyoshi Ide, Spiros Papadimitriou, and Michail Vlachos
DM769 "Zonal Co-location Pattern Discovery with Dynamic Parameters"
Mete Celik, James Kang, and Shashi Shekhar
DM773 "Change-Point Detection in Time-Series Data based on Subspace Identification"
Yoshinobu Kawahara, Takehisa Yairi, and Kazuo Machida
DM779 "Semi-supervised Document Clustering via Active Learning with Pairwise Constraints"
Ruizhang Huang and Wai LAM
DM789 "Training Conditional Random Fields by Periodic Step Size Adaptation"
Han-Shen Huang, Yu-Ming Chang, and Chun-Nan Hsu