
June 2005, Volume 7, Issue 1
- Text Mining and Natural Language Processing Introduction for the Special Issue [1]
- Mining Knowledge from Text Using Information Extraction [3]
- Instance Filtering for Entity Recognition [11]
- Chinese Named Entity Recognition using Lexicalized HMMs [19]
- Distributed Higher Order Association Rule Mining Using Information Extracted from Textual Data [26]
- Support Vector Machines Classification with A Very Large-scale Taxonomy [36]
- Mining Logs Files for DataDriven System Management [44]
- A baseline feature set for learning rhetorical zones using full articles in the biomedical domain [52]
- Using Text Mining and Natural Language Processing for Health Care Claims Processing [59]
- Extracting Statistical Data Frames from Text [67]
- Generalized Naive Bayes Classifiers [76]

December 2004, Volume 6, Issue 2
- Editorial: Special Issue on Web Content Mining [1]
- Extracting Relational Data from HTML Repositories [5]
- Learning Important Models for Web Page Blocks based on Layout and Content Analysis [14]
- Learning by Googling [24]
- Correlating Summarization of Multi-source News with K-way Graph Bi-clustering [34]
- Information Diffusion Through Blogspace [43]
- Mining Structures for Semantics [53]
- Learning to Extract Information from Large Domain-specific Websites using Sequential Models [61]
- Mining Semantics for Large Scale Integration on the Web: Evidences, Insights, and Challenges [67]
- A Critical Review of Multi-Objective Optimization in Data Mining: A Position Paper [77]
- Subspace Clustering for High Dimensional Categorical Data [87]
- KDD Cup 2004: Results and Analysis [95]
- Anti-matter detection: Particle Physics Model for KDD Cup 2004 [109]
- KDD Physics Task - Discussion of Modeling Approaches [113]
- Quick and Dirty Quantum Predictions with FIRMplus [115]
- The Weka Solution to the 2004 KDD Cup [117]
- A Block-Based Support Vector Machine Approach to the Protein Homology Prediction Task in KDD Cup 2004 [120]
- Protein Matching With Custom Neural Network Objective Functions [125]
- KDD-Cup 2004 - Protein Homology Task [128]
- Report on KDD Conference 2004 Panel Discussion: Can Natural Language Processing Help Text Mining? [132]
- Overview of the ACM SIGKDD 2004 Workshops [134]
- KDD-2004 Workshop Report: Link Analysis and Group Detection (LinkKDD-2004) [136]
- MultiRelational Data Mining 2004: Workshop Report [140]
- A Workshop Report: Mining for and from the Semantic Web at KDD 2004 [142]
- The 5th International Workshop on Multimedia Data Mining (MDM/KDD2004) [144]
- WebKDD 2004 - Web Mining and Web Usage Analysis Post-Workshop report [147]
- The Third SIGKDD Workshop on Mining Temporal and Sequential Data (KDD/TDM 2004) [152]
- Report on BIOKDD04: Workshop on Data Mining in Bioinformatics [153]
- A Report on the Fourth International Workshop on Environmental Applications of Machine Learning (EAML 2004) [155]
- Data Mining Standards, Services, and Platforms 2004 (DMSSP 2004) [157]
- The 1st Workshop on ROC Analysis in Artificial Intelligence (ROCAI-2004) [159]

June 2004, Volume 6, Issue 1
- Editorial [1]
- Mining with Rarity: A Unifying Framework [7]
- A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data [20]
- Learning from Imbalanced Data Sets with Boosting and Data Generation: The DataBoost-IM Approach [30]
- Class Imbalances versus Small Disjuncts [40]
- Minority Report in Fraud Detection: Classification of Skewed Data [50]
- Extreme Rebalancing for SVMs: a case study [60]
- A Multistrategy Approach for Digital Text Categorization from Imbalanced Documents [70]
- Feature Selection for Text Categorization on Imbalanced Data [80]
- Subspace Clustering for High Dimensional Data: A Review [90]
- MDM/KDD2003: Multimedia Data Mining [106]
- Advances in Frequent Itemset Mining Implementations: Report on FIMI 03 (FIMI Repository) [109]

December 2003, Volume 5, Issue 2
- Editorial []
- Microarray Data Mining: Facing the Challenges [1]
- A Novel Approach to Determine Normal Variation in Gene Expression Data [6]
- Gene Ranking Using Bootstrapped P-values [16]
- Improving Classification of Microarray Data using Prototype-based Feature Selection [23]
- Supervised Analysis When the Number of Candidate Features (p) Greatly Exceeds The Number of Cases (n) [31]
- Differential Expression, Class Discovery and Class Prediction using S-PLUS and S+ArrayAnalyzer [38]
- Machine Learning Methods Applied to DNA Microarray Data Can Improve the Diagnosis of Cancer [48]
- Loss-Based Estimation with Cross-Validation: Applications to Microarray Data Analysis [56]
- Classification of Heterogeneous Gene Expression Data [69]
- Toward Interactive Exploration of Gene Expression Patterns [79]
- Graphical Modeling Based Gene Interaction Analysis for Microarray Data [91]
- Meta-Clustering of Gene Expression Data and Literature-based Information [101]
- Mining Biologically Active Patterns in Metabolic Pathways using Microarray Expression Profiles [113]
- Statistical Methods for Joint Data Mining of Gene Expression and DNA Sequence Data [122]
- Machine Learning in Low-Level Microarray Analysis [130]
- "In vivo" Spam Filtering: A Challenge Problem for KDD [140]
- Overview of the KDD Cup 2003 [149]
- Citation Prediction Using Time Series Approach KDD Cup 2003 (Task 1) [152]
- Predicting citation rates for physics papers: Constructing features for an ordered probit model [154]
- A Data Cleaning Solution by Perl Scripts for the KDD Cup 2003 Task 2 [158]
- The Download Estimation Task on KDD Cup 2003 [160]
- Model Builder for Predictive Analytics and Fair Isaac's Approach to KDD Cup 2003 [163]
- Exploiting Relational Structure to Understand Publication Patterns in High-Energy Physics [165]
- Using Unsupervised Link Discovery Methods to Find Interesting Facts and Connections in Bibliography Dataset [173]
- The Myth of the Double-Blind Review? Author Identification Using Only Citations [179]
- EqRank: A Self-Consistent Equivalence Relation on Graph Vertexes [185]
- Summary from the KDD-03 Panel -- Data Mining: The Next 10 Years [191]
- KDD-2003 Workshop on Data Mining Standards, Services and Platforms (DM-SSP 03) [197]
- Data Mining in Bioinformatics: Report on BIOKDD '03 [198]
- Multirelational Data Mining 2003: Workshop Report [200]
- Advertisements []

July 2003, Volume 5, Issue 1
- Editorial: Multi-Relational Data Mining: The Current Frontiers []
- Multi-Relational Data Mining: An Introduction [1]
- Scalability and Efficiency in Multi-Relational Data Mining [17]
- Probabilistic Logic Learning [31]
- A Survey of Kernels for Structured Data [49]
- State of the Art of Graph-based Data Mining [59]
- Biological Applications of Multi-Relational Data Mining [69]
- Prospects and Challenges for Multi-Relational Data Mining [80]
- Link Mining: A New data Mining Challenge [84]
- Graph-based Relational Learning: Current and Future Directions [90]
- Exploratory Medical Knowledge Discovery: Experiences and Issues [94]
- A Report on the Summer School on Relational Data Mining [100]