June 2005, Volume 7, Issue 1

June 2005, Volume 7, Issue 1

  • Text Mining and Natural Language Processing Introduction for the Special Issue [1]
  • Mining Knowledge from Text Using Information Extraction [3]
  • Instance Filtering for Entity Recognition [11]
  • Chinese Named Entity Recognition using Lexicalized HMMs [19]
  • Distributed Higher Order Association Rule Mining Using Information Extracted from Textual Data [26]
  • Support Vector Machines Classification with A Very Large-scale Taxonomy [36]
  • Mining Logs Files for DataDriven System Management [44]
  • A baseline feature set for learning rhetorical zones using full articles in the biomedical domain [52]
  • Using Text Mining and Natural Language Processing for Health Care Claims Processing [59]
  • Extracting Statistical Data Frames from Text [67]
  • Generalized Naive Bayes Classifiers [76]

Read More

December 2004, Volume 6, Issue 2

December 2004, Volume 6, Issue 2

  • Editorial: Special Issue on Web Content Mining [1]
  • Extracting Relational Data from HTML Repositories [5]
  • Learning Important Models for Web Page Blocks based on Layout and Content Analysis [14]
  • Learning by Googling [24]
  • Correlating Summarization of Multi-source News with K-way Graph Bi-clustering [34]
  • Information Diffusion Through Blogspace [43]
  • Mining Structures for Semantics [53]
  • Learning to Extract Information from Large Domain-specific Websites using Sequential Models [61]
  • Mining Semantics for Large Scale Integration on the Web: Evidences, Insights, and Challenges [67]
  • A Critical Review of Multi-Objective Optimization in Data Mining: A Position Paper [77]
  • Subspace Clustering for High Dimensional Categorical Data [87]
  • KDD Cup 2004: Results and Analysis [95]
  • Anti-matter detection: Particle Physics Model for KDD Cup 2004 [109]
  • KDD Physics Task - Discussion of Modeling Approaches [113]
  • Quick and Dirty Quantum Predictions with FIRMplus [115]
  • The Weka Solution to the 2004 KDD Cup [117]
  • A Block-Based Support Vector Machine Approach to the Protein Homology Prediction Task in KDD Cup 2004 [120]
  • Protein Matching With Custom Neural Network Objective Functions [125]
  • KDD-Cup 2004 - Protein Homology Task [128]
  • Report on KDD Conference 2004 Panel Discussion: Can Natural Language Processing Help Text Mining? [132]
  • Overview of the ACM SIGKDD 2004 Workshops [134]
  • KDD-2004 Workshop Report: Link Analysis and Group Detection (LinkKDD-2004) [136]
  • MultiRelational Data Mining 2004: Workshop Report [140]
  • A Workshop Report: Mining for and from the Semantic Web at KDD 2004 [142]
  • The 5th International Workshop on Multimedia Data Mining (MDM/KDD2004) [144]
  • WebKDD 2004 - Web Mining and Web Usage Analysis Post-Workshop report [147]
  • The Third SIGKDD Workshop on Mining Temporal and Sequential Data (KDD/TDM 2004) [152]
  • Report on BIOKDD04: Workshop on Data Mining in Bioinformatics [153]
  • A Report on the Fourth International Workshop on Environmental Applications of Machine Learning (EAML 2004) [155]
  • Data Mining Standards, Services, and Platforms 2004 (DMSSP 2004) [157]
  • The 1st Workshop on ROC Analysis in Artificial Intelligence (ROCAI-2004) [159]

Read More

June 2004, Volume 6, Issue 1

June 2004, Volume 6, Issue 1

  • Editorial [1]
  • Mining with Rarity: A Unifying Framework [7]
  • A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data [20]
  • Learning from Imbalanced Data Sets with Boosting and Data Generation: The DataBoost-IM Approach [30]
  • Class Imbalances versus Small Disjuncts [40]
  • Minority Report in Fraud Detection: Classification of Skewed Data [50]
  • Extreme Rebalancing for SVMs: a case study [60]
  • A Multistrategy Approach for Digital Text Categorization from Imbalanced Documents [70]
  • Feature Selection for Text Categorization on Imbalanced Data [80]
  • Subspace Clustering for High Dimensional Data: A Review [90]
  • MDM/KDD2003: Multimedia Data Mining [106]
  • Advances in Frequent Itemset Mining Implementations: Report on FIMI 03 (FIMI Repository) [109]

Read More

December 2003, Volume 5, Issue 2

December 2003, Volume 5, Issue 2

  • Editorial []
  • Microarray Data Mining: Facing the Challenges [1]
  • A Novel Approach to Determine Normal Variation in Gene Expression Data [6]
  • Gene Ranking Using Bootstrapped P-values [16]
  • Improving Classification of Microarray Data using Prototype-based Feature Selection [23]
  • Supervised Analysis When the Number of Candidate Features (p) Greatly Exceeds The Number of Cases (n) [31]
  • Differential Expression, Class Discovery and Class Prediction using S-PLUS and S+ArrayAnalyzer [38]
  • Machine Learning Methods Applied to DNA Microarray Data Can Improve the Diagnosis of Cancer [48]
  • Loss-Based Estimation with Cross-Validation: Applications to Microarray Data Analysis [56]
  • Classification of Heterogeneous Gene Expression Data [69]
  • Toward Interactive Exploration of Gene Expression Patterns [79]
  • Graphical Modeling Based Gene Interaction Analysis for Microarray Data [91]
  • Meta-Clustering of Gene Expression Data and Literature-based Information [101]
  • Mining Biologically Active Patterns in Metabolic Pathways using Microarray Expression Profiles [113]
  • Statistical Methods for Joint Data Mining of Gene Expression and DNA Sequence Data [122]
  • Machine Learning in Low-Level Microarray Analysis [130]
  • "In vivo" Spam Filtering: A Challenge Problem for KDD [140]
  • Overview of the KDD Cup 2003 [149]
  • Citation Prediction Using Time Series Approach KDD Cup 2003 (Task 1) [152]
  • Predicting citation rates for physics papers: Constructing features for an ordered probit model [154]
  • A Data Cleaning Solution by Perl Scripts for the KDD Cup 2003 Task 2 [158]
  • The Download Estimation Task on KDD Cup 2003 [160]
  • Model Builder for Predictive Analytics and Fair Isaac's Approach to KDD Cup 2003 [163]
  • Exploiting Relational Structure to Understand Publication Patterns in High-Energy Physics [165]
  • Using Unsupervised Link Discovery Methods to Find Interesting Facts and Connections in Bibliography Dataset [173]
  • The Myth of the Double-Blind Review? Author Identification Using Only Citations [179]
  • EqRank: A Self-Consistent Equivalence Relation on Graph Vertexes [185]
  • Summary from the KDD-03 Panel -- Data Mining: The Next 10 Years [191]
  • KDD-2003 Workshop on Data Mining Standards, Services and Platforms (DM-SSP 03) [197]
  • Data Mining in Bioinformatics: Report on BIOKDD '03 [198]
  • Multirelational Data Mining 2003: Workshop Report [200]
  • Advertisements []

Read More

July 2003, Volume 5, Issue 1

July 2003, Volume 5, Issue 1

  • Editorial: Multi-Relational Data Mining: The Current Frontiers []
  • Multi-Relational Data Mining: An Introduction [1]
  • Scalability and Efficiency in Multi-Relational Data Mining [17]
  • Probabilistic Logic Learning [31]
  • A Survey of Kernels for Structured Data [49]
  • State of the Art of Graph-based Data Mining [59]
  • Biological Applications of Multi-Relational Data Mining [69]
  • Prospects and Challenges for Multi-Relational Data Mining [80]
  • Link Mining: A New data Mining Challenge [84]
  • Graph-based Relational Learning: Current and Future Directions [90]
  • Exploratory Medical Knowledge Discovery: Experiences and Issues [94]
  • A Report on the Summer School on Relational Data Mining [100]

Read More