Contest Information
Data for competition held at SIAM Text Mining 2007 Workshop. Part of the Seventh SIAM International Conference on Data Mining.
Seventh SIAM International Conference on Data Mining
SIAM Text Mining 2007
- Contest Description and Rules
- TrainingData.txt (25MB file; 21,519 reports with 1 report per record)
- TrainCategoryMatrix.csv (925KB file; 21,519 reports x 22 categories matrix)
- Contest scoring information/software
- TestData (2.9MB, 7,077 ASRS reports)
- TestTruth (450KB, correct anomaly assignments for each of the 7,077 ASRS test reports)