These files were downloaded and unzipped from: govdocs1 zip files
We removed the files that were identified as malware in 2013: metascan.
For background on this corpus, please see: govdocs1.
If you use these files, please cite: Garfinkel, Farrell, Roussev and Dinolt, Bringing Science to Digital Forensics with Standardized Forensic Corpora, DFRWS 2009, Montreal, Canada