These files were downloaded and unzipped from: govdocs1 zip files

We removed the files that were identified as malware in 2013: metascan.

For background on this corpus, please see: govdocs1.

If you use these files, please cite: Garfinkel, Farrell, Roussev and Dinolt, Bringing Science to Digital Forensics with Standardized Forensic Corpora, DFRWS 2009, Montreal, Canada