This folder aims at providing test data sets and intermediate results that were not presented in the original paper. *SimMarine_23G.fa.tgz: The simulated marine data set, in FASTA format *SimMarineGroundTruth.tgz: The ground-truth homolog reads used in the benchmark experiment on the simulated marine data set *Saliva.faa.tgz: The FragGeneScan called short peptides for the real human saliva data set (SRS013942), in FASTA format *SalivaAssembledContigs.tgz: The targeted assembly results on the real human saliva data set (SRS013942) *RESFAMResults.tgz: The RPKM of each anti-microbial resistance gene families as predicted by HMM-GRASPx and HMMER3 among 12 human microbiome project data sets (6 supragingival and 6 stool) *OralSearchResults.tgz: The search results generated by HMM-GRASPx and HMMER3 *OralGroundTruth.tgz: The ground-truth definition for the 14 high-abundance genomes in the human oral data sets *OralAbundanceCorrelation.tgz: The abundances (as computed in raw read count) predicted by BWA (ground-truth), HMM-GRASPx, and HMMER3 among 8 metatranscriptomic data sets (SRP049210) *OralDEResults.tgz: The differentially expressed protein families predicted by DESeq2 on the 8 metatranscriptomic data sets (SRP049210) *OralTargetedAssembly.tgz: The contigs generated by targeted assembly on reads recruited by HMM-GRASPx and HMMER3 from 8 metatranscriptomic data sets (SRP049210)