02 Aug 05 -- prot22 data set was changed to remove duplicate prots/gene for ensemble (ens) proteomes, as per inparanoid caluclation (result = same proteome counts). For Query entries in runs prior to this, the dupl. prots can be removed from blast output. Prior Subject db runs that have dups: ensAG, ensAM, ensCF, ensDR, ensFR, ensGG, ensHS. Should be re-run w/o dups, time permitting. This has reduced total prot22 entry count to 382143, from 463377. Note also: 1st runs of modDM, modCE were truncated (some query sets hit TG timeouts), will re-run. -- dgg 03 Aug 05 -- filtered out dupl. protein queries from the ens set, from these: (ensAG ensAM ensCF ensDR ensFR ensGG ensHS modCB modDD modMM modOG modSC ncbAT) Others are runs with dupl. removed from fasta data. .