The directory proteomics/ includes protein FASTA files from several organisms. Since some proteomics software, such as Proteome Discoverer, can produce errors when work with long descriptions or special characters, the protein FASTA files in this directory have not a functional description or it was trimmed and avoided for special characters. Files named as 'species_version_proteins_filtered_description.fasta' include a sequences with a 80 characters description. Files named as 'species_version_proteins_no_description.fasta' include only sequence indentifiers. Files named as 'species_version_proteins_pd.fasta' include only sequence indetifiers and masked reads were removed.