There are 3 different datasets of proteins for an unigene build. + Proteins predicted by longest6frame (a SGN script that translate the sequence in the 6 ORF and get the longest) + Proteins predicted by estscan (http://www.ch.embnet.org/software/ESTScan2.html) + Proteins preferred (for each unigene, compare both methods and get the longest protein) For each dataset, exists two files (cds and protein). Also is provided a version of the preferred protein dataset with annotations compatibles with ProteinPilot program. ---------------------------------------- Report: ---------------------------------------- Files: * cds sequences: - cds fasta file: Capsicum_annuum_cds_predicted_by_longest6frame.v1.fasta - number of sequences: 9741 - total bases: 4089408 - average sequences length: 419 - maximum sequence length: 2145 - minimum sequence length: 66 * protein sequences: - protein fasta file: Capsicum_annuum_protein_predicted_by_longest6frame.v1.fasta - number of sequences: 9741 - total aminoacids: 1359090 - average sequences length: 139 - maximum sequence length: 715 - minimum sequence length: 22 * cds sequences: - cds fasta file: Capsicum_annuum_cds_predicted_by_estscan.v1.fasta - number of sequences: 9342 - total bases: 4227447 - average sequences length: 452 - maximum sequence length: 2604 - minimum sequence length: 51 * protein sequences: - protein fasta file: Capsicum_annuum_protein_predicted_by_estscan.v1.fasta - number of sequences: 9342 - total aminoacids: 1401887 - average sequences length: 150 - maximum sequence length: 867 - minimum sequence length: 15 * cds sequences: - cds fasta file: Capsicum_annuum_cds_predicted_by_preferred.v1.fasta - number of sequences: 9554 - total bases: 4442889 - average sequences length: 465 - maximum sequence length: 2604 - minimum sequence length: 81 * protein sequences: - protein fasta file: Capsicum_annuum_protein_predicted_by_preferred.v1.fasta - number of sequences: 9554 - total aminoacids: 1474996 - average sequences length: 154 - maximum sequence length: 867 - minimum sequence length: 27 -------------------------------------------------