GeneSeqer. Version of March 12, 2006. Date run: Thu Jul 27 13:45:38 2006 (Bayesian) Splice site model (species): Arabidopsis thaliana Fast search parameters: MinMatchLen 16, MinQualityHSP 30, MinQualityCHAIN 50. Total number of ESTs: 175665 Total sequence length: 93213537 Minimum sequence length: 89 Maximum sequence length: 1082 Length distribution (number of sequences of specified length): < 100: 1 < 200: 2188 < 300: 8544 < 400: 20465 < 500: 39499 < 600: 49432 < 700: 32872 < 800: 19308 < 900: 3155 < 1000: 193 >=1000: 8 Input file : /tmp/bac-submission-temp-flepq/C12HBa0093P12/C12HBa0093P12.seq.screen ________________________________________________________________________________ Sequence 1: C12HBa0093P12.1-1, from 1 to 9803, both strands analyzed. ... started at: Thu Jul 27 14:05:14 2006 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 5 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 3 ... matches indexed, elapsed seconds = 3 HitsTableSize = 2 ******************************************************************************** EST sequence 6 -strand 404 n (File: SGN-E342303-) 1 TTATTAATTT GGAGGAACAT CAACAATATT TCAACAACAT TATGGAAGGC AATAAGATGG 61 AGCATAAGAG ATCATTTACA CAAGGACATG GCAAGAAAAT GTTGTCAATG AATTATTTTA 121 GCTTAGAGTC AATTATTTTG TTACTTGGTC TTACAGCATC TTTGTTACTT TTGCCATTGA 181 TGCTTCCACC ATTGCCACCA CCACCTTTTA TGTTGTTGTT AGTCCCAATT TTCATTCTTG 241 TTGTTCTTAT GATCTTAGCT TTTATGCCTT CTAATGTTAG GAATGTGACT TGCTCATATC 301 TTTAATTGTG TGGTTTAATT TAGAAGAAAA TATCAAAGTA GATTCACTTG TTTTTTACAT 361 AATGATTAAA TACATTTTTT TCAAAAAAAA AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 1 to 1347): Exon 1 209 517 ( 309 n); cDNA 96 381 ( 286 n); score: 0.851 PPA cDNA 383 404 MATCH C12HBa0093P12.1-1+ SGN-E342303- 0.851 309 0.765 C PGS_C12HBa0093P12.1-1+_SGN-E342303- (209 517) Alignment (genomic DNA sequence = upper lines): AAAGTGTTTG CAAAATGAAA TTTTTTTAAG TTAAGAGTTA ATTTTTTTTT GTTAATTGGG 268 ||| |||| || ||||| || ||||| || ||||| | | || |||| |||| || || AAAATGTTGT CA--ATGAA- TTATTTTAGC TT-AGAGTCA A--TTATTTT GTTACTT-GG 148 TCTTAACAGC ATTTTTTTGT TATTTTTTCC AATTGATCTT TCCCCCCATT TCCCCCCCCC 328 |||| ||||| | | ||||| || |||| || |||||| | | || ||| | | || || | TCTT-ACAGC A--TCTTTGT TACTTTTGCC -ATTGATGCT T-CCACCA-T T-GCCACCAC 201 CACCTTTTTA TGTTGTTTGT TAGTCCCCAT TTTTCTTTCT TGTTGTTCTT ATGATCTTAG 388 |||||||| | ||||| |||| |||| |||| ||||| |||| |||||||||| |||||||||| CACCTTTT-A TGTTG-TTGT TAGT-CCCAA TTTTCATTCT TGTTGTTCTT ATGATCTTAG 258 CTTTTATGCT CTTAAAATGT TAGGAAATGG TAACTTGCTC ATTATCTTTA ATTGTGTGGT 448 ||||||||| ||| ||||| |||| ||| | | |||||||| | |||||||| |||||||||| CTTTTATGC- CTTCTAATGT TAGG-AAT-G TGACTTGCTC A-TATCTTTA ATTGTGTGGT 314 TTAATTTAGA AAGAAAATAT CAAAGTAGAT TCACTTGTTT TTTTACATAA TGATTAAATA 508 ||||||||| |||||||||| |||||||||| ||||||| || |||||||||| |||||||||| TTAATTTAG- AAGAAAATAT CAAAGTAGAT TCACTTG-TT TTTTACATAA TGATTAAATA 372 CATTTTTTT 517 ||||||||| CATTTTTTT 381 hqPGS_C12HBa0093P12.1-1+_SGN-E342303- (209 517) ******************************************************************************** EST sequence 7 -strand 694 n (File: SGN-E376658-) 1 TTTGTGATTT CACCAAAAAA AAAATTTGTT AAAGATTGGC ACATTTTCAA GTTCAGTATT 61 CATTCGATTT TTGATATCTA CATAAAAAAA AAGTGTCCTG GTACTACTCA ATATTCCTCA 121 GAACGACTTC ATATTCAGGT CTCGAATTCA AAACCTCACA TCAAGATTCT TAGGAAATTT 181 CAAGATTGGT TGAAAAACTC ATATCCTTCT CTAAGTTTCA AGATTGGTTC CAAATTAAAA 241 CTCGAGACTT CTGAGTAAGA GCGTACGACT AGTAATGAAC ATGGACATGG AATCATCAGA 301 GGCAAAATTG AGATCATCAA AAGGGTTTAT TAATTTGGAG GAACATCAAC AATATTTCAA 361 CAACATTATG GAAGGCAATA AGATGGAGCA TAAGAGATCA TTTACACAAG GACATGGCAA 421 GAAAATGTTG TCAATGAATT ATTTTAGCTT AGAGTCAATT ATTTTGTTAC TTGGTCTTAC 481 AGCATCTTTG TTACTTTTGC CATTGATGCT TCCACCATTG CCACCACCAC CTTTTATGTT 541 GTTGTTAGTC CCAATTTTCA TTCTTGTTGT TCTTATGATC TTAGCTTTTA TGCCTTCTAA 601 TGTTAGGAAT GTGACTTGCT CATATCTTTA ATTGTGTGGT TTAATTTAGA AGAAAATATC 661 AAAGTAGATT CACTTGTTTT TTACATAAAA AAAA Predicted gene structure (within gDNA segment 1 to 1211): Exon 1 209 498 ( 290 n); cDNA 422 688 ( 267 n); score: 0.841 MATCH C12HBa0093P12.1-1+ SGN-E376658- 0.841 290 0.418 C PGS_C12HBa0093P12.1-1+_SGN-E376658- (209 498) Alignment (genomic DNA sequence = upper lines): AAAGTGTTTG CAAAATGAAA TTTTTTTAAG TTAAGAGTTA ATTTTTTTTT GTTAATTGGG 268 ||| |||| || ||||| || ||||| || ||||| | | || |||| |||| || || AAAATGTTGT CA--ATGAA- TTATTTTAGC TT-AGAGTCA A--TTATTTT GTTACTT-GG 474 TCTTAACAGC ATTTTTTTGT TATTTTTTCC AATTGATCTT TCCCCCCATT TCCCCCCCCC 328 |||| ||||| | | ||||| || |||| || |||||| | | || ||| | | || || | TCTT-ACAGC A--TCTTTGT TACTTTTGCC -ATTGATGCT T-CCACCA-T T-GCCACCAC 527 CACCTTTTTA TGTTGTTTGT TAGTCCCCAT TTTTCTTTCT TGTTGTTCTT ATGATCTTAG 388 |||||||| | ||||| |||| |||| |||| ||||| |||| |||||||||| |||||||||| CACCTTTT-A TGTTG-TTGT TAGT-CCCAA TTTTCATTCT TGTTGTTCTT ATGATCTTAG 584 CTTTTATGCT CTTAAAATGT TAGGAAATGG TAACTTGCTC ATTATCTTTA ATTGTGTGGT 448 ||||||||| ||| ||||| |||| ||| | | |||||||| | |||||||| |||||||||| CTTTTATGC- CTTCTAATGT TAGG-AAT-G TGACTTGCTC A-TATCTTTA ATTGTGTGGT 640 TTAATTTAGA AAGAAAATAT CAAAGTAGAT TCACTTGTTT TTTTACATAA 498 ||||||||| |||||||||| |||||||||| ||||||| || |||||||||| TTAATTTAG- AAGAAAATAT CAAAGTAGAT TCACTTG-TT TTTTACATAA 688 hqPGS_C12HBa0093P12.1-1+_SGN-E376658- (209 498) ******************************************************************************** EST sequence 3 +strand 648 n (File: SGN-E341757+) 1 AAAAACTACA AGTGTCTCTA ATGCATCCAA CTCCTTTTTC CTCCCAAAAA TCACTCACAA 61 TAATGGCTTT ACAACAACCA TTTCTATCTG AATTACTACT CCACTCATCT TCAATTAGAA 121 CAACACTCCC CAAAAGTTCA CTTTTTTCTC TACAAATACC TCAAAGATTC AATCTTTATC 181 TTCAAAATAA GCAAATCAAG AAACAAGGAA CAAGTTGTTA TGCAATTGCT GAGAATCTTG 241 AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA ATTCAAGTTT AGATGAGTTG AGGGGACAAA 301 GGGAAATTGT TGGTTATGAT TGGACTGAAG AATGGTATCC TTTGTATTTA ACCAAGAATG 361 TACCTAATGA TGCACCTTTA GGTCTTACTG TCTTTGATAA ACAAGTTGTT TTGTATAAAG 421 ATGGAAGTGG TGAACTTAGA TGCTTTGAAG ATAGATGTCC ACATAGATTG GCCAAACTCT 481 CCGAAGGGCA ACTGTATGAT GGCAAACTGG AATGCTTGTA TCACGGTTGG CAGTTTGACG 541 GAGATGGTAA ATGTGTCAAG ATACCACAGC TACCTTGAAA TGCTAAAATT CCTCGATCAG 601 CTTGTACGAA GACATATGAA ATTCGGGATT CCAAAGGAGT AGTCTGGA Predicted gene structure (within gDNA segment 7774 to 9803): Exon 1 8374 8839 ( 466 n); cDNA 1 466 ( 466 n); score: 1.000 MATCH C12HBa0093P12.1-1+ SGN-E341757+ 1.000 466 0.719 C PGS_C12HBa0093P12.1-1+_SGN-E341757+ (8374 8839) Alignment (genomic DNA sequence = upper lines): AAAAACTACA AGTGTCTCTA ATGCATCCAA CTCCTTTTTC CTCCCAAAAA TCACTCACAA 8433 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACTACA AGTGTCTCTA ATGCATCCAA CTCCTTTTTC CTCCCAAAAA TCACTCACAA 60 TAATGGCTTT ACAACAACCA TTTCTATCTG AATTACTACT CCACTCATCT TCAATTAGAA 8493 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAATGGCTTT ACAACAACCA TTTCTATCTG AATTACTACT CCACTCATCT TCAATTAGAA 120 CAACACTCCC CAAAAGTTCA CTTTTTTCTC TACAAATACC TCAAAGATTC AATCTTTATC 8553 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACACTCCC CAAAAGTTCA CTTTTTTCTC TACAAATACC TCAAAGATTC AATCTTTATC 180 TTCAAAATAA GCAAATCAAG AAACAAGGAA CAAGTTGTTA TGCAATTGCT GAGAATCTTG 8613 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAAAATAA GCAAATCAAG AAACAAGGAA CAAGTTGTTA TGCAATTGCT GAGAATCTTG 240 AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA ATTCAAGTTT AGATGAGTTG AGGGGACAAA 8673 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA ATTCAAGTTT AGATGAGTTG AGGGGACAAA 300 GGGAAATTGT TGGTTATGAT TGGACTGAAG AATGGTATCC TTTGTATTTA ACCAAGAATG 8733 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGAAATTGT TGGTTATGAT TGGACTGAAG AATGGTATCC TTTGTATTTA ACCAAGAATG 360 TACCTAATGA TGCACCTTTA GGTCTTACTG TCTTTGATAA ACAAGTTGTT TTGTATAAAG 8793 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACCTAATGA TGCACCTTTA GGTCTTACTG TCTTTGATAA ACAAGTTGTT TTGTATAAAG 420 ATGGAAGTGG TGAACTTAGA TGCTTTGAAG ATAGATGTCC ACATAG 8839 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ATGGAAGTGG TGAACTTAGA TGCTTTGAAG ATAGATGTCC ACATAG 466 hqPGS_C12HBa0093P12.1-1+_SGN-E341757+ (8374 8839) ******************************************************************************** EST sequence 1 +strand 755 n (File: SGN-E263538+) 1 CTCCTTTTTC CTCCCAAAAA TCACTCACAA TAATGGCTTT ACAACAACCA TTTCTATCTG 61 AATTACTACT CCACTCATCT TCAATTAGAA CAACACTCCC CAAAAGTTCA CTTTTTTCTC 121 TACAAATACC TCAAAGATTC AATCTTTATC TTCAAAATAA GCAAATCAAG AAACAAGGAA 181 CAAGTTGTTA TGCAATTGCT GAGAATCTTG AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA 241 ATTCAAGTTT AGATGAGTTG AGGGGACAAA GGGAAATTGT TGGTTATGAT TGGACTGAAG 301 AATGGTATCC TTTGTATTTA ACCAAGAATG TACCTAATGA TGCACCTTTA GGTCTTACTG 361 TCTTTGATAA ACAAGTTGTT TTGTATAAAG ATGGAAGTGG TGAACTTAGA TGCTTTGAAG 421 ATAGATGTCC ACATAGATTG GCCAAACTCT CCGAAGGGCA ACTGTATGAT GGCAAACTGG 481 AATGCTTGTA TCACGGTTGG CAGTTTGACG GAGATGGTAA ATGTGTCAAG ATACCACAGC 541 TACCTGAAAA TGCTAAAATT CCTCGATCAG CTTGTACGAA GACATATGAA ATTCGGGATT 601 CCCAAGGAGT AGTCTGGATA TGGATGTCTC ATGGAACACC ACCTAATATT AACAAAATCC 661 CCTGGTTTGA AATTTTGAGA GGAAAAGATT TCGAGACATT TCTACTATTC ACGAGCTCCC 721 TTATGATCAC TCTATTCTTC TCGAAAACCT TATGG Predicted gene structure (within gDNA segment 7804 to 9803): Exon 1 8404 8839 ( 436 n); cDNA 1 436 ( 436 n); score: 1.000 MATCH C12HBa0093P12.1-1+ SGN-E263538+ 1.000 436 0.577 C PGS_C12HBa0093P12.1-1+_SGN-E263538+ (8404 8839) Alignment (genomic DNA sequence = upper lines): CTCCTTTTTC CTCCCAAAAA TCACTCACAA TAATGGCTTT ACAACAACCA TTTCTATCTG 8463 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCCTTTTTC CTCCCAAAAA TCACTCACAA TAATGGCTTT ACAACAACCA TTTCTATCTG 60 AATTACTACT CCACTCATCT TCAATTAGAA CAACACTCCC CAAAAGTTCA CTTTTTTCTC 8523 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTACTACT CCACTCATCT TCAATTAGAA CAACACTCCC CAAAAGTTCA CTTTTTTCTC 120 TACAAATACC TCAAAGATTC AATCTTTATC TTCAAAATAA GCAAATCAAG AAACAAGGAA 8583 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACAAATACC TCAAAGATTC AATCTTTATC TTCAAAATAA GCAAATCAAG AAACAAGGAA 180 CAAGTTGTTA TGCAATTGCT GAGAATCTTG AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA 8643 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAGTTGTTA TGCAATTGCT GAGAATCTTG AAGCTGAGGA TCAAAGTTTG ATTTTGGAAA 240 ATTCAAGTTT AGATGAGTTG AGGGGACAAA GGGAAATTGT TGGTTATGAT TGGACTGAAG 8703 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTCAAGTTT AGATGAGTTG AGGGGACAAA GGGAAATTGT TGGTTATGAT TGGACTGAAG 300 AATGGTATCC TTTGTATTTA ACCAAGAATG TACCTAATGA TGCACCTTTA GGTCTTACTG 8763 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATGGTATCC TTTGTATTTA ACCAAGAATG TACCTAATGA TGCACCTTTA GGTCTTACTG 360 TCTTTGATAA ACAAGTTGTT TTGTATAAAG ATGGAAGTGG TGAACTTAGA TGCTTTGAAG 8823 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTTGATAA ACAAGTTGTT TTGTATAAAG ATGGAAGTGG TGAACTTAGA TGCTTTGAAG 420 ATAGATGTCC ACATAG 8839 |||||||||| |||||| ATAGATGTCC ACATAG 436 hqPGS_C12HBa0093P12.1-1+_SGN-E263538+ (8404 8839) ******************************************************************************** EST sequence 4 +strand 731 n (File: SGN-E343651+) 1 ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 61 ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 121 CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 181 GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 241 GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 301 CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 361 GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAGGTAAAA AGAACATACT 421 ATATATGTTT GTGTTTTCAT TGTCTCAGTT CAGTGTACAA AGCATCTCGA TATAAATAGT 481 ACTAATTAAA CGTATAGGTC ACACAGAGAT AACTTTGTTT GTTCAAGAAC TTAGTATTTT 541 AGCATGATCT TGTTATGCAT AGACATTTCT TTTTTACTTG TTCCTTATGT AGTTAAGTGA 601 TAGCTAATAC ATTATGTAGT TGCTTAATTT CTCTTGATTA GCTAAAATCT ATGTTTCGTG 661 TAAGGGACAA GTACAAGAAG GGAAGTAGGA AGAATCGATT TGTAGCAACT ACGAATTGAG 721 TTACTAGTCG T Predicted gene structure (within gDNA segment 7836 to 9776): Exon 1 8436 9166 ( 731 n); cDNA 1 731 ( 731 n); score: 0.999 MATCH C12HBa0093P12.1-1+ SGN-E343651+ 0.999 731 1.000 C PGS_C12HBa0093P12.1-1+_SGN-E343651+ (8436 9166) Alignment (genomic DNA sequence = upper lines): ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 8495 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 60 ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 8555 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 120 CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 8615 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 180 GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 8675 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 240 GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 8735 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 300 CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 8795 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 360 GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAGGTAAAA AGAACATACT 8855 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAGGTAAAA AGAACATACT 420 ATATATGTTT GTGTTTTCAT TGTCTCAGTT CAGTGTACAA AGCATCTCGA TATAAATAGT 8915 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATATGTTT GTGTTTTCAT TGTCTCAGTT CAGTGTACAA AGCATCTCGA TATAAATAGT 480 ACTAATTAAA CGTATAGGTC ACACAGAGAT AACTTTGTTT GTTCAAGAAC TTAGTATTTT 8975 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTAATTAAA CGTATAGGTC ACACAGAGAT AACTTTGTTT GTTCAAGAAC TTAGTATTTT 540 AGCATGATCT TGTTATGCAT AGACATTTCT TTTTTACTTG TTCCTTATGT AGTTAAGTGA 9035 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGCATGATCT TGTTATGCAT AGACATTTCT TTTTTACTTG TTCCTTATGT AGTTAAGTGA 600 TAGCTAATAC ATTATGTAGT TGCTTAATTT CTCTTGATTA GCTAAAATCT ATGTTTCGTG 9095 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGCTAATAC ATTATGTAGT TGCTTAATTT CTCTTGATTA GCTAAAATCT ATGTTTCGTG 660 TAAGGGACAA GTACAAGAAG GGAAGTAGGA AGAATCGATT TGTAGCAACT ACGATTTGAG 9155 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ||||| TAAGGGACAA GTACAAGAAG GGAAGTAGGA AGAATCGATT TGTAGCAACT ACGAATTGAG 720 TTACTAGTCG T 9166 |||||||||| | TTACTAGTCG T 731 hqPGS_C12HBa0093P12.1-1+_SGN-E343651+ (8436 9166) ******************************************************************************** EST sequence 5 +strand 567 n (File: SGN-E249417+) 1 ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 61 ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 121 CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 181 GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 241 GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 301 CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 361 GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAGATTGGC CAAACTCTCC 421 GAAGGGCAAC TGTATGATGG CAAACTGGAA TGCTTGTATC ACGGTTGGCA GTTTGACGGA 481 GATGGTAAAT GTGTCAAGAT ACCACAGCTA CCTGAAAATG CTAAAATTCC TCGATCAGCT 541 TGTACGAAGA CATATGAAAT TCGGGAT Predicted gene structure (within gDNA segment 7836 to 9803): Exon 1 8436 8839 ( 404 n); cDNA 1 404 ( 404 n); score: 1.000 MATCH C12HBa0093P12.1-1+ SGN-E249417+ 1.000 404 0.713 C PGS_C12HBa0093P12.1-1+_SGN-E249417+ (8436 8839) Alignment (genomic DNA sequence = upper lines): ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 8495 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGGCTTTAC AACAACCATT TCTATCTGAA TTACTACTCC ACTCATCTTC AATTAGAACA 60 ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 8555 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACACTCCCCA AAAGTTCACT TTTTTCTCTA CAAATACCTC AAAGATTCAA TCTTTATCTT 120 CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 8615 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAATAAGC AAATCAAGAA ACAAGGAACA AGTTGTTATG CAATTGCTGA GAATCTTGAA 180 GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 8675 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTGAGGATC AAAGTTTGAT TTTGGAAAAT TCAAGTTTAG ATGAGTTGAG GGGACAAAGG 240 GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 8735 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAATTGTTG GTTATGATTG GACTGAAGAA TGGTATCCTT TGTATTTAAC CAAGAATGTA 300 CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 8795 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTAATGATG CACCTTTAGG TCTTACTGTC TTTGATAAAC AAGTTGTTTT GTATAAAGAT 360 GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAG 8839 |||||||||| |||||||||| |||||||||| |||||||||| |||| GGAAGTGGTG AACTTAGATG CTTTGAAGAT AGATGTCCAC ATAG 404 hqPGS_C12HBa0093P12.1-1+_SGN-E249417+ (8436 8839) ******************************************************************************** EST sequence 2 +strand 435 n (File: SGN-E214195+) 1 TAATGATGCA CCTTTAGGTC TTACTGTCTT TGATAAACAA GTTGTTTTGT ATAAAGATGG 61 AAGTGGTGAA CTTAGATGCT TTGAAGATAG ATGTCCACAT AGATTGGCCA AACTCTCCGA 121 AGGGCAACTG TATGATGGCA AACTGGAATG CTTGTATCAC GGTTGGCAGT TTGACGGAGA 181 TGGTAAATGT GTCAAGATAC CACAGCTACC TGAAAATGCT AAAATTCCTC GATCAGCTTG 241 TACGAAGACA TATGAAATTC GGGATTCCCA AGGAGTAGTC TGGATATGGA TGTCTCATGG 301 AACACCACCT AATATTAACA AAATCCCCTG GTTTGAAAAT TTTGAGAGGA AAGGATTTCG 361 AGACATTTCT ACTATTCACG AGCTCCCTTA TGATCACTCT ATTCTTCTCC AAAACCTTAT 421 GGATCCTGCT CATGT Predicted gene structure (within gDNA segment 8138 to 9803): Exon 1 8738 8839 ( 102 n); cDNA 1 102 ( 102 n); score: 1.000 MATCH C12HBa0093P12.1-1+ SGN-E214195+ 1.000 102 0.234 C PGS_C12HBa0093P12.1-1+_SGN-E214195+ (8738 8839) Alignment (genomic DNA sequence = upper lines): TAATGATGCA CCTTTAGGTC TTACTGTCTT TGATAAACAA GTTGTTTTGT ATAAAGATGG 8797 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAATGATGCA CCTTTAGGTC TTACTGTCTT TGATAAACAA GTTGTTTTGT ATAAAGATGG 60 AAGTGGTGAA CTTAGATGCT TTGAAGATAG ATGTCCACAT AG 8839 |||||||||| |||||||||| |||||||||| |||||||||| || AAGTGGTGAA CTTAGATGCT TTGAAGATAG ATGTCCACAT AG 102 hqPGS_C12HBa0093P12.1-1+_SGN-E214195+ (8738 8839) Total number of EST alignments reported: 7 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 9803: PGL 1 (+ strand): 209 517 AGS-1 (209 517) SCR (e 0.851) Exon 1 209 517 ( 309 n); score: 0.851 PGS (209 517) SGN-E342303- PGS (209 498) SGN-E376658- 3-phase translation of AGS-1 (+strand): . . . . . . 209 AAAGTGTTTGCAAAATGAAATTTTTTTAAGTTAAGAGTTAATTTTTTTTTGTTAATTGGG K V F A K - N F F K L R V N F F L L I G K C L Q N E I F L S - E L I F F C - L G S V C K M K F F - V K S - F F F V N W . . . . . . 269 TCTTAACAGCATTTTTTTGTTATTTTTTCCAATTGATCTTTCCCCCCATTTCCCCCCCCC S - Q H F F V I F S N - S F P P F P P P L N S I F L L F F P I D L S P H F P P P V L T A F F C Y F F Q L I F P P I S P P . . . . . . 329 CACCTTTTTATGTTGTTTGTTAGTCCCCATTTTTCTTTCTTGTTGTTCTTATGATCTTAG H L F M L F V S P H F S F L L F L - S - T F L C C L L V P I F L S C C S Y D L S P P F Y V V C - S P F F F L V V L M I L . . . . . . 389 CTTTTATGCTCTTAAAATGTTAGGAAATGGTAACTTGCTCATTATCTTTAATTGTGTGGT L L C S - N V R K W - L A H Y L - L C G F Y A L K M L G N G N L L I I F N C V V A F M L L K C - E M V T C S L S L I V W . . . . . . 449 TTAATTTAGAAAGAAAATATCAAAGTAGATTCACTTGTTTTTTTACATAATGATTAAATA L I - K E N I K V D S L V F L H N D - I - F R K K I S K - I H L F F Y I M I K Y F N L E R K Y Q S R F T C F F T - - L N . 509 CATTTTTTT H F F I F T F F Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 517 AAAAAAATGTATTTAATCATTATGTAAAAAAACAAGTGAATCTACTTTGATATTTTCTTT K K M Y L I I M - K N K - I Y F D I F F K K C I - S L C K K T S E S T L I F S F K N V F N H Y V K K Q V N L L - Y F L . . . . . . 457 CTAAATTAAACCACACAATTAAAGATAATGAGCAAGTTACCATTTCCTAACATTTTAAGA L N - T T Q L K I M S K L P F P N I L R - I K P H N - R - - A S Y H F L T F - E S K L N H T I K D N E Q V T I S - H F K . . . . . . 397 GCATAAAAGCTAAGATCATAAGAACAACAAGAAAGAAAAATGGGGACTAACAAACAACAT A - K L R S - E Q Q E R K M G T N K Q H H K S - D H K N N K K E K W G L T N N I S I K A K I I R T T R K K N G D - Q T T . . . . . . 337 AAAAAGGTGGGGGGGGGGAAATGGGGGGAAAGATCAATTGGAAAAAATAACAAAAAAATG K K V G G G K W G E R S I G K N N K K M K R W G G G N G G K D Q L E K I T K K C - K G G G G E M G G K I N W K K - Q K N . . . . . . 277 CTGTTAAGACCCAATTAACAAAAAAAAATTAACTCTTAACTTAAAAAAATTTCATTTTGC L L R P N - Q K K I N S - L K K I S F C C - D P I N K K K L T L N L K K F H F A A V K T Q L T K K N - L L T - K N F I L . 217 AAACACTTT K H F N T Q T L Maximal non-overlapping open reading frames (>= 64 codons): none PGL 2 (+ strand): 8374 9166 AGS-1 (8374 9166) SCR (e 0.999) Exon 1 8374 9166 ( 793 n); score: 0.999 PGS (8374 8839) SGN-E341757+ PGS (8404 8839) SGN-E263538+ PGS (8436 9166) SGN-E343651+ PGS (8436 8839) SGN-E249417+ PGS (8738 8839) SGN-E214195+ 3-phase translation of AGS-1 (+strand): . . . . . . 8374 AAAAACTACAAGTGTCTCTAATGCATCCAACTCCTTTTTCCTCCCAAAAATCACTCACAA K N Y K C L - C I Q L L F P P K N H S Q K T T S V S N A S N S F F L P K I T H N K L Q V S L M H P T P F S S Q K S L T . . . . . . 8434 TAATGGCTTTACAACAACCATTTCTATCTGAATTACTACTCCACTCATCTTCAATTAGAA - W L Y N N H F Y L N Y Y S T H L Q L E N G F T T T I S I - I T T P L I F N - N I M A L Q Q P F L S E L L L H S S S I R . . . . . . 8494 CAACACTCCCCAAAAGTTCACTTTTTTCTCTACAAATACCTCAAAGATTCAATCTTTATC Q H S P K V H F F L Y K Y L K D S I F I N T P Q K F T F F S T N T S K I Q S L S T T L P K S S L F S L Q I P Q R F N L Y . . . . . . 8554 TTCAAAATAAGCAAATCAAGAAACAAGGAACAAGTTGTTATGCAATTGCTGAGAATCTTG F K I S K S R N K E Q V V M Q L L R I L S K - A N Q E T R N K L L C N C - E S - L Q N K Q I K K Q G T S C Y A I A E N L . . . . . . 8614 AAGCTGAGGATCAAAGTTTGATTTTGGAAAATTCAAGTTTAGATGAGTTGAGGGGACAAA K L R I K V - F W K I Q V - M S - G D K S - G S K F D F G K F K F R - V E G T K E A E D Q S L I L E N S S L D E L R G Q . . . . . . 8674 GGGAAATTGTTGGTTATGATTGGACTGAAGAATGGTATCCTTTGTATTTAACCAAGAATG G K L L V M I G L K N G I L C I - P R M G N C W L - L D - R M V S F V F N Q E C R E I V G Y D W T E E W Y P L Y L T K N . . . . . . 8734 TACCTAATGATGCACCTTTAGGTCTTACTGTCTTTGATAAACAAGTTGTTTTGTATAAAG Y L M M H L - V L L S L I N K L F C I K T - - C T F R S Y C L - - T S C F V - R V P N D A P L G L T V F D K Q V V L Y K . . . . . . 8794 ATGGAAGTGGTGAACTTAGATGCTTTGAAGATAGATGTCCACATAGGTAAAAAGAACATA M E V V N L D A L K I D V H I G K K N I W K W - T - M L - R - M S T - V K R T Y D G S G E L R C F E D R C P H R - K E H . . . . . . 8854 CTATATATGTTTGTGTTTTCATTGTCTCAGTTCAGTGTACAAAGCATCTCGATATAAATA L Y M F V F S L S Q F S V Q S I S I - I Y I C L C F H C L S S V Y K A S R Y K - T I Y V C V F I V S V Q C T K H L D I N . . . . . . 8914 GTACTAATTAAACGTATAGGTCACACAGAGATAACTTTGTTTGTTCAAGAACTTAGTATT V L I K R I G H T E I T L F V Q E L S I Y - L N V - V T Q R - L C L F K N L V F S T N - T Y R S H R D N F V C S R T - Y . . . . . . 8974 TTAGCATGATCTTGTTATGCATAGACATTTCTTTTTTACTTGTTCCTTATGTAGTTAAGT L A - S C Y A - T F L F Y L F L M - L S - H D L V M H R H F F F T C S L C S - V F S M I L L C I D I S F L L V P Y V V K . . . . . . 9034 GATAGCTAATACATTATGTAGTTGCTTAATTTCTCTTGATTAGCTAAAATCTATGTTTCG D S - Y I M - L L N F S - L A K I Y V S I A N T L C S C L I S L D - L K S M F R - - L I H Y V V A - F L L I S - N L C F . . . . . . 9094 TGTAAGGGACAAGTACAAGAAGGGAAGTAGGAAGAATCGATTTGTAGCAACTACGATTTG C K G Q V Q E G K - E E S I C S N Y D L V R D K Y K K G S R K N R F V A T T I - V - G T S T R R E V G R I D L - Q L R F . . 9154 AGTTACTAGTCGT S Y - S V T S R E L L V Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-1+_PGL-2_AGS-1_PPS_1 (8376 8843) (frame '0'; 465 bp, 155 residues) 1 KLQVSLMHPT PFSSQKSLTI MALQQPFLSE LLLHSSSIRT TLPKSSLFSL QIPQRFNLYL 61 QNKQIKKQGT SCYAIAENLE AEDQSLILEN SSLDELRGQR EIVGYDWTEE WYPLYLTKNV 121 PNDAPLGLTV FDKQVVLYKD GSGELRCFED RCPHR- 3-phase translation of AGS-1 (-strand): . . . . . . 9166 ACGACTAGTAACTCAAATCGTAGTTGCTACAAATCGATTCTTCCTACTTCCCTTCTTGTA T T S N S N R S C Y K S I L P T S L L V R L V T Q I V V A T N R F F L L P F L Y D - - L K S - L L Q I D S S Y F P S C . . . . . . 9106 CTTGTCCCTTACACGAAACATAGATTTTAGCTAATCAAGAGAAATTAAGCAACTACATAA L V P Y T K H R F - L I K R N - A T T - L S L T R N I D F S - S R E I K Q L H N T C P L H E T - I L A N Q E K L S N Y I . . . . . . 9046 TGTATTAGCTATCACTTAACTACATAAGGAACAAGTAAAAAAGAAATGTCTATGCATAAC C I S Y H L T T - G T S K K E M S M H N V L A I T - L H K E Q V K K K C L C I T M Y - L S L N Y I R N K - K R N V Y A - . . . . . . 8986 AAGATCATGCTAAAATACTAAGTTCTTGAACAAACAAAGTTATCTCTGTGTGACCTATAC K I M L K Y - V L E Q T K L S L C D L Y R S C - N T K F L N K Q S Y L C V T Y T Q D H A K I L S S - T N K V I S V - P I . . . . . . 8926 GTTTAATTAGTACTATTTATATCGAGATGCTTTGTACACTGAACTGAGACAATGAAAACA V - L V L F I S R C F V H - T E T M K T F N - Y Y L Y R D A L Y T E L R Q - K H R L I S T I Y I E M L C T L N - D N E N . . . . . . 8866 CAAACATATATAGTATGTTCTTTTTACCTATGTGGACATCTATCTTCAAAGCATCTAAGT Q T Y I V C S F Y L C G H L S S K H L S K H I - Y V L F T Y V D I Y L Q S I - V T N I Y S M F F L P M W T S I F K A S K . . . . . . 8806 TCACCACTTCCATCTTTATACAAAACAACTTGTTTATCAAAGACAGTAAGACCTAAAGGT S P L P S L Y K T T C L S K T V R P K G H H F H L Y T K Q L V Y Q R Q - D L K V F T T S I F I Q N N L F I K D S K T - R . . . . . . 8746 GCATCATTAGGTACATTCTTGGTTAAATACAAAGGATACCATTCTTCAGTCCAATCATAA A S L G T F L V K Y K G Y H S S V Q S - H H - V H S W L N T K D T I L Q S N H N C I I R Y I L G - I Q R I P F F S P I I . . . . . . 8686 CCAACAATTTCCCTTTGTCCCCTCAACTCATCTAAACTTGAATTTTCCAAAATCAAACTT P T I S L C P L N S S K L E F S K I K L Q Q F P F V P S T H L N L N F P K S N F T N N F P L S P Q L I - T - I F Q N Q T . . . . . . 8626 TGATCCTCAGCTTCAAGATTCTCAGCAATTGCATAACAACTTGTTCCTTGTTTCTTGATT - S S A S R F S A I A - Q L V P C F L I D P Q L Q D S Q Q L H N N L F L V S - F L I L S F K I L S N C I T T C S L F L D . . . . . . 8566 TGCTTATTTTGAAGATAAAGATTGAATCTTTGAGGTATTTGTAGAGAAAAAAGTGAACTT C L F - R - R L N L - G I C R E K S E L A Y F E D K D - I F E V F V E K K V N F L L I L K I K I E S L R Y L - R K K - T . . . . . . 8506 TTGGGGAGTGTTGTTCTAATTGAAGATGAGTGGAGTAGTAATTCAGATAGAAATGGTTGT L G S V V L I E D E W S S N S D R N G C W G V L F - L K M S G V V I Q I E M V V F G E C C S N - R - V E - - F R - K W L . . . . . . 8446 TGTAAAGCCATTATTGTGAGTGATTTTTGGGAGGAAAAAGGAGTTGGATGCATTAGAGAC C K A I I V S D F W E E K G V G C I R D V K P L L - V I F G R K K E L D A L E T L - S H Y C E - F L G G K R S W M H - R . . 8386 ACTTGTAGTTTTT T C S F L V V F H L - F Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-1-_PGL-2_AGS-1_PPS_1 (8884 8687) (frame '1'; 195 bp, 65 residues) 1 TETMKTQTYI VCSFYLCGHL SSKHLSSPLP SLYKTTCLSK TVRPKGASLG TFLVKYKGYH 61 SSVQS- ... finished at: Thu Jul 27 14:05:24 2006 ________________________________________________________________________________ Sequence 2: C12HBa0093P12.1-2, from 1 to 17133, both strands analyzed. ... started at: Thu Jul 27 14:05:24 2006 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 3 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 16 ******************************************************************************** EST sequence 12 +strand 469 n (File: SGN-E243027+) 1 AAAACAAGCT TTTTAAGACT TTCTACGCGA AAAGATGAAT GAGAGGGCCC TTTTAGCATT 61 GAAAAAGACG AAAGTGCACG AAGAACTATT AAAGCATGTC GATGTTTGGC TCATAAATGT 121 TGAGCAGCAA TTGGCCGATA TTGAACTGAC AAGCAAGCAA AAGGCTGTCT TTGAGAGTTA 181 GAAAACTGGG AATAATGCAA TCAAACCAAT ACACGGTGAG ATCAATCTCA ACGATGTGCA 241 AAAGTTAATG GATGATACTG CAAAGGCAAA AGCTTATCAA GACGAAGTCA ATGCTATTCT 301 TGGAGAGAAG CTATCTGCTG AAGATGAAGA GGAAGTTCTA GCAGAATTTG AGAATCTGGA 361 ATCTCATTTA ACACTTCAAG ATCTGCCAGA AAATCCTTCT GCAATACCTT CTGGTGAAAA 421 CGATGAAGAG AAACTGGATC TACCTGACGT ACCAACTAAA GCACCAGTC Predicted gene structure (within gDNA segment 1382 to 1): Exon 1 422 293 ( 130 n); cDNA 1 130 ( 130 n); score: 0.915 Intron 1 292 146 ( 147 n); Pd: 0.997 (s: 0.96), Pa: 0.991 (s: 0.96) Exon 2 145 1 ( 145 n); cDNA 131 275 ( 145 n); score: 0.938 MATCH C12HBa0093P12.1-2- SGN-E243027+ 0.927 275 0.586 C PGS_C12HBa0093P12.1-2-_SGN-E243027+ (422 293,145 1) Alignment (genomic DNA sequence = upper lines): AAAACAAGCT GCAAAAGACT TGCTACGCGA AAAGAAGAAA GAGAGGGCCC TTTTAGCATT 363 |||||||||| |||||| | |||||||| ||||| ||| |||||||||| |||||||||| AAAACAAGCT TTTTAAGACT TTCTACGCGA AAAGATGAAT GAGAGGGCCC TTTTAGCATT 60 GAAAAAGAAG AAAGTGCAAG AAGAACTATT AAAGCAAGTT GATGTTTGGC TCATAAATGT 303 |||||||| | |||||||| | |||||||||| |||||| || |||||||||| |||||||||| GAAAAAGACG AAAGTGCACG AAGAACTATT AAAGCATGTC GATGTTTGGC TCATAAATGT 120 TGAGCAGCAA GTAAGTAATT CTTCTTAATT CAGTTCCATA CTTATGCTAC CATGCTAAAT 243 |||||||||| TGAGCAGCAA .......... .......... .......... .......... .......... 130 TTCCTTATTT CTCTTAAGCA TATGCTACCA TGCTAAATTT CCTTATTTCT CTTAAGCATT 183 .......... .......... .......... .......... .......... .......... 130 TAAATTGTTT TCTGATTCTA TTTGCAAATT CTTTTAGTTG GCAGATATTG AACTGACAAG 123 ||| || ||||||| |||||||||| .......... .......... .......... .......TTG GCCGATATTG AACTGACAAG 153 CAAGCAAAAG GCTGTCTTTG AGAGTTTGAA AACTGGGAAT AATGCAATCA AAGCAATACA 63 |||||||||| |||||||||| |||||| ||| |||||||||| |||||||||| || ||||||| CAAGCAAAAG GCTGTCTTTG AGAGTTAGAA AACTGGGAAT AATGCAATCA AACCAATACA 213 AGGTGAGATC AATCTAGAGG ATGTTCAAAA GTTAATGGAT GATACTGCAG AGGCAAAAGC 3 ||||||||| ||||| | | |||| ||||| |||||||||| ||||||||| |||||||||| CGGTGAGATC AATCTCAACG ATGTGCAAAA GTTAATGGAT GATACTGCAA AGGCAAAAGC 273 TT 1 || TT 275 hqPGS_C12HBa0093P12.1-2-_SGN-E243027+ (422 293,145 1) ******************************************************************************** EST sequence 1 -strand 725 n (File: SGN-E395667-) 1 AATGTTGAGC AGCAATTGGC AGATATTGAA CTGACAAGCA AGCAAAAGGC TGTCTTTGAG 61 AGTTTGAAAA CTGGGAATAA TGCAATCAAA GCAATACAAG GTGAGATCAA TCTAGAGGAT 121 GTTCAAAAGT TAATGGATGA TACTGCAGAG GCAAAAGCTT ATCAAGACGA AGTCAATGCT 181 ATTCTTGGAG AGAAGCTATC TGCTGAAGAT GAAGAGGAAG TTCTAGCAGA ATTTGAGAAT 241 CTGGAATCTC AGTTAACACT TCAAGATCTG CCAGAAGTTC CTTCTGCAAT ACCTTCTGGT 301 GAAAACGTTG AAGAGAAACT GGATCTACCT GACGTACCAA CTAAAGCACC AGTCGTTTCA 361 GAAGCTGTTA TCGAGGACAC TCAAGATACT TCAACTGCCG TGTCTGTGCA AAAGAAAGTT 421 TTGGAGGAAC CAATACCTGC TTGATCCCAA CGCAACGTGG TTTATTCATA AAGTTCGAAT 481 GGAATTTATC AAAACGAGTA TCGAGTTTCA GAATGCTATA CAAGTCTTCT CGATGCTAAG 541 AATTTGTTAT CTTTCGACGT TTCTGTCTCA TCTCCTGTAA TTTGTGCAGA TTGTGGTTGT 601 AATTCTGGGG ACCAGATTTG TACAGTGCTT TCATACTCAT ATCCTACTAG TTTCTTCTAA 661 TTCTGCTACT ACTTTTGTTG CATTATGCAA AAAAGAAAGC TTACTGTTCT TGTTAAAAAA 721 AAAAA Predicted gene structure (within gDNA segment 905 to 1): Exon 1 307 293 ( 15 n); cDNA 1 15 ( 15 n); score: 1.000 Intron 1 292 146 ( 147 n); Pd: 0.997 (s: 0), Pa: 0.991 (s: 1.00) Exon 2 145 1 ( 145 n); cDNA 16 160 ( 145 n); score: 1.000 PPA cDNA 715 725 MATCH C12HBa0093P12.1-2- SGN-E395667- 1.000 160 0.221 C PGS_C12HBa0093P12.1-2-_SGN-E395667- (307 293,145 1) Alignment (genomic DNA sequence = upper lines): AATGTTGAGC AGCAAGTAAG TAATTCTTCT TAATTCAGTT CCATACTTAT GCTACCATGC 248 |||||||||| ||||| AATGTTGAGC AGCAA..... .......... .......... .......... .......... 15 TAAATTTCCT TATTTCTCTT AAGCATATGC TACCATGCTA AATTTCCTTA TTTCTCTTAA 188 .......... .......... .......... .......... .......... .......... 15 GCATTTAAAT TGTTTTCTGA TTCTATTTGC AAATTCTTTT AGTTGGCAGA TATTGAACTG 128 |||||||| |||||||||| .......... .......... .......... .......... ..TTGGCAGA TATTGAACTG 33 ACAAGCAAGC AAAAGGCTGT CTTTGAGAGT TTGAAAACTG GGAATAATGC AATCAAAGCA 68 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACAAGCAAGC AAAAGGCTGT CTTTGAGAGT TTGAAAACTG GGAATAATGC AATCAAAGCA 93 ATACAAGGTG AGATCAATCT AGAGGATGTT CAAAAGTTAA TGGATGATAC TGCAGAGGCA 8 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATACAAGGTG AGATCAATCT AGAGGATGTT CAAAAGTTAA TGGATGATAC TGCAGAGGCA 153 AAAGCTT 1 ||||||| AAAGCTT 160 hqPGS_C12HBa0093P12.1-2-_SGN-E395667- (307 293,145 1) ******************************************************************************** EST sequence 5 +strand 582 n (File: SGN-E254519+) 1 TTGAGCAGCA ATTGGCAGAT ATTGAACTGA CAAGCAAGCA AAAGGCTGTC TTTGAGAGTT 61 TGAAAACTGG GAATAATGCA ATCAAAGCAA TACAAGGTGA GATCAATCTA GAGGATGTTC 121 AAAAGTTAAT GGATGATACT GCAGAGGCAA AAGCTTATCA AGACGAAGTC AATGCTATTC 181 TTGGAGAGAA GCTATCTGCT GAAGATGAAG AGGAAGTTCT AGCAGAATTT GAGAATCTGG 241 AATCTCAGTT AACACTTCAA GATCTGCCAG AAGTTCCTTC TGCAATACCT TCTGGTGAAA 301 ACGTTGAAGA GAAACTGGAT CTACCTGACG TACCAACTAA AGCACCAGTC GTTTCAGAAG 361 CTGTTATCGA GGACACTCAA GATACTTCAA CTGCCGTGTC TGTGCAAAAG AAAGTTTTGG 421 AGGAACCAAT ACCTGCTTGA TCCCAACGCA ACGTGGTTTA TTCATAAAGT TCGAATGGAA 481 TTTATCAAAA CGAGTATCGA GTTTCAGAAT GCTATACAAG TCTTCTCGAT GCTAAGAATT 541 TGTTATCTTT CGACGTTTCT GTCTCATCTC CTGTAATTTG TG Predicted gene structure (within gDNA segment 855 to 1): Exon 1 303 293 ( 11 n); cDNA 1 11 ( 11 n); score: 1.000 Intron 1 292 146 ( 147 n); Pd: 0.997 (s: 0), Pa: 0.991 (s: 1.00) Exon 2 145 1 ( 145 n); cDNA 12 156 ( 145 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E254519+ 1.000 156 0.268 C PGS_C12HBa0093P12.1-2-_SGN-E254519+ (303 293,145 1) Alignment (genomic DNA sequence = upper lines): TTGAGCAGCA AGTAAGTAAT TCTTCTTAAT TCAGTTCCAT ACTTATGCTA CCATGCTAAA 244 |||||||||| | TTGAGCAGCA A......... .......... .......... .......... .......... 11 TTTCCTTATT TCTCTTAAGC ATATGCTACC ATGCTAAATT TCCTTATTTC TCTTAAGCAT 184 .......... .......... .......... .......... .......... .......... 11 TTAAATTGTT TTCTGATTCT ATTTGCAAAT TCTTTTAGTT GGCAGATATT GAACTGACAA 124 || |||||||||| |||||||||| .......... .......... .......... ........TT GGCAGATATT GAACTGACAA 33 GCAAGCAAAA GGCTGTCTTT GAGAGTTTGA AAACTGGGAA TAATGCAATC AAAGCAATAC 64 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAAGCAAAA GGCTGTCTTT GAGAGTTTGA AAACTGGGAA TAATGCAATC AAAGCAATAC 93 AAGGTGAGAT CAATCTAGAG GATGTTCAAA AGTTAATGGA TGATACTGCA GAGGCAAAAG 4 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGGTGAGAT CAATCTAGAG GATGTTCAAA AGTTAATGGA TGATACTGCA GAGGCAAAAG 153 CTT 1 ||| CTT 156 hqPGS_C12HBa0093P12.1-2-_SGN-E254519+ (303 293,145 1) ******************************************************************************** EST sequence 10 +strand 619 n (File: SGN-E395668+) 1 AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 61 GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 121 CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 181 TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 241 GTTCAATTTC GTATTTTTTT TTTGTGGGTT TTGAATTTGG GAGATGGGAA ATATATTTGT 301 GAAGAAACCG AAGATCACCG AAGTTGATAG AGCGATTTTG TCTTTGAAGA CTCAAAGGCG 361 TAAGCTTGCT CAATATCAGC AACAGCTGGA TGCTGTTATT GAAGCCGAAA AACAAGCTGC 421 AAAAGACTTG CTACGCGAAA AGAAGAAAGA GAGGGCCCTT TTAGCATTGA AAAAGAAGAA 481 AGTGCAAGAA GAACTATTAA AGCAAGTTGA TGTTTGGCTC ATAAATGTTG AGCAGCAATT 541 GGCAGATATT GAACTGACAA GCAAGCAAAA GGCTGTCTTT GAGAGTTTGA AAACTGGGAA 601 TAATGCAATC AAAGCAATA Predicted gene structure (within gDNA segment 3022 to 1): Exon 1 2422 2038 ( 385 n); cDNA 1 385 ( 385 n); score: 1.000 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 293 ( 153 n); cDNA 386 538 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 (s: 1.00), Pa: 0.991 (s: 1.00) Exon 3 145 65 ( 81 n); cDNA 539 619 ( 81 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E395668+ 1.000 619 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E395668+ (2422 2038,445 293,145 65) Alignment (genomic DNA sequence = upper lines): AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 2363 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 60 GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 2303 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 120 CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 2243 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 180 TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 2183 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 240 GTTCAATTTC GTATTTTTTT TTTGTGGGTT TTGAATTTGG GAGATGGGAA ATATATTTGT 2123 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTCAATTTC GTATTTTTTT TTTGTGGGTT TTGAATTTGG GAGATGGGAA ATATATTTGT 300 GAAGAAACCG AAGATCACCG AAGTTGATAG AGCGATTTTG TCTTTGAAGA CTCAAAGGCG 2063 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGAAACCG AAGATCACCG AAGTTGATAG AGCGATTTTG TCTTTGAAGA CTCAAAGGCG 360 TAAGCTTGCT CAATATCAGC AACAGGTATA AGGAATTTTA TTTTTGCCCC TTTTTGAAAA 2003 |||||||||| |||||||||| ||||| TAAGCTTGCT CAATATCAGC AACAG..... .......... .......... .......... 385 TTTTGACTGC AGTTCTCTTT TTGGTGGGCT AAACTAAAAT TGCTTAGTTT AGTTAAGATG 1943 .......... .......... .......... .......... .......... .......... 385 GATCATGGGT AGTTTTTGGA TTTTTTTTTC TGGGATTGGT TATTGTGGAG GATTATTAGG 1883 .......... .......... .......... .......... .......... .......... 385 TGAAATTATG AAATTTGACA GCAAGATGGC TTTTTGGGTG TTTATTGCAA TGCACAATTA 1823 .......... .......... .......... .......... .......... .......... 385 TAGCTCAACA ATGATGTTAA TACTAGATTT ATAATGTTGA GATTGAGTTG TAGAACTATG 1763 .......... .......... .......... .......... .......... .......... 385 TGTTTTTTCT TGAGTCTTTT AGCAATAACT ATGTTTAGGA AGAATTAGTT CTCTATGTTG 1703 .......... .......... .......... .......... .......... .......... 385 GTATTTGAGC CGAGGGTCAA CCTCCCTACC TCCAAGGTTT CCTACCTCCA AGGTTGCTGG 1643 .......... .......... .......... .......... .......... .......... 385 TAGGGGTAAG GTCTTCGTAG ACTTTACCTT CACAGACTCT ACTTTGTGGG ATTACACGGG 1583 .......... .......... .......... .......... .......... .......... 385 TGTGTTGTTG TTGTTATTAT ATGGCATGCG ATGAGTTTAA GTGGAGTAAG ATGATTAGTG 1523 .......... .......... .......... .......... .......... .......... 385 AGGGTTATAT AGCGGATCCC AACTTGTTTT TGTACTGAGG CATTGTTGTT ATTGGTTCTT 1463 .......... .......... .......... .......... .......... .......... 385 CCTTGACCAT AACGTGATGC TTAGAGGAGC TGTGTATCAA AGTCTCTATA TTTCTGAATG 1403 .......... .......... .......... .......... .......... .......... 385 AATCGACCAG CTCTTCCAAT TTGAGAAACA ATAGCAACTG AACTTGGCAG TACCTAGAAT 1343 .......... .......... .......... .......... .......... .......... 385 ATTTGTAAAT TTTACAATAC ACATTACATA TCCGTCATCT CAGTGGTCAT TATGTAGGGG 1283 .......... .......... .......... .......... .......... .......... 385 AGGAAGAGAT GTCGACAGCT TGTTCTTACT TTCAACACAT AAAAGATCTA CTAGACGGTG 1223 .......... .......... .......... .......... .......... .......... 385 GTTCTGATTT GAAAAAAAAA AATGTACGTT TCATACAAAG GTTTTCATTT TCTGTAGAAG 1163 .......... .......... .......... .......... .......... .......... 385 TTTGAGTAAA TACTGCTTCA TATGATGCTG ATGATAAGAA AAAGTGAGTC TTTGATGGAG 1103 .......... .......... .......... .......... .......... .......... 385 CTTTTGTCTT CTGTAATTGC CATCCAAGGA AACAATAGTC AACTCTATTC CTGGATGAAA 1043 .......... .......... .......... .......... .......... .......... 385 GAAATGACCT AATAGAGAGC TTTCGTTTCC TTTCTCCGAG TCAGAAAAGA GAATCTGTGA 983 .......... .......... .......... .......... .......... .......... 385 ATTGTCTAAA TCTTCAAAAC CATATAGCCT AGATATAAGT TGACCTACCA ATTCTACTTC 923 .......... .......... .......... .......... .......... .......... 385 TGAGCAATTA CCTTCTATAC AAGACCTCAA GTCTCAGCAT CAACCATTAC TTGAAATAAT 863 .......... .......... .......... .......... .......... .......... 385 AGCACTTTGT TGCTTATATC ATATTTTCTG TTAAAAATGA ACCAAAAAAA AAAAGAAGGG 803 .......... .......... .......... .......... .......... .......... 385 GCATTCTCAG TTTCATTGCA ACTGGACACG CAGCTAATTT TAATAGTTCG GGGAAGTTCT 743 .......... .......... .......... .......... .......... .......... 385 ACAGGAGAGT GACCCATACC TTGATTATGC AGTTAGTCAG AATGCTAACA ATCCCAACAA 683 .......... .......... .......... .......... .......... .......... 385 AATTGAAAAG TAATAAAAGA AAGAAAAACC CAAAATTGGG TAGAAGTTGG TTTTACATGA 623 .......... .......... .......... .......... .......... .......... 385 AGTAGGGGAA GTTTTTTGGA ATTCTATTAA ACGGTGCAAC TGAGATTGTC CACAATAAGA 563 .......... .......... .......... .......... .......... .......... 385 ACTTAAAGGT CCTCTCCTCT TTTTTTGGTG CCAAGGGCTC CACGTAGGAC TGATTTGCAA 503 .......... .......... .......... .......... .......... .......... 385 GCTTTCCTCA ATGTTGGTTT CAGACCTGCT AATCCCTTCT GTTATTCTTT GTTTCAGCTG 443 ||| .......... .......... .......... .......... .......... .......CTG 388 GATGCTGTTA TTGAAGCCGA AAAACAAGCT GCAAAAGACT TGCTACGCGA AAAGAAGAAA 383 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATGCTGTTA TTGAAGCCGA AAAACAAGCT GCAAAAGACT TGCTACGCGA AAAGAAGAAA 448 GAGAGGGCCC TTTTAGCATT GAAAAAGAAG AAAGTGCAAG AAGAACTATT AAAGCAAGTT 323 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAGAGGGCCC TTTTAGCATT GAAAAAGAAG AAAGTGCAAG AAGAACTATT AAAGCAAGTT 508 GATGTTTGGC TCATAAATGT TGAGCAGCAA GTAAGTAATT CTTCTTAATT CAGTTCCATA 263 |||||||||| |||||||||| |||||||||| GATGTTTGGC TCATAAATGT TGAGCAGCAA .......... .......... .......... 538 CTTATGCTAC CATGCTAAAT TTCCTTATTT CTCTTAAGCA TATGCTACCA TGCTAAATTT 203 .......... .......... .......... .......... .......... .......... 538 CCTTATTTCT CTTAAGCATT TAAATTGTTT TCTGATTCTA TTTGCAAATT CTTTTAGTTG 143 ||| .......... .......... .......... .......... .......... .......TTG 541 GCAGATATTG AACTGACAAG CAAGCAAAAG GCTGTCTTTG AGAGTTTGAA AACTGGGAAT 83 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCAGATATTG AACTGACAAG CAAGCAAAAG GCTGTCTTTG AGAGTTTGAA AACTGGGAAT 601 AATGCAATCA AAGCAATA 65 |||||||||| |||||||| AATGCAATCA AAGCAATA 619 hqPGS_C12HBa0093P12.1-2-_SGN-E395668+ (2422 2038,445 293,145 65) ******************************************************************************** EST sequence 6 +strand 617 n (File: SGN-E318076+) 1 AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 61 TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 121 ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 181 ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 241 TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 301 AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 361 AAGCTTGCTC AATATCAACA ACAGCTGGAT GCTGTTATTG AAGCCGAAAA ACAAGCTGCA 421 AAAGACTTGC TACGCGAAAA GAAGAAAGAG AGGGCCCTTT TAGCATTGAA AAAGAAGAAA 481 GTGCAAGAAG AACTATTAAA GCAAGTTGAT GTTTGGCTCA TAAATGTTGA GCAGCAATTG 541 GCAGATATTG AACTGACAAG CAAGCAAAAT GCTGTCTTTG AGAGTTTGAA TACTGGGAAT 601 AATGCAATCA AAGCAAT Predicted gene structure (within gDNA segment 3021 to 1): Exon 1 2421 2038 ( 384 n); cDNA 1 384 ( 384 n); score: 0.997 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 0.98), Pa: 0.997 (s: 1.00) Exon 2 445 293 ( 153 n); cDNA 385 537 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 (s: 1.00), Pa: 0.991 (s: 0.98) Exon 3 145 66 ( 80 n); cDNA 538 617 ( 80 n); score: 0.975 MATCH C12HBa0093P12.1-2- SGN-E318076+ 0.995 617 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E318076+ (2421 2038,445 293,145 66) Alignment (genomic DNA sequence = upper lines): AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 2362 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 60 TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 2302 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 120 ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 2242 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 180 ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 2182 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 240 TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 2122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 300 AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 2062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 360 AAGCTTGCTC AATATCAGCA ACAGGTATAA GGAATTTTAT TTTTGCCCCT TTTTGAAAAT 2002 |||||||||| ||||||| || |||| AAGCTTGCTC AATATCAACA ACAG...... .......... .......... .......... 384 TTTGACTGCA GTTCTCTTTT TGGTGGGCTA AACTAAAATT GCTTAGTTTA GTTAAGATGG 1942 .......... .......... .......... .......... .......... .......... 384 ATCATGGGTA GTTTTTGGAT TTTTTTTTCT GGGATTGGTT ATTGTGGAGG ATTATTAGGT 1882 .......... .......... .......... .......... .......... .......... 384 GAAATTATGA AATTTGACAG CAAGATGGCT TTTTGGGTGT TTATTGCAAT GCACAATTAT 1822 .......... .......... .......... .......... .......... .......... 384 AGCTCAACAA TGATGTTAAT ACTAGATTTA TAATGTTGAG ATTGAGTTGT AGAACTATGT 1762 .......... .......... .......... .......... .......... .......... 384 GTTTTTTCTT GAGTCTTTTA GCAATAACTA TGTTTAGGAA GAATTAGTTC TCTATGTTGG 1702 .......... .......... .......... .......... .......... .......... 384 TATTTGAGCC GAGGGTCAAC CTCCCTACCT CCAAGGTTTC CTACCTCCAA GGTTGCTGGT 1642 .......... .......... .......... .......... .......... .......... 384 AGGGGTAAGG TCTTCGTAGA CTTTACCTTC ACAGACTCTA CTTTGTGGGA TTACACGGGT 1582 .......... .......... .......... .......... .......... .......... 384 GTGTTGTTGT TGTTATTATA TGGCATGCGA TGAGTTTAAG TGGAGTAAGA TGATTAGTGA 1522 .......... .......... .......... .......... .......... .......... 384 GGGTTATATA GCGGATCCCA ACTTGTTTTT GTACTGAGGC ATTGTTGTTA TTGGTTCTTC 1462 .......... .......... .......... .......... .......... .......... 384 CTTGACCATA ACGTGATGCT TAGAGGAGCT GTGTATCAAA GTCTCTATAT TTCTGAATGA 1402 .......... .......... .......... .......... .......... .......... 384 ATCGACCAGC TCTTCCAATT TGAGAAACAA TAGCAACTGA ACTTGGCAGT ACCTAGAATA 1342 .......... .......... .......... .......... .......... .......... 384 TTTGTAAATT TTACAATACA CATTACATAT CCGTCATCTC AGTGGTCATT ATGTAGGGGA 1282 .......... .......... .......... .......... .......... .......... 384 GGAAGAGATG TCGACAGCTT GTTCTTACTT TCAACACATA AAAGATCTAC TAGACGGTGG 1222 .......... .......... .......... .......... .......... .......... 384 TTCTGATTTG AAAAAAAAAA ATGTACGTTT CATACAAAGG TTTTCATTTT CTGTAGAAGT 1162 .......... .......... .......... .......... .......... .......... 384 TTGAGTAAAT ACTGCTTCAT ATGATGCTGA TGATAAGAAA AAGTGAGTCT TTGATGGAGC 1102 .......... .......... .......... .......... .......... .......... 384 TTTTGTCTTC TGTAATTGCC ATCCAAGGAA ACAATAGTCA ACTCTATTCC TGGATGAAAG 1042 .......... .......... .......... .......... .......... .......... 384 AAATGACCTA ATAGAGAGCT TTCGTTTCCT TTCTCCGAGT CAGAAAAGAG AATCTGTGAA 982 .......... .......... .......... .......... .......... .......... 384 TTGTCTAAAT CTTCAAAACC ATATAGCCTA GATATAAGTT GACCTACCAA TTCTACTTCT 922 .......... .......... .......... .......... .......... .......... 384 GAGCAATTAC CTTCTATACA AGACCTCAAG TCTCAGCATC AACCATTACT TGAAATAATA 862 .......... .......... .......... .......... .......... .......... 384 GCACTTTGTT GCTTATATCA TATTTTCTGT TAAAAATGAA CCAAAAAAAA AAAGAAGGGG 802 .......... .......... .......... .......... .......... .......... 384 CATTCTCAGT TTCATTGCAA CTGGACACGC AGCTAATTTT AATAGTTCGG GGAAGTTCTA 742 .......... .......... .......... .......... .......... .......... 384 CAGGAGAGTG ACCCATACCT TGATTATGCA GTTAGTCAGA ATGCTAACAA TCCCAACAAA 682 .......... .......... .......... .......... .......... .......... 384 ATTGAAAAGT AATAAAAGAA AGAAAAACCC AAAATTGGGT AGAAGTTGGT TTTACATGAA 622 .......... .......... .......... .......... .......... .......... 384 GTAGGGGAAG TTTTTTGGAA TTCTATTAAA CGGTGCAACT GAGATTGTCC ACAATAAGAA 562 .......... .......... .......... .......... .......... .......... 384 CTTAAAGGTC CTCTCCTCTT TTTTTGGTGC CAAGGGCTCC ACGTAGGACT GATTTGCAAG 502 .......... .......... .......... .......... .......... .......... 384 CTTTCCTCAA TGTTGGTTTC AGACCTGCTA ATCCCTTCTG TTATTCTTTG TTTCAGCTGG 442 |||| .......... .......... .......... .......... .......... ......CTGG 388 ATGCTGTTAT TGAAGCCGAA AAACAAGCTG CAAAAGACTT GCTACGCGAA AAGAAGAAAG 382 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCTGTTAT TGAAGCCGAA AAACAAGCTG CAAAAGACTT GCTACGCGAA AAGAAGAAAG 448 AGAGGGCCCT TTTAGCATTG AAAAAGAAGA AAGTGCAAGA AGAACTATTA AAGCAAGTTG 322 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGGGCCCT TTTAGCATTG AAAAAGAAGA AAGTGCAAGA AGAACTATTA AAGCAAGTTG 508 ATGTTTGGCT CATAAATGTT GAGCAGCAAG TAAGTAATTC TTCTTAATTC AGTTCCATAC 262 |||||||||| |||||||||| ||||||||| ATGTTTGGCT CATAAATGTT GAGCAGCAA. .......... .......... .......... 537 TTATGCTACC ATGCTAAATT TCCTTATTTC TCTTAAGCAT ATGCTACCAT GCTAAATTTC 202 .......... .......... .......... .......... .......... .......... 537 CTTATTTCTC TTAAGCATTT AAATTGTTTT CTGATTCTAT TTGCAAATTC TTTTAGTTGG 142 |||| .......... .......... .......... .......... .......... ......TTGG 541 CAGATATTGA ACTGACAAGC AAGCAAAAGG CTGTCTTTGA GAGTTTGAAA ACTGGGAATA 82 |||||||||| |||||||||| |||||||| | |||||||||| ||||||||| |||||||||| CAGATATTGA ACTGACAAGC AAGCAAAATG CTGTCTTTGA GAGTTTGAAT ACTGGGAATA 601 ATGCAATCAA AGCAAT 66 |||||||||| |||||| ATGCAATCAA AGCAAT 617 hqPGS_C12HBa0093P12.1-2-_SGN-E318076+ (2421 2038,445 293,145 66) ******************************************************************************** EST sequence 7 +strand 609 n (File: SGN-E306854+) 1 ATCAAAAAAG TTAGAACTCA AAGAACAAGT AACATTGATT GCTACACAAT AATTTGTCTC 61 TTTTTGTCTA TTTGATCATC GCTGAGGACT CACCGATCCA TTAATTTCGT ATTTTCATAT 121 TGTTTTTTTC AGCTTATAAG GAGATCAATT ACGAATAATC TAAAGGGTCT TTGCATATAG 181 AGATGCAAAA GACCTCATTT TTGTTGTAAA TTTTTTATTT TTTTGTTGTG GGGTTGTTCA 241 ATTTCGTATT TTTTTTTTGT GGGTTTTGAA TTTGGGAGAT GGGAAATATA TTTGTGAAGA 301 AACCGAAGAT CACCGAAGTT GATAGAGCGA TTTTGTCTTT GAAGACTCAA AGGCGTAAGC 361 TTGCTCAATA TCAGCAACAG CTGGATGCTG TTATTGAAGC CGAAAAACAA GCTGCAAAAG 421 ACTTGCTACG CGAAAAGAAG AAAGAGAGGG CCCTTTTAGC ATTGAAAAAG AAGAAAGTGC 481 AAGAAGAACT ATTAAAGCAA GTTGATGTTT GGCTCATAAA TGTTGAGCAG CAATTGGCAG 541 ATATTGAACT GACAAGCAAG CAAAAGGCTG TCTTTGAGAG TTTGAAAACT GGGAATAATG 601 CAATCAAAG Predicted gene structure (within gDNA segment 3017 to 1): Exon 1 2417 2038 ( 380 n); cDNA 1 380 ( 380 n); score: 0.997 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 293 ( 153 n); cDNA 381 533 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 (s: 1.00), Pa: 0.991 (s: 1.00) Exon 3 145 70 ( 76 n); cDNA 534 609 ( 76 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E306854+ 0.998 609 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E306854+ (2417 2038,445 293,145 70) Alignment (genomic DNA sequence = upper lines): ATCAAAAAAG TTAGAACTCA AAGAACAAGT AACATTGATT TCTACACAAT AATTTGTCTC 2358 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| |||||||||| ATCAAAAAAG TTAGAACTCA AAGAACAAGT AACATTGATT GCTACACAAT AATTTGTCTC 60 TTTTTGTCTA TTTGATCATC GCTGAGGACT CACCGATCCA TTAATTTCGT ATTTTCATAT 2298 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTGTCTA TTTGATCATC GCTGAGGACT CACCGATCCA TTAATTTCGT ATTTTCATAT 120 TGTTTTTTTC AGCTTATAAG GAGATCAATT ACGAATAATC TAAAGGGTCT TTGCATATAG 2238 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTTTTTTC AGCTTATAAG GAGATCAATT ACGAATAATC TAAAGGGTCT TTGCATATAG 180 AGATGCAAAA GACCTCATTT TTGTTGTAAA TTTTTTATTT TTTTGTTGTG GGGTTGTTCA 2178 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATGCAAAA GACCTCATTT TTGTTGTAAA TTTTTTATTT TTTTGTTGTG GGGTTGTTCA 240 ATTTCGTATT TTTTTTTTGT GGGTTTTGAA TTTGGGAGAT GGGAAATATA TTTGTGAAGA 2118 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTCGTATT TTTTTTTTGT GGGTTTTGAA TTTGGGAGAT GGGAAATATA TTTGTGAAGA 300 AACCGAAGAT CACCGAAGTT GATAGAGCGA TTTTGTCTTT GAAGACTCAA AGGCGTAAGC 2058 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACCGAAGAT CACCGAAGTT GATAGAGCGA TTTTGTCTTT GAAGACTCAA AGGCGTAAGC 360 TTGCTCAATA TCAGCAACAG GTATAAGGAA TTTTATTTTT GCCCCTTTTT GAAAATTTTG 1998 |||||||||| |||||||||| TTGCTCAATA TCAGCAACAG .......... .......... .......... .......... 380 ACTGCAGTTC TCTTTTTGGT GGGCTAAACT AAAATTGCTT AGTTTAGTTA AGATGGATCA 1938 .......... .......... .......... .......... .......... .......... 380 TGGGTAGTTT TTGGATTTTT TTTTCTGGGA TTGGTTATTG TGGAGGATTA TTAGGTGAAA 1878 .......... .......... .......... .......... .......... .......... 380 TTATGAAATT TGACAGCAAG ATGGCTTTTT GGGTGTTTAT TGCAATGCAC AATTATAGCT 1818 .......... .......... .......... .......... .......... .......... 380 CAACAATGAT GTTAATACTA GATTTATAAT GTTGAGATTG AGTTGTAGAA CTATGTGTTT 1758 .......... .......... .......... .......... .......... .......... 380 TTTCTTGAGT CTTTTAGCAA TAACTATGTT TAGGAAGAAT TAGTTCTCTA TGTTGGTATT 1698 .......... .......... .......... .......... .......... .......... 380 TGAGCCGAGG GTCAACCTCC CTACCTCCAA GGTTTCCTAC CTCCAAGGTT GCTGGTAGGG 1638 .......... .......... .......... .......... .......... .......... 380 GTAAGGTCTT CGTAGACTTT ACCTTCACAG ACTCTACTTT GTGGGATTAC ACGGGTGTGT 1578 .......... .......... .......... .......... .......... .......... 380 TGTTGTTGTT ATTATATGGC ATGCGATGAG TTTAAGTGGA GTAAGATGAT TAGTGAGGGT 1518 .......... .......... .......... .......... .......... .......... 380 TATATAGCGG ATCCCAACTT GTTTTTGTAC TGAGGCATTG TTGTTATTGG TTCTTCCTTG 1458 .......... .......... .......... .......... .......... .......... 380 ACCATAACGT GATGCTTAGA GGAGCTGTGT ATCAAAGTCT CTATATTTCT GAATGAATCG 1398 .......... .......... .......... .......... .......... .......... 380 ACCAGCTCTT CCAATTTGAG AAACAATAGC AACTGAACTT GGCAGTACCT AGAATATTTG 1338 .......... .......... .......... .......... .......... .......... 380 TAAATTTTAC AATACACATT ACATATCCGT CATCTCAGTG GTCATTATGT AGGGGAGGAA 1278 .......... .......... .......... .......... .......... .......... 380 GAGATGTCGA CAGCTTGTTC TTACTTTCAA CACATAAAAG ATCTACTAGA CGGTGGTTCT 1218 .......... .......... .......... .......... .......... .......... 380 GATTTGAAAA AAAAAAATGT ACGTTTCATA CAAAGGTTTT CATTTTCTGT AGAAGTTTGA 1158 .......... .......... .......... .......... .......... .......... 380 GTAAATACTG CTTCATATGA TGCTGATGAT AAGAAAAAGT GAGTCTTTGA TGGAGCTTTT 1098 .......... .......... .......... .......... .......... .......... 380 GTCTTCTGTA ATTGCCATCC AAGGAAACAA TAGTCAACTC TATTCCTGGA TGAAAGAAAT 1038 .......... .......... .......... .......... .......... .......... 380 GACCTAATAG AGAGCTTTCG TTTCCTTTCT CCGAGTCAGA AAAGAGAATC TGTGAATTGT 978 .......... .......... .......... .......... .......... .......... 380 CTAAATCTTC AAAACCATAT AGCCTAGATA TAAGTTGACC TACCAATTCT ACTTCTGAGC 918 .......... .......... .......... .......... .......... .......... 380 AATTACCTTC TATACAAGAC CTCAAGTCTC AGCATCAACC ATTACTTGAA ATAATAGCAC 858 .......... .......... .......... .......... .......... .......... 380 TTTGTTGCTT ATATCATATT TTCTGTTAAA AATGAACCAA AAAAAAAAAG AAGGGGCATT 798 .......... .......... .......... .......... .......... .......... 380 CTCAGTTTCA TTGCAACTGG ACACGCAGCT AATTTTAATA GTTCGGGGAA GTTCTACAGG 738 .......... .......... .......... .......... .......... .......... 380 AGAGTGACCC ATACCTTGAT TATGCAGTTA GTCAGAATGC TAACAATCCC AACAAAATTG 678 .......... .......... .......... .......... .......... .......... 380 AAAAGTAATA AAAGAAAGAA AAACCCAAAA TTGGGTAGAA GTTGGTTTTA CATGAAGTAG 618 .......... .......... .......... .......... .......... .......... 380 GGGAAGTTTT TTGGAATTCT ATTAAACGGT GCAACTGAGA TTGTCCACAA TAAGAACTTA 558 .......... .......... .......... .......... .......... .......... 380 AAGGTCCTCT CCTCTTTTTT TGGTGCCAAG GGCTCCACGT AGGACTGATT TGCAAGCTTT 498 .......... .......... .......... .......... .......... .......... 380 CCTCAATGTT GGTTTCAGAC CTGCTAATCC CTTCTGTTAT TCTTTGTTTC AGCTGGATGC 438 |||||||| .......... .......... .......... .......... .......... ..CTGGATGC 388 TGTTATTGAA GCCGAAAAAC AAGCTGCAAA AGACTTGCTA CGCGAAAAGA AGAAAGAGAG 378 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTATTGAA GCCGAAAAAC AAGCTGCAAA AGACTTGCTA CGCGAAAAGA AGAAAGAGAG 448 GGCCCTTTTA GCATTGAAAA AGAAGAAAGT GCAAGAAGAA CTATTAAAGC AAGTTGATGT 318 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCCCTTTTA GCATTGAAAA AGAAGAAAGT GCAAGAAGAA CTATTAAAGC AAGTTGATGT 508 TTGGCTCATA AATGTTGAGC AGCAAGTAAG TAATTCTTCT TAATTCAGTT CCATACTTAT 258 |||||||||| |||||||||| ||||| TTGGCTCATA AATGTTGAGC AGCAA..... .......... .......... .......... 533 GCTACCATGC TAAATTTCCT TATTTCTCTT AAGCATATGC TACCATGCTA AATTTCCTTA 198 .......... .......... .......... .......... .......... .......... 533 TTTCTCTTAA GCATTTAAAT TGTTTTCTGA TTCTATTTGC AAATTCTTTT AGTTGGCAGA 138 |||||||| .......... .......... .......... .......... .......... ..TTGGCAGA 541 TATTGAACTG ACAAGCAAGC AAAAGGCTGT CTTTGAGAGT TTGAAAACTG GGAATAATGC 78 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTGAACTG ACAAGCAAGC AAAAGGCTGT CTTTGAGAGT TTGAAAACTG GGAATAATGC 601 AATCAAAG 70 |||||||| AATCAAAG 609 hqPGS_C12HBa0093P12.1-2-_SGN-E306854+ (2417 2038,445 293,145 70) ******************************************************************************** EST sequence 11 +strand 421 n (File: SGN-E321151+) 1 TTTGCATATA GAGATGCAAA AGACCTCATT TTTGTTGTAA ATTTTTTATT TTTTTGTTGT 61 GGGGTTGTTC AATTTCGTAT TTTTTTTTTG TGGGTTTTGA ATTTGGGAGA TGGGAAATAT 121 ATTTGTGAAG AAACCGAAGA TCACCGAAGT TGATAGAGCG ATTTTGTCTT TGAAGACTCA 181 AAGGCGTAAG CTTGCTCAAT ATCAGCAACA GCTGGATGCT GTTATTGAAG CCGAAAAACA 241 AGCTGCAAAA GACTTGCTAC GCGAAAAGAA GAAAGAGAGG GCCCTTTTAG CATTGAAAAA 301 GAAGAAAGTG CAAGAAGAAC TATTAAAGCA AGTTGATGTT TGGCTCATAA ATGTTGAGCA 361 GCAATTGGCA GATATTGAAC TGACAAGCAA GCAAAAGGCT GTCTTTGAGA GTTTGAAAAC 421 T Predicted gene structure (within gDNA segment 2848 to 1): Exon 1 2248 2038 ( 211 n); cDNA 1 211 ( 211 n); score: 1.000 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 293 ( 153 n); cDNA 212 364 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 (s: 1.00), Pa: 0.991 (s: 1.00) Exon 3 145 89 ( 57 n); cDNA 365 421 ( 57 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E321151+ 1.000 421 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E321151+ (2248 2038,445 293,145 89) Alignment (genomic DNA sequence = upper lines): TTTGCATATA GAGATGCAAA AGACCTCATT TTTGTTGTAA ATTTTTTATT TTTTTGTTGT 2189 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTGCATATA GAGATGCAAA AGACCTCATT TTTGTTGTAA ATTTTTTATT TTTTTGTTGT 60 GGGGTTGTTC AATTTCGTAT TTTTTTTTTG TGGGTTTTGA ATTTGGGAGA TGGGAAATAT 2129 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGGGTTGTTC AATTTCGTAT TTTTTTTTTG TGGGTTTTGA ATTTGGGAGA TGGGAAATAT 120 ATTTGTGAAG AAACCGAAGA TCACCGAAGT TGATAGAGCG ATTTTGTCTT TGAAGACTCA 2069 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTGAAG AAACCGAAGA TCACCGAAGT TGATAGAGCG ATTTTGTCTT TGAAGACTCA 180 AAGGCGTAAG CTTGCTCAAT ATCAGCAACA GGTATAAGGA ATTTTATTTT TGCCCCTTTT 2009 |||||||||| |||||||||| |||||||||| | AAGGCGTAAG CTTGCTCAAT ATCAGCAACA G......... .......... .......... 211 TGAAAATTTT GACTGCAGTT CTCTTTTTGG TGGGCTAAAC TAAAATTGCT TAGTTTAGTT 1949 .......... .......... .......... .......... .......... .......... 211 AAGATGGATC ATGGGTAGTT TTTGGATTTT TTTTTCTGGG ATTGGTTATT GTGGAGGATT 1889 .......... .......... .......... .......... .......... .......... 211 ATTAGGTGAA ATTATGAAAT TTGACAGCAA GATGGCTTTT TGGGTGTTTA TTGCAATGCA 1829 .......... .......... .......... .......... .......... .......... 211 CAATTATAGC TCAACAATGA TGTTAATACT AGATTTATAA TGTTGAGATT GAGTTGTAGA 1769 .......... .......... .......... .......... .......... .......... 211 ACTATGTGTT TTTTCTTGAG TCTTTTAGCA ATAACTATGT TTAGGAAGAA TTAGTTCTCT 1709 .......... .......... .......... .......... .......... .......... 211 ATGTTGGTAT TTGAGCCGAG GGTCAACCTC CCTACCTCCA AGGTTTCCTA CCTCCAAGGT 1649 .......... .......... .......... .......... .......... .......... 211 TGCTGGTAGG GGTAAGGTCT TCGTAGACTT TACCTTCACA GACTCTACTT TGTGGGATTA 1589 .......... .......... .......... .......... .......... .......... 211 CACGGGTGTG TTGTTGTTGT TATTATATGG CATGCGATGA GTTTAAGTGG AGTAAGATGA 1529 .......... .......... .......... .......... .......... .......... 211 TTAGTGAGGG TTATATAGCG GATCCCAACT TGTTTTTGTA CTGAGGCATT GTTGTTATTG 1469 .......... .......... .......... .......... .......... .......... 211 GTTCTTCCTT GACCATAACG TGATGCTTAG AGGAGCTGTG TATCAAAGTC TCTATATTTC 1409 .......... .......... .......... .......... .......... .......... 211 TGAATGAATC GACCAGCTCT TCCAATTTGA GAAACAATAG CAACTGAACT TGGCAGTACC 1349 .......... .......... .......... .......... .......... .......... 211 TAGAATATTT GTAAATTTTA CAATACACAT TACATATCCG TCATCTCAGT GGTCATTATG 1289 .......... .......... .......... .......... .......... .......... 211 TAGGGGAGGA AGAGATGTCG ACAGCTTGTT CTTACTTTCA ACACATAAAA GATCTACTAG 1229 .......... .......... .......... .......... .......... .......... 211 ACGGTGGTTC TGATTTGAAA AAAAAAAATG TACGTTTCAT ACAAAGGTTT TCATTTTCTG 1169 .......... .......... .......... .......... .......... .......... 211 TAGAAGTTTG AGTAAATACT GCTTCATATG ATGCTGATGA TAAGAAAAAG TGAGTCTTTG 1109 .......... .......... .......... .......... .......... .......... 211 ATGGAGCTTT TGTCTTCTGT AATTGCCATC CAAGGAAACA ATAGTCAACT CTATTCCTGG 1049 .......... .......... .......... .......... .......... .......... 211 ATGAAAGAAA TGACCTAATA GAGAGCTTTC GTTTCCTTTC TCCGAGTCAG AAAAGAGAAT 989 .......... .......... .......... .......... .......... .......... 211 CTGTGAATTG TCTAAATCTT CAAAACCATA TAGCCTAGAT ATAAGTTGAC CTACCAATTC 929 .......... .......... .......... .......... .......... .......... 211 TACTTCTGAG CAATTACCTT CTATACAAGA CCTCAAGTCT CAGCATCAAC CATTACTTGA 869 .......... .......... .......... .......... .......... .......... 211 AATAATAGCA CTTTGTTGCT TATATCATAT TTTCTGTTAA AAATGAACCA AAAAAAAAAA 809 .......... .......... .......... .......... .......... .......... 211 GAAGGGGCAT TCTCAGTTTC ATTGCAACTG GACACGCAGC TAATTTTAAT AGTTCGGGGA 749 .......... .......... .......... .......... .......... .......... 211 AGTTCTACAG GAGAGTGACC CATACCTTGA TTATGCAGTT AGTCAGAATG CTAACAATCC 689 .......... .......... .......... .......... .......... .......... 211 CAACAAAATT GAAAAGTAAT AAAAGAAAGA AAAACCCAAA ATTGGGTAGA AGTTGGTTTT 629 .......... .......... .......... .......... .......... .......... 211 ACATGAAGTA GGGGAAGTTT TTTGGAATTC TATTAAACGG TGCAACTGAG ATTGTCCACA 569 .......... .......... .......... .......... .......... .......... 211 ATAAGAACTT AAAGGTCCTC TCCTCTTTTT TTGGTGCCAA GGGCTCCACG TAGGACTGAT 509 .......... .......... .......... .......... .......... .......... 211 TTGCAAGCTT TCCTCAATGT TGGTTTCAGA CCTGCTAATC CCTTCTGTTA TTCTTTGTTT 449 .......... .......... .......... .......... .......... .......... 211 CAGCTGGATG CTGTTATTGA AGCCGAAAAA CAAGCTGCAA AAGACTTGCT ACGCGAAAAG 389 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ...CTGGATG CTGTTATTGA AGCCGAAAAA CAAGCTGCAA AAGACTTGCT ACGCGAAAAG 268 AAGAAAGAGA GGGCCCTTTT AGCATTGAAA AAGAAGAAAG TGCAAGAAGA ACTATTAAAG 329 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAAGAGA GGGCCCTTTT AGCATTGAAA AAGAAGAAAG TGCAAGAAGA ACTATTAAAG 328 CAAGTTGATG TTTGGCTCAT AAATGTTGAG CAGCAAGTAA GTAATTCTTC TTAATTCAGT 269 |||||||||| |||||||||| |||||||||| |||||| CAAGTTGATG TTTGGCTCAT AAATGTTGAG CAGCAA.... .......... .......... 364 TCCATACTTA TGCTACCATG CTAAATTTCC TTATTTCTCT TAAGCATATG CTACCATGCT 209 .......... .......... .......... .......... .......... .......... 364 AAATTTCCTT ATTTCTCTTA AGCATTTAAA TTGTTTTCTG ATTCTATTTG CAAATTCTTT 149 .......... .......... .......... .......... .......... .......... 364 TAGTTGGCAG ATATTGAACT GACAAGCAAG CAAAAGGCTG TCTTTGAGAG TTTGAAAACT 89 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ...TTGGCAG ATATTGAACT GACAAGCAAG CAAAAGGCTG TCTTTGAGAG TTTGAAAACT 421 hqPGS_C12HBa0093P12.1-2-_SGN-E321151+ (2248 2038,445 293,145 89) ******************************************************************************** EST sequence 13 +strand 566 n (File: SGN-E210308+) 1 AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 61 TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 121 ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 181 ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 241 TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 301 AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 361 AAGCTTGCTC AATATCAGCA ACAGCTGGAT GCTGTTATTG AAGCCGAAAA ACAAGCTGCA 421 AAAGACTTGC TACGCGAAAA GAAGAAAGAG AGGGCCCTTT TAGCATTGAA AAAGAAGAAA 481 GTGCAAGAAG AACTATTAAA GCAAGTTGAT GTTTGGCTCA TAAATGTTGA GCAGCAATTG 541 GCAGATATTG AACTGACAAG CAAGCA Predicted gene structure (within gDNA segment 3021 to 1): Exon 1 2421 2038 ( 384 n); cDNA 1 384 ( 384 n); score: 1.000 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 293 ( 153 n); cDNA 385 537 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 (s: 1.00), Pa: 0.991 (s: 0) Exon 3 145 117 ( 29 n); cDNA 538 566 ( 29 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E210308+ 1.000 566 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E210308+ (2421 2038,445 293,145 117) Alignment (genomic DNA sequence = upper lines): AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 2362 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGATCAAA AAAGTTAGAA CTCAAAGAAC AAGTAACATT GATTTCTACA CAATAATTTG 60 TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 2302 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCTTTTTG TCTATTTGAT CATCGCTGAG GACTCACCGA TCCATTAATT TCGTATTTTC 120 ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 2242 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATATTGTTTT TTTCAGCTTA TAAGGAGATC AATTACGAAT AATCTAAAGG GTCTTTGCAT 180 ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 2182 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATAGAGATGC AAAAGACCTC ATTTTTGTTG TAAATTTTTT ATTTTTTTGT TGTGGGGTTG 240 TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 2122 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAATTTCG TATTTTTTTT TTGTGGGTTT TGAATTTGGG AGATGGGAAA TATATTTGTG 300 AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 2062 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGAAACCGA AGATCACCGA AGTTGATAGA GCGATTTTGT CTTTGAAGAC TCAAAGGCGT 360 AAGCTTGCTC AATATCAGCA ACAGGTATAA GGAATTTTAT TTTTGCCCCT TTTTGAAAAT 2002 |||||||||| |||||||||| |||| AAGCTTGCTC AATATCAGCA ACAG...... .......... .......... .......... 384 TTTGACTGCA GTTCTCTTTT TGGTGGGCTA AACTAAAATT GCTTAGTTTA GTTAAGATGG 1942 .......... .......... .......... .......... .......... .......... 384 ATCATGGGTA GTTTTTGGAT TTTTTTTTCT GGGATTGGTT ATTGTGGAGG ATTATTAGGT 1882 .......... .......... .......... .......... .......... .......... 384 GAAATTATGA AATTTGACAG CAAGATGGCT TTTTGGGTGT TTATTGCAAT GCACAATTAT 1822 .......... .......... .......... .......... .......... .......... 384 AGCTCAACAA TGATGTTAAT ACTAGATTTA TAATGTTGAG ATTGAGTTGT AGAACTATGT 1762 .......... .......... .......... .......... .......... .......... 384 GTTTTTTCTT GAGTCTTTTA GCAATAACTA TGTTTAGGAA GAATTAGTTC TCTATGTTGG 1702 .......... .......... .......... .......... .......... .......... 384 TATTTGAGCC GAGGGTCAAC CTCCCTACCT CCAAGGTTTC CTACCTCCAA GGTTGCTGGT 1642 .......... .......... .......... .......... .......... .......... 384 AGGGGTAAGG TCTTCGTAGA CTTTACCTTC ACAGACTCTA CTTTGTGGGA TTACACGGGT 1582 .......... .......... .......... .......... .......... .......... 384 GTGTTGTTGT TGTTATTATA TGGCATGCGA TGAGTTTAAG TGGAGTAAGA TGATTAGTGA 1522 .......... .......... .......... .......... .......... .......... 384 GGGTTATATA GCGGATCCCA ACTTGTTTTT GTACTGAGGC ATTGTTGTTA TTGGTTCTTC 1462 .......... .......... .......... .......... .......... .......... 384 CTTGACCATA ACGTGATGCT TAGAGGAGCT GTGTATCAAA GTCTCTATAT TTCTGAATGA 1402 .......... .......... .......... .......... .......... .......... 384 ATCGACCAGC TCTTCCAATT TGAGAAACAA TAGCAACTGA ACTTGGCAGT ACCTAGAATA 1342 .......... .......... .......... .......... .......... .......... 384 TTTGTAAATT TTACAATACA CATTACATAT CCGTCATCTC AGTGGTCATT ATGTAGGGGA 1282 .......... .......... .......... .......... .......... .......... 384 GGAAGAGATG TCGACAGCTT GTTCTTACTT TCAACACATA AAAGATCTAC TAGACGGTGG 1222 .......... .......... .......... .......... .......... .......... 384 TTCTGATTTG AAAAAAAAAA ATGTACGTTT CATACAAAGG TTTTCATTTT CTGTAGAAGT 1162 .......... .......... .......... .......... .......... .......... 384 TTGAGTAAAT ACTGCTTCAT ATGATGCTGA TGATAAGAAA AAGTGAGTCT TTGATGGAGC 1102 .......... .......... .......... .......... .......... .......... 384 TTTTGTCTTC TGTAATTGCC ATCCAAGGAA ACAATAGTCA ACTCTATTCC TGGATGAAAG 1042 .......... .......... .......... .......... .......... .......... 384 AAATGACCTA ATAGAGAGCT TTCGTTTCCT TTCTCCGAGT CAGAAAAGAG AATCTGTGAA 982 .......... .......... .......... .......... .......... .......... 384 TTGTCTAAAT CTTCAAAACC ATATAGCCTA GATATAAGTT GACCTACCAA TTCTACTTCT 922 .......... .......... .......... .......... .......... .......... 384 GAGCAATTAC CTTCTATACA AGACCTCAAG TCTCAGCATC AACCATTACT TGAAATAATA 862 .......... .......... .......... .......... .......... .......... 384 GCACTTTGTT GCTTATATCA TATTTTCTGT TAAAAATGAA CCAAAAAAAA AAAGAAGGGG 802 .......... .......... .......... .......... .......... .......... 384 CATTCTCAGT TTCATTGCAA CTGGACACGC AGCTAATTTT AATAGTTCGG GGAAGTTCTA 742 .......... .......... .......... .......... .......... .......... 384 CAGGAGAGTG ACCCATACCT TGATTATGCA GTTAGTCAGA ATGCTAACAA TCCCAACAAA 682 .......... .......... .......... .......... .......... .......... 384 ATTGAAAAGT AATAAAAGAA AGAAAAACCC AAAATTGGGT AGAAGTTGGT TTTACATGAA 622 .......... .......... .......... .......... .......... .......... 384 GTAGGGGAAG TTTTTTGGAA TTCTATTAAA CGGTGCAACT GAGATTGTCC ACAATAAGAA 562 .......... .......... .......... .......... .......... .......... 384 CTTAAAGGTC CTCTCCTCTT TTTTTGGTGC CAAGGGCTCC ACGTAGGACT GATTTGCAAG 502 .......... .......... .......... .......... .......... .......... 384 CTTTCCTCAA TGTTGGTTTC AGACCTGCTA ATCCCTTCTG TTATTCTTTG TTTCAGCTGG 442 |||| .......... .......... .......... .......... .......... ......CTGG 388 ATGCTGTTAT TGAAGCCGAA AAACAAGCTG CAAAAGACTT GCTACGCGAA AAGAAGAAAG 382 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATGCTGTTAT TGAAGCCGAA AAACAAGCTG CAAAAGACTT GCTACGCGAA AAGAAGAAAG 448 AGAGGGCCCT TTTAGCATTG AAAAAGAAGA AAGTGCAAGA AGAACTATTA AAGCAAGTTG 322 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGGGCCCT TTTAGCATTG AAAAAGAAGA AAGTGCAAGA AGAACTATTA AAGCAAGTTG 508 ATGTTTGGCT CATAAATGTT GAGCAGCAAG TAAGTAATTC TTCTTAATTC AGTTCCATAC 262 |||||||||| |||||||||| ||||||||| ATGTTTGGCT CATAAATGTT GAGCAGCAA. .......... .......... .......... 537 TTATGCTACC ATGCTAAATT TCCTTATTTC TCTTAAGCAT ATGCTACCAT GCTAAATTTC 202 .......... .......... .......... .......... .......... .......... 537 CTTATTTCTC TTAAGCATTT AAATTGTTTT CTGATTCTAT TTGCAAATTC TTTTAGTTGG 142 |||| .......... .......... .......... .......... .......... ......TTGG 541 CAGATATTGA ACTGACAAGC AAGCA 117 |||||||||| |||||||||| ||||| CAGATATTGA ACTGACAAGC AAGCA 566 hqPGS_C12HBa0093P12.1-2-_SGN-E210308+ (2421 2038,445 293,145 117) ******************************************************************************** EST sequence 15 +strand 517 n (File: SGN-E287725+) 1 AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC TCTTTTTGTC 61 TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT ATTGTTTTTT 121 TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT AGAGATGCAA 181 AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT CAATTTCGTT 241 TTTTTTTTGT GGGTTTTGAA TTTGGGAGAT GGGAAATATA TTTGTGAAGA AACCGAAGAT 301 CACCGAAGTT GATAGAGCGA TTTTGTCTTT GAAGACTCAA AGGCGTAAGC TTGCTCAATA 361 TCAGCAACAG CTGGATGCTG TTATTGAAGC CGAAAAACAA GCTGCAAAAG ACTTGCTACG 421 CGAAAAGAAG AAAGAGAGGG CCCTTTTAGC ATTGAAAAAG AAGAAAGTGC AAGAAGAACT 481 ATTAAAGCAA GTTGATGTTT GGCTCATAAA TGTTGAG Predicted gene structure (within gDNA segment 3009 to 1): Exon 1 2409 2038 ( 372 n); cDNA 1 370 ( 370 n); score: 0.995 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 299 ( 147 n); cDNA 371 517 ( 147 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E287725+ 0.996 519 1.004 C PGS_C12HBa0093P12.1-2-_SGN-E287725+ (2409 2038,445 299) Alignment (genomic DNA sequence = upper lines): AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC TCTTTTTGTC 2350 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC TCTTTTTGTC 60 TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT ATTGTTTTTT 2290 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT ATTGTTTTTT 120 TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT AGAGATGCAA 2230 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT AGAGATGCAA 180 AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT CAATTTCGTA 2170 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT CAATTTCG-- 238 TTTTTTTTTT GTGGGTTTTG AATTTGGGAG ATGGGAAATA TATTTGTGAA GAAACCGAAG 2110 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTTTTTT GTGGGTTTTG AATTTGGGAG ATGGGAAATA TATTTGTGAA GAAACCGAAG 298 ATCACCGAAG TTGATAGAGC GATTTTGTCT TTGAAGACTC AAAGGCGTAA GCTTGCTCAA 2050 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCACCGAAG TTGATAGAGC GATTTTGTCT TTGAAGACTC AAAGGCGTAA GCTTGCTCAA 358 TATCAGCAAC AGGTATAAGG AATTTTATTT TTGCCCCTTT TTGAAAATTT TGACTGCAGT 1990 |||||||||| || TATCAGCAAC AG........ .......... .......... .......... .......... 370 TCTCTTTTTG GTGGGCTAAA CTAAAATTGC TTAGTTTAGT TAAGATGGAT CATGGGTAGT 1930 .......... .......... .......... .......... .......... .......... 370 TTTTGGATTT TTTTTTCTGG GATTGGTTAT TGTGGAGGAT TATTAGGTGA AATTATGAAA 1870 .......... .......... .......... .......... .......... .......... 370 TTTGACAGCA AGATGGCTTT TTGGGTGTTT ATTGCAATGC ACAATTATAG CTCAACAATG 1810 .......... .......... .......... .......... .......... .......... 370 ATGTTAATAC TAGATTTATA ATGTTGAGAT TGAGTTGTAG AACTATGTGT TTTTTCTTGA 1750 .......... .......... .......... .......... .......... .......... 370 GTCTTTTAGC AATAACTATG TTTAGGAAGA ATTAGTTCTC TATGTTGGTA TTTGAGCCGA 1690 .......... .......... .......... .......... .......... .......... 370 GGGTCAACCT CCCTACCTCC AAGGTTTCCT ACCTCCAAGG TTGCTGGTAG GGGTAAGGTC 1630 .......... .......... .......... .......... .......... .......... 370 TTCGTAGACT TTACCTTCAC AGACTCTACT TTGTGGGATT ACACGGGTGT GTTGTTGTTG 1570 .......... .......... .......... .......... .......... .......... 370 TTATTATATG GCATGCGATG AGTTTAAGTG GAGTAAGATG ATTAGTGAGG GTTATATAGC 1510 .......... .......... .......... .......... .......... .......... 370 GGATCCCAAC TTGTTTTTGT ACTGAGGCAT TGTTGTTATT GGTTCTTCCT TGACCATAAC 1450 .......... .......... .......... .......... .......... .......... 370 GTGATGCTTA GAGGAGCTGT GTATCAAAGT CTCTATATTT CTGAATGAAT CGACCAGCTC 1390 .......... .......... .......... .......... .......... .......... 370 TTCCAATTTG AGAAACAATA GCAACTGAAC TTGGCAGTAC CTAGAATATT TGTAAATTTT 1330 .......... .......... .......... .......... .......... .......... 370 ACAATACACA TTACATATCC GTCATCTCAG TGGTCATTAT GTAGGGGAGG AAGAGATGTC 1270 .......... .......... .......... .......... .......... .......... 370 GACAGCTTGT TCTTACTTTC AACACATAAA AGATCTACTA GACGGTGGTT CTGATTTGAA 1210 .......... .......... .......... .......... .......... .......... 370 AAAAAAAAAT GTACGTTTCA TACAAAGGTT TTCATTTTCT GTAGAAGTTT GAGTAAATAC 1150 .......... .......... .......... .......... .......... .......... 370 TGCTTCATAT GATGCTGATG ATAAGAAAAA GTGAGTCTTT GATGGAGCTT TTGTCTTCTG 1090 .......... .......... .......... .......... .......... .......... 370 TAATTGCCAT CCAAGGAAAC AATAGTCAAC TCTATTCCTG GATGAAAGAA ATGACCTAAT 1030 .......... .......... .......... .......... .......... .......... 370 AGAGAGCTTT CGTTTCCTTT CTCCGAGTCA GAAAAGAGAA TCTGTGAATT GTCTAAATCT 970 .......... .......... .......... .......... .......... .......... 370 TCAAAACCAT ATAGCCTAGA TATAAGTTGA CCTACCAATT CTACTTCTGA GCAATTACCT 910 .......... .......... .......... .......... .......... .......... 370 TCTATACAAG ACCTCAAGTC TCAGCATCAA CCATTACTTG AAATAATAGC ACTTTGTTGC 850 .......... .......... .......... .......... .......... .......... 370 TTATATCATA TTTTCTGTTA AAAATGAACC AAAAAAAAAA AGAAGGGGCA TTCTCAGTTT 790 .......... .......... .......... .......... .......... .......... 370 CATTGCAACT GGACACGCAG CTAATTTTAA TAGTTCGGGG AAGTTCTACA GGAGAGTGAC 730 .......... .......... .......... .......... .......... .......... 370 CCATACCTTG ATTATGCAGT TAGTCAGAAT GCTAACAATC CCAACAAAAT TGAAAAGTAA 670 .......... .......... .......... .......... .......... .......... 370 TAAAAGAAAG AAAAACCCAA AATTGGGTAG AAGTTGGTTT TACATGAAGT AGGGGAAGTT 610 .......... .......... .......... .......... .......... .......... 370 TTTTGGAATT CTATTAAACG GTGCAACTGA GATTGTCCAC AATAAGAACT TAAAGGTCCT 550 .......... .......... .......... .......... .......... .......... 370 CTCCTCTTTT TTTGGTGCCA AGGGCTCCAC GTAGGACTGA TTTGCAAGCT TTCCTCAATG 490 .......... .......... .......... .......... .......... .......... 370 TTGGTTTCAG ACCTGCTAAT CCCTTCTGTT ATTCTTTGTT TCAGCTGGAT GCTGTTATTG 430 |||||| |||||||||| .......... .......... .......... .......... ....CTGGAT GCTGTTATTG 386 AAGCCGAAAA ACAAGCTGCA AAAGACTTGC TACGCGAAAA GAAGAAAGAG AGGGCCCTTT 370 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGCCGAAAA ACAAGCTGCA AAAGACTTGC TACGCGAAAA GAAGAAAGAG AGGGCCCTTT 446 TAGCATTGAA AAAGAAGAAA GTGCAAGAAG AACTATTAAA GCAAGTTGAT GTTTGGCTCA 310 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGCATTGAA AAAGAAGAAA GTGCAAGAAG AACTATTAAA GCAAGTTGAT GTTTGGCTCA 506 TAAATGTTGA G 299 |||||||||| | TAAATGTTGA G 517 hqPGS_C12HBa0093P12.1-2-_SGN-E287725+ (2409 2038,445 299) ******************************************************************************** EST sequence 9 +strand 516 n (File: SGN-E307480+) 1 AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 61 GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 121 CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 181 TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 241 GTTCAATTTC GTATTTTTTT TTTTGTGGGT TTTGAATTTG GGAGATGGGA AATATATTTG 301 TGAAGAAACC GAAGATCACC GAAGTTGATA GAGCGATTTT GTCTTTGAAG ACTCAAAGGC 361 GTAAGCTTGC TCAATATCAG CAACAGCTGG ATGCTGTTAT TGAAGCCGAA AAACAAGCTG 421 CAAAAGACTT GCTACGCGAA AAGAAGAAAG AGAGGGCCCT TTTAGCATTG AAAAAGAAGA 481 AAGTGCAAGA AGAACTATTA AAGCAAGTTG ATGTTT Predicted gene structure (within gDNA segment 3022 to 1): Exon 1 2422 2038 ( 385 n); cDNA 1 386 ( 386 n); score: 0.994 Intron 1 2037 446 (1592 n); Pd: 0.984 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 2 445 316 ( 130 n); cDNA 387 516 ( 130 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E307480+ 0.995 515 0.998 C PGS_C12HBa0093P12.1-2-_SGN-E307480+ (2422 2038,445 316) Alignment (genomic DNA sequence = upper lines): AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 2363 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAGATCAA AAAAGTTAGA ACTCAAAGAA CAAGTAACAT TGATTTCTAC ACAATAATTT 60 GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 2303 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT TTCGTATTTT 120 CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 2243 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG GGTCTTTGCA 180 TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 2183 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG TTGTGGGGTT 240 GTTCAATTTC GTA-TTTTTT TTTTGTGGGT TTTGAATTTG GGAGATGGGA AATATATTTG 2124 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTCAATTTC GTATTTTTTT TTTTGTGGGT TTTGAATTTG GGAGATGGGA AATATATTTG 300 TGAAGAAACC GAAGATCACC GAAGTTGATA GAGCGATTTT GTCTTTGAAG ACTCAAAGGC 2064 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGAAACC GAAGATCACC GAAGTTGATA GAGCGATTTT GTCTTTGAAG ACTCAAAGGC 360 GTAAGCTTGC TCAATATCAG CAACAGGTAT AAGGAATTTT ATTTTTGCCC CTTTTTGAAA 2004 |||||||||| |||||||||| |||||| GTAAGCTTGC TCAATATCAG CAACAG.... .......... .......... .......... 386 ATTTTGACTG CAGTTCTCTT TTTGGTGGGC TAAACTAAAA TTGCTTAGTT TAGTTAAGAT 1944 .......... .......... .......... .......... .......... .......... 386 GGATCATGGG TAGTTTTTGG ATTTTTTTTT CTGGGATTGG TTATTGTGGA GGATTATTAG 1884 .......... .......... .......... .......... .......... .......... 386 GTGAAATTAT GAAATTTGAC AGCAAGATGG CTTTTTGGGT GTTTATTGCA ATGCACAATT 1824 .......... .......... .......... .......... .......... .......... 386 ATAGCTCAAC AATGATGTTA ATACTAGATT TATAATGTTG AGATTGAGTT GTAGAACTAT 1764 .......... .......... .......... .......... .......... .......... 386 GTGTTTTTTC TTGAGTCTTT TAGCAATAAC TATGTTTAGG AAGAATTAGT TCTCTATGTT 1704 .......... .......... .......... .......... .......... .......... 386 GGTATTTGAG CCGAGGGTCA ACCTCCCTAC CTCCAAGGTT TCCTACCTCC AAGGTTGCTG 1644 .......... .......... .......... .......... .......... .......... 386 GTAGGGGTAA GGTCTTCGTA GACTTTACCT TCACAGACTC TACTTTGTGG GATTACACGG 1584 .......... .......... .......... .......... .......... .......... 386 GTGTGTTGTT GTTGTTATTA TATGGCATGC GATGAGTTTA AGTGGAGTAA GATGATTAGT 1524 .......... .......... .......... .......... .......... .......... 386 GAGGGTTATA TAGCGGATCC CAACTTGTTT TTGTACTGAG GCATTGTTGT TATTGGTTCT 1464 .......... .......... .......... .......... .......... .......... 386 TCCTTGACCA TAACGTGATG CTTAGAGGAG CTGTGTATCA AAGTCTCTAT ATTTCTGAAT 1404 .......... .......... .......... .......... .......... .......... 386 GAATCGACCA GCTCTTCCAA TTTGAGAAAC AATAGCAACT GAACTTGGCA GTACCTAGAA 1344 .......... .......... .......... .......... .......... .......... 386 TATTTGTAAA TTTTACAATA CACATTACAT ATCCGTCATC TCAGTGGTCA TTATGTAGGG 1284 .......... .......... .......... .......... .......... .......... 386 GAGGAAGAGA TGTCGACAGC TTGTTCTTAC TTTCAACACA TAAAAGATCT ACTAGACGGT 1224 .......... .......... .......... .......... .......... .......... 386 GGTTCTGATT TGAAAAAAAA AAATGTACGT TTCATACAAA GGTTTTCATT TTCTGTAGAA 1164 .......... .......... .......... .......... .......... .......... 386 GTTTGAGTAA ATACTGCTTC ATATGATGCT GATGATAAGA AAAAGTGAGT CTTTGATGGA 1104 .......... .......... .......... .......... .......... .......... 386 GCTTTTGTCT TCTGTAATTG CCATCCAAGG AAACAATAGT CAACTCTATT CCTGGATGAA 1044 .......... .......... .......... .......... .......... .......... 386 AGAAATGACC TAATAGAGAG CTTTCGTTTC CTTTCTCCGA GTCAGAAAAG AGAATCTGTG 984 .......... .......... .......... .......... .......... .......... 386 AATTGTCTAA ATCTTCAAAA CCATATAGCC TAGATATAAG TTGACCTACC AATTCTACTT 924 .......... .......... .......... .......... .......... .......... 386 CTGAGCAATT ACCTTCTATA CAAGACCTCA AGTCTCAGCA TCAACCATTA CTTGAAATAA 864 .......... .......... .......... .......... .......... .......... 386 TAGCACTTTG TTGCTTATAT CATATTTTCT GTTAAAAATG AACCAAAAAA AAAAAGAAGG 804 .......... .......... .......... .......... .......... .......... 386 GGCATTCTCA GTTTCATTGC AACTGGACAC GCAGCTAATT TTAATAGTTC GGGGAAGTTC 744 .......... .......... .......... .......... .......... .......... 386 TACAGGAGAG TGACCCATAC CTTGATTATG CAGTTAGTCA GAATGCTAAC AATCCCAACA 684 .......... .......... .......... .......... .......... .......... 386 AAATTGAAAA GTAATAAAAG AAAGAAAAAC CCAAAATTGG GTAGAAGTTG GTTTTACATG 624 .......... .......... .......... .......... .......... .......... 386 AAGTAGGGGA AGTTTTTTGG AATTCTATTA AACGGTGCAA CTGAGATTGT CCACAATAAG 564 .......... .......... .......... .......... .......... .......... 386 AACTTAAAGG TCCTCTCCTC TTTTTTTGGT GCCAAGGGCT CCACGTAGGA CTGATTTGCA 504 .......... .......... .......... .......... .......... .......... 386 AGCTTTCCTC AATGTTGGTT TCAGACCTGC TAATCCCTTC TGTTATTCTT TGTTTCAGCT 444 || .......... .......... .......... .......... .......... ........CT 388 GGATGCTGTT ATTGAAGCCG AAAAACAAGC TGCAAAAGAC TTGCTACGCG AAAAGAAGAA 384 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATGCTGTT ATTGAAGCCG AAAAACAAGC TGCAAAAGAC TTGCTACGCG AAAAGAAGAA 448 AGAGAGGGCC CTTTTAGCAT TGAAAAAGAA GAAAGTGCAA GAAGAACTAT TAAAGCAAGT 324 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGAGGGCC CTTTTAGCAT TGAAAAAGAA GAAAGTGCAA GAAGAACTAT TAAAGCAAGT 508 TGATGTTT 316 |||||||| TGATGTTT 516 hqPGS_C12HBa0093P12.1-2-_SGN-E307480+ (2422 2038,445 316) ******************************************************************************** EST sequence 4 +strand 403 n (File: SGN-E303502+) 1 AGATCAAAAA AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC 61 TCTTTTTGTC TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT 121 ATTGTTTTTT TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT 181 AGAGATGCAA AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT 241 CAATTTCGTA TTTTTTTTTT TGTGGGTTTT GAATTTGGGA GATGGGAAAT ATATTTGTGA 301 AGAAACCGAA GATCACCGAA GTTGATAGAG CGATTTTGTC TTTGAAGACT CAAAGGCGTA 361 AGCTTGCTCA ATATCAGCAA CAGCTGGATG CTGTTATTGA AGC Predicted gene structure (within gDNA segment 3019 to 1228): Exon 1 2419 2038 ( 382 n); cDNA 1 383 ( 383 n); score: 0.993 MATCH C12HBa0093P12.1-2- SGN-E303502+ 0.993 382 0.948 C PGS_C12HBa0093P12.1-2-_SGN-E303502+ (2419 2038) Alignment (genomic DNA sequence = upper lines): AGATCAAAAA AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC 2360 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATCAAAAA AGTTAGAACT CAAAGAACAA GTAACATTGA TTTCTACACA ATAATTTGTC 60 TCTTTTTGTC TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT 2300 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTTTTGTC TATTTGATCA TCGCTGAGGA CTCACCGATC CATTAATTTC GTATTTTCAT 120 ATTGTTTTTT TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT 2240 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGTTTTTT TCAGCTTATA AGGAGATCAA TTACGAATAA TCTAAAGGGT CTTTGCATAT 180 AGAGATGCAA AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT 2180 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGATGCAA AAGACCTCAT TTTTGTTGTA AATTTTTTAT TTTTTTGTTG TGGGGTTGTT 240 CAATTTCGTA -TTTTTTTTT TGTGGGTTTT GAATTTGGGA GATGGGAAAT ATATTTGTGA 2121 |||||||||| ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATTTCGTA TTTTTTTTTT TGTGGGTTTT GAATTTGGGA GATGGGAAAT ATATTTGTGA 300 AGAAACCGAA GATCACCGAA GTTGATAGAG CGATTTTGTC TTTGAAGACT CAAAGGCGTA 2061 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAAACCGAA GATCACCGAA GTTGATAGAG CGATTTTGTC TTTGAAGACT CAAAGGCGTA 360 AGCTTGCTCA ATATCAGCAA CAG 2038 |||||||||| |||||||||| ||| AGCTTGCTCA ATATCAGCAA CAG 383 hqPGS_C12HBa0093P12.1-2-_SGN-E303502+ (2419 2038) ******************************************************************************** EST sequence 19 +strand 367 n (File: SGN-E331254+) 1 AAAAAAGTTA GAACTCAAAG AACAAGTAAC ATTGATTTCT ACACAATAAT TTGTCTCTTT 61 TTGTCTATTT GATCATCGCT GAGGACTCAC CGATCCATTA ATTTCGTATT TTCATATTGT 121 TTTTTTCAGC TTATAAGGAG ATCAATTACG AATAATCTAA AGGGTCTTTG CATATAGAGA 181 TGCAAAAGAC CTCATTTTTG TTGTAAATTT TTTATTTTTT TGTTGTGGGG TTGTTCAATT 241 TCGTATTTTT TTTTTGTGGG TTTTGAATTT GGGAGATGGG AAATATATTT GTGAAGAAAC 301 CGAAGATCAC CGAAGTTGAT AGAGCGATTT TGTCTTTGAA GACTCAAAGG CGTAAGCTTG 361 CTCAATA Predicted gene structure (within gDNA segment 3014 to 1438): Exon 1 2414 2048 ( 367 n); cDNA 1 367 ( 367 n); score: 1.000 MATCH C12HBa0093P12.1-2- SGN-E331254+ 1.000 367 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E331254+ (2414 2048) Alignment (genomic DNA sequence = upper lines): AAAAAAGTTA GAACTCAAAG AACAAGTAAC ATTGATTTCT ACACAATAAT TTGTCTCTTT 2355 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAAAGTTA GAACTCAAAG AACAAGTAAC ATTGATTTCT ACACAATAAT TTGTCTCTTT 60 TTGTCTATTT GATCATCGCT GAGGACTCAC CGATCCATTA ATTTCGTATT TTCATATTGT 2295 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGTCTATTT GATCATCGCT GAGGACTCAC CGATCCATTA ATTTCGTATT TTCATATTGT 120 TTTTTTCAGC TTATAAGGAG ATCAATTACG AATAATCTAA AGGGTCTTTG CATATAGAGA 2235 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTTCAGC TTATAAGGAG ATCAATTACG AATAATCTAA AGGGTCTTTG CATATAGAGA 180 TGCAAAAGAC CTCATTTTTG TTGTAAATTT TTTATTTTTT TGTTGTGGGG TTGTTCAATT 2175 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCAAAAGAC CTCATTTTTG TTGTAAATTT TTTATTTTTT TGTTGTGGGG TTGTTCAATT 240 TCGTATTTTT TTTTTGTGGG TTTTGAATTT GGGAGATGGG AAATATATTT GTGAAGAAAC 2115 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGTATTTTT TTTTTGTGGG TTTTGAATTT GGGAGATGGG AAATATATTT GTGAAGAAAC 300 CGAAGATCAC CGAAGTTGAT AGAGCGATTT TGTCTTTGAA GACTCAAAGG CGTAAGCTTG 2055 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGAAGATCAC CGAAGTTGAT AGAGCGATTT TGTCTTTGAA GACTCAAAGG CGTAAGCTTG 360 CTCAATA 2048 ||||||| CTCAATA 367 hqPGS_C12HBa0093P12.1-2-_SGN-E331254+ (2414 2048) ******************************************************************************** EST sequence 8 +strand 245 n (File: SGN-E307348+) 1 ACAATAATTT GTCTCTTTTT GGCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT 61 TGAGTATTTT CATATTGCTT GTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG 121 GGTCTTTGCA TATAGAGATG CAAAAGACCT CATTTGTGGT GTAAATTCTT TATTGTTTTG 181 ATGTGGGGTT GTTCAATTCC GTATTTTTTT TTTTGTGGGT TGTGAATTTG GGAGATGGGA 241 AATAT Predicted gene structure (within gDNA segment 2972 to 708): Exon 1 2372 2129 ( 244 n); cDNA 1 245 ( 245 n); score: 0.941 MATCH C12HBa0093P12.1-2- SGN-E307348+ 0.941 244 0.996 C PGS_C12HBa0093P12.1-2-_SGN-E307348+ (2372 2129) Alignment (genomic DNA sequence = upper lines): ACAATAATTT GTCTCTTTTT GTCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT 2313 |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| |||||||||| ACAATAATTT GTCTCTTTTT GGCTATTTGA TCATCGCTGA GGACTCACCG ATCCATTAAT 60 TTCGTATTTT CATATTGTTT TTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG 2253 | ||||||| ||||||| || ||||||||| |||||||||| |||||||||| |||||||||| TGAGTATTTT CATATTGCTT GTTTCAGCTT ATAAGGAGAT CAATTACGAA TAATCTAAAG 120 GGTCTTTGCA TATAGAGATG CAAAAGACCT CATTTTTGTT GTAAATTTTT TATTTTTTTG 2193 |||||||||| |||||||||| |||||||||| ||||| || | ||||||| || |||| ||||| GGTCTTTGCA TATAGAGATG CAAAAGACCT CATTTGTGGT GTAAATTCTT TATTGTTTTG 180 TTGTGGGGTT GTTCAATTTC GTA-TTTTTT TTTTGTGGGT TTTGAATTTG GGAGATGGGA 2134 ||||||||| |||||||| | ||| |||||| |||||||||| | |||||||| |||||||||| ATGTGGGGTT GTTCAATTCC GTATTTTTTT TTTTGTGGGT TGTGAATTTG GGAGATGGGA 240 AATAT 2129 ||||| AATAT 245 hqPGS_C12HBa0093P12.1-2-_SGN-E307348+ (2372 2129) ******************************************************************************** EST sequence 2 -strand 795 n (File: SGN-E545754-) 1 ATTTTTCAAA AAACGTTGGT CAAGTTGTTG TTCTTGGTGG TGCTTTTGCT GTAAATGGAA 61 ATGTGAGTCC AGCAGCTGAG GCAAATATCT TCAAAGATCC AACTGCTGCT GATATTGTAT 121 TTACAAGTGG TGCTGATGTT CTTGCTGTTG GATTAAATAT TACACATCAA GTTGTCCTTA 181 CTGATTCTCA TCGTGGCGAA TTGGCAAAGT CCAATGGAAA GTTTGCCAAG TACCTCAGCA 241 AGCTTTTGGA TGTCTATTTC GATTATCACA ATACTGCATA CAGCACGAGA GGTGTCTTCC 301 TTCATGATCC AACTGCTTTA CTTGCTGCTG TTAATCCATC ACTCCTCACC TATTCAGAAG 361 GTGTCGTTCG CGTTCAGACA GTTGGCATCA CAAAGGGTCT CACAATCTTT TATAACAAAC 421 AGAAAAGGTT TGTTGAAGTC ACTGAATGGT CTGAAAAACC TATAGTTAAA GTGGCAGCAA 481 CTGTTGATGC TCCTAAAGTT ATCGAATTGG TGATGACACG ACTCGTCAAT TCTTAGAAGT 541 ATTCATTGAA CATTTATTTG AACAGTGTTA CTGTCTTTCT GCACTCAATA TTCAGAAACT 601 TAACTACGCT ATAAGGAACG GAATCTTAAA CTATAGACCA AGTACTACTC TCAGGAACAA 661 GGATGAATGT ATGTAAGCAG TAAGCAGCTC ATATACAGAA TATGAACTTG CAAAAATAAG 721 TTACAGCTAG ACTAGTCGTG TTCTGATAGA TTTAATCTTT GGTAAAGATA TGACATTTGA 781 GCCTAAAAAA AAAAA Predicted gene structure (within gDNA segment 10824 to 7977): Exon 1 10140 10057 ( 84 n); cDNA 3 86 ( 84 n); score: 0.976 Intron 1 10056 9980 ( 77 n); Pd: 1.000 (s: 1.00), Pa: 0.684 (s: 1.00) Exon 2 9979 9883 ( 97 n); cDNA 87 183 ( 97 n); score: 1.000 Intron 2 9882 9456 ( 427 n); Pd: 0.962 (s: 1.00), Pa: 0.666 (s: 1.00) Exon 3 9455 9348 ( 108 n); cDNA 184 291 ( 108 n); score: 1.000 Intron 3 9347 9254 ( 94 n); Pd: 0.481 (s: 1.00), Pa: 0.982 (s: 1.00) Exon 4 9253 9118 ( 136 n); cDNA 292 427 ( 136 n); score: 1.000 Intron 4 9117 9027 ( 91 n); Pd: 0.984 (s: 1.00), Pa: 0.917 (s: 1.00) Exon 5 9026 8662 ( 365 n); cDNA 428 791 ( 364 n); score: 0.989 MATCH C12HBa0093P12.1-2- SGN-E545754- 0.992 790 0.994 C PGS_C12HBa0093P12.1-2-_SGN-E545754- (10140 10057,9979 9883,9455 9348,9253 9118,9026 8662) Alignment (genomic DNA sequence = upper lines): TTTTTCAAAA ACGTTGGTCA AGTTGTTGTT CTTGGTGGTG CTTTTGCTGT AAATGGAAAT 10081 |||| |||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTCAAAAA ACGTTGGTCA AGTTGTTGTT CTTGGTGGTG CTTTTGCTGT AAATGGAAAT 62 GTGAGTCCAG CAGCTGAGGC AAATGTAAGT TGCTAATAGC TACATTCTTT ATATGTGCTT 10021 |||||||||| |||||||||| |||| GTGAGTCCAG CAGCTGAGGC AAAT...... .......... .......... .......... 86 CAACTTCTAA GACTTTTTTT TTTATTCTTC GTTTGCGCAA GATCTTCAAA GATCCAACTG 9961 ||||||||| |||||||||| .......... .......... .......... .......... .ATCTTCAAA GATCCAACTG 105 CTGCTGATAT TGTATTTACA AGTGGTGCTG ATGTTCTTGC TGTTGGATTA AATATTACAC 9901 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGCTGATAT TGTATTTACA AGTGGTGCTG ATGTTCTTGC TGTTGGATTA AATATTACAC 165 ATCAAGTTGT CCTTACTGGT AATCTTTTCT TTGTTTCTTA AGTTTCATTT CAAGCCACTG 9841 |||||||||| |||||||| ATCAAGTTGT CCTTACTG.. .......... .......... .......... .......... 183 TCTCCGTTGT TATTTTAAGA AGAATGTGCT ATCGAATAAT AGAAACAAAT ACTAAGAGAG 9781 .......... .......... .......... .......... .......... .......... 183 TAGTGTAGAG GAGGCGCGAG AGTTTGGCTA TAGCAAGTTT TAGCAGATGT TGAGGTATAG 9721 .......... .......... .......... .......... .......... .......... 183 GCCAAAGAAG AACGGGAGGA TGTGATTAGA CAGGACATGA CTAGTTTTCA ACTCACTGAG 9661 .......... .......... .......... .......... .......... .......... 183 GACATGATCC TAGATAGAAA AGTATGGAGA TCAAGGATTA AGGTAGAAAG GTAGTAATCC 9601 .......... .......... .......... .......... .......... .......... 183 TATCTACACA TAAGTATCTT ATTCTCTATT TTTTACTACT ACCTGTTACT TCATTTGCTT 9541 .......... .......... .......... .......... .......... .......... 183 TGTTTATCTT GTTATTTTGC GGTCTTAGCT CCTAGGGTCC TGTTTTAGTT AACTTCTCTT 9481 .......... .......... .......... .......... .......... .......... 183 AATAGCATCT ATCTACTGTT GTTAGATTCT CATCGTGGCG AATTGGCAAA GTCCAATGGA 9421 ||||| |||||||||| |||||||||| |||||||||| .......... .......... .....ATTCT CATCGTGGCG AATTGGCAAA GTCCAATGGA 218 AAGTTTGCCA AGTACCTCAG CAAGCTTTTG GATGTCTATT TCGATTATCA CAATACTGCA 9361 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAGTTTGCCA AGTACCTCAG CAAGCTTTTG GATGTCTATT TCGATTATCA CAATACTGCA 278 TACAGCACGA GAGGTCTGTC TGTCACGTGC AAAGTTAGCT GTTTCCATAT TTCTCGTTTG 9301 |||||||||| ||| TACAGCACGA GAG....... .......... .......... .......... .......... 291 GTTTCTACTC ACTTTCTTTA TGTAACTGGA ATCGGGGCTC TATGTAGGTG TCTTCCTTCA 9241 ||| |||||||||| .......... .......... .......... .......... .......GTG TCTTCCTTCA 304 TGATCCAACT GCTTTACTTG CTGCTGTTAA TCCATCACTC CTCACCTATT CAGAAGGTGT 9181 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATCCAACT GCTTTACTTG CTGCTGTTAA TCCATCACTC CTCACCTATT CAGAAGGTGT 364 CGTTCGCGTT CAGACAGTTG GCATCACAAA GGGTCTCACA ATCTTTTATA ACAAACAGAA 9121 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGTTCGCGTT CAGACAGTTG GCATCACAAA GGGTCTCACA ATCTTTTATA ACAAACAGAA 424 AAGGTATTTG GTTCATAATG CCTTAATATA AGTGTTATAT TGTTGCTATA TTATCCCTAT 9061 ||| AAG....... .......... .......... .......... .......... .......... 427 AACTACATAC TTCATCTGCT TTTGGTTTTA CAAGGTTTGT TGAAGTCACT GAATGGTCTG 9001 |||||| |||||||||| |||||||||| .......... .......... .......... ....GTTTGT TGAAGTCACT GAATGGTCTG 453 AAAAACCTAT AGTTAAAGTG GCAGCAACTG TTGATGCTCC TAAAGTTATC GAATTGGTGA 8941 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAAACCTAT AGTTAAAGTG GCAGCAACTG TTGATGCTCC TAAAGTTATC GAATTGGTGA 513 TGACACGACT CGTCAATTCT TAGAAGTATT CATTGAACAT TTATTTGAAC AGTGTTACTG 8881 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGACACGACT CGTCAATTCT TAGAAGTATT CATTGAACAT TTATTTGAAC AGTGTTACTG 573 TCTTTCTGCA CTCAATATTC AGAAACTTAA CTACGCTATA AGGAACGGAA TCTTAAACTA 8821 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTTTCTGCA CTCAATATTC AGAAACTTAA CTACGCTATA AGGAACGGAA TCTTAAACTA 633 TAGACCAAGT ACTACTCTCA GGAACAAGGA TGAATGTATG TAAGCAGTAA GCAGCTCATA 8761 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGACCAAGT ACTACTCTCA GGAACAAGGA TGAATGTATG TAAGCAGTAA GCAGCTCATA 693 TACAGAATAT GAACTTGCAA AAATAAGTTA CAGCTAGACT AGTCGTGTTC TGATAGATTT 8701 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACAGAATAT GAACTTGCAA AAATAAGTTA CAGCTAGACT AGTCGTGTTC TGATAGATTT 753 AATCTTTGTG TAAAGATATG ACATTTGAGC CTAACTCAA 8662 |||||||| | |||||||||| |||||||||| |||| || AATCTTTG-G TAAAGATATG ACATTTGAGC CTAAAAAAA 791 hqPGS_C12HBa0093P12.1-2-_SGN-E545754- (10140 10057,9979 9883,9455 9348,9253 9118,9026 8662) ******************************************************************************** EST sequence 3 -strand 494 n (File: SGN-E545761-) 1 AGCACGAGAG GTGTCTTCCT TCATGATCCA ACTGCTTTAC TTGCTGCTGT TAATCCATCA 61 CTCCTCACCT ATTCAGAAGG TGTCGTTAGC GTTCAGACAG TTGGCATCAC AAAGGGTCTC 121 ACAATCTTTT ATAACAAACA GAAAAGGTTT GTTGAAGTCA CTGAATGGTC TGAAAAACCT 181 ATAGTTAAAG TGGCAGCAAC TGTTGATGCT CCTAAAGTTA TCGAATTGGT GATGACACGA 241 CTCGTCAATT CTTAGAAGTA TTCATTGAAC ATTTATTTGA ACAGTGTTAC TGTCTTTCTG 301 CACTCAATAT TCAGAAACTT AACTACGCTA TAAGGAACGG AATCTTAAAC TATAGACCAA 361 GTACTACTCT CAGGAACAAG GATGAATGTA TGTAAGCAGT AAGCAGCTCA TATACAGAAT 421 ATGAACTTGC AAAAATAAGT TACAGCTAGA CTAGTCGTGT TCTGATAGAT TTAATCTTGG 481 GTAAAGATAT GACA Predicted gene structure (within gDNA segment 9945 to 7935): Exon 1 9357 9348 ( 10 n); cDNA 1 10 ( 10 n); score: 1.000 Intron 1 9347 9254 ( 94 n); Pd: 0.481 (s: 0), Pa: 0.982 (s: 1.00) Exon 2 9253 9118 ( 136 n); cDNA 11 146 ( 136 n); score: 0.993 Intron 2 9117 9027 ( 91 n); Pd: 0.984 (s: 1.00), Pa: 0.917 (s: 1.00) Exon 3 9026 8678 ( 349 n); cDNA 147 494 ( 348 n); score: 0.994 MATCH C12HBa0093P12.1-2- SGN-E545761- 0.994 495 1.002 C PGS_C12HBa0093P12.1-2-_SGN-E545761- (9357 9348,9253 9118,9026 8678) Alignment (genomic DNA sequence = upper lines): AGCACGAGAG GTCTGTCTGT CACGTGCAAA GTTAGCTGTT TCCATATTTC TCGTTTGGTT 9298 |||||||||| AGCACGAGAG .......... .......... .......... .......... .......... 10 TCTACTCACT TTCTTTATGT AACTGGAATC GGGGCTCTAT GTAGGTGTCT TCCTTCATGA 9238 |||||| |||||||||| .......... .......... .......... .......... ....GTGTCT TCCTTCATGA 26 TCCAACTGCT TTACTTGCTG CTGTTAATCC ATCACTCCTC ACCTATTCAG AAGGTGTCGT 9178 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCCAACTGCT TTACTTGCTG CTGTTAATCC ATCACTCCTC ACCTATTCAG AAGGTGTCGT 86 TCGCGTTCAG ACAGTTGGCA TCACAAAGGG TCTCACAATC TTTTATAACA AACAGAAAAG 9118 | |||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGCGTTCAG ACAGTTGGCA TCACAAAGGG TCTCACAATC TTTTATAACA AACAGAAAAG 146 GTATTTGGTT CATAATGCCT TAATATAAGT GTTATATTGT TGCTATATTA TCCCTATAAC 9058 .......... .......... .......... .......... .......... .......... 146 TACATACTTC ATCTGCTTTT GGTTTTACAA GGTTTGTTGA AGTCACTGAA TGGTCTGAAA 8998 ||||||||| |||||||||| |||||||||| .......... .......... .......... .GTTTGTTGA AGTCACTGAA TGGTCTGAAA 175 AACCTATAGT TAAAGTGGCA GCAACTGTTG ATGCTCCTAA AGTTATCGAA TTGGTGATGA 8938 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACCTATAGT TAAAGTGGCA GCAACTGTTG ATGCTCCTAA AGTTATCGAA TTGGTGATGA 235 CACGACTCGT CAATTCTTAG AAGTATTCAT TGAACATTTA TTTGAACAGT GTTACTGTCT 8878 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACGACTCGT CAATTCTTAG AAGTATTCAT TGAACATTTA TTTGAACAGT GTTACTGTCT 295 TTCTGCACTC AATATTCAGA AACTTAACTA CGCTATAAGG AACGGAATCT TAAACTATAG 8818 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCTGCACTC AATATTCAGA AACTTAACTA CGCTATAAGG AACGGAATCT TAAACTATAG 355 ACCAAGTACT ACTCTCAGGA ACAAGGATGA ATGTATGTAA GCAGTAAGCA GCTCATATAC 8758 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACCAAGTACT ACTCTCAGGA ACAAGGATGA ATGTATGTAA GCAGTAAGCA GCTCATATAC 415 AGAATATGAA CTTGCAAAAA TAAGTTACAG CTAGACTAGT CGTGTTCTGA TAGATTTAAT 8698 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAATATGAA CTTGCAAAAA TAAGTTACAG CTAGACTAGT CGTGTTCTGA TAGATTTAAT 475 CTTTGTGTAA AGATATGACA 8678 | ||| |||| |||||||||| C-TTGGGTAA AGATATGACA 494 hqPGS_C12HBa0093P12.1-2-_SGN-E545761- (9253 9118,9026 8678) ******************************************************************************** EST sequence 18 +strand 540 n (File: SGN-E330507+) 1 AGAACTAGTC TCGAGTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 61 TGGATAAAAC TATTGATAAT ATATTCAATT TCTGTTAGTT TACCCAAAAA AACTAAAATG 121 AAGCTTGAGG GAAAATTCTT TGTACATTTG GGTGACTTTA CTTAAATCCC TCAGGTTTTG 181 CTTCATAAAC ACGACGGAAA CCTTGTGAAA AAACATAACA AAAGCCACAG ATTTCCTGAA 241 AGATCTTTGC TTTGACAAAC ATTAACCGGA AAAACAAAAG AGACTCAATC CGCCGATTCA 301 ACTATATCAA CTCCAACTGG ATTGACAGTG TCTTTAAAAC CCTGCCATCA ATACTGCAAA 361 AAAAGAGCAC ATTTCCACTT TTCCATTTGG AAATGCTCAC ATAAGTAGCC ACTCGAGACC 421 ACATACACTC GTTATGCTGT TTTAATTGCA ACTGGCTTAC CTCGACCCAT GGAAAAACCC 481 CTGGTGCCAT CTGGCATGCG AGGCACAAAA GTTTGCTTGA CTGGTGCTGA CTGATCAAAA Predicted gene structure (within gDNA segment 9694 to 7315): Exon 1 8424 7952 ( 473 n); cDNA 68 540 ( 473 n); score: 0.964 PPA cDNA 50 15 MATCH C12HBa0093P12.1-2- SGN-E330507+ 0.964 473 0.876 C PGS_C12HBa0093P12.1-2-_SGN-E330507+ (8424 7952) Alignment (genomic DNA sequence = upper lines): AACTATTGAT AATATATTCA ATTTCTGTTA GTATACACAA AAAAACTAAG ATGAAGCTAG 8365 |||||||||| |||||||||| |||||||||| || ||| ||| ||||||||| |||||||| | AACTATTGAT AATATATTCA ATTTCTGTTA GTTTACCCAA AAAAACTAAA ATGAAGCTTG 127 AGGGAAGATT CTTTGTACAT TTGTGTGACT TTACTTAAAT CCCTCAGGTT TTGCTTCATA 8305 |||||| ||| |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| AGGGAAAATT CTTTGTACAT TTGGGTGACT TTACTTAAAT CCCTCAGGTT TTGCTTCATA 187 AACACGACGG AAACCTAGTG AAGAAACATA ACAAAAGCCA CAGATTTCCT GAAAGATCTT 8245 |||||||||| |||||| ||| || ||||||| |||||||||| |||||||||| |||||||||| AACACGACGG AAACCTTGTG AAAAAACATA ACAAAAGCCA CAGATTTCCT GAAAGATCTT 247 TGCTTTGACA AACATAAACA GGAAAAACAA AAGAGACTCA ATCCGCCGAT TCAACTATAT 8185 |||||||||| ||||| ||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTTTGACA AACATTAACC GGAAAAACAA AAGAGACTCA ATCCGCCGAT TCAACTATAT 307 CAACTCCAAC TGGATTGACA GTGTCTCTAA GACCCTGCCA TCAATACTGC AAAAAAAGAG 8125 |||||||||| |||||||||| |||||| ||| ||||||||| |||||||||| |||||||||| CAACTCCAAC TGGATTGACA GTGTCTTTAA AACCCTGCCA TCAATACTGC AAAAAAAGAG 367 CACATTTCCA CTTCTCCATT TGGAAATGCT CACATAAGTA GCCACTCGAG ACCACATACA 8065 |||||||||| ||| |||||| |||||||||| |||||||||| |||||||||| |||||||||| CACATTTCCA CTTTTCCATT TGGAAATGCT CACATAAGTA GCCACTCGAG ACCACATACA 427 CTCGTTATGC TGTTTTAATT GCAACTGGCT TACCTCGACC CATGGAGAAG CCCCTGGTGC 8005 |||||||||| |||||||||| |||||||||| |||||||||| |||||| || |||||||||| CTCGTTATGC TGTTTTAATT GCAACTGGCT TACCTCGACC CATGGAAAAA CCCCTGGTGC 487 CATCTGGCAT GCGAGGCACA GAAGTTTGCT TGACTGGTGC TGACTGATCA GAA 7952 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| || CATCTGGCAT GCGAGGCACA AAAGTTTGCT TGACTGGTGC TGACTGATCA AAA 540 hqPGS_C12HBa0093P12.1-2-_SGN-E330507+ (8424 7952) ******************************************************************************** EST sequence 16 +strand 667 n (File: SGN-E545755+) 1 CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 61 AGAAAATCAT CATTGATACT GACCCTGGAA TTGATGATGC AATCGCGATT TTCGTGGCAC 121 TTCAATCTCC TGAAATTGAA GTAATTGGAC TCACAACAAT ATTTGGTAAT GTTCAGACAA 181 CTCTGTCAAC CAGAAATGCT TTACATCTGT TGGAGATAGC TGGGAGGACA GATATTCCAG 241 TGGCTGAAGG CTCACACGTT ACGATCACTG AAGGCGTAAA ACTTCAAAGT AGTGGATATG 301 TTCATGGCGC GGATGGACTC GGAAACCAAA ACATTTCTCC ACCCAAAGGA AAGGCTATTG 361 AACAGACTGC AGCTGAATTT CTCATTCAGC AAACTAGTCT TTACCCTGGA AAGGTCACTG 421 TTGTGGCCTT AGGCCCCCTG ACAAATATAG CACTTGCTAT TCAGTTGGAT CCTGAATTTT 481 TCAAAAACGT TGGTCAAGTT GTTGTTCTTG GTGGTGCTTT TGCTGTAAAT GGAAATGTGA 541 GTCCAGCAGC TGAGGCAAAT ATCTTCAAAG ATCCAACTGC TGCTGATATT GTATTTACAA 601 GTGGTGCTGA TGTTCTTGCT GTTGGATTAA ATATTACACA TCAAGTTGTC CTTACTGGAT 661 CTCATCG Predicted gene structure (within gDNA segment 12090 to 9182): Exon 1 11490 11398 ( 93 n); cDNA 1 93 ( 93 n); score: 1.000 Intron 1 11397 11054 ( 344 n); Pd: 0.998 (s: 1.00), Pa: 0.963 (s: 1.00) Exon 2 11053 10938 ( 116 n); cDNA 94 209 ( 116 n); score: 1.000 Intron 2 10937 10695 ( 243 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 10694 10635 ( 60 n); cDNA 210 269 ( 60 n); score: 1.000 Intron 3 10634 10434 ( 201 n); Pd: 0.996 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 4 10433 10248 ( 186 n); cDNA 270 455 ( 186 n); score: 1.000 Intron 4 10247 10162 ( 86 n); Pd: 0.910 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 5 10161 10057 ( 105 n); cDNA 456 560 ( 105 n); score: 1.000 Intron 5 10056 9980 ( 77 n); Pd: 1.000 (s: 1.00), Pa: 0.684 (s: 1.00) Exon 6 9979 9883 ( 97 n); cDNA 561 657 ( 97 n); score: 1.000 Intron 6 9882 9456 ( 427 n); Pd: 0.962 (s: 1.00), Pa: 0.666 (s: 0) Exon 7 9455 9446 ( 10 n); cDNA 658 667 ( 10 n); score: 0.800 MATCH C12HBa0093P12.1-2- SGN-E545755+ 1.000 667 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E545755+ (11490 11398,11053 10938,10694 10635,10433 10248,10161 10057,9979 9883,9455 9446) Alignment (genomic DNA sequence = upper lines): CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 11431 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 60 AGAAAATCAT CATTGATACT GACCCTGGAA TTGGTAATTT TGCTTTTATT ACAACTATTT 11371 |||||||||| |||||||||| |||||||||| ||| AGAAAATCAT CATTGATACT GACCCTGGAA TTG....... .......... .......... 93 CTCGTAGTTT TTTGTTCTTC GATTTTGATT ATTACCTGTG TTTTTTGTGC TTTGATTGTC 11311 .......... .......... .......... .......... .......... .......... 93 ACATTATTTC GTTGTCATTG CTATTCTTTC TTTCTTATGT TGTGTACTTG CTCCCATGTC 11251 .......... .......... .......... .......... .......... .......... 93 ACACGTCTAA TCGTGGTTAC CACTCTACGA GTTAGATAAG ATCTGTGTAC ATTCTATCCT 11191 .......... .......... .......... .......... .......... .......... 93 CGTCATATTA CACTATGGTA TCTTCGTGTT TTTGTTTCTT CATAGTAAGA GCATTTCAAG 11131 .......... .......... .......... .......... .......... .......... 93 ATTTCATTTT TTTTTTGTGA TTACCAATTC AACTGTTGAG TTATTTGAGA TTTTATATGA 11071 .......... .......... .......... .......... .......... .......... 93 ATTAATATCA TGTTTAGATG ATGCAATCGC GATTTTCGTG GCACTTCAAT CTCCTGAAAT 11011 ||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .......ATG ATGCAATCGC GATTTTCGTG GCACTTCAAT CTCCTGAAAT 136 TGAAGTAATT GGACTCACAA CAATATTTGG TAATGTTCAG ACAACTCTGT CAACCAGAAA 10951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGTAATT GGACTCACAA CAATATTTGG TAATGTTCAG ACAACTCTGT CAACCAGAAA 196 TGCTTTACAT CTGGTAAAGT TCTATCTTTA TTTCATTATT GGACACGGAG TTTAAGAAAG 10891 |||||||||| ||| TGCTTTACAT CTG....... .......... .......... .......... .......... 209 AACGACTTTC CATACATAGT AAATGACTAG AAATATCACT CATGATAATA TGTGATGTTC 10831 .......... .......... .......... .......... .......... .......... 209 AGTTTTCAAA TATAGAAATG TGTCGTTCTT TTTAGAACAG ACTAAAAAGG AAAGGGTAAC 10771 .......... .......... .......... .......... .......... .......... 209 ATATAAAATG AAAAACGTAC GTTTAGTTCT TTTGCCATGA GCATGAATGT TGATTTTTCG 10711 .......... .......... .......... .......... .......... .......... 209 AGTTTTGATG ATTTAGTTGG AGATAGCTGG GAGGACAGAT ATTCCAGTGG CTGAAGGCTC 10651 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......TTGG AGATAGCTGG GAGGACAGAT ATTCCAGTGG CTGAAGGCTC 253 ACACGTTACG ATCACTGTAT GTTCAGTTCT CTGTCTTATT GCTCATCCAT ATTACAATAC 10591 |||||||||| |||||| ACACGTTACG ATCACT.... .......... .......... .......... .......... 269 AAACAAATTG ATACGGAGGG AGTATAAGAT TTACAGAACG AAATGAACAC TTTGTAATGT 10531 .......... .......... .......... .......... .......... .......... 269 TTACAGTATT TCTGATTGTG TACCTTTGCT CATTCAGCAT ATTTAGACCT AGCTTAGAAA 10471 .......... .......... .......... .......... .......... .......... 269 TTTTTCCCCC GCGGACTCTG TTTATGTTCA TGTAAAGGAA GGCGTAAAAC TTCAAAGTAG 10411 ||| |||||||||| |||||||||| .......... .......... .......... .......GAA GGCGTAAAAC TTCAAAGTAG 292 TGGATATGTT CATGGCGCGG ATGGACTCGG AAACCAAAAC ATTTCTCCAC CCAAAGGAAA 10351 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGATATGTT CATGGCGCGG ATGGACTCGG AAACCAAAAC ATTTCTCCAC CCAAAGGAAA 352 GGCTATTGAA CAGACTGCAG CTGAATTTCT CATTCAGCAA ACTAGTCTTT ACCCTGGAAA 10291 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCTATTGAA CAGACTGCAG CTGAATTTCT CATTCAGCAA ACTAGTCTTT ACCCTGGAAA 412 GGTCACTGTT GTGGCCTTAG GCCCCCTGAC AAATATAGCA CTTGTAAGTA CGAGATAATG 10231 |||||||||| |||||||||| |||||||||| |||||||||| ||| GGTCACTGTT GTGGCCTTAG GCCCCCTGAC AAATATAGCA CTT....... .......... 455 GTCAATAACG CACTTGAAGT ATTACTTGTT TGCGAGTTTC CTACCTGATA TATGTATATA 10171 .......... .......... .......... .......... .......... .......... 455 TATGAACAGG CTATTCAGTT GGATCCTGAA TTTTTCAAAA ACGTTGGTCA AGTTGTTGTT 10111 | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .........G CTATTCAGTT GGATCCTGAA TTTTTCAAAA ACGTTGGTCA AGTTGTTGTT 506 CTTGGTGGTG CTTTTGCTGT AAATGGAAAT GTGAGTCCAG CAGCTGAGGC AAATGTAAGT 10051 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| CTTGGTGGTG CTTTTGCTGT AAATGGAAAT GTGAGTCCAG CAGCTGAGGC AAAT...... 560 TGCTAATAGC TACATTCTTT ATATGTGCTT CAACTTCTAA GACTTTTTTT TTTATTCTTC 9991 .......... .......... .......... .......... .......... .......... 560 GTTTGCGCAA GATCTTCAAA GATCCAACTG CTGCTGATAT TGTATTTACA AGTGGTGCTG 9931 ||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .ATCTTCAAA GATCCAACTG CTGCTGATAT TGTATTTACA AGTGGTGCTG 609 ATGTTCTTGC TGTTGGATTA AATATTACAC ATCAAGTTGT CCTTACTGGT AATCTTTTCT 9871 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| ATGTTCTTGC TGTTGGATTA AATATTACAC ATCAAGTTGT CCTTACTG.. .......... 657 TTGTTTCTTA AGTTTCATTT CAAGCCACTG TCTCCGTTGT TATTTTAAGA AGAATGTGCT 9811 .......... .......... .......... .......... .......... .......... 657 ATCGAATAAT AGAAACAAAT ACTAAGAGAG TAGTGTAGAG GAGGCGCGAG AGTTTGGCTA 9751 .......... .......... .......... .......... .......... .......... 657 TAGCAAGTTT TAGCAGATGT TGAGGTATAG GCCAAAGAAG AACGGGAGGA TGTGATTAGA 9691 .......... .......... .......... .......... .......... .......... 657 CAGGACATGA CTAGTTTTCA ACTCACTGAG GACATGATCC TAGATAGAAA AGTATGGAGA 9631 .......... .......... .......... .......... .......... .......... 657 TCAAGGATTA AGGTAGAAAG GTAGTAATCC TATCTACACA TAAGTATCTT ATTCTCTATT 9571 .......... .......... .......... .......... .......... .......... 657 TTTTACTACT ACCTGTTACT TCATTTGCTT TGTTTATCTT GTTATTTTGC GGTCTTAGCT 9511 .......... .......... .......... .......... .......... .......... 657 CCTAGGGTCC TGTTTTAGTT AACTTCTCTT AATAGCATCT ATCTACTGTT GTTAGATTCT 9451 ||| .......... .......... .......... .......... .......... .....GATCT 662 CATCG 9446 ||||| CATCG 667 hqPGS_C12HBa0093P12.1-2-_SGN-E545755+ (11490 11398,11053 10938,10694 10635,10433 10248,10161 10057,9979 9883) ******************************************************************************** EST sequence 17 +strand 543 n (File: SGN-E545762+) 1 CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 61 AGAAAATCAT CATTGATACT GACCCTGGAA TTGATGATGC AATCGCGATT TTCGTGGCAC 121 TTCAATCTCC TGAAATTGAA GTAATTGGAC TCACAACAAT ATTTGGTAAT GTTCAGACAA 181 CTCTGTCAAC CAGAAATGCT TTACATCTGT TGGAGATAGC TGGGAGGACA GATATTCCAG 241 TGGCTGAAGG CTCACACGTT ACGATCACTG AAGGCGTAAA ACTTCAAAGT AGTGGATATG 301 TTCATGGCGC GGATGGACTC GGAAACCAAA ACATTTCTCC ACCCAAAGGA AAGGCTATTG 361 AACAGACTGC AGCTGAATTT CTCATTCAGC AAACTAGTCT TTACCCTGGA AAGGTCACTG 421 TTGTGGCCTT AGGCCCCCTG ACAAATATAG CACTTGCTAT TCAGTTGGAT CCTGAATTTT 481 TCAAAAACGT TGGTCAAGTT GTTGTTCTTG GTGGGGCTTT TGCTGTAAAT GGAAATGTGA 541 GTC Predicted gene structure (within gDNA segment 12090 to 9464): Exon 1 11490 11398 ( 93 n); cDNA 1 93 ( 93 n); score: 1.000 Intron 1 11397 11054 ( 344 n); Pd: 0.998 (s: 1.00), Pa: 0.963 (s: 1.00) Exon 2 11053 10938 ( 116 n); cDNA 94 209 ( 116 n); score: 1.000 Intron 2 10937 10695 ( 243 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 10694 10635 ( 60 n); cDNA 210 269 ( 60 n); score: 1.000 Intron 3 10634 10434 ( 201 n); Pd: 0.996 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 4 10433 10248 ( 186 n); cDNA 270 455 ( 186 n); score: 1.000 Intron 4 10247 10162 ( 86 n); Pd: 0.910 (s: 1.00), Pa: 0.994 (s: 1.00) Exon 5 10161 10074 ( 88 n); cDNA 456 543 ( 88 n); score: 0.989 MATCH C12HBa0093P12.1-2- SGN-E545762+ 0.998 543 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E545762+ (11490 11398,11053 10938,10694 10635,10433 10248,10161 10074) Alignment (genomic DNA sequence = upper lines): CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 11431 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATAATGAA TCCCTTTTTT TAATTCATTT GCATAATCAA TAAAATGGTA GAAGAAATCA 60 AGAAAATCAT CATTGATACT GACCCTGGAA TTGGTAATTT TGCTTTTATT ACAACTATTT 11371 |||||||||| |||||||||| |||||||||| ||| AGAAAATCAT CATTGATACT GACCCTGGAA TTG....... .......... .......... 93 CTCGTAGTTT TTTGTTCTTC GATTTTGATT ATTACCTGTG TTTTTTGTGC TTTGATTGTC 11311 .......... .......... .......... .......... .......... .......... 93 ACATTATTTC GTTGTCATTG CTATTCTTTC TTTCTTATGT TGTGTACTTG CTCCCATGTC 11251 .......... .......... .......... .......... .......... .......... 93 ACACGTCTAA TCGTGGTTAC CACTCTACGA GTTAGATAAG ATCTGTGTAC ATTCTATCCT 11191 .......... .......... .......... .......... .......... .......... 93 CGTCATATTA CACTATGGTA TCTTCGTGTT TTTGTTTCTT CATAGTAAGA GCATTTCAAG 11131 .......... .......... .......... .......... .......... .......... 93 ATTTCATTTT TTTTTTGTGA TTACCAATTC AACTGTTGAG TTATTTGAGA TTTTATATGA 11071 .......... .......... .......... .......... .......... .......... 93 ATTAATATCA TGTTTAGATG ATGCAATCGC GATTTTCGTG GCACTTCAAT CTCCTGAAAT 11011 ||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .......ATG ATGCAATCGC GATTTTCGTG GCACTTCAAT CTCCTGAAAT 136 TGAAGTAATT GGACTCACAA CAATATTTGG TAATGTTCAG ACAACTCTGT CAACCAGAAA 10951 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGAAGTAATT GGACTCACAA CAATATTTGG TAATGTTCAG ACAACTCTGT CAACCAGAAA 196 TGCTTTACAT CTGGTAAAGT TCTATCTTTA TTTCATTATT GGACACGGAG TTTAAGAAAG 10891 |||||||||| ||| TGCTTTACAT CTG....... .......... .......... .......... .......... 209 AACGACTTTC CATACATAGT AAATGACTAG AAATATCACT CATGATAATA TGTGATGTTC 10831 .......... .......... .......... .......... .......... .......... 209 AGTTTTCAAA TATAGAAATG TGTCGTTCTT TTTAGAACAG ACTAAAAAGG AAAGGGTAAC 10771 .......... .......... .......... .......... .......... .......... 209 ATATAAAATG AAAAACGTAC GTTTAGTTCT TTTGCCATGA GCATGAATGT TGATTTTTCG 10711 .......... .......... .......... .......... .......... .......... 209 AGTTTTGATG ATTTAGTTGG AGATAGCTGG GAGGACAGAT ATTCCAGTGG CTGAAGGCTC 10651 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......TTGG AGATAGCTGG GAGGACAGAT ATTCCAGTGG CTGAAGGCTC 253 ACACGTTACG ATCACTGTAT GTTCAGTTCT CTGTCTTATT GCTCATCCAT ATTACAATAC 10591 |||||||||| |||||| ACACGTTACG ATCACT.... .......... .......... .......... .......... 269 AAACAAATTG ATACGGAGGG AGTATAAGAT TTACAGAACG AAATGAACAC TTTGTAATGT 10531 .......... .......... .......... .......... .......... .......... 269 TTACAGTATT TCTGATTGTG TACCTTTGCT CATTCAGCAT ATTTAGACCT AGCTTAGAAA 10471 .......... .......... .......... .......... .......... .......... 269 TTTTTCCCCC GCGGACTCTG TTTATGTTCA TGTAAAGGAA GGCGTAAAAC TTCAAAGTAG 10411 ||| |||||||||| |||||||||| .......... .......... .......... .......GAA GGCGTAAAAC TTCAAAGTAG 292 TGGATATGTT CATGGCGCGG ATGGACTCGG AAACCAAAAC ATTTCTCCAC CCAAAGGAAA 10351 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGATATGTT CATGGCGCGG ATGGACTCGG AAACCAAAAC ATTTCTCCAC CCAAAGGAAA 352 GGCTATTGAA CAGACTGCAG CTGAATTTCT CATTCAGCAA ACTAGTCTTT ACCCTGGAAA 10291 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGCTATTGAA CAGACTGCAG CTGAATTTCT CATTCAGCAA ACTAGTCTTT ACCCTGGAAA 412 GGTCACTGTT GTGGCCTTAG GCCCCCTGAC AAATATAGCA CTTGTAAGTA CGAGATAATG 10231 |||||||||| |||||||||| |||||||||| |||||||||| ||| GGTCACTGTT GTGGCCTTAG GCCCCCTGAC AAATATAGCA CTT....... .......... 455 GTCAATAACG CACTTGAAGT ATTACTTGTT TGCGAGTTTC CTACCTGATA TATGTATATA 10171 .......... .......... .......... .......... .......... .......... 455 TATGAACAGG CTATTCAGTT GGATCCTGAA TTTTTCAAAA ACGTTGGTCA AGTTGTTGTT 10111 | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| .........G CTATTCAGTT GGATCCTGAA TTTTTCAAAA ACGTTGGTCA AGTTGTTGTT 506 CTTGGTGGTG CTTTTGCTGT AAATGGAAAT GTGAGTC 10074 |||||||| | |||||||||| |||||||||| ||||||| CTTGGTGGGG CTTTTGCTGT AAATGGAAAT GTGAGTC 543 hqPGS_C12HBa0093P12.1-2-_SGN-E545762+ (11490 11398,11053 10938,10694 10635,10433 10248,10161 10074) ******************************************************************************** EST sequence 14 +strand 440 n (File: SGN-E312682+) 1 TATAATGAAT CCCTTTTTTT AATTCATTTG CATAATCAAT AAAATGGTAG AAGAAATCAA 61 GAAAATCATC ATTGATACTG ACCCTGGAAT TGATGATGCA ATCGCGATTT TCGTGGCACT 121 TCAATCTCCT GAAATTGAAG TAATTGGACT CACAACAATA TTTGGTAATG TTCAGACAAC 181 TCTGTCAACC AGAAATGCTT TACATCTGTT GGAGATAGCT GGGAGGACAG ATATTCCAGT 241 GGCTGAAGGC TCACACGTTA CGATCACTGA AGGCGTAAAA CTTCAAAGTA GTGGATATGT 301 TCATGGCGCG GATGGACTCG GAAACCAAAA CATTTCTCCA CCCAAAGGAA AGGCTATTGA 361 ACAGACTGCA GCTGAATTTC TCATTCAGCA AACTAGTCTT TACCCTGGAA AGGTCACTGT 421 TGTGGCCTTA AGCCCCCTGA Predicted gene structure (within gDNA segment 12089 to 9562): Exon 1 11489 11398 ( 92 n); cDNA 1 92 ( 92 n); score: 1.000 Intron 1 11397 11054 ( 344 n); Pd: 0.998 (s: 1.00), Pa: 0.963 (s: 1.00) Exon 2 11053 10938 ( 116 n); cDNA 93 208 ( 116 n); score: 1.000 Intron 2 10937 10695 ( 243 n); Pd: 0.922 (s: 1.00), Pa: 0.997 (s: 1.00) Exon 3 10694 10635 ( 60 n); cDNA 209 268 ( 60 n); score: 1.000 Intron 3 10634 10434 ( 201 n); Pd: 0.996 (s: 1.00), Pa: 0.000 (s: 1.00) Exon 4 10433 10262 ( 172 n); cDNA 269 440 ( 172 n); score: 0.994 MATCH C12HBa0093P12.1-2- SGN-E312682+ 0.998 440 1.000 C PGS_C12HBa0093P12.1-2-_SGN-E312682+ (11489 11398,11053 10938,10694 10635,10433 10262) Alignment (genomic DNA sequence = upper lines): TATAATGAAT CCCTTTTTTT AATTCATTTG CATAATCAAT AAAATGGTAG AAGAAATCAA 11430 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATAATGAAT CCCTTTTTTT AATTCATTTG CATAATCAAT AAAATGGTAG AAGAAATCAA 60 GAAAATCATC ATTGATACTG ACCCTGGAAT TGGTAATTTT GCTTTTATTA CAACTATTTC 11370 |||||||||| |||||||||| |||||||||| || GAAAATCATC ATTGATACTG ACCCTGGAAT TG........ .......... .......... 92 TCGTAGTTTT TTGTTCTTCG ATTTTGATTA TTACCTGTGT TTTTTGTGCT TTGATTGTCA 11310 .......... .......... .......... .......... .......... .......... 92 CATTATTTCG TTGTCATTGC TATTCTTTCT TTCTTATGTT GTGTACTTGC TCCCATGTCA 11250 .......... .......... .......... .......... .......... .......... 92 CACGTCTAAT CGTGGTTACC ACTCTACGAG TTAGATAAGA TCTGTGTACA TTCTATCCTC 11190 .......... .......... .......... .......... .......... .......... 92 GTCATATTAC ACTATGGTAT CTTCGTGTTT TTGTTTCTTC ATAGTAAGAG CATTTCAAGA 11130 .......... .......... .......... .......... .......... .......... 92 TTTCATTTTT TTTTTGTGAT TACCAATTCA ACTGTTGAGT TATTTGAGAT TTTATATGAA 11070 .......... .......... .......... .......... .......... .......... 92 TTAATATCAT GTTTAGATGA TGCAATCGCG ATTTTCGTGG CACTTCAATC TCCTGAAATT 11010 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......ATGA TGCAATCGCG ATTTTCGTGG CACTTCAATC TCCTGAAATT 136 GAAGTAATTG GACTCACAAC AATATTTGGT AATGTTCAGA CAACTCTGTC AACCAGAAAT 10950 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGTAATTG GACTCACAAC AATATTTGGT AATGTTCAGA CAACTCTGTC AACCAGAAAT 196 GCTTTACATC TGGTAAAGTT CTATCTTTAT TTCATTATTG GACACGGAGT TTAAGAAAGA 10890 |||||||||| || GCTTTACATC TG........ .......... .......... .......... .......... 208 ACGACTTTCC ATACATAGTA AATGACTAGA AATATCACTC ATGATAATAT GTGATGTTCA 10830 .......... .......... .......... .......... .......... .......... 208 GTTTTCAAAT ATAGAAATGT GTCGTTCTTT TTAGAACAGA CTAAAAAGGA AAGGGTAACA 10770 .......... .......... .......... .......... .......... .......... 208 TATAAAATGA AAAACGTACG TTTAGTTCTT TTGCCATGAG CATGAATGTT GATTTTTCGA 10710 .......... .......... .......... .......... .......... .......... 208 GTTTTGATGA TTTAGTTGGA GATAGCTGGG AGGACAGATA TTCCAGTGGC TGAAGGCTCA 10650 ||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .....TTGGA GATAGCTGGG AGGACAGATA TTCCAGTGGC TGAAGGCTCA 253 CACGTTACGA TCACTGTATG TTCAGTTCTC TGTCTTATTG CTCATCCATA TTACAATACA 10590 |||||||||| ||||| CACGTTACGA TCACT..... .......... .......... .......... .......... 268 AACAAATTGA TACGGAGGGA GTATAAGATT TACAGAACGA AATGAACACT TTGTAATGTT 10530 .......... .......... .......... .......... .......... .......... 268 TACAGTATTT CTGATTGTGT ACCTTTGCTC ATTCAGCATA TTTAGACCTA GCTTAGAAAT 10470 .......... .......... .......... .......... .......... .......... 268 TTTTCCCCCG CGGACTCTGT TTATGTTCAT GTAAAGGAAG GCGTAAAACT TCAAAGTAGT 10410 |||| |||||||||| |||||||||| .......... .......... .......... ......GAAG GCGTAAAACT TCAAAGTAGT 292 GGATATGTTC ATGGCGCGGA TGGACTCGGA AACCAAAACA TTTCTCCACC CAAAGGAAAG 10350 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGATATGTTC ATGGCGCGGA TGGACTCGGA AACCAAAACA TTTCTCCACC CAAAGGAAAG 352 GCTATTGAAC AGACTGCAGC TGAATTTCTC ATTCAGCAAA CTAGTCTTTA CCCTGGAAAG 10290 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTATTGAAC AGACTGCAGC TGAATTTCTC ATTCAGCAAA CTAGTCTTTA CCCTGGAAAG 412 GTCACTGTTG TGGCCTTAGG CCCCCTGA 10262 |||||||||| |||||||| | |||||||| GTCACTGTTG TGGCCTTAAG CCCCCTGA 440 hqPGS_C12HBa0093P12.1-2-_SGN-E312682+ (11489 11398,11053 10938,10694 10635,10433 10262) Total number of EST alignments reported: 19 ________________________________________________________________________________ Predicted gene locations (2) in segment 1 to 17133: PGL 1 (- strand): 2422 1 AGS-1 (2422 2038,445 293,145 1) SCR (e 1.000 d 0.984 a 0.997,e 1.000 d 0.997 a 0.991,e 1.000) Exon 1 2422 2038 ( 385 n); score: 1.000 Intron 1 2037 446 (1592 n); Pd: 0.984 Pa: 0.997 Exon 2 445 293 ( 153 n); score: 1.000 Intron 2 292 146 ( 147 n); Pd: 0.997 Pa: 0.991 Exon 3 145 1 ( 145 n); score: 1.000 PGS (422 293,145 1) SGN-E243027+ PGS (307 293,145 1) SGN-E395667- PGS (303 293,145 1) SGN-E254519+ PGS (2422 2038,445 293,145 65) SGN-E395668+ PGS (2421 2038,445 293,145 66) SGN-E318076+ PGS (2417 2038,445 293,145 70) SGN-E306854+ PGS (2248 2038,445 293,145 89) SGN-E321151+ PGS (2421 2038,445 293,145 117) SGN-E210308+ PGS (2409 2038,445 299) SGN-E287725+ PGS (2422 2038,445 316) SGN-E307480+ PGS (2419 2038) SGN-E303502+ PGS (2414 2048) SGN-E331254+ PGS (2372 2129) SGN-E307348+ 3-phase translation of AGS-1 (-strand): . . . . . . 2422 AAAAGATCAAAAAAGTTAGAACTCAAAGAACAAGTAACATTGATTTCTACACAATAATTT K R S K K L E L K E Q V T L I S T Q - F K D Q K S - N S K N K - H - F L H N N L K I K K V R T Q R T S N I D F Y T I I . . . . . . 2362 GTCTCTTTTTGTCTATTTGATCATCGCTGAGGACTCACCGATCCATTAATTTCGTATTTT V S F C L F D H R - G L T D P L I S Y F S L F V Y L I I A E D S P I H - F R I F C L F L S I - S S L R T H R S I N F V F . . . . . . 2302 CATATTGTTTTTTTCAGCTTATAAGGAGATCAATTACGAATAATCTAAAGGGTCTTTGCA H I V F F S L - G D Q L R I I - R V F A I L F F S A Y K E I N Y E - S K G S L H S Y C F F Q L I R R S I T N N L K G L C . . . . . . 2242 TATAGAGATGCAAAAGACCTCATTTTTGTTGTAAATTTTTTATTTTTTTGTTGTGGGGTT Y R D A K D L I F V V N F L F F C C G V I E M Q K T S F L L - I F Y F F V V G L I - R C K R P H F C C K F F I F L L W G . . . . . . 2182 GTTCAATTTCGTATTTTTTTTTTGTGGGTTTTGAATTTGGGAGATGGGAAATATATTTGT V Q F R I F F L W V L N L G D G K Y I C F N F V F F F C G F - I W E M G N I F V C S I S Y F F F V G F E F G R W E I Y L . . . . . . 2122 GAAGAAACCGAAGATCACCGAAGTTGATAGAGCGATTTTGTCTTTGAAGACTCAAAGGCG E E T E D H R S - - S D F V F E D S K A K K P K I T E V D R A I L S L K T Q R R - R N R R S P K L I E R F C L - R L K G . . . : . . . 2062 TAAGCTTGCTCAATATCAGCAACAG : CTGGATGCTGTTATTGAAGCCGAAAAACAAGCTGC - A C S I S A T : A G C C Y - S R K T S C K L A Q Y Q Q Q : L D A V I E A E K Q A A V S L L N I S N S : W M L L L K P K N K L . . . . . . 410 AAAAGACTTGCTACGCGAAAAGAAGAAAGAGAGGGCCCTTTTAGCATTGAAAAAGAAGAA K R L A T R K E E R E G P F S I E K E E K D L L R E K K K E R A L L A L K K K K Q K T C Y A K R R K R G P F - H - K R R . . . . . . : 350 AGTGCAAGAAGAACTATTAAAGCAAGTTGATGTTTGGCTCATAAATGTTGAGCAGCAA : TT S A R R T I K A S - C L A H K C - A A : I V Q E E L L K Q V D V W L I N V E Q Q : L K C K K N Y - S K L M F G S - M L S S N : . . . . . . 143 GGCAGATATTGAACTGACAAGCAAGCAAAAGGCTGTCTTTGAGAGTTTGAAAACTGGGAA G R Y - T D K Q A K G C L - E F E N W E A D I E L T S K Q K A V F E S L K T G N W Q I L N - Q A S K R L S L R V - K L G . . . . . . 83 TAATGCAATCAAAGCAATACAAGGTGAGATCAATCTAGAGGATGTTCAAAAGTTAATGGA - C N Q S N T R - D Q S R G C S K V N G N A I K A I Q G E I N L E D V Q K L M D I M Q S K Q Y K V R S I - R M F K S - W . . . 23 TGATACTGCAGAGGCAAAAGCTT - Y C R G K S D T A E A K A M I L Q R Q K L Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-2-_PGL-1_AGS-1_PPS_1 (2148 2038,445 293,145 2) (frame '2'; 408 bp, 136 residues) 1 IWEMGNIFVK KPKITEVDRA ILSLKTQRRK LAQYQQQLDA VIEAEKQAAK DLLREKKKER 61 ALLALKKKKV QEELLKQVDV WLINVEQQLA DIELTSKQKA VFESLKTGNN AIKAIQGEIN 121 LEDVQKLMDD TAEAKA PGL 2 (- strand): 11490 7952 AGS-1 (8424 7952) SCR (e 0.964) Exon 1 8424 7952 ( 473 n); score: 0.964 PGS (8424 7952) SGN-E330507+ 3-phase translation of AGS-1 (-strand): . . . . . . 8424 AACTATTGATAATATATTCAATTTCTGTTAGTATACACAAAAAAACTAAGATGAAGCTAG N Y - - Y I Q F L L V Y T K K L R - S - T I D N I F N F C - Y T Q K N - D E A R L L I I Y S I S V S I H K K T K M K L . . . . . . 8364 AGGGAAGATTCTTTGTACATTTGTGTGACTTTACTTAAATCCCTCAGGTTTTGCTTCATA R E D S L Y I C V T L L K S L R F C F I G K I L C T F V - L Y L N P S G F A S - E G R F F V H L C D F T - I P Q V L L H . . . . . . 8304 AACACGACGGAAACCTAGTGAAGAAACATAACAAAAGCCACAGATTTCCTGAAAGATCTT N T T E T - - R N I T K A T D F L K D L T R R K P S E E T - Q K P Q I S - K I F K H D G N L V K K H N K S H R F P E R S . . . . . . 8244 TGCTTTGACAAACATAAACAGGAAAAACAAAAGAGACTCAATCCGCCGATTCAACTATAT C F D K H K Q E K Q K R L N P P I Q L Y A L T N I N R K N K R D S I R R F N Y I L L - Q T - T G K T K E T Q S A D S T I . . . . . . 8184 CAACTCCAACTGGATTGACAGTGTCTCTAAGACCCTGCCATCAATACTGCAAAAAAAGAG Q L Q L D - Q C L - D P A I N T A K K E N S N W I D S V S K T L P S I L Q K K S S T P T G L T V S L R P C H Q Y C K K R . . . . . . 8124 CACATTTCCACTTCTCCATTTGGAAATGCTCACATAAGTAGCCACTCGAGACCACATACA H I S T S P F G N A H I S S H S R P H T T F P L L H L E M L T - V A T R D H I H A H F H F S I W K C S H K - P L E T T Y . . . . . . 8064 CTCGTTATGCTGTTTTAATTGCAACTGGCTTACCTCGACCCATGGAGAAGCCCCTGGTGC L V M L F - L Q L A Y L D P W R S P W C S L C C F N C N W L T S T H G E A P G A T R Y A V L I A T G L P R P M E K P L V . . . . . . 8004 CATCTGGCATGCGAGGCACAGAAGTTTGCTTGACTGGTGCTGACTGATCAGAA H L A C E A Q K F A - L V L T D Q I W H A R H R S L L D W C - L I R P S G M R G T E V C L T G A D - S E Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 7952 TTCTGATCAGTCAGCACCAGTCAAGCAAACTTCTGTGCCTCGCATGCCAGATGGCACCAG F - S V S T S Q A N F C A S H A R W H Q S D Q S A P V K Q T S V P R M P D G T R L I S Q H Q S S K L L C L A C Q M A P . . . . . . 8012 GGGCTTCTCCATGGGTCGAGGTAAGCCAGTTGCAATTAAAACAGCATAACGAGTGTATGT G L L H G S R - A S C N - N S I T S V C G F S M G R G K P V A I K T A - R V Y V G A S P W V E V S Q L Q L K Q H N E C M . . . . . . 8072 GGTCTCGAGTGGCTACTTATGTGAGCATTTCCAAATGGAGAAGTGGAAATGTGCTCTTTT G L E W L L M - A F P N G E V E M C S F V S S G Y L C E H F Q M E K W K C A L F W S R V A T Y V S I S K W R S G N V L F . . . . . . 8132 TTTGCAGTATTGATGGCAGGGTCTTAGAGACACTGTCAATCCAGTTGGAGTTGATATAGT F A V L M A G S - R H C Q S S W S - Y S L Q Y - W Q G L R D T V N P V G V D I V F C S I D G R V L E T L S I Q L E L I - . . . . . . 8192 TGAATCGGCGGATTGAGTCTCTTTTGTTTTTCCTGTTTATGTTTGTCAAAGCAAAGATCT - I G G L S L F C F S C L C L S K Q R S E S A D - V S F V F P V Y V C Q S K D L L N R R I E S L L F F L F M F V K A K I . . . . . . 8252 TTCAGGAAATCTGTGGCTTTTGTTATGTTTCTTCACTAGGTTTCCGTCGTGTTTATGAAG F R K S V A F V M F L H - V S V V F M K S G N L W L L L C F F T R F P S C L - S F Q E I C G F C Y V S S L G F R R V Y E . . . . . . 8312 CAAAACCTGAGGGATTTAAGTAAAGTCACACAAATGTACAAAGAATCTTCCCTCTAGCTT Q N L R D L S K V T Q M Y K E S S L - L K T - G I - V K S H K C T K N L P S S F A K P E G F K - S H T N V Q R I F P L A . . . . . . 8372 CATCTTAGTTTTTTTGTGTATACTAACAGAAATTGAATATATTATCAATAGTT H L S F F V Y T N R N - I Y Y Q - I L V F L C I L T E I E Y I I N S S S - F F C V Y - Q K L N I L S I V Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-2+_PGL-2_AGS-1_PPS_1 (7954 8190) (frame '0'; 234 bp, 78 residues) 1 LISQHQSSKL LCLACQMAPG ASPWVEVSQL QLKQHNECMW SRVATYVSIS KWRSGNVLFF 61 CSIDGRVLET LSIQLELI- AGS-2 (11490 11398,11053 10938,10694 10635,10433 10248,10161 10057,9979 9883,9455 9348,9253 9118,9026 8662) SCR (e 1.000 d 0.998 a 0.963,e 1.000 d 0.922 a 0.997,e 1.000 d 0.996 a 0.000,e 1.000 d 0.910 a 0.994,e 1.000 d 1.000 a 0.684,e 1.000 d 0.962 a 0.666,e 1.000 d 0.481 a 0.982,e 1.000 d 0.984 a 0.917,e 0.994) Exon 1 11490 11398 ( 93 n); score: 1.000 Intron 1 11397 11054 ( 344 n); Pd: 0.998 Pa: 0.963 Exon 2 11053 10938 ( 116 n); score: 1.000 Intron 2 10937 10695 ( 243 n); Pd: 0.922 Pa: 0.997 Exon 3 10694 10635 ( 60 n); score: 1.000 Intron 3 10634 10434 ( 201 n); Pd: 0.996 Pa: 0.000 Exon 4 10433 10248 ( 186 n); score: 1.000 Intron 4 10247 10162 ( 86 n); Pd: 0.910 Pa: 0.994 Exon 5 10161 10057 ( 105 n); score: 1.000 Intron 5 10056 9980 ( 77 n); Pd: 1.000 Pa: 0.684 Exon 6 9979 9883 ( 97 n); score: 1.000 Intron 6 9882 9456 ( 427 n); Pd: 0.962 Pa: 0.666 Exon 7 9455 9348 ( 108 n); score: 1.000 Intron 7 9347 9254 ( 94 n); Pd: 0.481 Pa: 0.982 Exon 8 9253 9118 ( 136 n); score: 1.000 Intron 8 9117 9027 ( 91 n); Pd: 0.984 Pa: 0.917 Exon 9 9026 8662 ( 365 n); score: 0.994 PGS (10140 10057,9979 9883,9455 9348,9253 9118,9026 8662) SGN-E545754- PGS (9253 9118,9026 8678) SGN-E545761- PGS (11490 11398,11053 10938,10694 10635,10433 10248,10161 10057,9979 9883) SGN-E545755+ PGS (11490 11398,11053 10938,10694 10635,10433 10248,10161 10074) SGN-E545762+ PGS (11489 11398,11053 10938,10694 10635,10433 10262) SGN-E312682+ 3-phase translation of AGS-2 (-strand): . . . . . . 11490 CTATAATGAATCCCTTTTTTTAATTCATTTGCATAATCAATAAAATGGTAGAAGAAATCA L - - I P F F N S F A - S I K W - K K S Y N E S L F L I H L H N Q - N G R R N Q I M N P F F - F I C I I N K M V E E I . . . . : . . 11430 AGAAAATCATCATTGATACTGACCCTGGAATTG : ATGATGCAATCGCGATTTTCGTGGCAC R K S S L I L T L E L : M M Q S R F S W H E N H H - Y - P W N - : - C N R D F R G T K K I I I D T D P G I : D D A I A I F V A . . . . . . 11026 TTCAATCTCCTGAAATTGAAGTAATTGGACTCACAACAATATTTGGTAATGTTCAGACAA F N L L K L K - L D S Q Q Y L V M F R Q S I S - N - S N W T H N N I W - C S D N L Q S P E I E V I G L T T I F G N V Q T . . . : . . . 10966 CTCTGTCAACCAGAAATGCTTTACATCTG : TTGGAGATAGCTGGGAGGACAGATATTCCAG L C Q P E M L Y I C : W R - L G G Q I F Q S V N Q K C F T S : V G D S W E D R Y S S T L S T R N A L H L : L E I A G R T D I P . . . : . . . 10663 TGGCTGAAGGCTCACACGTTACGATCACT : GAAGGCGTAAAACTTCAAAGTAGTGGATATG W L K A H T L R S L : K A - N F K V V D M G - R L T R Y D H : - R R K T S K - W I C V A E G S H V T I T : E G V K L Q S S G Y . . . . . . 10402 TTCATGGCGCGGATGGACTCGGAAACCAAAACATTTCTCCACCCAAAGGAAAGGCTATTG F M A R M D S E T K T F L H P K E R L L S W R G W T R K P K H F S T Q R K G Y - V H G A D G L G N Q N I S P P K G K A I . . . . . . 10342 AACAGACTGCAGCTGAATTTCTCATTCAGCAAACTAGTCTTTACCCTGGAAAGGTCACTG N R L Q L N F S F S K L V F T L E R S L T D C S - I S H S A N - S L P W K G H C E Q T A A E F L I Q Q T S L Y P G K V T . . . . : . . 10282 TTGTGGCCTTAGGCCCCCTGACAAATATAGCACTT : GCTATTCAGTTGGATCCTGAATTTT L W P - A P - Q I - H L : L F S W I L N F C G L R P P D K Y S T : C Y S V G S - I F V V A L G P L T N I A L : A I Q L D P E F . . . . . . 10136 TCAAAAACGTTGGTCAAGTTGTTGTTCTTGGTGGTGCTTTTGCTGTAAATGGAAATGTGA S K T L V K L L F L V V L L L - M E M - Q K R W S S C C S W W C F C C K W K C E F K N V G Q V V V L G G A F A V N G N V . . : . . . . 10076 GTCCAGCAGCTGAGGCAAAT : ATCTTCAAAGATCCAACTGCTGCTGATATTGTATTTACAA V Q Q L R Q I : S S K I Q L L L I L Y L Q S S S - G K : Y L Q R S N C C - Y C I Y K S P A A E A N : I F K D P T A A D I V F T . . . . . . : 9939 GTGGTGCTGATGTTCTTGCTGTTGGATTAAATATTACACATCAAGTTGTCCTTACTG : ATT V V L M F L L L D - I L H I K L S L L : I W C - C S C C W I K Y Y T S S C P Y - : F S G A D V L A V G L N I T H Q V V L T : D . . . . . . 9452 CTCATCGTGGCGAATTGGCAAAGTCCAATGGAAAGTTTGCCAAGTACCTCAGCAAGCTTT L I V A N W Q S P M E S L P S T S A S F S S W R I G K V Q W K V C Q V P Q Q A F S H R G E L A K S N G K F A K Y L S K L . . . . . : . 9392 TGGATGTCTATTTCGATTATCACAATACTGCATACAGCACGAGAG : GTGTCTTCCTTCATG W M S I S I I T I L H T A R E : V S S F M G C L F R L S Q Y C I Q H E R : C L P S - L D V Y F D Y H N T A Y S T R : G V F L H . . . . . . 9238 ATCCAACTGCTTTACTTGCTGCTGTTAATCCATCACTCCTCACCTATTCAGAAGGTGTCG I Q L L Y L L L L I H H S S P I Q K V S S N C F T C C C - S I T P H L F R R C R D P T A L L A A V N P S L L T Y S E G V . . . . . . 9178 TTCGCGTTCAGACAGTTGGCATCACAAAGGGTCTCACAATCTTTTATAACAAACAGAAAA F A F R Q L A S Q R V S Q S F I T N R K S R S D S W H H K G S H N L L - Q T E K V R V Q T V G I T K G L T I F Y N K Q K . : . . . . . 9118 G : GTTTGTTGAAGTCACTGAATGGTCTGAAAAACCTATAGTTAAAGTGGCAGCAACTGTTG : G L L K S L N G L K N L - L K W Q Q L L : V C - S H - M V - K T Y S - S G S N C - R : F V E V T E W S E K P I V K V A A T V . . . . . . 8967 ATGCTCCTAAAGTTATCGAATTGGTGATGACACGACTCGTCAATTCTTAGAAGTATTCAT M L L K L S N W - - H D S S I L R S I H C S - S Y R I G D D T T R Q F L E V F I D A P K V I E L V M T R L V N S - K Y S . . . . . . 8907 TGAACATTTATTTGAACAGTGTTACTGTCTTTCTGCACTCAATATTCAGAAACTTAACTA - T F I - T V L L S F C T Q Y S E T - L E H L F E Q C Y C L S A L N I Q K L N Y L N I Y L N S V T V F L H S I F R N L T . . . . . . 8847 CGCTATAAGGAACGGAATCTTAAACTATAGACCAAGTACTACTCTCAGGAACAAGGATGA R Y K E R N L K L - T K Y Y S Q E Q G - A I R N G I L N Y R P S T T L R N K D E T L - G T E S - T I D Q V L L S G T R M . . . . . . 8787 ATGTATGTAAGCAGTAAGCAGCTCATATACAGAATATGAACTTGCAAAAATAAGTTACAG M Y V S S K Q L I Y R I - T C K N K L Q C M - A V S S S Y T E Y E L A K I S Y S N V C K Q - A A H I Q N M N L Q K - V T . . . . . . 8727 CTAGACTAGTCGTGTTCTGATAGATTTAATCTTTGTGTAAAGATATGACATTTGAGCCTA L D - S C S D R F N L C V K I - H L S L - T S R V L I D L I F V - R Y D I - A - A R L V V F - - I - S L C K D M T F E P . 8667 ACTCAA T Q L N S Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-2-_PGL-2_AGS-2_PPS_1 (11467 11398,11053 10938,10694 10635,10433 10248,10161 10057,9979 9883,9455 9348,9253 9118,9026 8918) (frame '0'; 984 bp, 328 residues) 1 FICIINKMVE EIKKIIIDTD PGIDDAIAIF VALQSPEIEV IGLTTIFGNV QTTLSTRNAL 61 HLLEIAGRTD IPVAEGSHVT ITEGVKLQSS GYVHGADGLG NQNISPPKGK AIEQTAAEFL 121 IQQTSLYPGK VTVVALGPLT NIALAIQLDP EFFKNVGQVV VLGGAFAVNG NVSPAAEANI 181 FKDPTAADIV FTSGADVLAV GLNITHQVVL TDSHRGELAK SNGKFAKYLS KLLDVYFDYH 241 NTAYSTRGVF LHDPTALLAA VNPSLLTYSE GVVRVQTVGI TKGLTIFYNK QKRFVEVTEW 301 SEKPIVKVAA TVDAPKVIEL VMTRLVNS- ... finished at: Thu Jul 27 14:05:40 2006 ________________________________________________________________________________ Sequence 3: C12HBa0093P12.1-3, from 1 to 27618, both strands analyzed. ... started at: Thu Jul 27 14:05:40 2006 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA +strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 10 EST library file: /tmp/cxgn-bacpublish-resources-NsUNzK/sgn_ests; matching gDNA -strand ... ... found all matches, elapsed seconds = 4 ... matches indexed, elapsed seconds = 4 HitsTableSize = 14 ******************************************************************************** EST sequence 1 +strand 641 n (File: SGN-E253402+) 1 GCACCAATCG TGCTCATCTG GTTGATATCA ATTATGATAA TTGGGTTGTA CAACACTATC 61 ATTTGGAACC CCAAAATTGT GTCTGCTTTT TCGCCCTATT ATATCATCAA GTTTTTTAGG 121 GATACAGGAA AAGATGGTTG GATTTCTCTT GGAGGTATTC TCCTCTCAGT TGCAGGTACT 181 GAAGCTATGT ATGCAGATCT TGGTCATTTC TCTGCCTTCT CCATGAGGAT TACATTTGCA 241 TTTGTGGTGT ATCCGTGCTT GGTGATACAG TACATGGGTC AAGCTGCTTT TCTGTCAAAA 301 AATCTAGATT CCATTCCAAA TAGCTTCTAT AGCTCAATAC CTGATGGTGT ATACTGGCCT 361 GTTTTTGTTA TTGCAACCCT TGCAGCCATT GTAAGCAGCC AATCTATCAT CACAGCCACA 421 TTCTCAATCG TCAAGCAATG TAATTCACTA GGTTGCTTCC CGCGGGTCAA GATTGTCCAC 481 ACCTCAAAGC ATAAAGGGCA GATCTATGTA CCAGAAATAA ATTGGATCCT GATGATTCTC 541 ACTCTTGCTG TGGCTATCGG GTTCCAAGAT ACAACTTTGA TTGGAAATGC ATACGGGCTA 601 GCTTGCATGA CAGTTATGTT TATCACAACA TTCCTCATGA C Predicted gene structure (within gDNA segment 6341 to 9650): Exon 1 6941 7115 ( 175 n); cDNA 1 175 ( 175 n); score: 1.000 Intron 1 7116 7598 ( 483 n); Pd: 0.999 (s: 1.00), Pa: 0.992 (s: 1.00) Exon 2 7599 7651 ( 53 n); cDNA 176 228 ( 53 n); score: 1.000 Intron 2 7652 7868 ( 217 n); Pd: 0.992 (s: 1.00), Pa: 0.977 (s: 1.00) Exon 3 7869 7983 ( 115 n); cDNA 229 343 ( 115 n); score: 1.000 Intron 3 7984 8076 ( 93 n); Pd: 0.943 (s: 1.00), Pa: 0.934 (s: 1.00) Exon 4 8077 8328 ( 252 n); cDNA 344 595 ( 252 n); score: 0.996 Intron 4 8329 8994 ( 666 n); Pd: 0.980 (s: 1.00), Pa: 0.973 (s: 1.00) Exon 5 8995 9040 ( 46 n); cDNA 596 641 ( 46 n); score: 1.000 MATCH C12HBa0093P12.1-3+ SGN-E253402+ 0.998 641 1.000 C PGS_C12HBa0093P12.1-3+_SGN-E253402+ (6941 7115,7599 7651,7869 7983,8077 8328,8995 9040) Alignment (genomic DNA sequence = upper lines): GCACCAATCG TGCTCATCTG GTTGATATCA ATTATGATAA TTGGGTTGTA CAACACTATC 7000 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACCAATCG TGCTCATCTG GTTGATATCA ATTATGATAA TTGGGTTGTA CAACACTATC 60 ATTTGGAACC CCAAAATTGT GTCTGCTTTT TCGCCCTATT ATATCATCAA GTTTTTTAGG 7060 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGGAACC CCAAAATTGT GTCTGCTTTT TCGCCCTATT ATATCATCAA GTTTTTTAGG 120 GATACAGGAA AAGATGGTTG GATTTCTCTT GGAGGTATTC TCCTCTCAGT TGCAGGTATA 7120 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| GATACAGGAA AAGATGGTTG GATTTCTCTT GGAGGTATTC TCCTCTCAGT TGCAG..... 175 TGACGTCTAT TATCTCTTAA CTCAGTAATT ATTTAGGCAT TATATTTAAA CAAACTCTCA 7180 .......... .......... .......... .......... .......... .......... 175 AACATGGCCT CTGACAAGTA ATCGCTCCGA TTTTGAGAGT GAACATCTAG ACACCTCAAC 7240 .......... .......... .......... .......... .......... .......... 175 TCGTCTATAG TGTGTCAGTT GAACACTCCA ACCTACAAAA TGATCATCTA GACACCTCCA 7300 .......... .......... .......... .......... .......... .......... 175 AAATTTATGT GCCACGTCAA TGTAGGGTGT CCATGACCAT GAGACATAAT AGGGCCAAGT 7360 .......... .......... .......... .......... .......... .......... 175 TGGAGTGTTT AGTTGTCACT TGAGACCAAG TTAAGATGTC TAGCTGTGCA TTCTCAAAGT 7420 .......... .......... .......... .......... .......... .......... 175 TGGAGTGCTT ACTTGCCAGC TGAGGCCAAG TTTGAGTGTC TGTTTATGTT ATTTATACAA 7480 .......... .......... .......... .......... .......... .......... 175 GAAAACATAG TAAACTTTGA GAATTCAAAG CAGTTGTTTG TGCAGACACT TTTTGAAACT 7540 .......... .......... .......... .......... .......... .......... 175 ATGAGATAAA CATTTACTAG AATTTTGGAT CTGATTTCCA ATTATTTTCA TTTATCAGGT 7600 || .......... .......... .......... .......... .......... ........GT 177 ACTGAAGCTA TGTATGCAGA TCTTGGTCAT TTCTCTGCCT TCTCCATGAG GGTAAGTCAA 7660 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | ACTGAAGCTA TGTATGCAGA TCTTGGTCAT TTCTCTGCCT TCTCCATGAG G......... 228 ACTATTTGTT GGAGCGGACT AAATTAAACT GTGGATCGTG AAAACTTTGT GCTCGTACTC 7720 .......... .......... .......... .......... .......... .......... 228 GTCTATAAGC TGAGTGTCCT GGTGTCTGGC AGATTATATT GTCCGTAGTT GTCATTTTTT 7780 .......... .......... .......... .......... .......... .......... 228 CTGCATCATA TTGTTCAACG TCCTAGTTAA CTTGTTGGAT TCCTGGAACT TACACCATGT 7840 .......... .......... .......... .......... .......... .......... 228 CTATCTGAAT TTGTTTTGTC CTATGCAGAT TACATTTGCA TTTGTGGTGT ATCCGTGCTT 7900 || |||||||||| |||||||||| |||||||||| .......... .......... ........AT TACATTTGCA TTTGTGGTGT ATCCGTGCTT 260 GGTGATACAG TACATGGGTC AAGCTGCTTT TCTGTCAAAA AATCTAGATT CCATTCCAAA 7960 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTGATACAG TACATGGGTC AAGCTGCTTT TCTGTCAAAA AATCTAGATT CCATTCCAAA 320 TAGCTTCTAT AGCTCAATAC CTGGTTGGTT TTTCTTTTCC TTTTCTCTTT GTAACAGCTT 8020 |||||||||| |||||||||| ||| TAGCTTCTAT AGCTCAATAC CTG....... .......... .......... .......... 343 AATGTTTTCA CGATAAATGA TTCAATCCTA ACTGCTGGTC TATGCTATCC ATGCAGATGG 8080 |||| .......... .......... .......... .......... .......... ......ATGG 347 TGTATACTGG CCTGTTTTTG TTATTGCAAC CCTTGCAGCC ATTGTAGGCA GCCAATCTAT 8140 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| TGTATACTGG CCTGTTTTTG TTATTGCAAC CCTTGCAGCC ATTGTAAGCA GCCAATCTAT 407 CATCACAGCC ACATTCTCAA TCGTCAAGCA ATGTAATTCA CTAGGTTGCT TCCCGCGGGT 8200 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATCACAGCC ACATTCTCAA TCGTCAAGCA ATGTAATTCA CTAGGTTGCT TCCCGCGGGT 467 CAAGATTGTC CACACCTCAA AGCATAAAGG GCAGATCTAT GTACCAGAAA TAAATTGGAT 8260 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAGATTGTC CACACCTCAA AGCATAAAGG GCAGATCTAT GTACCAGAAA TAAATTGGAT 527 CCTGATGATT CTCACTCTTG CTGTGGCTAT CGGGTTCCAA GATACAACTT TGATTGGAAA 8320 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTGATGATT CTCACTCTTG CTGTGGCTAT CGGGTTCCAA GATACAACTT TGATTGGAAA 587 TGCATACGGT AAGCCTTCTT GTGTGCTAAA GATGTCTTAT TTGGTCTTAA GTCTAAACAA 8380 |||||||| TGCATACG.. .......... .......... .......... .......... .......... 595 CCCTCTACGA GACTTTAAAT ATCAACAACT CTGCTCGACC AATTTTCACT ACAGAATCTG 8440 .......... .......... .......... .......... .......... .......... 595 ATTTGCTTTA CATGATTCTG TTCTTCTAAA TGCTGCTCCT TTTCTTAGTG TCTTCACGAT 8500 .......... .......... .......... .......... .......... .......... 595 TTTCTTTTTT TCCCCAAGCT TGTGCTGGAT GATTTATATA TGTCCAATAA AGAAGATCTC 8560 .......... .......... .......... .......... .......... .......... 595 TTGCAAAAAG AGAATAACAA TTAAACTGAA ACATAAATGG TGAACAAGGA GGATTCTCTA 8620 .......... .......... .......... .......... .......... .......... 595 ATTGGATGAC ATTTTTTTTA TAAAAACAGC CCGGTGCACT AAGCTCCTGC TATGTGCGGG 8680 .......... .......... .......... .......... .......... .......... 595 GTTCGGGGAA GGGACGGACC ACAAGGGTCT ACTATATGCA GTTTTACTCT GCAACTGTCA 8740 .......... .......... .......... .......... .......... .......... 595 AGAGGTTGTT TACAGTTCGA ACCTGTGACC TCCAGGTCAC ATGGCAACAA CTTTACCAGC 8800 .......... .......... .......... .......... .......... .......... 595 TATGTTAAGG CCCCTTCCTC TAATTGGATG ACATGGAGAA GTAAAATATT CCTTGTGTAG 8860 .......... .......... .......... .......... .......... .......... 595 AAGGGATTTT TTATGTACAT TACTGCATTT AGTAAAGCCC AGCTTGCCTT TTATAGTTGG 8920 .......... .......... .......... .......... .......... .......... 595 ATTTTTTTCT TTCTAAATTC GAGTCATTCT ATAACTCTTG AATTAAACCT TCTTATAGAC 8980 .......... .......... .......... .......... .......... .......... 595 GTTTTTTTTT TCAGGGCTAG CTTGCATGAC AGTTATGTTT ATCACAACAT TCCTCATGAC 9040 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....GGCTAG CTTGCATGAC AGTTATGTTT ATCACAACAT TCCTCATGAC 641 hqPGS_C12HBa0093P12.1-3+_SGN-E253402+ (6941 7115,7599 7651,7869 7983,8077 8328,8995 9040) ******************************************************************************** EST sequence 7 +strand 703 n (File: SGN-E332161+) 1 TAGCTTGCAT GACAGTTATG TTTATCACGA CATTTCTCAT GGCGCTTGTC ATAATCTTTG 61 TCTGGCAAAA AAGTGTAGCA CTTGCAATTC CTTTTCTCCT TTTATTCGGA ATCATCGAAG 121 GCGTCTACCT GTCTTCTGCA TGCATCAAGA TCCCACAAGG AGGATGGGTC TCCCTTGTGC 181 TCTCTTTTGC CTTCTTGACT ATCATGTTTG TCTGGCACTA TGGAACTCGC AAGAAGTACA 241 ACTTTGATCT TCACAATAAA GTTCCATTGA AATGGCTCCT TGGCTTGGGC CCCAGCCTCG 301 GTATTGTTCG TGTGCCAGGG ATAGGGCTGA TATACTCTGA ATTGGCAACA GGAATTCCAT 361 CCATCTTCTC TCACTTTGTT ACAAATCTCC CTGCATTTCA CAATGTGATG GTGTTTGTAT 421 GTGTCAAATC TGTTCCAGTA CCATTTGTCC CACCCGAGGA GCGCTTCCTC ATTGGTCGCA 481 TCTGCCCAAG ACCCTATCGC ATGTACCGTT GCATTGCCAG ATATGGTTAC AAGGACATAC 541 AGCGAGACAA TGGGAACTTT GAGGACCTTC TCATCCAGAG TATAGCAGAG TTCATCCAAA 601 TGGAAGCTGT AGAACCACAA CTCTCAAGCT CCGAGAGTCC ATCATTTGAT GGAAGGATGG 661 CAGTCATAAG CACAAGAAGT GTACAGTCAG GCTCAACATT ACT Predicted gene structure (within gDNA segment 5140 to 11147): Exon 1 8998 9672 ( 675 n); cDNA 1 675 ( 675 n); score: 0.858 MATCH C12HBa0093P12.1-3+ SGN-E332161+ 0.858 675 0.960 C PGS_C12HBa0093P12.1-3+_SGN-E332161+ (8998 9672) Alignment (genomic DNA sequence = upper lines): TAGCTTGCAT GACAGTTATG TTTATCACAA CATTCCTCAT GACACTTGTT ATAATCTTTG 9057 |||||||||| |||||||||| |||||||| | |||| ||||| | | ||||| |||||||||| TAGCTTGCAT GACAGTTATG TTTATCACGA CATTTCTCAT GGCGCTTGTC ATAATCTTTG 60 TGTGGCAAAG AAGTTTAGTA TTTGCTGCTG CTTTTCTCCT TTTCTTCTGG TTCATCGAAG 9117 | ||||||| |||| ||| | |||| | |||||||||| ||| ||| | ||||||||| TCTGGCAAAA AAGTGTAGCA CTTGCAATTC CTTTTCTCCT TTTATTCGGA ATCATCGAAG 120 GTCTCTACCT ATCTTCCGCA GCCATTAAGG CTCCACAGGG AGGATGGGTA TCCCTTTTGC 9177 | ||||||| ||||| ||| ||| ||| ||||| || ||||||||| |||||| ||| GCGTCTACCT GTCTTCTGCA TGCATCAAGA TCCCACAAGG AGGATGGGTC TCCCTTGTGC 180 TCTCTTTTAT CCTCTTAGCC ATCATGCTTG TGTGGCACTA TGGAACTTGC AAGAAGTACA 9237 |||||||| | |||| | |||||| ||| | |||||||| ||||||| || |||||||||| TCTCTTTTGC CTTCTTGACT ATCATGTTTG TCTGGCACTA TGGAACTCGC AAGAAGTACA 240 AATATGACCT GCACAACAAA GTTCCATTGA AATGGATCCT TGGCTTGGGT CCAAGCCTTG 9297 | | ||| || ||||| ||| |||||||||| ||||| |||| ||||||||| || ||||| | ACTTTGATCT TCACAATAAA GTTCCATTGA AATGGCTCCT TGGCTTGGGC CCCAGCCTCG 300 GTATTGTCCG CGTCCCAGGG ATAGGGCTAA TATACTCTGA ACTGGTAACA GGAGTTCCAC 9357 ||||||| || || |||||| |||||||| | |||||||||| | ||| |||| ||| ||||| GTATTGTTCG TGTGCCAGGG ATAGGGCTGA TATACTCTGA ATTGGCAACA GGAATTCCAT 360 CTATCTTCTC TCACTTTGTC ACAAATCTCC CTGCATTTCA TAATGTAATG GTGTTTGTAT 9417 | |||||||| ||||||||| |||||||||| |||||||||| ||||| ||| |||||||||| CCATCTTCTC TCACTTTGTT ACAAATCTCC CTGCATTTCA CAATGTGATG GTGTTTGTAT 420 GCGTCAAATC TGTTCCTGTA CCTCATGTCT CATCCGATGA GCGCTTCCTC ATTGGTCGTG 9477 | |||||||| |||||| ||| || |||| || |||| || |||||||||| |||||||| GTGTCAAATC TGTTCCAGTA CCATTTGTCC CACCCGAGGA GCGCTTCCTC ATTGGTCGCA 480 TTGGCCCAAG ATCATATCGC ATGTATCGTT GCATTGTTCG ATATGGTTAC AAGGACGCAC 9537 | ||||||| | | |||||| ||||| |||| |||||| | |||||||||| |||||| || TCTGCCCAAG ACCCTATCGC ATGTACCGTT GCATTGCCAG ATATGGTTAC AAGGACATAC 540 AGCAAGGTAC TGGGAACTTT GAGGACCTTC TCATCCAAAG TCTAGCAGAG TTCATCCAAA 9597 ||| || | |||||||||| |||||||||| ||||||| || | |||||||| |||||||||| AGCGAGACAA TGGGAACTTT GAGGACCTTC TCATCCAGAG TATAGCAGAG TTCATCCAAA 600 TGGAAGCTGT GGAACCACAA TTATCAAGCC CCGATAGTTC ATCACTTGAT GGTAGGATGG 9657 |||||||||| ||||||||| | |||||| |||| ||| | |||| ||||| || ||||||| TGGAAGCTGT AGAACCACAA CTCTCAAGCT CCGAGAGTCC ATCATTTGAT GGAAGGATGG 660 CAGTTATAAG CACAA 9672 |||| ||||| ||||| CAGTCATAAG CACAA 675 hqPGS_C12HBa0093P12.1-3+_SGN-E332161+ (8998 9672) ******************************************************************************** EST sequence 9 +strand 598 n (File: SGN-E309637+) 1 TTGACTATCA TGTTTGTCTG GCACTATGGA ACTCGCAAGA AGTACAACTT TGATCTTCAC 61 AATAAAGTTC CATTGAAATG GCTCCTTGGC TTGGGCCCCA GCCTCGGTAT TGTTCGTGTG 121 CCAGGGATAG GGCTGATATA CTCTGAATTG GCAACAGGAA TTCCATCCAT CTTCTCTCAC 181 TTTGTTACAA ATCTCCCTGC ATTTCACAAT GTGATGGTGT TTGTATGTGT CAAATCTGTT 241 CCAGTACCAT TTGTCCCACC CGAGGAGCGC TTCCTCATTG GTCGCATCTG CCCAAGACCC 301 TATCGCATGT ACCGTTGCAT TGCCAGATAT GGTTACAAGG ACATACAGCG AGACAATGGG 361 AACTTTGAGG ACCTTCTCAT CCAGAGTATA GCAGAGTTCA TCCAAATGGA AGCTGTAGAA 421 CCACAACTCT CAAGCTCCGA GAGTCCATCA TTTGATGGAA GGATGGCAGT CATAAGCACA 481 AGAAGTGTAC AGTCAGGCTC AACATTACTC GTCTCAGAGG AGGATTATGG TATTACTAAC 541 TCCATTCAAA GCAGCAAATC TTTAACGCTC CAAAGTCTAA GATCTGCTGG TGATGATG Predicted gene structure (within gDNA segment 7080 to 12037): Exon 1 9192 9786 ( 595 n); cDNA 1 598 ( 598 n); score: 0.826 MATCH C12HBa0093P12.1-3+ SGN-E309637+ 0.826 595 0.995 C PGS_C12HBa0093P12.1-3+_SGN-E309637+ (9192 9786) Alignment (genomic DNA sequence = upper lines): TTAGCCATCA TGCTTGTGTG GCACTATGGA ACTTGCAAGA AGTACAAATA TGACCTGCAC 9251 || | |||| || |||| || |||||||||| ||| |||||| ||||||| | ||| || ||| TTGACTATCA TGTTTGTCTG GCACTATGGA ACTCGCAAGA AGTACAACTT TGATCTTCAC 60 AACAAAGTTC CATTGAAATG GATCCTTGGC TTGGGTCCAA GCCTTGGTAT TGTCCGCGTC 9311 || ||||||| |||||||||| | |||||||| ||||| || | |||| ||||| ||| || || AATAAAGTTC CATTGAAATG GCTCCTTGGC TTGGGCCCCA GCCTCGGTAT TGTTCGTGTG 120 CCAGGGATAG GGCTAATATA CTCTGAACTG GTAACAGGAG TTCCACCTAT CTTCTCTCAC 9371 |||||||||| |||| ||||| ||||||| || | ||||||| ||||| | || |||||||||| CCAGGGATAG GGCTGATATA CTCTGAATTG GCAACAGGAA TTCCATCCAT CTTCTCTCAC 180 TTTGTCACAA ATCTCCCTGC ATTTCATAAT GTAATGGTGT TTGTATGCGT CAAATCTGTT 9431 ||||| |||| |||||||||| |||||| ||| || ||||||| ||||||| || |||||||||| TTTGTTACAA ATCTCCCTGC ATTTCACAAT GTGATGGTGT TTGTATGTGT CAAATCTGTT 240 CCTGTACCTC ATGTCTCATC CGATGAGCGC TTCCTCATTG GTCGTGTTGG CCCAAGATCA 9491 || ||||| |||| || | ||| |||||| |||||||||| |||| | | ||||||| | CCAGTACCAT TTGTCCCACC CGAGGAGCGC TTCCTCATTG GTCGCATCTG CCCAAGACCC 300 TATCGCATGT ATCGTTGCAT TGTTCGATAT GGTTACAAGG ACGCACAGCA AGGTACTGGG 9551 |||||||||| | |||||||| || ||||| |||||||||| || ||||| || | |||| TATCGCATGT ACCGTTGCAT TGCCAGATAT GGTTACAAGG ACATACAGCG AGACAATGGG 360 AACTTTGAGG ACCTTCTCAT CCAAAGTCTA GCAGAGTTCA TCCAAATGGA AGCTGTGGAA 9611 |||||||||| |||||||||| ||| ||| || |||||||||| |||||||||| |||||| ||| AACTTTGAGG ACCTTCTCAT CCAGAGTATA GCAGAGTTCA TCCAAATGGA AGCTGTAGAA 420 CCACAATTAT CAAGCCCCGA TAGTTCATCA CTTGATGGTA GGATGGCAGT TATAAGCAC- 9670 |||||| | | ||||| |||| ||| ||||| ||||||| | |||||||||| |||||||| CCACAACTCT CAAGCTCCGA GAGTCCATCA TTTGATGGAA GGATGGCAGT CATAAGCACA 480 A-AA-TCTAC AGTCACACTC ACCATTTATC ATAGATGATG ATGATTTTGA AACATGTTCC 9728 | || | ||| ||||| ||| | |||| || | || | | |||| || | | | AGAAGTGTAC AGTCAGGCTC AACATTACTC GTCTCAGAGG AGGATTATGG TATTACTAAC 540 ACCATTCAAA GCAGCAAGTC ACTGACACTT CAAAGTGTAA GATCTTTTTA TGATGATG 9786 ||||||||| ||||||| || | || || |||||| ||| ||||| | |||||||| TCCATTCAAA GCAGCAAATC TTTAACGCTC CAAAGTCTAA GATCTGCTGG TGATGATG 598 hqPGS_C12HBa0093P12.1-3+_SGN-E309637+ (9192 9786) ******************************************************************************** EST sequence 8 +strand 603 n (File: SGN-E309611+) 1 CAAGACCCTA TCGCATGTAC CGTTGCATTG CCAGATATGG TTACAAGGAC ATACAGCGAG 61 ACAATGGGAA CTTTGAGGAC CTTCTCATCC AGAGTATAGC AGAGTTCATC CAAATGGAAG 121 CTGTAGAACC ACAACTCTCA AGCTCCGAGA GTCCATCATT TGATGGAAGG ATGGCAGTCA 181 TAAGCACAAG AAGTGTACAG TCAGGCTCAA CATTACTCGT CTCAGAGGAG GATTATGGTA 241 TTACTAACTC CATTCAAAGC AGCAAATCTT TAACGCTCCA AAGTCTAAGA TCTGCTGGTG 301 ATGATGAGAA CCCACAGATG AGGAGGCGCA GAGTAAGGTT TCGCTTGCCA GAAAACCCTG 361 GCATGGATCC AGCTGTTCGG GATGAGCTTT CAGATCTAAT AGATGCAAAA GATGCAGGTG 421 TTGCATATAT AATGGGACAC TCTTACGTGA AGGCGAGGAG ATCAGCTTCT TTCATGAAGA 481 AGCTAGTTAT CGACATTGGT TATTCATTTC TGCGCAAAAA CTGTAGGGGT CCTGCTGTGG 541 CACTTAATAT TCCTCACATT AGTCTCATTG AAGTTGGCAT GATATACTAT GTCTAAGCTA 601 TAG Predicted gene structure (within gDNA segment 8587 to 10765): Exon 1 9484 10082 ( 599 n); cDNA 1 603 ( 603 n); score: 0.773 MATCH C12HBa0093P12.1-3+ SGN-E309611+ 0.773 599 0.993 C PGS_C12HBa0093P12.1-3+_SGN-E309611+ (9484 10082) Alignment (genomic DNA sequence = upper lines): CAAGATCATA TCGCATGTAT CGTTGCATTG TTCGATATGG TTACAAGGAC GCACAGCAAG 9543 ||||| | || ||||||||| |||||||||| ||||||| |||||||||| ||||| || CAAGACCCTA TCGCATGTAC CGTTGCATTG CCAGATATGG TTACAAGGAC ATACAGCGAG 60 GTACTGGGAA CTTTGAGGAC CTTCTCATCC AAAGTCTAGC AGAGTTCATC CAAATGGAAG 9603 | |||||| |||||||||| |||||||||| | ||| |||| |||||||||| |||||||||| ACAATGGGAA CTTTGAGGAC CTTCTCATCC AGAGTATAGC AGAGTTCATC CAAATGGAAG 120 CTGTGGAACC ACAATTATCA AGCCCCGATA GTTCATCACT TGATGGTAGG ATGGCAGTTA 9663 |||| ||||| |||| | ||| ||| |||| | || ||||| | |||||| ||| |||||||| | CTGTAGAACC ACAACTCTCA AGCTCCGAGA GTCCATCATT TGATGGAAGG ATGGCAGTCA 180 TAAGCAC-A- AA-TCTACAG TCACACTCAC CATTTATCAT AGATGATGAT GATTTTGAAA 9720 ||||||| | || | ||||| ||| |||| |||| || | || || |||| || | TAAGCACAAG AAGTGTACAG TCAGGCTCAA CATTACTCGT CTCAGAGGAG GATTATGGTA 240 CATGTTCCAC CATTCAAAGC AGCAAGTCAC TGACACTTCA AAGTGTAAGA TCTTTTTATG 9780 | | | |||||||||| ||||| || | || || || |||| ||||| ||| | || TTACTAACTC CATTCAAAGC AGCAAATCTT TAACGCTCCA AAGTCTAAGA TCTGCTGGTG 300 ATGATGGGAA CCATGAAAAC AGAAAACGAC GAATCAGGTT CAACTTGCCA GAGAACTCTG 9840 |||||| ||| || | | || | || || | ||||| ||||||| || ||| ||| ATGATGAGAA CCCACAGATG AGGAGGCGCA GAGTAAGGTT TCGCTTGCCA GAAAACCCTG 360 GCATGGATCC TGAAGTTAGG GATGAGCTTA TAGATTTGGT TCAGGCAAAG GAGTCAGGGG 9900 |||||||||| | ||| || ||||||||| |||| | | | ||||| || |||| | GCATGGATCC AGCTGTTCGG GATGAGCTTT CAGATCTAAT AGATGCAAAA GATGCAGGTG 420 TTGCATATAT AATGGGACAC TCATATGTCA AGGCACGTAG ATTGTCCTCT TGCTGGAAGA 9960 |||||||||| |||||||||| || || || | |||| | || || | ||| | | ||||| TTGCATATAT AATGGGACAC TCTTACGTGA AGGCGAGGAG ATCAGCTTCT TTCATGAAGA 480 AATTTGTCAT TGACGTTGCA TATTCATTTC TGCGTAAGAA CTGCAGAGCT TCCGCTGTTG 10020 | | || || ||| ||| |||||||||| |||| || || ||| || | | | ||||| | AGCTAGTTAT CGACATTGGT TATTCATTTC TGCGCAAAAA CTGTAGGGGT CCTGCTGTGG 540 CACTTAACAT TCCTCACATT AGTCTTATTG AAGTTGGCAT GATATACTAT GTCT-AGAGA 10079 ||||||| || |||||||||| ||||| |||| |||||||||| |||||||||| |||| || | CACTTAATAT TCCTCACATT AGTCTCATTG AAGTTGGCAT GATATACTAT GTCTAAGCTA 600 GAG 10082 || TAG 603 hqPGS_C12HBa0093P12.1-3+_SGN-E309611+ (9484 10082) ******************************************************************************** EST sequence 23 -strand 684 n (File: SGN-E249331-) 1 TCACTGACAC TTCAAAGTGT AAGATCTTTT TATGATGATG GGAACCATGA AAACAGAAAA 61 CGACGAATCA GGTTCAACTT GCCAGAGAAC TCTGGCATGG ATCCTGAAGT TAGGGATGAG 121 CTTATAGATT TGGTTCAGGC AAAGGAGTCA GGGGTTGCAT ATATAATGGG ACACTCATAT 181 GTCAAGGCAC GTAGATTGTC CTCTTGCTGG AAGAAATTTG TCATTGACGT TGCATATTCA 241 TTTCTGCGTA AGAACTGCAG AGCTTCCGCT GTTGCACTTA ACATTCCTCA CATTAGTCTT 301 ATTGAAGTTG GCATGATATA CTATGTCTAG AGAGAGGCTT GGAGCCAAGA GAACATTGAA 361 CGCCTTCATC GGAGTTACTT GCAGAATCTT TTCACAGGTA GATGAACTTT TTCTTGAATA 421 TTTTTGCCCC AAATACCAAG TCTTGCTCAT CACTAATCTT TGTATTGTTA GTATATATAT 481 ATTTGTTATA TCATCTCTTT CACATATGAC CTGTATTTAT TTGTGTATTT TATAGTTAGA 541 TAGAGACAGT AGTTATAATT TTAATAGGTG ATTTAAGCTA TGATTGAAAT AGCTATATAG 601 GCTTTTGTAA TCTAACATTT TGATTTCTTT TTAAAAAAAT TGTAAACTAT TTATTAACAT 661 TAAAAAAACC AAAAAAAAAA AAAA Predicted gene structure (within gDNA segment 9137 to 11228): Exon 1 9747 10408 ( 662 n); cDNA 1 662 ( 662 n); score: 1.000 PPA cDNA 671 684 MATCH C12HBa0093P12.1-3+ SGN-E249331- 1.000 662 0.968 C PGS_C12HBa0093P12.1-3+_SGN-E249331- (9747 10408) Alignment (genomic DNA sequence = upper lines): TCACTGACAC TTCAAAGTGT AAGATCTTTT TATGATGATG GGAACCATGA AAACAGAAAA 9806 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCACTGACAC TTCAAAGTGT AAGATCTTTT TATGATGATG GGAACCATGA AAACAGAAAA 60 CGACGAATCA GGTTCAACTT GCCAGAGAAC TCTGGCATGG ATCCTGAAGT TAGGGATGAG 9866 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGACGAATCA GGTTCAACTT GCCAGAGAAC TCTGGCATGG ATCCTGAAGT TAGGGATGAG 120 CTTATAGATT TGGTTCAGGC AAAGGAGTCA GGGGTTGCAT ATATAATGGG ACACTCATAT 9926 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTATAGATT TGGTTCAGGC AAAGGAGTCA GGGGTTGCAT ATATAATGGG ACACTCATAT 180 GTCAAGGCAC GTAGATTGTC CTCTTGCTGG AAGAAATTTG TCATTGACGT TGCATATTCA 9986 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTCAAGGCAC GTAGATTGTC CTCTTGCTGG AAGAAATTTG TCATTGACGT TGCATATTCA 240 TTTCTGCGTA AGAACTGCAG AGCTTCCGCT GTTGCACTTA ACATTCCTCA CATTAGTCTT 10046 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTCTGCGTA AGAACTGCAG AGCTTCCGCT GTTGCACTTA ACATTCCTCA CATTAGTCTT 300 ATTGAAGTTG GCATGATATA CTATGTCTAG AGAGAGGCTT GGAGCCAAGA GAACATTGAA 10106 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGAAGTTG GCATGATATA CTATGTCTAG AGAGAGGCTT GGAGCCAAGA GAACATTGAA 360 CGCCTTCATC GGAGTTACTT GCAGAATCTT TTCACAGGTA GATGAACTTT TTCTTGAATA 10166 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CGCCTTCATC GGAGTTACTT GCAGAATCTT TTCACAGGTA GATGAACTTT TTCTTGAATA 420 TTTTTGCCCC AAATACCAAG TCTTGCTCAT CACTAATCTT TGTATTGTTA GTATATATAT 10226 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTGCCCC AAATACCAAG TCTTGCTCAT CACTAATCTT TGTATTGTTA GTATATATAT 480 ATTTGTTATA TCATCTCTTT CACATATGAC CTGTATTTAT TTGTGTATTT TATAGTTAGA 10286 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTGTTATA TCATCTCTTT CACATATGAC CTGTATTTAT TTGTGTATTT TATAGTTAGA 540 TAGAGACAGT AGTTATAATT TTAATAGGTG ATTTAAGCTA TGATTGAAAT AGCTATATAG 10346 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGAGACAGT AGTTATAATT TTAATAGGTG ATTTAAGCTA TGATTGAAAT AGCTATATAG 600 GCTTTTGTAA TCTAACATTT TGATTTCTTT TTAAAAAAAT TGTAAACTAT TTATTAACAT 10406 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCTTTTGTAA TCTAACATTT TGATTTCTTT TTAAAAAAAT TGTAAACTAT TTATTAACAT 660 TA 10408 || TA 662 hqPGS_C12HBa0093P12.1-3+_SGN-E249331- (9747 10408) ******************************************************************************** EST sequence 21 +strand 461 n (File: SGN-E356653+) 1 TGGAGTACTT TCAGCAGGTC TAAATTCAGG TACTCCTTGC ATATTTGGTG TTTTGACATG 61 TGATACCTTG GAGCAGGCTT TCAATCGCGT TGGTGGGAAG GCTGGGAATA AAGGTTCCGA 121 AACTGCATTG ACTGCTATTG AGATGGCATC TTTGTTTGAG CACCACCTAA AGCCTTCAGA 181 GTAGACAATC CTTCTTATCG CGACAAGGTT CTGGATTTTC ACCTTTAAAC AGAATCTCAC 241 TCGCATTACC ATCCCAACTA GTTACTCCGA CTAATAAGGT TAGACATCGA GGGGGAAAAA 301 GAGTTCCCTC TACTTCTCGG TTTCCTTCCT CGTCTACTCA TTGATGCTGG ATGTAAACAT 361 TCTTGTAAAA GCTGCACTTG TTTGAGAAAA TGTTGCACTT TGTTTCAAGT TTGAGTTTTG 421 GATAATAGTA TTCTTTCAAG TTTTGAGTTT TGGACTTCAA A Predicted gene structure (within gDNA segment 13473 to 10600): Exon 1 12873 12845 ( 29 n); cDNA 1 29 ( 29 n); score: 1.000 Intron 1 12844 12150 ( 695 n); Pd: 0.993 (s: 0), Pa: 0.928 (s: 1.00) Exon 2 12149 12103 ( 47 n); cDNA 30 76 ( 47 n); score: 1.000 Intron 2 12102 11808 ( 295 n); Pd: 0.988 (s: 1.00), Pa: 0.398 (s: 1.00) Exon 3 11807 11748 ( 60 n); cDNA 77 136 ( 60 n); score: 1.000 Intron 3 11747 11535 ( 213 n); Pd: 0.998 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 4 11534 11210 ( 325 n); cDNA 137 461 ( 325 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E356653+ 1.000 461 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E356653+ (12873 12845,12149 12103,11807 11748,11534 11210) Alignment (genomic DNA sequence = upper lines): TGGAGTACTT TCAGCAGGTC TAAATTCAGG TGAGATAACA TACTCCATTA TAATCCACAT 12814 |||||||||| |||||||||| ||||||||| TGGAGTACTT TCAGCAGGTC TAAATTCAG. .......... .......... .......... 29 GAGCAATAAC AAGTATGCTG ATTTTTAATC TTTTGCAACT CTTCAGTACT TTATTGATAG 12754 .......... .......... .......... .......... .......... .......... 29 ATAAGATTTT GGAACATTGT TAAAACTTCG AATGTAATAT CTGGAATTAA ATATGATGGT 12694 .......... .......... .......... .......... .......... .......... 29 TGAAATAGAG GATTGTTATG GGACCATGAT CCAACAAAGT GATAGGCAAG GTTTAGTAAG 12634 .......... .......... .......... .......... .......... .......... 29 GTTGTGAGAC TAAGCATGTA ATATGGGGTT GTTAGGCTTC TAAAGTCCGA ACATATTCAC 12574 .......... .......... .......... .......... .......... .......... 29 AGGATGAATT TGTTAATCTG GATGTTCGGA CATACAAGAC TGATAAGATT AGAAATGATA 12514 .......... .......... .......... .......... .......... .......... 29 ACTGACAGAA GGTGTAAGTA CCCTAAAAGA TGAAAAGAAG GGTCGTTGAA GACTACTTAT 12454 .......... .......... .......... .......... .......... .......... 29 GTCTTAAACA GTCATCTTAG ATAGTTAGGC TGTGTCGAAC ACTCAGGCAT AGTACAACAA 12394 .......... .......... .......... .......... .......... .......... 29 CTTGAAGATC CTATGTTAGA ATCCTAGTTA AAAACTGTGT GAGTTCATAT GCTCTAGTCC 12334 .......... .......... .......... .......... .......... .......... 29 TGGTGTCATA TTTACTCGAT GGTTGTGCTT CTGAGCAGTA TAATAAGTTC CCCGATGAAT 12274 .......... .......... .......... .......... .......... .......... 29 TTATCAAGGT AAAAGGTAGT TTGATTACCT CTGTTACACA TCTTCTGTCA TTATACGCTA 12214 .......... .......... .......... .......... .......... .......... 29 TTTTCAGTTC ATCTTTTTCA ATTCGCTGGC CAAACTAAAA TCTCCCTTAT GTAATGAGTT 12154 .......... .......... .......... .......... .......... .......... 29 GCAGGTACTC CTTGCATATT TGGTGTTTTG ACATGTGATA CCTTGGAGCA GGTAAGCAAT 12094 |||||| |||||||||| |||||||||| |||||||||| |||||||||| | ....GTACTC CTTGCATATT TGGTGTTTTG ACATGTGATA CCTTGGAGCA G......... 76 TAGCCGTTAT ACACGTGTGT TGGTGTTCTA TCTCCTTTGT CCATTGTATA AATAGTGGTT 12034 .......... .......... .......... .......... .......... .......... 76 GGAATAACAG ATTCACGGAA AGTATAATCT GCATCAGCTG AGAGGTTATT ATACCTTGTA 11974 .......... .......... .......... .......... .......... .......... 76 GCTTTGTACA GCGAGATAGT CATTGTTGTT TCGTGTACTT GAACTGTTAG TTTTTTATGG 11914 .......... .......... .......... .......... .......... .......... 76 ATTTGTGTAA CATATACTCT CCTTGGAACT ACATTTACAA TATTCTTCTT GGCTGTGTGA 11854 .......... .......... .......... .......... .......... .......... 76 GGCTATCTTG CTCATGCATG CTAACCAACA TCGCTATATA ATGCAGGCTT TCAATCGCGT 11794 |||| |||||||||| .......... .......... .......... .......... ......GCTT TCAATCGCGT 90 TGGTGGGAAG GCTGGGAATA AAGGTTCCGA AACTGCATTG ACTGCTGTAA GTTGCCTGTG 11734 |||||||||| |||||||||| |||||||||| |||||||||| |||||| TGGTGGGAAG GCTGGGAATA AAGGTTCCGA AACTGCATTG ACTGCT.... .......... 136 ATTGTGTGAC AATTAGATTT TATCCTCAAA ACAAGTTCAT TTGTGTCTGA TTCTGCTCAA 11674 .......... .......... .......... .......... .......... .......... 136 ATATATCTGC CTTAATATGC GATTAAAAGG GTCTGACTAA TATAATTGTT GAAACTTAAA 11614 .......... .......... .......... .......... .......... .......... 136 CATCAATAAC TTAAGACTCA GTGTTCTGTT AGCAATAAGT GACTGATCTT GAATGCATAT 11554 .......... .......... .......... .......... .......... .......... 136 ATTGTGTTTA ATTTTTCAGA TTGAGATGGC ATCTTTGTTT GAGCACCACC TAAAGCCTTC 11494 | |||||||||| |||||||||| |||||||||| |||||||||| .......... .........A TTGAGATGGC ATCTTTGTTT GAGCACCACC TAAAGCCTTC 177 AGAGTAGACA ATCCTTCTTA TCGCGACAAG GTTCTGGATT TTCACCTTTA AACAGAATCT 11434 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGAGTAGACA ATCCTTCTTA TCGCGACAAG GTTCTGGATT TTCACCTTTA AACAGAATCT 237 CACTCGCATT ACCATCCCAA CTAGTTACTC CGACTAATAA GGTTAGACAT CGAGGGGGAA 11374 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CACTCGCATT ACCATCCCAA CTAGTTACTC CGACTAATAA GGTTAGACAT CGAGGGGGAA 297 AAAGAGTTCC CTCTACTTCT CGGTTTCCTT CCTCGTCTAC TCATTGATGC TGGATGTAAA 11314 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAAGAGTTCC CTCTACTTCT CGGTTTCCTT CCTCGTCTAC TCATTGATGC TGGATGTAAA 357 CATTCTTGTA AAAGCTGCAC TTGTTTGAGA AAATGTTGCA CTTTGTTTCA AGTTTGAGTT 11254 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTCTTGTA AAAGCTGCAC TTGTTTGAGA AAATGTTGCA CTTTGTTTCA AGTTTGAGTT 417 TTGGATAATA GTATTCTTTC AAGTTTTGAG TTTTGGACTT CAAA 11210 |||||||||| |||||||||| |||||||||| |||||||||| |||| TTGGATAATA GTATTCTTTC AAGTTTTGAG TTTTGGACTT CAAA 461 hqPGS_C12HBa0093P12.1-3-_SGN-E356653+ (12873 12845,12149 12103,11807 11748,11534 11210) ******************************************************************************** EST sequence 2 -strand 682 n (File: SGN-E540633-) 1 GTTTTAATGA TCTGATCACC AAGAAGCTTT TGGAGGGAGC TTTGAACACA TTCAAGAGTT 61 ACTCAGTTAG AGAAGAAGAC ATTGATGTTG TGTGGGTTCC TGGTAGTTTT GAAATTGGTG 121 TTGTCGCTCA GCAACTTGGC AAGTCAAGAA AGTATCAATC AATATTGTGT ATCGGTGCGG 181 TGATTAGAGG TGATACCTCC CATTATGATG CTGTGGTTAA TGCTGCCACG TCTGGAGTAC 241 TTTCCGCAGG TTTAAATTCA GGTACTCCTT GCATATTTGG TGTCCTGACA TGTGATACCT 301 TGGAACAGGC TTTCGACCGT GTTGGTGGGA AGGCTGGGAA TAAAGGTGCT GAAGCAGCAT 361 TGACAGCCAT TGAGATGGCA TCTTTGTTTG AGCACCATCT GAAGCCATTA CAGTAGGGAA 421 ATTAAAGACC TCTTTTTTTT TTCTTTTTTT TTTTTTGCTT CAGCTTTATT GTTGGAAGAA 481 AGTTGTGTAG TAATTTGGCA ATTGTGTTCT TTGAGGAATC AGAATTGCTC TAAATTGTTT 541 GCAGGGATAA CATTTGTGAA CCAAACTGCA GTTGTTCCAA TGTTGTATTT TCAAGTAATA 601 ATAATAACAA CAATAATACA TCTTTTCTCT ATTAAAAAAA AAAAAAAAAA AAAAAAAAAA 661 AAAAAAACTC GAGGGGGGCC CG Predicted gene structure (within gDNA segment 14857 to 7727): Exon 1 14247 14162 ( 86 n); cDNA 1 86 ( 86 n); score: 0.919 Intron 1 14161 13886 ( 276 n); Pd: 0.991 (s: 0.86), Pa: 0.987 (s: 0.80) Exon 2 13885 13790 ( 96 n); cDNA 87 182 ( 96 n); score: 0.766 Intron 2 13789 12924 ( 866 n); Pd: 0.993 (s: 0.72), Pa: 0.995 (s: 0.84) Exon 3 12923 12845 ( 79 n); cDNA 183 261 ( 79 n); score: 0.873 Intron 3 12844 12150 ( 695 n); Pd: 0.993 (s: 0.90), Pa: 0.928 (s: 0.94) Exon 4 12149 12103 ( 47 n); cDNA 262 308 ( 47 n); score: 0.936 Intron 4 12102 11808 ( 295 n); Pd: 0.988 (s: 0.94), Pa: 0.398 (s: 0.86) Exon 5 11807 11748 ( 60 n); cDNA 309 368 ( 60 n); score: 0.850 Intron 5 11747 11535 ( 213 n); Pd: 0.998 (s: 0.86), Pa: 0.999 (s: 0.90) Exon 6 11534 11487 ( 48 n); cDNA 369 416 ( 48 n); score: 0.896 PPA cDNA 634 668 MATCH C12HBa0093P12.1-3- SGN-E540633- 0.849 416 0.610 C PGS_C12HBa0093P12.1-3-_SGN-E540633- (14247 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11487) Alignment (genomic DNA sequence = upper lines): GTTTTAATGA TCTGATCACC AAGAAGCTTT TGGAGGGAGC TTTGGAGACT TTCAAGAATT 14188 |||||||||| |||||||||| |||||||||| |||||||||| |||| | || ||||||| || GTTTTAATGA TCTGATCACC AAGAAGCTTT TGGAGGGAGC TTTGAACACA TTCAAGAGTT 60 ACTCGGTTAG AGAGGAAGAT ATTGATGTAC GTTTTCTATA TCTAATCCTC CGTTCCCTCA 14128 |||| ||||| ||| ||||| |||||| ACTCAGTTAG AGAAGAAGAC ATTGAT.... .......... .......... .......... 86 AAACCTGTAT TTTCTCTTAA ACTTGATGCA ATTACTACAG AAGGTCCGCT TGCCATGAAA 14068 .......... .......... .......... .......... .......... .......... 86 AATCTAAATG AGAGATTGAT TGCTACATAA TACCCCTCTA TGACACGCTA TACTGGAGCA 14008 .......... .......... .......... .......... .......... .......... 86 TAAAGATCCA CTATGGGATA AATGGAGAAT GTTGTATCAA TAGATTAAAT ATAGAATAAG 13948 .......... .......... .......... .......... .......... .......... 86 GCGTGGTGGG AAGTTGTCTC TTCCATTTCA TGGCGTCATC TAGTTTTTTC CTCATAATGC 13888 .......... .......... .......... .......... .......... .......... 86 AGGTTGTGTG GGTTCCTGGT TGTTTTGAAA TCGGCGTGAC TGCACAGCTT CTTGGAAAGT 13828 |||||||| |||||||||| ||||||||| | || || || |||| ||||| |||| ..GTTGTGTG GGTTCCTGGT AGTTTTGAAA TTGGTGTTGT CGCTCAGCAA CTTGGCAAGT 144 CACAGAAA-T ATCACGCAAT ACTCTGCATT GGGGCTGTGG TAAGCTCACT TCCAAACATA 13769 || ||||| | |||| |||| | | || || || || ||| CA-AGAAAGT ATCAATCAAT ATTGTGTATC GGTGCGGTG. .......... .......... 182 AGTTTACCTT GTCGTTTTTC CTTCTACATT ATTTGCCTAC TACGATTCTG TTTCATATGA 13709 .......... .......... .......... .......... .......... .......... 182 TGATATTTTT TGTATCGTGT ATGGAAATAT GGAAGATTAC AGTCAGATGC TTAAGTTATT 13649 .......... .......... .......... .......... .......... .......... 182 CTTTTGGTAT ATTCTGTTTT GGATCGTCTT ATGTGTTCAT CATTGTAAGT GAACTAATCT 13589 .......... .......... .......... .......... .......... .......... 182 TTTGAGTTAC GGAAGGGGAA GCAGCTTAGA GTATGATGAG AAGCATTAAC TGTGTCTTTT 13529 .......... .......... .......... .......... .......... .......... 182 GTTCCTGTTG AGAAGACTAG CATTTATTTG TGACGCTCTT CCCTCAACAA TGGAACTATA 13469 .......... .......... .......... .......... .......... .......... 182 AAGCTGAACA AATAAAACTC AAGCACTATT TTGACAGATA GGCAAAATTA TTAAACAAAG 13409 .......... .......... .......... .......... .......... .......... 182 AAGCTATTGA ACTTTAGATT CCTTACTGAA GATATATACT TGGATTCATT AAATGGAAAT 13349 .......... .......... .......... .......... .......... .......... 182 CTTTAACTAA TTTCCATTGA TGTGAGGGAG GGAAAAATCT GTGGATTTAG CTCAGTGTAT 13289 .......... .......... .......... .......... .......... .......... 182 TCTAATTTTT CTTTGTGGGA CGGGCTAAAC GTTTAGTTGT TGTAGTTGGA CATAGTAGGG 13229 .......... .......... .......... .......... .......... .......... 182 AGGCATGCGT GAATTTAGTG CCTTATTGTG AAGTAACCTC ATCTGAACAT ACTGATAAAC 13169 .......... .......... .......... .......... .......... .......... 182 ACTATATTAT TTTTGCCCCG GTGGATTTCT TTCTCAGTTA ATAAATATCT GGAAAGGAAA 13109 .......... .......... .......... .......... .......... .......... 182 AGGTAATATA AAGCTAAAAC TCTGAGCATT ACAAACCCGA AGAAAGAAAA AATCCATCAC 13049 .......... .......... .......... .......... .......... .......... 182 AGACAAGAGA TTGAAGTGCC TTTCTGCTTT TCCCTCTGTC CGAATCATAA TCTAATATTT 12989 .......... .......... .......... .......... .......... .......... 182 CTTTTTTTCT TGATGGTGAT GTGTATCATT GTTTTTAATT GTATTTTTTG CTTTGTGGCT 12929 .......... .......... .......... .......... .......... .......... 182 AGTAGATCAG AGGTGATACA TCTCACTACG ATGCAGTCGT TAATGCTGCC ACATCTGGAG 12869 || || ||||||||| || || || | |||| || || |||||||||| || ||||||| .....ATTAG AGGTGATACC TCCCATTATG ATGCTGTGGT TAATGCTGCC ACGTCTGGAG 237 TACTTTCAGC AGGTCTAAAT TCAGGTGAGA TAACATACTC CATTATAATC CACATGAGCA 12809 ||||||| || |||| ||||| |||| TACTTTCCGC AGGTTTAAAT TCAG...... .......... .......... .......... 261 ATAACAAGTA TGCTGATTTT TAATCTTTTG CAACTCTTCA GTACTTTATT GATAGATAAG 12749 .......... .......... .......... .......... .......... .......... 261 ATTTTGGAAC ATTGTTAAAA CTTCGAATGT AATATCTGGA ATTAAATATG ATGGTTGAAA 12689 .......... .......... .......... .......... .......... .......... 261 TAGAGGATTG TTATGGGACC ATGATCCAAC AAAGTGATAG GCAAGGTTTA GTAAGGTTGT 12629 .......... .......... .......... .......... .......... .......... 261 GAGACTAAGC ATGTAATATG GGGTTGTTAG GCTTCTAAAG TCCGAACATA TTCACAGGAT 12569 .......... .......... .......... .......... .......... .......... 261 GAATTTGTTA ATCTGGATGT TCGGACATAC AAGACTGATA AGATTAGAAA TGATAACTGA 12509 .......... .......... .......... .......... .......... .......... 261 CAGAAGGTGT AAGTACCCTA AAAGATGAAA AGAAGGGTCG TTGAAGACTA CTTATGTCTT 12449 .......... .......... .......... .......... .......... .......... 261 AAACAGTCAT CTTAGATAGT TAGGCTGTGT CGAACACTCA GGCATAGTAC AACAACTTGA 12389 .......... .......... .......... .......... .......... .......... 261 AGATCCTATG TTAGAATCCT AGTTAAAAAC TGTGTGAGTT CATATGCTCT AGTCCTGGTG 12329 .......... .......... .......... .......... .......... .......... 261 TCATATTTAC TCGATGGTTG TGCTTCTGAG CAGTATAATA AGTTCCCCGA TGAATTTATC 12269 .......... .......... .......... .......... .......... .......... 261 AAGGTAAAAG GTAGTTTGAT TACCTCTGTT ACACATCTTC TGTCATTATA CGCTATTTTC 12209 .......... .......... .......... .......... .......... .......... 261 AGTTCATCTT TTTCAATTCG CTGGCCAAAC TAAAATCTCC CTTATGTAAT GAGTTGCAGG 12149 | .......... .......... .......... .......... .......... .........G 262 TACTCCTTGC ATATTTGGTG TTTTGACATG TGATACCTTG GAGCAGGTAA GCAATTAGCC 12089 |||||||||| |||||||||| | ||||||| |||||||||| || ||| TACTCCTTGC ATATTTGGTG TCCTGACATG TGATACCTTG GAACAG.... .......... 308 GTTATACACG TGTGTTGGTG TTCTATCTCC TTTGTCCATT GTATAAATAG TGGTTGGAAT 12029 .......... .......... .......... .......... .......... .......... 308 AACAGATTCA CGGAAAGTAT AATCTGCATC AGCTGAGAGG TTATTATACC TTGTAGCTTT 11969 .......... .......... .......... .......... .......... .......... 308 GTACAGCGAG ATAGTCATTG TTGTTTCGTG TACTTGAACT GTTAGTTTTT TATGGATTTG 11909 .......... .......... .......... .......... .......... .......... 308 TGTAACATAT ACTCTCCTTG GAACTACATT TACAATATTC TTCTTGGCTG TGTGAGGCTA 11849 .......... .......... .......... .......... .......... .......... 308 TCTTGCTCAT GCATGCTAAC CAACATCGCT ATATAATGCA GGCTTTCAAT CGCGTTGGTG 11789 |||||| | || ||||||| .......... .......... .......... .......... .GCTTTCGAC CGTGTTGGTG 327 GGAAGGCTGG GAATAAAGGT TCCGAAACTG CATTGACTGC TGTAAGTTGC CTGTGATTGT 11729 |||||||||| |||||||||| | ||| | | ||||||| || GGAAGGCTGG GAATAAAGGT GCTGAAGCAG CATTGACAGC C......... .......... 368 GTGACAATTA GATTTTATCC TCAAAACAAG TTCATTTGTG TCTGATTCTG CTCAAATATA 11669 .......... .......... .......... .......... .......... .......... 368 TCTGCCTTAA TATGCGATTA AAAGGGTCTG ACTAATATAA TTGTTGAAAC TTAAACATCA 11609 .......... .......... .......... .......... .......... .......... 368 ATAACTTAAG ACTCAGTGTT CTGTTAGCAA TAAGTGACTG ATCTTGAATG CATATATTGT 11549 .......... .......... .......... .......... .......... .......... 368 GTTTAATTTT TCAGATTGAG ATGGCATCTT TGTTTGAGCA CCACCTAAAG CCTTCAGAGT 11489 |||||| |||||||||| |||||||||| ||| || ||| || | | ||| .......... ....ATTGAG ATGGCATCTT TGTTTGAGCA CCATCTGAAG CCATTACAGT 414 AG 11487 || AG 416 hqPGS_C12HBa0093P12.1-3-_SGN-E540633- (14247 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11487) ******************************************************************************** EST sequence 17 +strand 823 n (File: SGN-E540635+) 1 TTAAACTTCA ATCCCTTTCC TTTTTCTTCA TTTTCATACA ATTTGGTGAA AGCAATAAAA 61 TTCTCTTTAC TTTGTGAACA TCAGAGTAAT GGCAACTACG GCATTTGTAG AATTCAGTGT 121 TGTTGTTCGT CCTCAATCGA ATTTCGTTAA TACTTCATAC TTGCAGCAGC TTCATAGTCA 181 TCGTCCCTTT TTATTCCTCA GCATTCCCAA ATCAACTGCT GCTCTCTCAT TCACTCAATC 241 TCAAGGTTTT GGATGTGGAA TTGAGAGACA ACAGTGTGAT CGTCGGGATT TTGTTCAAAC 301 ATCAGCTGTT CGAGAGTTGG CTGGTTCTCT TATTTCTGCC CAAGGACATC GTTTTGCTAT 361 TGTGGTGGCA CGTTTTAATG ATCTGATCAC CAAGAAGCTT TTGGAGGGAG CTTTGAACAC 421 ATTCAAGAGT TACTCAGTTA GAGAAGAAGA CATTGATGTT GTGTGGGTTC CTGGTAGTTT 481 TGAAATTGGT GTTGTCGCTC AGCAACTTGG CAAGTCAAGA AAGTATCAAT CAATATTGTG 541 TATCGGTGCG GTGATTAGAG GTGATACCTC CCATTATGAT GCTGTGGTTA ATGCTGCCAC 601 GTCTGGAGTA CTTTCCGCAG GTTTAAATTC AGGTACTCCT TGCATATTTG GTGTCCTGAC 661 ATGTGATACC TTGGAACAGG CTTTCGACCG TGTTGGTGGG AAAGGCTGGA AATAAGGTGC 721 TGAAGCAGCA TTGACAGCCA TTGAGATGGG CATCTTGTTT GAGCACCATC TGAAGCCATT 781 ACAGTAGGGA AATTTAAGAC CTCTTTTTTT TTTCTTTTTT TTT Predicted gene structure (within gDNA segment 18521 to 10017): Exon 1 16726 16721 ( 6 n); cDNA 209 214 ( 6 n); score: 1.000 Intron 1 16720 15210 (1511 n); Pd: 0.000 (s: 0), Pa: 0.000 (s: 0) Exon 2 15209 15178 ( 32 n); cDNA 215 245 ( 31 n); score: 0.781 Intron 2 15177 15099 ( 79 n); Pd: 0.952 (s: 0), Pa: 0.999 (s: 0.68) Exon 3 15098 14977 ( 122 n); cDNA 246 361 ( 116 n); score: 0.717 Intron 3 14976 14258 ( 719 n); Pd: 1.000 (s: 0.76), Pa: 0.972 (s: 0.98) Exon 4 14257 14162 ( 96 n); cDNA 362 457 ( 96 n); score: 0.917 Intron 4 14161 13886 ( 276 n); Pd: 0.991 (s: 0.86), Pa: 0.987 (s: 0.80) Exon 5 13885 13790 ( 96 n); cDNA 458 553 ( 96 n); score: 0.766 Intron 5 13789 12924 ( 866 n); Pd: 0.993 (s: 0.72), Pa: 0.995 (s: 0.84) Exon 6 12923 12845 ( 79 n); cDNA 554 632 ( 79 n); score: 0.873 Intron 6 12844 12150 ( 695 n); Pd: 0.993 (s: 0.90), Pa: 0.928 (s: 0.94) Exon 7 12149 12103 ( 47 n); cDNA 633 679 ( 47 n); score: 0.936 Intron 7 12102 11808 ( 295 n); Pd: 0.988 (s: 0.94), Pa: 0.398 (s: 0.77) Exon 8 11807 11748 ( 60 n); cDNA 680 739 ( 60 n); score: 0.775 Intron 8 11747 11535 ( 213 n); Pd: 0.998 (s: 0.77), Pa: 0.999 (s: 0) Exon 9 11534 11497 ( 38 n); cDNA 740 777 ( 38 n); score: 0.855 MATCH C12HBa0093P12.1-3- SGN-E540635+ 0.805 576 0.700 C PGS_C12HBa0093P12.1-3-_SGN-E540635+ (16726 16721,15209 15178,15098 14977,14257 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11497) Alignment (genomic DNA sequence = upper lines): AAATCAGTTA AAGGGGTAAA AGAGGTATGA AACAAATTAA TTGTTGAGTA TCCCATACGA 16667 |||||| AAATCA.... .......... .......... .......... .......... .......... 214 ATCTCAGGCT GTGACATGAA CTCATTTGTG TGTATTAGGT ACAGGAAGAA ATCTAGAAAT 16607 .......... .......... .......... .......... .......... .......... 214 GGGATTCCTC ATCGCTTAAA CTTAAAGTTA ATTGCTTTAA AGTAGGATAA CTCTTTCCTG 16547 .......... .......... .......... .......... .......... .......... 214 CCTGCTTATT GACCACAACC ACACCAACCC ACAGTGACGG AGCCAGGAGT GTTACTAAGG 16487 .......... .......... .......... .......... .......... .......... 214 CGTATCACAA TATCAAAACT AGACAGACGT AAAAAAATAA GGAGAGTTAA CATATAGTAT 16427 .......... .......... .......... .......... .......... .......... 214 ATACATACAT ACATAAAATT CTAAACTACT ACCTGACTAA GGAAGACAAT GCCAGGCTCT 16367 .......... .......... .......... .......... .......... .......... 214 TGTGATTCCA ACAAGCGAAG ACCGGTTTTA ACTCCTTTCA AAACTCCATC ATCAGCATTC 16307 .......... .......... .......... .......... .......... .......... 214 TCATAATGAG TTTCTAGCAA TAAAGGCAAT CTCAAAAGCA ACTTAGCCAA TTGTGGTACC 16247 .......... .......... .......... .......... .......... .......... 214 ACTTCTTCGA ACCATTTTGC AGCTTCATCC CTAGGAAGCA ACTGCACAAA CAATTCAGCA 16187 .......... .......... .......... .......... .......... .......... 214 TTTAGTAAAC AATCGTATCA GTTCAAACTT TTTTTTTTCA TAATAAACCA GTAGCTTGTT 16127 .......... .......... .......... .......... .......... .......... 214 CGATCAAGTT TTTGAGAAGT CAAAAGTGTT TATTTTTAAA GAATCGAGGT GTTCGGCCAA 16067 .......... .......... .......... .......... .......... .......... 214 GATCTTACGA AAAACAACAC TCCTTCTCCC TCCGTTTAAA TTTATTTGCC TTTATTCAGA 16007 .......... .......... .......... .......... .......... .......... 214 GTGCACAAGC AAACGAAATT CGGACATAAA AACCTCAATC TATTTGAAAA CGTACTATAA 15947 .......... .......... .......... .......... .......... .......... 214 ATCACGATAA TAAAAAATAG AAACTTTTTT TCACAACCTA TCAACATCGA TCAAACACAA 15887 .......... .......... .......... .......... .......... .......... 214 ATTATTGCAC TAATAGTATC TTTTCAAATT AAGAATCGAA ACAACAAACT AATACTATAC 15827 .......... .......... .......... .......... .......... .......... 214 TTTAAAACAA AAAAAAAATG AAATTTATAT GTTCATGAAC AGAGAATTAT GGGAAACGAA 15767 .......... .......... .......... .......... .......... .......... 214 ATTTACATCA TCGAAGAACA GAGCAAAACC TTGAGAAGCA GAAGTATCAA TTGAAAAACT 15707 .......... .......... .......... .......... .......... .......... 214 GTGGAACGAG AGAGAGTTTC GAATATCGGA AATGGCGATG AATAGGCCTT CGCCGGAGTC 15647 .......... .......... .......... .......... .......... .......... 214 AATTTTGCTG TGATTAGGAC CTTTCGAGAT ATCGGTAAGT GCTTCAGTTA CCGGCGCCGG 15587 .......... .......... .......... .......... .......... .......... 214 CCAAAAAAGT GAAGATGATC GGAGTGAAAA GGGTAAAAAT GGAAGAATTG ACTTCAAGTC 15527 .......... .......... .......... .......... .......... .......... 214 TTCTCTGTTC TCCATTAAAG TTCGTTTTTT TCGACGTTAC ATAATATCAC ATATTTATAT 15467 .......... .......... .......... .......... .......... .......... 214 TTAATTTAAA AATAAAAATT ATTTTAAAAC AGATTAAAAA GAAAATAAAA TAATAACGAA 15407 .......... .......... .......... .......... .......... .......... 214 GATATAAATC AGAAACGCTA CTGTTACCAA AAACTCAACT GGAAGAAGAA AGTCAAATTA 15347 .......... .......... .......... .......... .......... .......... 214 CAGTCTGAGA AGATAAAAAT GGCGGCTTCA GCTTTCGGAC AGTGTAGTCT TCTTCCTCGT 15287 .......... .......... .......... .......... .......... .......... 214 ACAGTATCTT TGAATCCTCA GCAGTCTCAT CGTCAGCTCT GCAGTTTGTC TTTCCATAGA 15227 .......... .......... .......... .......... .......... .......... 214 CAAACTGTAA ATTCTTCACT TCCTGCACTG TCATTCACTC AGTCTATAGG TTAATATTAT 15167 || | |||| || |||||||||| | ||| || .......... .......AC- TGCTGCTCTC TCATTCACTC AATCTCAAG. .......... 245 TTCTTTCGAT TTCGATTCGA TTTGATACTG TTTGGGTTTA AATTGGGAAT TTTTTTGAAT 15107 .......... .......... .......... .......... .......... .......... 245 TTGATTAGGT TTTGGGTCTG CAATTGAGAG AC-ATTGTGT GGATCGAAAC GGGTCGGATT 15048 || ||||| | || ||||||||| || | |||| ||||| | | ||||| ........GT TTTGGATGTG GAATTGAGAG ACAACAGTGT -GATCG--TC --G--GGATT 290 TGTTTAAAAC GGATGCTGTT CGTCAGTTGA ATGGTTCAGT TATCTCTGCT AAGGGGCATC 14988 | || |||| |||||| || ||||| |||||| | ||| ||||| | || |||| TTGTTCAAAC ATCAGCTGTT CGAGAGTTGG CTGGTTCTCT TATTTCTGCC CAAGGACATC 350 GGTTTGCTAT TGCAAGTTTT TTCTCTCTTT TTCTTATAAT TATCCGTTTG ATTGCTCGAA 14928 | |||||||| | GTTTTGCTAT T......... .......... .......... .......... .......... 361 TTCTTCAAAA ATGTTGCTTC ACTTGTGTCG AATCAGCTAA AAATGACTAC TTTTGGAGTA 14868 .......... .......... .......... .......... .......... .......... 361 TCTGATAAGA CCCGTGGAGT TTTTTTAGAG TCTGAGCAAC ATCACTTTAC GATACTCTTT 14808 .......... .......... .......... .......... .......... .......... 361 GCATTTCATT TTATGTGGTG ATAAGAAATG AAGGATTGTA TCTCCATCAG ACTTGATTTA 14748 .......... .......... .......... .......... .......... .......... 361 TGTGATGCTT GTGTAGCTAT ATATATATAT ATATATATAT ATATGTCATT CTTTTTGGGA 14688 .......... .......... .......... .......... .......... .......... 361 CTGCAACAGA AGAAGTAGCT CTTTTCTTAC ATATAGGACT GCGATAGAGT ACATAGGAGG 14628 .......... .......... .......... .......... .......... .......... 361 GAGGAATAAA CATGCTAAAA GTACCTTAGT ATATGGTGTT TCATGCTTTG TTAATTCCAA 14568 .......... .......... .......... .......... .......... .......... 361 TAATAGCATG TCAATTGTCA TTAAGGGCAA CCTGAAATGT TAAAAAAGCG AAGAGTATTC 14508 .......... .......... .......... .......... .......... .......... 361 AAGAAAAGAA ATATCTCGTG TAAATTGAAA AGGAGGGAAC CGTAGTTTAA AGTGGTGATT 14448 .......... .......... .......... .......... .......... .......... 361 TTTAATTGCG AGAGTTAACA GTTTCATTTA AAGAAATTGT GAAAAGTCAA TTGATACAAT 14388 .......... .......... .......... .......... .......... .......... 361 TCGTGTTTGG TCTGTAGTTC TGTTATGTTT TTGGTTTTAC TTGAAGCCGT ATAGAAGTTG 14328 .......... .......... .......... .......... .......... .......... 361 AGTTCATTTC ATTGACAATG TTTCTAATAG AGTGTTATTG GAACTGACTG ACCGTCTGTT 14268 .......... .......... .......... .......... .......... .......... 361 GGTTTTAAAG GTGGTTGCAC GTTTTAATGA TCTGATCACC AAGAAGCTTT TGGAGGGAGC 14208 ||||| |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... GTGGTGGCAC GTTTTAATGA TCTGATCACC AAGAAGCTTT TGGAGGGAGC 411 TTTGGAGACT TTCAAGAATT ACTCGGTTAG AGAGGAAGAT ATTGATGTAC GTTTTCTATA 14148 |||| | || ||||||| || |||| ||||| ||| ||||| |||||| TTTGAACACA TTCAAGAGTT ACTCAGTTAG AGAAGAAGAC ATTGAT.... .......... 457 TCTAATCCTC CGTTCCCTCA AAACCTGTAT TTTCTCTTAA ACTTGATGCA ATTACTACAG 14088 .......... .......... .......... .......... .......... .......... 457 AAGGTCCGCT TGCCATGAAA AATCTAAATG AGAGATTGAT TGCTACATAA TACCCCTCTA 14028 .......... .......... .......... .......... .......... .......... 457 TGACACGCTA TACTGGAGCA TAAAGATCCA CTATGGGATA AATGGAGAAT GTTGTATCAA 13968 .......... .......... .......... .......... .......... .......... 457 TAGATTAAAT ATAGAATAAG GCGTGGTGGG AAGTTGTCTC TTCCATTTCA TGGCGTCATC 13908 .......... .......... .......... .......... .......... .......... 457 TAGTTTTTTC CTCATAATGC AGGTTGTGTG GGTTCCTGGT TGTTTTGAAA TCGGCGTGAC 13848 |||||||| |||||||||| ||||||||| | || || .......... .......... ..GTTGTGTG GGTTCCTGGT AGTTTTGAAA TTGGTGTTGT 495 TGCACAGCTT CTTGGAAAGT CACAGAAA-T ATCACGCAAT ACTCTGCATT GGGGCTGTGG 13789 || |||| ||||| |||| || ||||| | |||| |||| | | || || || || ||| CGCTCAGCAA CTTGGCAAGT CA-AGAAAGT ATCAATCAAT ATTGTGTATC GGTGCGGTG. 553 TAAGCTCACT TCCAAACATA AGTTTACCTT GTCGTTTTTC CTTCTACATT ATTTGCCTAC 13729 .......... .......... .......... .......... .......... .......... 553 TACGATTCTG TTTCATATGA TGATATTTTT TGTATCGTGT ATGGAAATAT GGAAGATTAC 13669 .......... .......... .......... .......... .......... .......... 553 AGTCAGATGC TTAAGTTATT CTTTTGGTAT ATTCTGTTTT GGATCGTCTT ATGTGTTCAT 13609 .......... .......... .......... .......... .......... .......... 553 CATTGTAAGT GAACTAATCT TTTGAGTTAC GGAAGGGGAA GCAGCTTAGA GTATGATGAG 13549 .......... .......... .......... .......... .......... .......... 553 AAGCATTAAC TGTGTCTTTT GTTCCTGTTG AGAAGACTAG CATTTATTTG TGACGCTCTT 13489 .......... .......... .......... .......... .......... .......... 553 CCCTCAACAA TGGAACTATA AAGCTGAACA AATAAAACTC AAGCACTATT TTGACAGATA 13429 .......... .......... .......... .......... .......... .......... 553 GGCAAAATTA TTAAACAAAG AAGCTATTGA ACTTTAGATT CCTTACTGAA GATATATACT 13369 .......... .......... .......... .......... .......... .......... 553 TGGATTCATT AAATGGAAAT CTTTAACTAA TTTCCATTGA TGTGAGGGAG GGAAAAATCT 13309 .......... .......... .......... .......... .......... .......... 553 GTGGATTTAG CTCAGTGTAT TCTAATTTTT CTTTGTGGGA CGGGCTAAAC GTTTAGTTGT 13249 .......... .......... .......... .......... .......... .......... 553 TGTAGTTGGA CATAGTAGGG AGGCATGCGT GAATTTAGTG CCTTATTGTG AAGTAACCTC 13189 .......... .......... .......... .......... .......... .......... 553 ATCTGAACAT ACTGATAAAC ACTATATTAT TTTTGCCCCG GTGGATTTCT TTCTCAGTTA 13129 .......... .......... .......... .......... .......... .......... 553 ATAAATATCT GGAAAGGAAA AGGTAATATA AAGCTAAAAC TCTGAGCATT ACAAACCCGA 13069 .......... .......... .......... .......... .......... .......... 553 AGAAAGAAAA AATCCATCAC AGACAAGAGA TTGAAGTGCC TTTCTGCTTT TCCCTCTGTC 13009 .......... .......... .......... .......... .......... .......... 553 CGAATCATAA TCTAATATTT CTTTTTTTCT TGATGGTGAT GTGTATCATT GTTTTTAATT 12949 .......... .......... .......... .......... .......... .......... 553 GTATTTTTTG CTTTGTGGCT AGTAGATCAG AGGTGATACA TCTCACTACG ATGCAGTCGT 12889 || || ||||||||| || || || | |||| || || .......... .......... .....ATTAG AGGTGATACC TCCCATTATG ATGCTGTGGT 588 TAATGCTGCC ACATCTGGAG TACTTTCAGC AGGTCTAAAT TCAGGTGAGA TAACATACTC 12829 |||||||||| || ||||||| ||||||| || |||| ||||| |||| TAATGCTGCC ACGTCTGGAG TACTTTCCGC AGGTTTAAAT TCAG...... .......... 632 CATTATAATC CACATGAGCA ATAACAAGTA TGCTGATTTT TAATCTTTTG CAACTCTTCA 12769 .......... .......... .......... .......... .......... .......... 632 GTACTTTATT GATAGATAAG ATTTTGGAAC ATTGTTAAAA CTTCGAATGT AATATCTGGA 12709 .......... .......... .......... .......... .......... .......... 632 ATTAAATATG ATGGTTGAAA TAGAGGATTG TTATGGGACC ATGATCCAAC AAAGTGATAG 12649 .......... .......... .......... .......... .......... .......... 632 GCAAGGTTTA GTAAGGTTGT GAGACTAAGC ATGTAATATG GGGTTGTTAG GCTTCTAAAG 12589 .......... .......... .......... .......... .......... .......... 632 TCCGAACATA TTCACAGGAT GAATTTGTTA ATCTGGATGT TCGGACATAC AAGACTGATA 12529 .......... .......... .......... .......... .......... .......... 632 AGATTAGAAA TGATAACTGA CAGAAGGTGT AAGTACCCTA AAAGATGAAA AGAAGGGTCG 12469 .......... .......... .......... .......... .......... .......... 632 TTGAAGACTA CTTATGTCTT AAACAGTCAT CTTAGATAGT TAGGCTGTGT CGAACACTCA 12409 .......... .......... .......... .......... .......... .......... 632 GGCATAGTAC AACAACTTGA AGATCCTATG TTAGAATCCT AGTTAAAAAC TGTGTGAGTT 12349 .......... .......... .......... .......... .......... .......... 632 CATATGCTCT AGTCCTGGTG TCATATTTAC TCGATGGTTG TGCTTCTGAG CAGTATAATA 12289 .......... .......... .......... .......... .......... .......... 632 AGTTCCCCGA TGAATTTATC AAGGTAAAAG GTAGTTTGAT TACCTCTGTT ACACATCTTC 12229 .......... .......... .......... .......... .......... .......... 632 TGTCATTATA CGCTATTTTC AGTTCATCTT TTTCAATTCG CTGGCCAAAC TAAAATCTCC 12169 .......... .......... .......... .......... .......... .......... 632 CTTATGTAAT GAGTTGCAGG TACTCCTTGC ATATTTGGTG TTTTGACATG TGATACCTTG 12109 | |||||||||| |||||||||| | ||||||| |||||||||| .......... .........G TACTCCTTGC ATATTTGGTG TCCTGACATG TGATACCTTG 673 GAGCAGGTAA GCAATTAGCC GTTATACACG TGTGTTGGTG TTCTATCTCC TTTGTCCATT 12049 || ||| GAACAG.... .......... .......... .......... .......... .......... 679 GTATAAATAG TGGTTGGAAT AACAGATTCA CGGAAAGTAT AATCTGCATC AGCTGAGAGG 11989 .......... .......... .......... .......... .......... .......... 679 TTATTATACC TTGTAGCTTT GTACAGCGAG ATAGTCATTG TTGTTTCGTG TACTTGAACT 11929 .......... .......... .......... .......... .......... .......... 679 GTTAGTTTTT TATGGATTTG TGTAACATAT ACTCTCCTTG GAACTACATT TACAATATTC 11869 .......... .......... .......... .......... .......... .......... 679 TTCTTGGCTG TGTGAGGCTA TCTTGCTCAT GCATGCTAAC CAACATCGCT ATATAATGCA 11809 .......... .......... .......... .......... .......... .......... 679 GGCTTTCAAT CGCGTTGGTG GG-AAGGCTG GGAATAAAGG TTCCGAAACT GCATTGACTG 11750 |||||| | || ||||||| || ||||||| | ||| |||| | | ||| | |||||||| | .GCTTTCGAC CGTGTTGGTG GGAAAGGCTG GAAAT-AAGG TGCTGAAGCA GCATTGACAG 737 CTGTAAGTTG CCTGTGATTG TGTGACAATT AGATTTTATC CTCAAAACAA GTTCATTTGT 11690 | CC........ .......... .......... .......... .......... .......... 739 GTCTGATTCT GCTCAAATAT ATCTGCCTTA ATATGCGATT AAAAGGGTCT GACTAATATA 11630 .......... .......... .......... .......... .......... .......... 739 ATTGTTGAAA CTTAAACATC AATAACTTAA GACTCAGTGT TCTGTTAGCA ATAAGTGACT 11570 .......... .......... .......... .......... .......... .......... 739 GATCTTGAAT GCATATATTG TGTTTAATTT TTCAGATTGA GAT-GGCATC TTTGTTTGAG 11511 ||||| ||| |||||| ||||||||| .......... .......... .......... .....ATTGA GATGGGCATC -TTGTTTGAG 763 CACCACCTAA AGCC 11497 ||||| || | |||| CACCATCTGA AGCC 777 hqPGS_C12HBa0093P12.1-3-_SGN-E540635+ (15209 15178,15098 14977,14257 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11497) ******************************************************************************** EST sequence 3 -strand 306 n (File: SGN-E226588-) 1 ATTTTCACCT TTAAACAGAA TCTCACTCGC ATTACCATCC CAACTAGTTA CTCCGACTAA 61 TAAGGTTAGA CATCGAGGGG GAAAAAGAGT TCCCTCTACT TCTCGGTTTC CTTCCTCGTC 121 TACTCATTGA TGCTGGATGT AAACATTCTT GTAAAAGCTG CACTTGTTTG AGAAAATGTT 181 GCACTTTGTT TCAAGTTTGA GTTTTGGATA ATAGTATTCT TTCAAGTTTT GAGTTTTGGA 241 CTTCAAATTG GTTCTATGTG AATTGGAACT TCAGCATTGA ATATAAAAAA AAAAAAAAAA 301 AAAAAA Predicted gene structure (within gDNA segment 12066 to 10254): Exon 1 11456 11174 ( 283 n); cDNA 1 283 ( 283 n); score: 0.993 PPA cDNA 285 306 MATCH C12HBa0093P12.1-3- SGN-E226588- 0.993 283 0.925 C PGS_C12HBa0093P12.1-3-_SGN-E226588- (11456 11174) Alignment (genomic DNA sequence = upper lines): ATTTTCACCT TTAAACAGAA TCTCACTCGC ATTACCATCC CAACTAGTTA CTCCGACTAA 11397 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTTTCACCT TTAAACAGAA TCTCACTCGC ATTACCATCC CAACTAGTTA CTCCGACTAA 60 TAAGGTTAGA CATCGAGGGG GAAAAAGAGT TCCCTCTACT TCTCGGTTTC CTTCCTCGTC 11337 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAAGGTTAGA CATCGAGGGG GAAAAAGAGT TCCCTCTACT TCTCGGTTTC CTTCCTCGTC 120 TACTCATTGA TGCTGGATGT AAACATTCTT GTAAAAGCTG CACTTGTTTG AGAAAATGTT 11277 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TACTCATTGA TGCTGGATGT AAACATTCTT GTAAAAGCTG CACTTGTTTG AGAAAATGTT 180 GCACTTTGTT TCAAGTTTGA GTTTTGGATA ATAGTATTCT TTCAAGTTTT GAGTTTTGGA 11217 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCACTTTGTT TCAAGTTTGA GTTTTGGATA ATAGTATTCT TTCAAGTTTT GAGTTTTGGA 240 CTTCAAATTG GTTCTATGTG AATTGGAACT TCAACATTGA TTA 11174 |||||||||| |||||||||| |||||||||| ||| |||||| || CTTCAAATTG GTTCTATGTG AATTGGAACT TCAGCATTGA ATA 283 hqPGS_C12HBa0093P12.1-3-_SGN-E226588- (11456 11174) ******************************************************************************** EST sequence 22 +strand 565 n (File: SGN-E288074+) 1 TCAAATTACA GTCTGAGAAG ATAAAAATGG CGGCTTCAGC TTTCGGACAG TGTAGTCTTC 61 TTCCTCGTAC AGTATCTTTG AATCCTCAGC AGTCTCATCG TCAGCTCTGC AGTTTGTCTT 121 TCCATAGACA AACTGTAAAT TCTTCACTTA CTGCACTGTC ATTCACTCAG TCTATAGGTT 181 TTGGGTCTGC AATTGAGAGA CATTGTGTGG ATCGAAACGG GTCGGATTTG TTTAAAACGG 241 ATGCTGTTCG TCAGTTGAAT GGTTCAGTTA TCTCTGCTAA GGGGCATCGG TTTGCTATTG 301 TGGTTGCACG TTTTAATGAT CTGATCACCA AGAAGCTTTT GGAGGGAGCT TTGGAGACTT 361 TCAAGAATTA CTCGGTTAGA GAGGAAGATA TTGATGTTGT GTGGGTTCCT GGTTGTTTTG 421 AAATCGGCGT GACTGCACAG CTTCTTGGAA AGTCACAGAA ATATCACGCA ATACTCTGCA 481 TTGGGGCTGT GATCAGAGGT GATACATCTC ACTACGATGC AGTCGTTAAT GCTGCCACAT 541 CTGGAGTACT TTCAGCAGGT CTAAA Predicted gene structure (within gDNA segment 15954 to 12240): Exon 1 15354 15178 ( 177 n); cDNA 1 177 ( 177 n); score: 0.994 Intron 1 15177 15099 ( 79 n); Pd: 0.952 (s: 0.98), Pa: 0.999 (s: 1.00) Exon 2 15098 14977 ( 122 n); cDNA 178 299 ( 122 n); score: 1.000 Intron 2 14976 14258 ( 719 n); Pd: 1.000 (s: 1.00), Pa: 0.972 (s: 1.00) Exon 3 14257 14162 ( 96 n); cDNA 300 395 ( 96 n); score: 1.000 Intron 3 14161 13886 ( 276 n); Pd: 0.991 (s: 1.00), Pa: 0.987 (s: 1.00) Exon 4 13885 13790 ( 96 n); cDNA 396 491 ( 96 n); score: 1.000 Intron 4 13789 12924 ( 866 n); Pd: 0.993 (s: 1.00), Pa: 0.995 (s: 1.00) Exon 5 12923 12850 ( 74 n); cDNA 492 565 ( 74 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E288074+ 0.998 565 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E288074+ (15354 15178,15098 14977,14257 14162,13885 13790,12923 12850) Alignment (genomic DNA sequence = upper lines): TCAAATTACA GTCTGAGAAG ATAAAAATGG CGGCTTCAGC TTTCGGACAG TGTAGTCTTC 15295 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAAATTACA GTCTGAGAAG ATAAAAATGG CGGCTTCAGC TTTCGGACAG TGTAGTCTTC 60 TTCCTCGTAC AGTATCTTTG AATCCTCAGC AGTCTCATCG TCAGCTCTGC AGTTTGTCTT 15235 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCCTCGTAC AGTATCTTTG AATCCTCAGC AGTCTCATCG TCAGCTCTGC AGTTTGTCTT 120 TCCATAGACA AACTGTAAAT TCTTCACTTC CTGCACTGTC ATTCACTCAG TCTATAGGTT 15175 |||||||||| |||||||||| ||||||||| |||||||||| |||||||||| ||||||| TCCATAGACA AACTGTAAAT TCTTCACTTA CTGCACTGTC ATTCACTCAG TCTATAG... 177 AATATTATTT CTTTCGATTT CGATTCGATT TGATACTGTT TGGGTTTAAA TTGGGAATTT 15115 .......... .......... .......... .......... .......... .......... 177 TTTTGAATTT GATTAGGTTT TGGGTCTGCA ATTGAGAGAC ATTGTGTGGA TCGAAACGGG 15055 |||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ......GTTT TGGGTCTGCA ATTGAGAGAC ATTGTGTGGA TCGAAACGGG 221 TCGGATTTGT TTAAAACGGA TGCTGTTCGT CAGTTGAATG GTTCAGTTAT CTCTGCTAAG 14995 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCGGATTTGT TTAAAACGGA TGCTGTTCGT CAGTTGAATG GTTCAGTTAT CTCTGCTAAG 281 GGGCATCGGT TTGCTATTGC AAGTTTTTTC TCTCTTTTTC TTATAATTAT CCGTTTGATT 14935 |||||||||| |||||||| GGGCATCGGT TTGCTATT.. .......... .......... .......... .......... 299 GCTCGAATTC TTCAAAAATG TTGCTTCACT TGTGTCGAAT CAGCTAAAAA TGACTACTTT 14875 .......... .......... .......... .......... .......... .......... 299 TGGAGTATCT GATAAGACCC GTGGAGTTTT TTTAGAGTCT GAGCAACATC ACTTTACGAT 14815 .......... .......... .......... .......... .......... .......... 299 ACTCTTTGCA TTTCATTTTA TGTGGTGATA AGAAATGAAG GATTGTATCT CCATCAGACT 14755 .......... .......... .......... .......... .......... .......... 299 TGATTTATGT GATGCTTGTG TAGCTATATA TATATATATA TATATATATA TGTCATTCTT 14695 .......... .......... .......... .......... .......... .......... 299 TTTGGGACTG CAACAGAAGA AGTAGCTCTT TTCTTACATA TAGGACTGCG ATAGAGTACA 14635 .......... .......... .......... .......... .......... .......... 299 TAGGAGGGAG GAATAAACAT GCTAAAAGTA CCTTAGTATA TGGTGTTTCA TGCTTTGTTA 14575 .......... .......... .......... .......... .......... .......... 299 ATTCCAATAA TAGCATGTCA ATTGTCATTA AGGGCAACCT GAAATGTTAA AAAAGCGAAG 14515 .......... .......... .......... .......... .......... .......... 299 AGTATTCAAG AAAAGAAATA TCTCGTGTAA ATTGAAAAGG AGGGAACCGT AGTTTAAAGT 14455 .......... .......... .......... .......... .......... .......... 299 GGTGATTTTT AATTGCGAGA GTTAACAGTT TCATTTAAAG AAATTGTGAA AAGTCAATTG 14395 .......... .......... .......... .......... .......... .......... 299 ATACAATTCG TGTTTGGTCT GTAGTTCTGT TATGTTTTTG GTTTTACTTG AAGCCGTATA 14335 .......... .......... .......... .......... .......... .......... 299 GAAGTTGAGT TCATTTCATT GACAATGTTT CTAATAGAGT GTTATTGGAA CTGACTGACC 14275 .......... .......... .......... .......... .......... .......... 299 GTCTGTTGGT TTTAAAGGTG GTTGCACGTT TTAATGATCT GATCACCAAG AAGCTTTTGG 14215 ||| |||||||||| |||||||||| |||||||||| |||||||||| .......... .......GTG GTTGCACGTT TTAATGATCT GATCACCAAG AAGCTTTTGG 342 AGGGAGCTTT GGAGACTTTC AAGAATTACT CGGTTAGAGA GGAAGATATT GATGTACGTT 14155 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||| AGGGAGCTTT GGAGACTTTC AAGAATTACT CGGTTAGAGA GGAAGATATT GAT....... 395 TTCTATATCT AATCCTCCGT TCCCTCAAAA CCTGTATTTT CTCTTAAACT TGATGCAATT 14095 .......... .......... .......... .......... .......... .......... 395 ACTACAGAAG GTCCGCTTGC CATGAAAAAT CTAAATGAGA GATTGATTGC TACATAATAC 14035 .......... .......... .......... .......... .......... .......... 395 CCCTCTATGA CACGCTATAC TGGAGCATAA AGATCCACTA TGGGATAAAT GGAGAATGTT 13975 .......... .......... .......... .......... .......... .......... 395 GTATCAATAG ATTAAATATA GAATAAGGCG TGGTGGGAAG TTGTCTCTTC CATTTCATGG 13915 .......... .......... .......... .......... .......... .......... 395 CGTCATCTAG TTTTTTCCTC ATAATGCAGG TTGTGTGGGT TCCTGGTTGT TTTGAAATCG 13855 | |||||||||| |||||||||| |||||||||| .......... .......... .........G TTGTGTGGGT TCCTGGTTGT TTTGAAATCG 426 GCGTGACTGC ACAGCTTCTT GGAAAGTCAC AGAAATATCA CGCAATACTC TGCATTGGGG 13795 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGTGACTGC ACAGCTTCTT GGAAAGTCAC AGAAATATCA CGCAATACTC TGCATTGGGG 486 CTGTGGTAAG CTCACTTCCA AACATAAGTT TACCTTGTCG TTTTTCCTTC TACATTATTT 13735 ||||| CTGTG..... .......... .......... .......... .......... .......... 491 GCCTACTACG ATTCTGTTTC ATATGATGAT ATTTTTTGTA TCGTGTATGG AAATATGGAA 13675 .......... .......... .......... .......... .......... .......... 491 GATTACAGTC AGATGCTTAA GTTATTCTTT TGGTATATTC TGTTTTGGAT CGTCTTATGT 13615 .......... .......... .......... .......... .......... .......... 491 GTTCATCATT GTAAGTGAAC TAATCTTTTG AGTTACGGAA GGGGAAGCAG CTTAGAGTAT 13555 .......... .......... .......... .......... .......... .......... 491 GATGAGAAGC ATTAACTGTG TCTTTTGTTC CTGTTGAGAA GACTAGCATT TATTTGTGAC 13495 .......... .......... .......... .......... .......... .......... 491 GCTCTTCCCT CAACAATGGA ACTATAAAGC TGAACAAATA AAACTCAAGC ACTATTTTGA 13435 .......... .......... .......... .......... .......... .......... 491 CAGATAGGCA AAATTATTAA ACAAAGAAGC TATTGAACTT TAGATTCCTT ACTGAAGATA 13375 .......... .......... .......... .......... .......... .......... 491 TATACTTGGA TTCATTAAAT GGAAATCTTT AACTAATTTC CATTGATGTG AGGGAGGGAA 13315 .......... .......... .......... .......... .......... .......... 491 AAATCTGTGG ATTTAGCTCA GTGTATTCTA ATTTTTCTTT GTGGGACGGG CTAAACGTTT 13255 .......... .......... .......... .......... .......... .......... 491 AGTTGTTGTA GTTGGACATA GTAGGGAGGC ATGCGTGAAT TTAGTGCCTT ATTGTGAAGT 13195 .......... .......... .......... .......... .......... .......... 491 AACCTCATCT GAACATACTG ATAAACACTA TATTATTTTT GCCCCGGTGG ATTTCTTTCT 13135 .......... .......... .......... .......... .......... .......... 491 CAGTTAATAA ATATCTGGAA AGGAAAAGGT AATATAAAGC TAAAACTCTG AGCATTACAA 13075 .......... .......... .......... .......... .......... .......... 491 ACCCGAAGAA AGAAAAAATC CATCACAGAC AAGAGATTGA AGTGCCTTTC TGCTTTTCCC 13015 .......... .......... .......... .......... .......... .......... 491 TCTGTCCGAA TCATAATCTA ATATTTCTTT TTTTCTTGAT GGTGATGTGT ATCATTGTTT 12955 .......... .......... .......... .......... .......... .......... 491 TTAATTGTAT TTTTTGCTTT GTGGCTAGTA GATCAGAGGT GATACATCTC ACTACGATGC 12895 ||||||||| |||||||||| |||||||||| .......... .......... .......... .ATCAGAGGT GATACATCTC ACTACGATGC 520 AGTCGTTAAT GCTGCCACAT CTGGAGTACT TTCAGCAGGT CTAAA 12850 |||||||||| |||||||||| |||||||||| |||||||||| ||||| AGTCGTTAAT GCTGCCACAT CTGGAGTACT TTCAGCAGGT CTAAA 565 hqPGS_C12HBa0093P12.1-3-_SGN-E288074+ (15354 15178,15098 14977,14257 14162,13885 13790,12923 12850) ******************************************************************************** EST sequence 20 +strand 427 n (File: SGN-E270528+) 1 TGTTACCAAA AACTCAACTG GAAGAAGAAA GTCAAATTAC AGTCTGAGAA GATAAAAATG 61 GCGGCTTCAG CTTTCGGACA GTGTAGTCTT CTTCCTCGTA CAGTATCTTT GAATCCTCAG 121 CAGTCTCATC GTCAGCTCTG CAGTTTGTCT TTCCATAGAC AAACTGTAAA TTCTTCACTT 181 CCTGCACTGT CATTCACTCA GTCTATAGGT TTTGGGTCTG CAATTGAGAG ACATTGTGTG 241 GATCGAAACG GGTCGGATTT GTTTAAAACG GATGCTGTTC GTCAGTTGAA TGGTTCAGTT 301 ATCTCTGCTA AGGGGCATCG GTTTGCTATT GTGGTTGCAC GTTTTAATGA TCTGATCACC 361 AAGAAGCTTT TGGAGGGAGC TTTGGAGACT TTCAAGAATT ACTCGGTTAG AGAGGAAGAT 421 ATTGATG Predicted gene structure (within gDNA segment 15985 to 13551): Exon 1 15385 15178 ( 208 n); cDNA 1 208 ( 208 n); score: 1.000 Intron 1 15177 15099 ( 79 n); Pd: 0.952 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 2 15098 14977 ( 122 n); cDNA 209 330 ( 122 n); score: 1.000 Intron 2 14976 14258 ( 719 n); Pd: 1.000 (s: 1.00), Pa: 0.972 (s: 1.00) Exon 3 14257 14162 ( 96 n); cDNA 331 426 ( 96 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E270528+ 1.000 426 0.998 C PGS_C12HBa0093P12.1-3-_SGN-E270528+ (15385 15178,15098 14977,14257 14162) Alignment (genomic DNA sequence = upper lines): TGTTACCAAA AACTCAACTG GAAGAAGAAA GTCAAATTAC AGTCTGAGAA GATAAAAATG 15326 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTACCAAA AACTCAACTG GAAGAAGAAA GTCAAATTAC AGTCTGAGAA GATAAAAATG 60 GCGGCTTCAG CTTTCGGACA GTGTAGTCTT CTTCCTCGTA CAGTATCTTT GAATCCTCAG 15266 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GCGGCTTCAG CTTTCGGACA GTGTAGTCTT CTTCCTCGTA CAGTATCTTT GAATCCTCAG 120 CAGTCTCATC GTCAGCTCTG CAGTTTGTCT TTCCATAGAC AAACTGTAAA TTCTTCACTT 15206 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTCTCATC GTCAGCTCTG CAGTTTGTCT TTCCATAGAC AAACTGTAAA TTCTTCACTT 180 CCTGCACTGT CATTCACTCA GTCTATAGGT TAATATTATT TCTTTCGATT TCGATTCGAT 15146 |||||||||| |||||||||| |||||||| CCTGCACTGT CATTCACTCA GTCTATAG.. .......... .......... .......... 208 TTGATACTGT TTGGGTTTAA ATTGGGAATT TTTTTGAATT TGATTAGGTT TTGGGTCTGC 15086 ||| |||||||||| .......... .......... .......... .......... .......GTT TTGGGTCTGC 221 AATTGAGAGA CATTGTGTGG ATCGAAACGG GTCGGATTTG TTTAAAACGG ATGCTGTTCG 15026 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTGAGAGA CATTGTGTGG ATCGAAACGG GTCGGATTTG TTTAAAACGG ATGCTGTTCG 281 TCAGTTGAAT GGTTCAGTTA TCTCTGCTAA GGGGCATCGG TTTGCTATTG CAAGTTTTTT 14966 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| TCAGTTGAAT GGTTCAGTTA TCTCTGCTAA GGGGCATCGG TTTGCTATT. .......... 330 CTCTCTTTTT CTTATAATTA TCCGTTTGAT TGCTCGAATT CTTCAAAAAT GTTGCTTCAC 14906 .......... .......... .......... .......... .......... .......... 330 TTGTGTCGAA TCAGCTAAAA ATGACTACTT TTGGAGTATC TGATAAGACC CGTGGAGTTT 14846 .......... .......... .......... .......... .......... .......... 330 TTTTAGAGTC TGAGCAACAT CACTTTACGA TACTCTTTGC ATTTCATTTT ATGTGGTGAT 14786 .......... .......... .......... .......... .......... .......... 330 AAGAAATGAA GGATTGTATC TCCATCAGAC TTGATTTATG TGATGCTTGT GTAGCTATAT 14726 .......... .......... .......... .......... .......... .......... 330 ATATATATAT ATATATATAT ATGTCATTCT TTTTGGGACT GCAACAGAAG AAGTAGCTCT 14666 .......... .......... .......... .......... .......... .......... 330 TTTCTTACAT ATAGGACTGC GATAGAGTAC ATAGGAGGGA GGAATAAACA TGCTAAAAGT 14606 .......... .......... .......... .......... .......... .......... 330 ACCTTAGTAT ATGGTGTTTC ATGCTTTGTT AATTCCAATA ATAGCATGTC AATTGTCATT 14546 .......... .......... .......... .......... .......... .......... 330 AAGGGCAACC TGAAATGTTA AAAAAGCGAA GAGTATTCAA GAAAAGAAAT ATCTCGTGTA 14486 .......... .......... .......... .......... .......... .......... 330 AATTGAAAAG GAGGGAACCG TAGTTTAAAG TGGTGATTTT TAATTGCGAG AGTTAACAGT 14426 .......... .......... .......... .......... .......... .......... 330 TTCATTTAAA GAAATTGTGA AAAGTCAATT GATACAATTC GTGTTTGGTC TGTAGTTCTG 14366 .......... .......... .......... .......... .......... .......... 330 TTATGTTTTT GGTTTTACTT GAAGCCGTAT AGAAGTTGAG TTCATTTCAT TGACAATGTT 14306 .......... .......... .......... .......... .......... .......... 330 TCTAATAGAG TGTTATTGGA ACTGACTGAC CGTCTGTTGG TTTTAAAGGT GGTTGCACGT 14246 || |||||||||| .......... .......... .......... .......... ........GT GGTTGCACGT 342 TTTAATGATC TGATCACCAA GAAGCTTTTG GAGGGAGCTT TGGAGACTTT CAAGAATTAC 14186 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTAATGATC TGATCACCAA GAAGCTTTTG GAGGGAGCTT TGGAGACTTT CAAGAATTAC 402 TCGGTTAGAG AGGAAGATAT TGAT 14162 |||||||||| |||||||||| |||| TCGGTTAGAG AGGAAGATAT TGAT 426 hqPGS_C12HBa0093P12.1-3-_SGN-E270528+ (15385 15178,15098 14977,14257 14162) ******************************************************************************** EST sequence 18 +strand 369 n (File: SGN-E289600+) 1 AATTACAGTC TGAGAAGATA AAAATGGCGG CTTCAGCTTT CGGACAGTGT AGTCTTCTTC 61 CTCGTACAGT ATCTTTGAAT CCTCAGCAGT CTCATCGTCA GCTCTGCAGT TTGTCTTTCC 121 ATAGACAAAC TGTAAATTCT TCACTTCCTG CACTGTCATT CACTCAGTCT ATAGGTTTTG 181 GGTCTGCAAT TGAGAGACAT TGTGTGGATC GAAACGGGTC GGATTTGTTT AAAACGGATG 241 CTGTTCGTCA GTTGAATGGT TCAGTTATCT CTGCTAAGGG GCATCGGTTT GCTATTGTGG 301 TTGCACGTTT TAATGATCTG ATCACCAAGA AGCTTTTGGA GGGAGCTTTG GAGACTTTCA 361 AGAATTACT Predicted gene structure (within gDNA segment 15951 to 13575): Exon 1 15351 15178 ( 174 n); cDNA 1 174 ( 174 n); score: 1.000 Intron 1 15177 15099 ( 79 n); Pd: 0.952 (s: 1.00), Pa: 0.999 (s: 1.00) Exon 2 15098 14977 ( 122 n); cDNA 175 296 ( 122 n); score: 1.000 Intron 2 14976 14258 ( 719 n); Pd: 1.000 (s: 1.00), Pa: 0.972 (s: 1.00) Exon 3 14257 14185 ( 73 n); cDNA 297 369 ( 73 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E289600+ 1.000 369 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E289600+ (15351 15178,15098 14977,14257 14185) Alignment (genomic DNA sequence = upper lines): AATTACAGTC TGAGAAGATA AAAATGGCGG CTTCAGCTTT CGGACAGTGT AGTCTTCTTC 15292 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTACAGTC TGAGAAGATA AAAATGGCGG CTTCAGCTTT CGGACAGTGT AGTCTTCTTC 60 CTCGTACAGT ATCTTTGAAT CCTCAGCAGT CTCATCGTCA GCTCTGCAGT TTGTCTTTCC 15232 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTCGTACAGT ATCTTTGAAT CCTCAGCAGT CTCATCGTCA GCTCTGCAGT TTGTCTTTCC 120 ATAGACAAAC TGTAAATTCT TCACTTCCTG CACTGTCATT CACTCAGTCT ATAGGTTAAT 15172 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||| ATAGACAAAC TGTAAATTCT TCACTTCCTG CACTGTCATT CACTCAGTCT ATAG...... 174 ATTATTTCTT TCGATTTCGA TTCGATTTGA TACTGTTTGG GTTTAAATTG GGAATTTTTT 15112 .......... .......... .......... .......... .......... .......... 174 TGAATTTGAT TAGGTTTTGG GTCTGCAATT GAGAGACATT GTGTGGATCG AAACGGGTCG 15052 ||||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ...GTTTTGG GTCTGCAATT GAGAGACATT GTGTGGATCG AAACGGGTCG 221 GATTTGTTTA AAACGGATGC TGTTCGTCAG TTGAATGGTT CAGTTATCTC TGCTAAGGGG 14992 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATTTGTTTA AAACGGATGC TGTTCGTCAG TTGAATGGTT CAGTTATCTC TGCTAAGGGG 281 CATCGGTTTG CTATTGCAAG TTTTTTCTCT CTTTTTCTTA TAATTATCCG TTTGATTGCT 14932 |||||||||| ||||| CATCGGTTTG CTATT..... .......... .......... .......... .......... 296 CGAATTCTTC AAAAATGTTG CTTCACTTGT GTCGAATCAG CTAAAAATGA CTACTTTTGG 14872 .......... .......... .......... .......... .......... .......... 296 AGTATCTGAT AAGACCCGTG GAGTTTTTTT AGAGTCTGAG CAACATCACT TTACGATACT 14812 .......... .......... .......... .......... .......... .......... 296 CTTTGCATTT CATTTTATGT GGTGATAAGA AATGAAGGAT TGTATCTCCA TCAGACTTGA 14752 .......... .......... .......... .......... .......... .......... 296 TTTATGTGAT GCTTGTGTAG CTATATATAT ATATATATAT ATATATATGT CATTCTTTTT 14692 .......... .......... .......... .......... .......... .......... 296 GGGACTGCAA CAGAAGAAGT AGCTCTTTTC TTACATATAG GACTGCGATA GAGTACATAG 14632 .......... .......... .......... .......... .......... .......... 296 GAGGGAGGAA TAAACATGCT AAAAGTACCT TAGTATATGG TGTTTCATGC TTTGTTAATT 14572 .......... .......... .......... .......... .......... .......... 296 CCAATAATAG CATGTCAATT GTCATTAAGG GCAACCTGAA ATGTTAAAAA AGCGAAGAGT 14512 .......... .......... .......... .......... .......... .......... 296 ATTCAAGAAA AGAAATATCT CGTGTAAATT GAAAAGGAGG GAACCGTAGT TTAAAGTGGT 14452 .......... .......... .......... .......... .......... .......... 296 GATTTTTAAT TGCGAGAGTT AACAGTTTCA TTTAAAGAAA TTGTGAAAAG TCAATTGATA 14392 .......... .......... .......... .......... .......... .......... 296 CAATTCGTGT TTGGTCTGTA GTTCTGTTAT GTTTTTGGTT TTACTTGAAG CCGTATAGAA 14332 .......... .......... .......... .......... .......... .......... 296 GTTGAGTTCA TTTCATTGAC AATGTTTCTA ATAGAGTGTT ATTGGAACTG ACTGACCGTC 14272 .......... .......... .......... .......... .......... .......... 296 TGTTGGTTTT AAAGGTGGTT GCACGTTTTA ATGATCTGAT CACCAAGAAG CTTTTGGAGG 14212 |||||| |||||||||| |||||||||| |||||||||| |||||||||| .......... ....GTGGTT GCACGTTTTA ATGATCTGAT CACCAAGAAG CTTTTGGAGG 342 GAGCTTTGGA GACTTTCAAG AATTACT 14185 |||||||||| |||||||||| ||||||| GAGCTTTGGA GACTTTCAAG AATTACT 369 hqPGS_C12HBa0093P12.1-3-_SGN-E289600+ (15351 15178,15098 14977,14257 14185) ******************************************************************************** EST sequence 5 +strand 581 n (File: SGN-E320920+) 1 TTGTGTTGGA TGGATTTTTA AGTATTCCTT TTGGAGACTG TTAGTTGATT TTAAGTGTTG 61 AATATTCACA TTTCCAATCA CCCTGTATAG TTTAACGGCA TAATCCACAG TGTGTCATGG 121 ACACCTGATG TTGATGTGGC AAATATATTT GGTTGGTGCC TAGATGATCA TTTTGTAAGT 181 TGGAGTGTTC AACTGACACA GTGGAGATGA GTTGAGGTGT AGATGTGCAT ACTCAAAGTT 241 GGAGTGTTTT CTTGCTAGTT GATGCCGAGT TTGAGTGTTT TCTTGCTAGT TGATGCCGAG 301 TTCAAGTGTC TGTTTCTGCA TTATGACTAG TTAAATTATC GACTTAACCT TATTATTGAA 361 CTCAAGACTT TGGTTCTTCT GTTCGTTATC TTATTTTCAT GCCTCATAAT TAATGTCAGG 421 ATTCTTTGGT TCAACTATTT AAAATGCCAC ACACAATACT TTGCTTGGCT GTTGATGAGT 481 GGAAGTATGT TATTACTATT GCTTGTTTTG TTTTATCTTT TATCTAAACC AATGGACCTA 541 TGTCGAATGG AGATAAAATA GGTATACTTA GTCTATGGGA T Predicted gene structure (within gDNA segment 18301 to 24719): Exon 1 20641 20802 ( 162 n); cDNA 103 262 ( 160 n); score: 0.784 MATCH C12HBa0093P12.1-3+ SGN-E320920+ 0.784 162 0.279 C PGS_C12HBa0093P12.1-3+_SGN-E320920+ (20641 20802) Alignment (genomic DNA sequence = upper lines): ATCGCCACTG TCTCATGGAC ACCCATCACT GGCGTGTCAC ATAAATTTTG GAGGTATTCA 20700 ||| || || | |||||||| ||| | | ||| || ||| |||| | ||| | ATCCACAGTG TGTCATGGAC ACCTGATGTT GATGTGGCAA ATATATTTGG TTGGTGCCTA 162 GAGGATCGTT TTGTAAGTTT GAGTGTTCAA CTGACACAAT TGAGATGAGT TGAGGTGCCT 20760 || |||| || ||||||||| |||||||||| |||||||| | ||||||||| ||||||| | GATGATCATT TTGTAAGTTG GAGTGTTCAA CTGACACAGT GGAGATGAGT TGAGGTG--T 220 AGATATGCAT ACTCAAAGTT GGAGTGTTTA TTTGCCATAT GA 20802 |||| ||||| |||||||||| ||||||||| |||| | | || AGATGTGCAT ACTCAAAGTT GGAGTGTTTT CTTGCTAGTT GA 262 hqPGS_C12HBa0093P12.1-3+_SGN-E320920+ (20641 20802) ******************************************************************************** EST sequence 24 +strand 635 n (File: SGN-E301519+) 1 TTTAATACAC ATTTTCAATG ATGAAAGGGA GATATTTAAA CTTTTAATAC CGAAAGGGCA 61 TGGTCTAGCT ATCAATGAAG TGAGTGAATT ATGAGTGATC GTGATTCAAA TCTCGGCAGA 121 AACAAAAAAA AAACACAATT TCTTCACGTG TGTTGACATT GGTTGATAAA GTTACTTGAT 181 ACTGTAGTAA GAAGTACTAG GCATCCGATG GAACATCGAG GTATGTGCAA GCTGATGACC 241 CTAAACATAT CATTAGTTAT CAAAGACTTA AAATTGTAAT CCAACCAAAT ATACCATATG 301 CTTCTTATTG GGGATGTTTA AAAAAAAAAG ACGTTCTACC TTCACGAGCT AGGGGTAAAG 361 TGTACGTACA CTCTATCCTC TCCTAACCCC ACCTATGAGA TTACATTGGG TACGTTGTTG 421 TTGCTTCTTA GTGGGACACC TTAGGAATAA ATAGCTTTTC TTCCTTTCAA ACCTTTCACC 481 CAGTTAAAAT ACCAATCAAT CAATTATACC ACAATATTAC AATATACAAG TATCTAACTT 541 GACTAGTATA CAAGTGTGAT AATAACAACT AGAACTTCAT ATAACCAACA ACTCCTCCTA 601 TGTGAGGTTC ACATGGTTGG TCTCAAGTTC GAATG Predicted gene structure (within gDNA segment 22796 to 20952): Exon 1 22196 21562 ( 635 n); cDNA 1 635 ( 635 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E301519+ 1.000 635 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E301519+ (22196 21562) Alignment (genomic DNA sequence = upper lines): TTTAATACAC ATTTTCAATG ATGAAAGGGA GATATTTAAA CTTTTAATAC CGAAAGGGCA 22137 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTAATACAC ATTTTCAATG ATGAAAGGGA GATATTTAAA CTTTTAATAC CGAAAGGGCA 60 TGGTCTAGCT ATCAATGAAG TGAGTGAATT ATGAGTGATC GTGATTCAAA TCTCGGCAGA 22077 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTCTAGCT ATCAATGAAG TGAGTGAATT ATGAGTGATC GTGATTCAAA TCTCGGCAGA 120 AACAAAAAAA AAACACAATT TCTTCACGTG TGTTGACATT GGTTGATAAA GTTACTTGAT 22017 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AACAAAAAAA AAACACAATT TCTTCACGTG TGTTGACATT GGTTGATAAA GTTACTTGAT 180 ACTGTAGTAA GAAGTACTAG GCATCCGATG GAACATCGAG GTATGTGCAA GCTGATGACC 21957 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ACTGTAGTAA GAAGTACTAG GCATCCGATG GAACATCGAG GTATGTGCAA GCTGATGACC 240 CTAAACATAT CATTAGTTAT CAAAGACTTA AAATTGTAAT CCAACCAAAT ATACCATATG 21897 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTAAACATAT CATTAGTTAT CAAAGACTTA AAATTGTAAT CCAACCAAAT ATACCATATG 300 CTTCTTATTG GGGATGTTTA AAAAAAAAAG ACGTTCTACC TTCACGAGCT AGGGGTAAAG 21837 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTCTTATTG GGGATGTTTA AAAAAAAAAG ACGTTCTACC TTCACGAGCT AGGGGTAAAG 360 TGTACGTACA CTCTATCCTC TCCTAACCCC ACCTATGAGA TTACATTGGG TACGTTGTTG 21777 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTACGTACA CTCTATCCTC TCCTAACCCC ACCTATGAGA TTACATTGGG TACGTTGTTG 420 TTGCTTCTTA GTGGGACACC TTAGGAATAA ATAGCTTTTC TTCCTTTCAA ACCTTTCACC 21717 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGCTTCTTA GTGGGACACC TTAGGAATAA ATAGCTTTTC TTCCTTTCAA ACCTTTCACC 480 CAGTTAAAAT ACCAATCAAT CAATTATACC ACAATATTAC AATATACAAG TATCTAACTT 21657 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAGTTAAAAT ACCAATCAAT CAATTATACC ACAATATTAC AATATACAAG TATCTAACTT 540 GACTAGTATA CAAGTGTGAT AATAACAACT AGAACTTCAT ATAACCAACA ACTCCTCCTA 21597 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GACTAGTATA CAAGTGTGAT AATAACAACT AGAACTTCAT ATAACCAACA ACTCCTCCTA 600 TGTGAGGTTC ACATGGTTGG TCTCAAGTTC GAATG 21562 |||||||||| |||||||||| |||||||||| ||||| TGTGAGGTTC ACATGGTTGG TCTCAAGTTC GAATG 635 hqPGS_C12HBa0093P12.1-3-_SGN-E301519+ (22196 21562) ******************************************************************************** EST sequence 6 +strand 624 n (File: SGN-E542827+) 1 TTTTCTTTTT TTTTGGTTAA TGACTAATTT ATAATTATTA TTTTGATAAT CAAATTTATT 61 TATATTTCAC TAATATTCTT GTAAAACTTA TTGTAGATGA CCAAAATTTT TCTTTGAATA 121 CCAAATTAAA TTACAATACA CACAAAAAAA ATAGTTTAGT TTTTTTTCTC TTTAAACTAA 181 GGAATGAAAG AAAAAAAATT AGAATAACAA ACTCAAATAA TTATAATAAA AGAAGTCAAA 241 CAATAATTTA TGTATAAAAA AAATTAAATA TAACCTCGAA CTTTGATAGA AGAATAATAT 301 ATACCTTTAA ATAATTTTTT TAAAAACAAT CAAAAGTAAT AAATATAAAT TTAAAATTAA 361 TTTTTTAATA TATATTAGCC ATTTTGTAAC GACAGTGCTG CAAATGACAA AATTGAGAAA 421 TATATCAAAC TTTTTCCGTA AAATAGATAA ACTTAAAAGA GGATATTTGT AAACAACACA 481 AAATCTTCAA TCAAATACAA AGTTCAAACA CTAGCAGTGA ACCAAATCAT CGAAAAGTTA 541 TAAAATTGTT AGAAATTTCT ACCATATATT GGATGAAAAC ATTGAAATTT CCTGGAATTT 601 TGAACTTTAG GTTGCTGGTT TTGG Predicted gene structure (within gDNA segment 22234 to 27618): Exon 1 22995 23359 ( 365 n); cDNA 1 365 ( 365 n); score: 0.856 Intron 1 23360 23407 ( 48 n); Pd: 0.000 (s: 0.81), Pa: 0.985 (s: 0) Exon 2 23408 23434 ( 27 n); cDNA 366 391 ( 26 n); score: 0.704 Intron 2 23435 23471 ( 37 n); Pd: 0.347 (s: 0), Pa: 0.000 (s: 0) Exon 3 23472 23491 ( 20 n); cDNA 392 410 ( 19 n); score: 0.650 Intron 3 23492 24032 ( 541 n); Pd: 0.393 (s: 0), Pa: 0.000 (s: 0.64) Exon 4 24033 24101 ( 69 n); cDNA 411 477 ( 67 n); score: 0.652 Intron 4 24102 24705 ( 604 n); Pd: 0.000 (s: 0.66), Pa: 0.000 (s: 0) Exon 5 24706 24720 ( 15 n); cDNA 478 492 ( 15 n); score: 0.667 Intron 5 24721 27133 (2413 n); Pd: 0.288 (s: 0), Pa: 0.753 (s: 0) Exon 6 27134 27172 ( 39 n); cDNA 493 528 ( 36 n); score: 0.590 Intron 6 27173 27395 ( 223 n); Pd: 0.893 (s: 0), Pa: 0.416 (s: 0) Exon 7 27396 27404 ( 9 n); cDNA 529 537 ( 9 n); score: 0.667 MATCH C12HBa0093P12.1-3+ SGN-E542827+ 0.824 544 0.872 C PGS_C12HBa0093P12.1-3+_SGN-E542827+ (22995 23359,23408 23434,23472 23491,24033 24101,24706 24720,27134 27172,27396 27404) Alignment (genomic DNA sequence = upper lines): TTTTGACTTT TTTGGTTTCA ATGACTAATT TATAATTATT ATTTTGATAA TCAAATTTAT 23054 |||| ||| ||| | || | |||||||||| |||||||||| |||||||||| |||||||||| TTTTCTTTTT TTTTGGTT-A ATGACTAATT TATAATTATT ATTTTGATAA TCAAATTTAT 59 TTATGTTTCA CTAATATTCT TGTAAAACTT GTTGTAGATG ACCAAATTTT TTCTTCGAAT 23114 |||| ||||| |||||||||| |||||||||| ||||||||| |||||| ||| ||||| |||| TTATATTTCA CTAATATTCT TGTAAAACTT ATTGTAGATG ACCAAAATTT TTCTTTGAAT 119 ACAAAATTAA ATTACAAT-- ACACAAAAAA AATAGTTTAA TTTTTTTCTT TAAACTAAGG 23172 || ||||||| |||||||| |||||||||| ||||||||| ||||||| | | || ACCAAATTAA ATTACAATAC ACACAAAAAA AATAGTTTAG TTTTTTT-TC TCTTTAAACT 178 AATGAAAGAA AAAAAACAAA ATAAGAATAA GAAACTCAAA TAATTATAAT AAAAGAAGTT 23232 || ||| ||| | |||| ||| || ||||||| ||||||||| |||||||||| ||||||||| AAGGAATGAA AGAAAA-AAA ATTAGAATAA CAAACTCAAA TAATTATAAT AAAAGAAGTC 237 AAAAAATAAT TTATGTATCA AAAAAAATTA AA-ATATACC TTGAACTTTG ATAGAAGAAT 23291 ||| |||||| |||||||| | |||||||||| || ||| ||| | |||||||| |||||||||| AAACAATAAT TTATGTAT-A AAAAAAATTA AATATA-ACC TCGAACTTTG ATAGAAGAAT 295 CATATATACC CCTAAATCAT TTTTTTTAAA AAAAT--GAA GTAAAAAATA TAAATTTAAA 23349 ||||||||| ||||| || |||||| ||| | ||| || |||| ||||| |||||||||| AATATATACC TTTAAATAAT TTTTTTAAAA ACAATCAAAA GTAATAAATA TAAATTTAAA 355 ATTAATTTTT TAACATCCGT TAAATGAAGG GTATATGTGA GCCATTTTGT AACGGCAGGG 23409 |||||||||| ATTAATTTTT .......... .......... .......... .......... ........TA 367 GTATATGTGA GCCGATTGTA TAACGGTAAG GGCATATATG AGCCACTTTT ATAACGAGGG 23469 ||||| | | ||| ||| | ||||| ATATATATTA GCC-ATTTTG TAACG..... .......... .......... .......... 391 ATATATTAGC TCCAAATGAG GGGTATATCA GACCCTTTTC CCTTGAATAT ATGTTACCTT 23529 | | | || | ||||||| ..ACAGT-GC TGCAAATGAC AA........ .......... .......... .......... 410 CCACCAGTAT TGGATAACTC AAGCTTTTAT TATTTTTTTC TTATATCTTA GTAAAAGGTA 23589 .......... .......... .......... .......... .......... .......... 410 GTGATTTTGC TACTAGAATC TTTTGCATTT TCTTTTTAGG CATTTATATA TTTACTTAAT 23649 .......... .......... .......... .......... .......... .......... 410 TTTTCATTTC CTCCAAATTT CTCATGCCAT GAACAATCTT AGCTCCAATG CTTTATGAGG 23709 .......... .......... .......... .......... .......... .......... 410 AATTTGAGTT ATCAAAAAAT TTAATTGATT TATGTCTATT TTGTAAATAT ACAAACTCAT 23769 .......... .......... .......... .......... .......... .......... 410 ATTTGATCAT ATCAAAGGGT TTTCCAACTA ATATTTGTAA TTTTGATGTT ATGGTATGAT 23829 .......... .......... .......... .......... .......... .......... 410 AAAAAAAAAT CATTTTTAGT GGATAAGGTT TAATTACTAA ACAATAATTA TTCTTTTATT 23889 .......... .......... .......... .......... .......... .......... 410 ATTATTTTTG TTTATCCATA CCTAAGATGG AGAAGACAGC ACACATCTCA TCATAGTCGC 23949 .......... .......... .......... .......... .......... .......... 410 AAAAATACAA ATACCCCACC CCCACTCCAA AACAAAAATA AAAATCGTGA AATTATTAAT 24009 .......... .......... .......... .......... .......... .......... 410 ACATCGTATT TAAATTTCTA TAAAATTGAA TAAATAAAAA TCACATTTTA AGTAAAATAA 24069 ||||| | ||||| | || |||| |||||||| .......... .......... ...AATTG-A GAAATATATC AAACTTTTTC CGTAAAATAG 446 ACATATAAAA CACGATAAAA GTTGTATAAA ACGTGTATAT CTACTTGTAG AAATTTTATA 24129 | | | || | || | | ||||| | | || ATAAACTTAA -AAGAGGATA TTTGTAAACA AC........ .......... .......... 477 CAAATTTAAA TATCTCCTTC TGCTTAATCT TGCAAGATCA TCATGCAATA CAAGTTATTA 24189 .......... .......... .......... .......... .......... .......... 477 AATTAAATCT TATAACTTTA TTGTTTTATT ATAAATAAAG TTCAATTATT ATCTCATACG 24249 .......... .......... .......... .......... .......... .......... 477 TAAAAATCAA CTGAAATTAC AACACAACAA TTATCTTTTC TCATTATTCA TATTCATTTC 24309 .......... .......... .......... .......... .......... .......... 477 AAACTACAAC ACATTTCATC AAAATCAACG AACTCAAGTA CAAAGAAAAT CATGAAAAAG 24369 .......... .......... .......... .......... .......... .......... 477 ACACCCAAAT AACCAAAACA AATTAAACAA AAACCTTAAA AATTCAATAA CTAGATAATG 24429 .......... .......... .......... .......... .......... .......... 477 CACTCGAATT TCTAAGTCCA TAACACCCAA ATTTTGGAAG CATATTGTTG ATAACAACGT 24489 .......... .......... .......... .......... .......... .......... 477 TAGAAATACG ATGATACGCT TCATCTGTCA GATGAAGCCC GTCCCAATTA ACGTATTGGG 24549 .......... .......... .......... .......... .......... .......... 477 ACGGGTTAGG ACATACATTG GTACTAGCTG AACTACATCC CCCGGCATTG TATCGTCCTC 24609 .......... .......... .......... .......... .......... .......... 477 CGCTCCCACA ACATGCCGAT ACCAATGTAC TTGGATTAAA TCCCAACCAA GACGCGTATC 24669 .......... .......... .......... .......... .......... .......... 477 GAAAAACAAG CCTAAAACCA CCATAGTAAT CCCCATACAC AATTTTAACA CGTGGGAACT 24729 ||| ||| || | | .......... .......... .......... ......ACAA AATCTTCAAT C......... 492 CACATCGTAG ATTTTCTAGA GCCTTCTTTA GCTCGATATT ATGATACGTA GCGAAATCAT 24789 .......... .......... .......... .......... .......... .......... 492 TATAAAATTT CAAACAACCA TATTGATCAT AAGCATTTGG ATTCGTGTCA GCGAATCTTG 24849 .......... .......... .......... .......... .......... .......... 492 TTAGATATGA TGATAGACAC CCAAAAGGAA AAACTCCTGG AACCAAAATT CGAGTTGATC 24909 .......... .......... .......... .......... .......... .......... 492 CTAATTGGAT CACCTCTTTG ATGGCACTTA TAATGCCATC AATAATAAAA GGTACGTACG 24969 .......... .......... .......... .......... .......... .......... 492 TACGTACCTC AGGTTCAGGT TTATTTCCCG CTAAAGCATT CCAATAATCA ACTCCACCAA 25029 .......... .......... .......... .......... .......... .......... 492 ACTCTCCTAA TACTACAAGA GAGTTTCGTA GAGTTGTAGA ATATTTTGAC CCATAGGTTG 25089 .......... .......... .......... .......... .......... .......... 492 ACTGGAGGTG AGATTTAAAC CACTCTAATT GACTCGGGAG AGGAACGTTG AACGGGACAT 25149 .......... .......... .......... .......... .......... .......... 492 GTCCAATGCC CCTCTCCTCC AAAAAAGAGT TATTCATCGC CGTTGCACCA GCAACGGCGA 25209 .......... .......... .......... .......... .......... .......... 492 AATTAGCACC TTGACTAAAG GAAACACCTG ATTTGTCCAT GTAAGGATTG AGGAATGGGA 25269 .......... .......... .......... .......... .......... .......... 492 GACTGAGAGC CGTGGCGATA TAGTCAGCGA TAATACGACC GTCAGAAAAA CGTCCAGTAG 25329 .......... .......... .......... .......... .......... .......... 492 GTTTATGAAA AAAGGTTTCA CCATAAGGTA GACCCCATGC CTGGGCCGAT ATGACGGCAC 25389 .......... .......... .......... .......... .......... .......... 492 CAGGTATGCG GATCACGTTT CCAGCATCCG CGATAGAATC GCCGAACTGA AAAAGTGATG 25449 .......... .......... .......... .......... .......... .......... 492 TGATGTGACA TCTTGTTAGG ACATCACAAT GAGAAGAGTG AAAAAATAAA GTTATTATTA 25509 .......... .......... .......... .......... .......... .......... 492 TTAGTAAAAG AAAGAAAGAA AATTTCGATA GAAAAAGAGA AGCCATAATT ATGAATGATT 25569 .......... .......... .......... .......... .......... .......... 492 TTTTTTTGGT TGAAAACTTT GGAATAGTAA AGTGGTATTT ATAGTGAAAA AATAAAACGT 25629 .......... .......... .......... .......... .......... .......... 492 ACATTGCATG TTAGTTTAGA TCGTGATTCA ATACAACACA TATACTAATA AATATATTAT 25689 .......... .......... .......... .......... .......... .......... 492 TTAATGATTT TTTTAATTCA AACATCAAAT GTCTGAATTA TATTTCAATT TTGACCGAAA 25749 .......... .......... .......... .......... .......... .......... 492 TTATTGTAAC AATATCGAAC TTTGGAAATG ACTTTTTACC CTGCACTATT TAATAGTGTA 25809 .......... .......... .......... .......... .......... .......... 492 TTTTAAAGGT ATATATGTGT TCACGTGAAC ATCATAAATA TTACATCATT ATAAATAGTT 25869 .......... .......... .......... .......... .......... .......... 492 ATGTGTCCAT GTTGTCACAT ATATACCTTT AAAATACACT ATTAAATAGT GCAATGGTAA 25929 .......... .......... .......... .......... .......... .......... 492 AAGATCCTCC ATAAAGTTTG GTATCGTAAT AACAATTTCG ATCAAAGTTA AAATATTTTT 25989 .......... .......... .......... .......... .......... .......... 492 CAGACTATTT TTCTACGATT TCATTATTTA GTGAATGTTT TAGGTTAAAT ATTAAACAAA 26049 .......... .......... .......... .......... .......... .......... 492 CAACTAACAC ACACCGACAC AATATACGTA CTTACACAAT TACGATATCG ATATTTATTA 26109 .......... .......... .......... .......... .......... .......... 492 TTTTCAAACT AAATTTTGGA AGTTCGATAT TTAATACCTA ATATTAATTG GGATCCTATT 26169 .......... .......... .......... .......... .......... .......... 492 TATCCTCACG TAGTGTAAGG TGTAAGATAA CCTTTGGCTA ATCGAAAGTA CAAAATACAT 26229 .......... .......... .......... .......... .......... .......... 492 ATATAATTAA AAAGGTATAC TTACCAAGTC TTATCTTATA ATATATATAT ATATATATAT 26289 .......... .......... .......... .......... .......... .......... 492 ATATAGTCTT CTAATTATAA TATTACAAAA TAATTATTAT TAATTTCCAA TTAAGAAAAT 26349 .......... .......... .......... .......... .......... .......... 492 TAGCAGCCTA AATACAGAAC ATTTGTCTAA ATAAAATTTA ATATAGTGCA TGTGCTTGTA 26409 .......... .......... .......... .......... .......... .......... 492 TAACAAGAAG ACAAGATGCA TTTAGTTAGG GACAAAATTT ATAGTTAATT TACTTAAATT 26469 .......... .......... .......... .......... .......... .......... 492 ATGATATGAT TTATCTAAAA AAGGACTCAA AATAGACTTT GGTTGAGGCT TTTGGTAGGG 26529 .......... .......... .......... .......... .......... .......... 492 AGTGATTTTT CGAATAGTTA AATTTTAATG TAGATAATAT TGAATATTAG GTGAAATTTT 26589 .......... .......... .......... .......... .......... .......... 492 TTTTTAAGAA AAAAAATTTG AGTTGTATAG CTATTCGTTT ATATATATAT ATAATATATT 26649 .......... .......... .......... .......... .......... .......... 492 GGTCGGAATT TTGGGTGCGT TTGGTATGGA GGAAAATATT TTTGGATTTT TTATGTTAGC 26709 .......... .......... .......... .......... .......... .......... 492 TAATAAAAAA TATTGATTTT TTTTTATTTT GAGAAAACAA GGTTTATTTT TTAGAAAAAA 26769 .......... .......... .......... .......... .......... .......... 492 AAGTGAGGAA AATGATTTTT CGCAATAGCT ATACAATAAA ATTGTTATTT CTTAGCAAAC 26829 .......... .......... .......... .......... .......... .......... 492 AAGCTCACCG GCAGATTTCA CTATCATCCA CACGACAATT TAACTTTTCG AAAAATTACT 26889 .......... .......... .......... .......... .......... .......... 492 CACAACCAAA AGCCTCCACC AAACGTCATT ACGCCACCTC CCTAGACATA AATAAAGGAG 26949 .......... .......... .......... .......... .......... .......... 492 TCAGTTGGTT TTCATATCGT CTGGCCACAG ATTACACCCC TCTGCATCTT CTCATTAAAA 27009 .......... .......... .......... .......... .......... .......... 492 TCATCCAAAT ATTTGTCTAT TTCCACAAAC TAACACCGCA CGATATTACC TTCATATCTA 27069 .......... .......... .......... .......... .......... .......... 492 CTCCTGATTA TCATTGCCAA CAAATAAATC TACATTCTCA GCCCTACCTT CTCTGATTTA 27129 .......... .......... .......... .......... .......... .......... 492 TTAGATACAC AAATGAATAC AACAACGATT ATACATCAAA ATCGCATCGT TTCAATACTA 27189 | | || ||| | | | || || | | | | || ||| ....AAATAC AAA-G--TTC AAACACTAGC AGTGAACCAA ATC....... .......... 528 CATCACCATC ATTCTCTCTC CTACTTACCC GCGCCCCACG TTGACCACAG ATGTCCTTAT 27249 .......... .......... .......... .......... .......... .......... 528 TAACTTACTC GCTCCCCACC TTCTGCATAC TGGGCCTTTT GGCCACTAGA AAGACTACTC 27309 .......... .......... .......... .......... .......... .......... 528 ACTGTGTCCG ACACCTCACG ACCACTCCAC TACCCCAGTC CCGTGTAATA CAATGATATT 27369 .......... .......... .......... .......... .......... .......... 528 CACTTTTTTC ACTCTCACTC GCACAGACGG ATAAG 27404 | | | ||| .......... .......... ......ATCG AAAAG 537 hqPGS_C12HBa0093P12.1-3+_SGN-E542827+ (22995 23359,23408 23434) ******************************************************************************** EST sequence 4 +strand 460 n (File: SGN-E243215+) 1 TTATTGTAGA TGACCAATTT TTTCTTCGAA TACGAAATTA AATTACAATA CACACAAAAA 61 AAATATTTGA ATTTTTTTTA TTTAAACTAA GGAATGAAAG AAAAAAACAA AATAAGAATA 121 AGAAACTCAA ATTATTATAA TAAAAGAAGT CAAAAAATAA TTTTTGTATG AAAAAATTAA 181 AATATACCTT GAACTTTGAT AGAAGAATCA TATATATCCC TAAATATTTT TTTTAAAAAA 241 AAATTAGAAG TAACAAATAT AAATTTAAAA CTAATTTTTT AACTTTCGTT AAATGAAGGG 301 TATATGTGAG CCATTTTCTA ACGGCAGGGG TATATGTGAG CCGTTTGTAT AACGATAAGG 361 GCATATATGA ACCACTTTTA TTACGAGGGA TATATCAGCT CTAAATGACA AAGTTGAGAG 421 GTATATCAGA CCCTTTTCCC TATTTTTTAA AATTTCATAC Predicted gene structure (within gDNA segment 22349 to 25203): Exon 1 23083 23488 ( 406 n); cDNA 1 408 ( 408 n); score: 0.894 MATCH C12HBa0093P12.1-3+ SGN-E243215+ 0.894 406 0.883 C PGS_C12HBa0093P12.1-3+_SGN-E243215+ (23083 23488) Alignment (genomic DNA sequence = upper lines): TTGTTGTAGA TGACCAAATT TTTTCTTCGA ATACAAAATT AAATTACAAT --ACACAAAA 23140 || ||||||| ||||| |||| |||||||||| |||| ||||| |||||||||| |||||||| TTATTGTAGA TGACC-AATT TTTTCTTCGA ATACGAAATT AAATTACAAT ACACACAAAA 59 AAAATAGTTT AA-TTTTTTT CTTTAAACTA AGGAATGAAA GAAAAAAAAC AAAATAAGAA 23199 |||||| || || ||||||| ||||||||| |||||||||| || ||||||| |||||||||| AAAATATTTG AATTTTTTTT ATTTAAACTA AGGAATGAAA GA-AAAAAAC AAAATAAGAA 118 TAAGAAACTC AAATAATTAT AATAAAAGAA GTTAAAAAAT AATTTATGTA TCAAAAAAAA 23259 |||||||||| |||| ||||| |||||||||| || ||||||| ||||| |||| | |||||| TAAGAAACTC AAATTATTAT AATAAAAGAA GTCAAAAAAT AATTTTTGTA T--GAAAAAA 176 TTAAAATATA CCTTGAACTT TGATAGAAGA ATCATATATA CCCCTAAATC ATTTTTTT-T 23318 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| ||||||| TTAAAATATA CCTTGAACTT TGATAGAAGA ATCATATATA TCCCTAAATA TTTTTTTTAA 236 AAAAAAA-T- GAAGTAAAAA ATATAAATTT AAAATTAATT TTTTAACATC CGTTAAATGA 23376 ||||||| | ||||||| || |||||||||| |||| ||||| ||||||| | |||||||||| AAAAAAATTA GAAGTAACAA ATATAAATTT AAAACTAATT TTTTAACTTT CGTTAAATGA 296 AGGGTATATG TGAGCCATTT TGTAACGGCA GGGGTATATG TGAGCCGATT GTATAACGGT 23436 |||||||||| |||||||||| | |||||||| |||||||||| ||||||| || |||||||| | AGGGTATATG TGAGCCATTT TCTAACGGCA GGGGTATATG TGAGCCGTTT GTATAACGAT 356 AAGGGCATAT ATGAGCCACT TTTATAACGA GGGATATATT AGCTCCAAAT GA 23488 |||||||||| |||| ||||| ||||| |||| ||||||||| ||||| |||| || AAGGGCATAT ATGAACCACT TTTATTACGA GGGATATATC AGCTCTAAAT GA 408 hqPGS_C12HBa0093P12.1-3+_SGN-E243215+ (23083 23488) ******************************************************************************** EST sequence 11 -strand 701 n (File: SGN-E578389-) 1 TCAAATAATT ATAATAAATA AGTCAAAAAA ATAATTTATG TATTAAAAAA ATTTGAAATA 61 TACCTTGAAC TTTGAAAAAA GAATCATATA TGCCCCTAAA TATATTTTTT TTTAAAATTA 121 AAGTAAAATT ATAAATTTAA AAGTAATTTT TTCACTTTCG TTAAATGAAG GGTATATATG 181 AGCTCATTTT GTAACGGCAG AGGTATATGT GAACCATTTG TATAACGGTA AGGGTATATA 241 TGAGCCACTT TCATAACGAG GGGTATATCA GTTTCAAATG ACAAAGTTGA GGGGTATATC 301 ATACCCTTTT CCCATAATAT TATTCATTTT TGGGTTGACG GGTCAAACCT TGGGCTGCTT 361 AGGACTTGAT TAGACCGCTA TTTTATTGAC TCTTTAATTA ATGGGCAACT TTCACATATA 421 ACAAACAAAA AATTCATATT TGTATGCTAT AACAAAGTTT GCATAATTGC GCTCCATAGC 481 AAACATAAAA TTGTATAATT CGCTGACCTA AATTGTATAA TTCGCTGGCC TATTTCGCTG 541 CAATTGTATA ATTCGCTATC CTATTTAACT ACAATTGTAT AATTCGCTGC CTATTTCGCT 601 GCAATATTAT TATAAAATTT GCTTTGCATA TAATTGAACC GAATTAAAAT GTATGTATAT 661 TGCATAATTA TAAGTGTATA GCAATAAGAT ATATGTTTTT C Predicted gene structure (within gDNA segment 22598 to 27618): Exon 1 23208 23488 ( 281 n); cDNA 1 281 ( 281 n); score: 0.863 MATCH C12HBa0093P12.1-3+ SGN-E578389- 0.863 281 0.401 C PGS_C12HBa0093P12.1-3+_SGN-E578389- (23208 23488) Alignment (genomic DNA sequence = upper lines): TCAAATAATT ATAATAAAAG AAGTTAAAAA ATAATTTATG TATCAAAAAA AATTAAAATA 23267 |||||||||| |||||||| | ||||| |||||||||| ||| |||||| | || ||||| TCAAATAATT ATAATAAATA AGTCAAAAAA ATAATTTATG TATTAAAAAA ATTTGAAATA 60 TACCTTGAAC TTTGATAGAA GAATCATATA TACCCCTAAA TCATTTTTTT TAAAAAAATG 23327 |||||||||| ||||| | || |||||||||| | |||||||| | |||||| | |||| | TACCTTGAAC TTTGAAAAAA GAATCATATA TGCCCCTAAA TATATTTTTT TTTAAAATTA 120 AAGTAAAAAA TATAAATTTA AAATTAATTT TTTAACATCC GTTAAATGAA GGGTATATGT 23387 |||| |||| |||||||||| ||| |||||| ||| || | | |||||||||| |||||||| | AAGT-AAAAT TATAAATTTA AAAGTAATTT TTTCACTTTC GTTAAATGAA GGGTATATAT 179 GAGC-CATTT TGTAACGGCA GGGGTATATG TGAGCCGATT GTATAACGGT AAGGGCATAT 23446 |||| ||||| |||||||||| | |||||||| ||| || || |||||||||| ||||| |||| GAGCTCATTT TGTAACGGCA GAGGTATATG TGAACCATTT GTATAACGGT AAGGGTATAT 239 ATGAGCCACT TTTATAACGA GGGATATATT AGCTCCAAAT GA 23488 |||||||||| || ||||||| ||| ||||| || | ||||| || ATGAGCCACT TTCATAACGA GGGGTATATC AGTTTCAAAT GA 281 hqPGS_C12HBa0093P12.1-3+_SGN-E578389- (23208 23488) ******************************************************************************** EST sequence 10 -strand 799 n (File: SGN-E543825-) 1 AAACTCTCTT GTAGTATTAG GAGAGTTTGT GGGAGTTGAT TATTGGAATG CTTTAGCGGG 61 AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGATG GCATTATAAG 121 TGCCATCAAA GAGGTGATCC AATTAGGATC AACTCGAATT TTGGTTCCAG GAGTTTTTCC 181 TTTTGGGTGT CTATCATCAT ATCTAACAAG ATTCGCTGAC ACGAATCCAA ATGCTTATGA 241 TCAATATGGT TGTTTGAAAT TTTATAATGA TTTCGCTACG TATCATAATA TCGAGCTAAA 301 GAAGGCTCTA GAAAATCTAC GATGTGAGTT CCCACGTGTT AAAATTGTGT ATGGGGATTA 361 CTATGGTGGT TTTAGGCTTG TTTTTCGATA CGCGTCTTGG TTGGGATTTA ATCCAAGTAC 421 ATTGGTATCG GCATGTTGTG GGAGCGGAGG ACGATACAAT GCCGGGGGAT GTAGTTCAGC 481 TAGTACCAAT GTATGTCCTA ACCCGTCCCA ATACGTTAAT TGGGACGGGC TTCATCTGAC 541 AGATGAAGCG TATCATCGTA TTTCTAACGT TGTTATCAAC AATATGCTTC CAAAATTTGG 601 GTGTTATGGA CTTAGAAATT CGAGTGCATT ATCTAGTTAT TGAATTTTTA AGGTTTTTGT 661 TTAATTTGTT TTGGTTATTT GGGTGTCTTT TTCATGATTT TCTTTGTACT TGAGTTCGTT 721 GATTTTGATG AAATGTGTTG TAGTTTGAAA TGAATATGAA TAATGAGAAA AGATAATTGT 781 TGTGTTGTAA AAAAAAAAA Predicted gene structure (within gDNA segment 25665 to 23576): Exon 1 25055 24267 ( 789 n); cDNA 1 789 ( 789 n); score: 0.997 PPA cDNA 790 799 MATCH C12HBa0093P12.1-3- SGN-E543825- 0.997 789 0.987 C PGS_C12HBa0093P12.1-3-_SGN-E543825- (25055 24267) Alignment (genomic DNA sequence = upper lines): AAACTCTCTT GTAGTATTAG GAGAGTTTGG TGGAGTTGAT TATTGGAATG CTTTAGCGGG 24996 |||||||||| |||||||||| ||||||||| ||||||||| |||||||||| |||||||||| AAACTCTCTT GTAGTATTAG GAGAGTTTGT GGGAGTTGAT TATTGGAATG CTTTAGCGGG 60 AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGATG GCATTATAAG 24936 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGATG GCATTATAAG 120 TGCCATCAAA GAGGTGATCC AATTAGGATC AACTCGAATT TTGGTTCCAG GAGTTTTTCC 24876 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCCATCAAA GAGGTGATCC AATTAGGATC AACTCGAATT TTGGTTCCAG GAGTTTTTCC 180 TTTTGGGTGT CTATCATCAT ATCTAACAAG ATTCGCTGAC ACGAATCCAA ATGCTTATGA 24816 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTGGGTGT CTATCATCAT ATCTAACAAG ATTCGCTGAC ACGAATCCAA ATGCTTATGA 240 TCAATATGGT TGTTTGAAAT TTTATAATGA TTTCGCTACG TATCATAATA TCGAGCTAAA 24756 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATATGGT TGTTTGAAAT TTTATAATGA TTTCGCTACG TATCATAATA TCGAGCTAAA 300 GAAGGCTCTA GAAAATCTAC GATGTGAGTT CCCACGTGTT AAAATTGTGT ATGGGGATTA 24696 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAGGCTCTA GAAAATCTAC GATGTGAGTT CCCACGTGTT AAAATTGTGT ATGGGGATTA 360 CTATGGTGGT TTTAGGCTTG TTTTTCGATA CGCGTCTTGG TTGGGATTTA ATCCAAGTAC 24636 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTATGGTGGT TTTAGGCTTG TTTTTCGATA CGCGTCTTGG TTGGGATTTA ATCCAAGTAC 420 ATTGGTATCG GCATGTTGTG GGAGCGGAGG ACGATACAAT GCCGGGGGAT GTAGTTCAGC 24576 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATTGGTATCG GCATGTTGTG GGAGCGGAGG ACGATACAAT GCCGGGGGAT GTAGTTCAGC 480 TAGTACCAAT GTATGTCCTA ACCCGTCCCA ATACGTTAAT TGGGACGGGC TTCATCTGAC 24516 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAGTACCAAT GTATGTCCTA ACCCGTCCCA ATACGTTAAT TGGGACGGGC TTCATCTGAC 540 AGATGAAGCG TATCATCGTA TTTCTAACGT TGTTATCAAC AATATGCTTC CAAAATTTGG 24456 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGATGAAGCG TATCATCGTA TTTCTAACGT TGTTATCAAC AATATGCTTC CAAAATTTGG 600 GTGTTATGGA CTTAGAAATT CGAGTGCATT ATCTAGTTAT TGAATTTTTA AGGTTTTTGT 24396 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTGTTATGGA CTTAGAAATT CGAGTGCATT ATCTAGTTAT TGAATTTTTA AGGTTTTTGT 660 TTAATTTGTT TTGGTTATTT GGGTGTCTTT TTCATGATTT TCTTTGTACT TGAGTTCGTT 24336 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTAATTTGTT TTGGTTATTT GGGTGTCTTT TTCATGATTT TCTTTGTACT TGAGTTCGTT 720 GATTTTGATG AAATGTGTTG TAGTTTGAAA TGAATATGAA TAATGAGAAA AGATAATTGT 24276 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GATTTTGATG AAATGTGTTG TAGTTTGAAA TGAATATGAA TAATGAGAAA AGATAATTGT 780 TGTGTTGTA 24267 ||||||||| TGTGTTGTA 789 hqPGS_C12HBa0093P12.1-3-_SGN-E543825- (25055 24267) ******************************************************************************** EST sequence 19 +strand 727 n (File: SGN-E543826+) 1 TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 61 CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 121 TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 181 TATCGGCCCA GGCATGGGGT CTACCTTATG GTGAAACCTT TTTTCATAAA CCTACTGGAC 241 GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 301 TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGTTG 361 CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 421 TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTCCAG TCAACCTATG 481 GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGAG TTTGGTGGAG 541 TTGATTATTG GAATGCTTTA GCGGGAAATA AACCTGAACC TGAGGTACGT ACGTACGTAC 601 CTTTTATTAT TGATGGCATT ATAAGTGCCA TCAAAGAGGT GATCCAATTA GGATCGACTC 661 GAATTTTGGT TCCAGGAGTT TTTCCTTTTG GGTGTCTATC ATCATATCTA ACAAGATTCG 721 CTGACAC Predicted gene structure (within gDNA segment 26160 to 24224): Exon 1 25560 24834 ( 727 n); cDNA 1 727 ( 727 n); score: 0.999 MATCH C12HBa0093P12.1-3- SGN-E543826+ 0.999 727 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E543826+ (25560 24834) Alignment (genomic DNA sequence = upper lines): TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 25501 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 60 CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 25441 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 120 TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 25381 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 180 TATCGGCCCA GGCATGGGGT CTACCTTATG GTGAAACCTT TTTTCATAAA CCTACTGGAC 25321 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TATCGGCCCA GGCATGGGGT CTACCTTATG GTGAAACCTT TTTTCATAAA CCTACTGGAC 240 GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 25261 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 300 TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGTTG 25201 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGTTG 360 CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 25141 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 420 TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTCCAG TCAACCTATG 25081 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTCCAG TCAACCTATG 480 GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGAG TTTGGTGGAG 25021 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGAG TTTGGTGGAG 540 TTGATTATTG GAATGCTTTA GCGGGAAATA AACCTGAACC TGAGGTACGT ACGTACGTAC 24961 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTGATTATTG GAATGCTTTA GCGGGAAATA AACCTGAACC TGAGGTACGT ACGTACGTAC 600 CTTTTATTAT TGATGGCATT ATAAGTGCCA TCAAAGAGGT GATCCAATTA GGATCAACTC 24901 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||| |||| CTTTTATTAT TGATGGCATT ATAAGTGCCA TCAAAGAGGT GATCCAATTA GGATCGACTC 660 GAATTTTGGT TCCAGGAGTT TTTCCTTTTG GGTGTCTATC ATCATATCTA ACAAGATTCG 24841 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAATTTTGGT TCCAGGAGTT TTTCCTTTTG GGTGTCTATC ATCATATCTA ACAAGATTCG 720 CTGACAC 24834 ||||||| CTGACAC 727 hqPGS_C12HBa0093P12.1-3-_SGN-E543826+ (25560 24834) ******************************************************************************** EST sequence 14 +strand 661 n (File: SGN-E305600+) 1 ACAAATAATT CATCTTCCAT CCAAAAAAAA ATCATTCATA ATTATGGCTT CTCTTTTTCT 61 ATCGAAATTT TCTTTCTTTC TTTTACTAAT AATAATAACT TTATTTTTTC ACTCTTCTCA 121 TTGTGATGTC CTAACAAGAT GTCACATCAC ATCACTTTTT CAGTTCGGCG ATTCTATCGC 181 GGATGCTGGA AACGTGATCC GCATACCTGG TGCCGTCATA TCGGCCCAGG CATGGGGTCT 241 ACCTTATGGT GAAACCTTTT TTCATAAACC TACTGGACGT TTTTCTGACG GTCGTATTAT 301 CGCTGACTAT ATCGCCACGG CTCTCAGTCT CCCATTCCTC AATCCTTACA TGGACAAATC 361 AGGTGTTTCC TTTAGTCAAG GTGCTAATTT CGCCGTTGCT GGTGCAACGG CGATGAATAA 421 CTCTTTTTTG GAGGAGAGGG GCATTGGACA TGTCCCGTTC AACGTTCCTC TCCCGAGTCA 481 ATTAGAGTGG TTTAAATCTC ACCTCCAGTC AACCTATGGG TCAAAATATT CTACAACTCT 541 ACGAAACTCT CTTGTAGTAT TAGGAGAGTT TGGTGGAGTT GATTATTGGA ATGCTTTAGC 601 GGGAAATAAA CCTGAACCTG AGGTACGTAC GTACGTACCT TTTATTATTG ATGGCATTAT 661 A Predicted gene structure (within gDNA segment 26378 to 24328): Exon 1 25577 24938 ( 640 n); cDNA 22 661 ( 640 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E305600+ 1.000 640 0.968 C PGS_C12HBa0093P12.1-3-_SGN-E305600+ (25577 24938) Alignment (genomic DNA sequence = upper lines): CAAAAAAAAA TCATTCATAA TTATGGCTTC TCTTTTTCTA TCGAAATTTT CTTTCTTTCT 25518 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAAAAAAA TCATTCATAA TTATGGCTTC TCTTTTTCTA TCGAAATTTT CTTTCTTTCT 81 TTTACTAATA ATAATAACTT TATTTTTTCA CTCTTCTCAT TGTGATGTCC TAACAAGATG 25458 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTACTAATA ATAATAACTT TATTTTTTCA CTCTTCTCAT TGTGATGTCC TAACAAGATG 141 TCACATCACA TCACTTTTTC AGTTCGGCGA TTCTATCGCG GATGCTGGAA ACGTGATCCG 25398 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCACATCACA TCACTTTTTC AGTTCGGCGA TTCTATCGCG GATGCTGGAA ACGTGATCCG 201 CATACCTGGT GCCGTCATAT CGGCCCAGGC ATGGGGTCTA CCTTATGGTG AAACCTTTTT 25338 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATACCTGGT GCCGTCATAT CGGCCCAGGC ATGGGGTCTA CCTTATGGTG AAACCTTTTT 261 TCATAAACCT ACTGGACGTT TTTCTGACGG TCGTATTATC GCTGACTATA TCGCCACGGC 25278 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATAAACCT ACTGGACGTT TTTCTGACGG TCGTATTATC GCTGACTATA TCGCCACGGC 321 TCTCAGTCTC CCATTCCTCA ATCCTTACAT GGACAAATCA GGTGTTTCCT TTAGTCAAGG 25218 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCAGTCTC CCATTCCTCA ATCCTTACAT GGACAAATCA GGTGTTTCCT TTAGTCAAGG 381 TGCTAATTTC GCCGTTGCTG GTGCAACGGC GATGAATAAC TCTTTTTTGG AGGAGAGGGG 25158 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTAATTTC GCCGTTGCTG GTGCAACGGC GATGAATAAC TCTTTTTTGG AGGAGAGGGG 441 CATTGGACAT GTCCCGTTCA ACGTTCCTCT CCCGAGTCAA TTAGAGTGGT TTAAATCTCA 25098 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTGGACAT GTCCCGTTCA ACGTTCCTCT CCCGAGTCAA TTAGAGTGGT TTAAATCTCA 501 CCTCCAGTCA ACCTATGGGT CAAAATATTC TACAACTCTA CGAAACTCTC TTGTAGTATT 25038 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCCAGTCA ACCTATGGGT CAAAATATTC TACAACTCTA CGAAACTCTC TTGTAGTATT 561 AGGAGAGTTT GGTGGAGTTG ATTATTGGAA TGCTTTAGCG GGAAATAAAC CTGAACCTGA 24978 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGAGAGTTT GGTGGAGTTG ATTATTGGAA TGCTTTAGCG GGAAATAAAC CTGAACCTGA 621 GGTACGTACG TACGTACCTT TTATTATTGA TGGCATTATA 24938 |||||||||| |||||||||| |||||||||| |||||||||| GGTACGTACG TACGTACCTT TTATTATTGA TGGCATTATA 661 hqPGS_C12HBa0093P12.1-3-_SGN-E305600+ (25577 24938) ******************************************************************************** EST sequence 16 +strand 656 n (File: SGN-E373690+) 1 ACAAATAATT CATCTTCCAT CCAAAAAAAA ATCATTCATA ATTATGGCTT CTCTTTTTCT 61 ATCGAAATTT TCTTTCTTTC TTTTACTAAT AATAATAACT TTATTTTTTC ACTCTTCTCA 121 TTGTGATGTC CTAACAAGAT GTCACATCAC ATCACTTTTT CAGTTCGGCG ATTCTATCGC 181 GGATGCTGGA AACGTGATCC GCATACCTGG TGCCGTCATA TCGGCCCAGG CATGGGGTCT 241 ACCTTATGGT GAAACCTTTT TTCATAAACC TACTGGACGT TTTTCTGACG GTCGTATTAT 301 CGCTGACTAT ATCGCCACGG CTCTCAGTCT CCCATTCCTC AATCCTTACA TGGACAAATC 361 AGGTGTTTCC TTTAGTCAAG GTGCTAATTT CGCCGTTGCT GGTGCAACGG CGATGAATAA 421 CTCTTTTTTG GAGGAGAGGG GCATTGGACA TGTCCCGTTC AACGTTCCTC TCCCGAGTCA 481 ATTAGAGTGG TTTAAATCTC ACCTCCAGTC AACCTATGGG TCAAAATATT CTACAACTCT 541 ACGAAACTCT CTTGTAGTAT TAGGAGAGTT TGGTGGAGTT GATTATTGGA ATGCTTTAGC 601 GGGAAATAAA CCTGAACCTG AGGTACGTAC GTACGTACCT TTTATTATTG ATGGCA Predicted gene structure (within gDNA segment 26378 to 24333): Exon 1 25577 24943 ( 635 n); cDNA 22 656 ( 635 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E373690+ 1.000 635 0.968 C PGS_C12HBa0093P12.1-3-_SGN-E373690+ (25577 24943) Alignment (genomic DNA sequence = upper lines): CAAAAAAAAA TCATTCATAA TTATGGCTTC TCTTTTTCTA TCGAAATTTT CTTTCTTTCT 25518 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAAAAAAAAA TCATTCATAA TTATGGCTTC TCTTTTTCTA TCGAAATTTT CTTTCTTTCT 81 TTTACTAATA ATAATAACTT TATTTTTTCA CTCTTCTCAT TGTGATGTCC TAACAAGATG 25458 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTACTAATA ATAATAACTT TATTTTTTCA CTCTTCTCAT TGTGATGTCC TAACAAGATG 141 TCACATCACA TCACTTTTTC AGTTCGGCGA TTCTATCGCG GATGCTGGAA ACGTGATCCG 25398 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCACATCACA TCACTTTTTC AGTTCGGCGA TTCTATCGCG GATGCTGGAA ACGTGATCCG 201 CATACCTGGT GCCGTCATAT CGGCCCAGGC ATGGGGTCTA CCTTATGGTG AAACCTTTTT 25338 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATACCTGGT GCCGTCATAT CGGCCCAGGC ATGGGGTCTA CCTTATGGTG AAACCTTTTT 261 TCATAAACCT ACTGGACGTT TTTCTGACGG TCGTATTATC GCTGACTATA TCGCCACGGC 25278 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCATAAACCT ACTGGACGTT TTTCTGACGG TCGTATTATC GCTGACTATA TCGCCACGGC 321 TCTCAGTCTC CCATTCCTCA ATCCTTACAT GGACAAATCA GGTGTTTCCT TTAGTCAAGG 25218 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCTCAGTCTC CCATTCCTCA ATCCTTACAT GGACAAATCA GGTGTTTCCT TTAGTCAAGG 381 TGCTAATTTC GCCGTTGCTG GTGCAACGGC GATGAATAAC TCTTTTTTGG AGGAGAGGGG 25158 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTAATTTC GCCGTTGCTG GTGCAACGGC GATGAATAAC TCTTTTTTGG AGGAGAGGGG 441 CATTGGACAT GTCCCGTTCA ACGTTCCTCT CCCGAGTCAA TTAGAGTGGT TTAAATCTCA 25098 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CATTGGACAT GTCCCGTTCA ACGTTCCTCT CCCGAGTCAA TTAGAGTGGT TTAAATCTCA 501 CCTCCAGTCA ACCTATGGGT CAAAATATTC TACAACTCTA CGAAACTCTC TTGTAGTATT 25038 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CCTCCAGTCA ACCTATGGGT CAAAATATTC TACAACTCTA CGAAACTCTC TTGTAGTATT 561 AGGAGAGTTT GGTGGAGTTG ATTATTGGAA TGCTTTAGCG GGAAATAAAC CTGAACCTGA 24978 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AGGAGAGTTT GGTGGAGTTG ATTATTGGAA TGCTTTAGCG GGAAATAAAC CTGAACCTGA 621 GGTACGTACG TACGTACCTT TTATTATTGA TGGCA 24943 |||||||||| |||||||||| |||||||||| ||||| GGTACGTACG TACGTACCTT TTATTATTGA TGGCA 656 hqPGS_C12HBa0093P12.1-3-_SGN-E373690+ (25577 24943) ******************************************************************************** EST sequence 15 +strand 588 n (File: SGN-E305486+) 1 GAAATTTTGT TTCTTTCTTT TACTAATAAT AATAACTTTA TTTTTTCACT CTTCTCATTG 61 TGATGTCCTA ACAAGATGTC ACATCACATC ACTTTTTCAG TTCGGCGATT CTATCGCGGA 121 TGCTGGAAAC GTGATCCGCA TACCTGGTGC CGTCATATCG GCCCAGGCAT GGGGTCTACC 181 TTATGGTGAA ACCTTTTTTC ATAAACCTAC TGGACGTTTT TCTGACGGTC GGATGATCGC 241 TGACTATATC GCCACGGCTC TCAGTCTCCC ATTCCTCAAT CCTTACATGG ACAAATCAAG 301 TGTTTCCTTT AGTCAAGGTG CTAATTTCGC CGTTGCTGGT GCAACGGCGA TGAATAACTC 361 TTTTTTGGAG GAGAGGGGCA TTGGACATGT CCCGTTCAAC GTTCCTCTCC CGAGTCAATT 421 AGAGCGGTTT AAATCTCACC TCCAGACAAC CTATGGGTCA AAATATTCTA CAACTGTACG 481 AAACTCTCTT GTAGTATTAG GAGAGTTTGG TGGAATTGAT TATTGGAATG CTTTANCGGG 541 AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGA Predicted gene structure (within gDNA segment 26216 to 24338): Exon 1 25535 24948 ( 588 n); cDNA 1 588 ( 588 n); score: 0.985 MATCH C12HBa0093P12.1-3- SGN-E305486+ 0.985 588 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E305486+ (25535 24948) Alignment (genomic DNA sequence = upper lines): GAAATTTTCT TTCTTTCTTT TACTAATAAT AATAACTTTA TTTTTTCACT CTTCTCATTG 25476 |||||||| | |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GAAATTTTGT TTCTTTCTTT TACTAATAAT AATAACTTTA TTTTTTCACT CTTCTCATTG 60 TGATGTCCTA ACAAGATGTC ACATCACATC ACTTTTTCAG TTCGGCGATT CTATCGCGGA 25416 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGATGTCCTA ACAAGATGTC ACATCACATC ACTTTTTCAG TTCGGCGATT CTATCGCGGA 120 TGCTGGAAAC GTGATCCGCA TACCTGGTGC CGTCATATCG GCCCAGGCAT GGGGTCTACC 25356 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGCTGGAAAC GTGATCCGCA TACCTGGTGC CGTCATATCG GCCCAGGCAT GGGGTCTACC 180 TTATGGTGAA ACCTTTTTTC ATAAACCTAC TGGACGTTTT TCTGACGGTC GTATTATCGC 25296 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| | || ||||| TTATGGTGAA ACCTTTTTTC ATAAACCTAC TGGACGTTTT TCTGACGGTC GGATGATCGC 240 TGACTATATC GCCACGGCTC TCAGTCTCCC ATTCCTCAAT CCTTACATGG ACAAATCAGG 25236 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||| | TGACTATATC GCCACGGCTC TCAGTCTCCC ATTCCTCAAT CCTTACATGG ACAAATCAAG 300 TGTTTCCTTT AGTCAAGGTG CTAATTTCGC CGTTGCTGGT GCAACGGCGA TGAATAACTC 25176 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGTTTCCTTT AGTCAAGGTG CTAATTTCGC CGTTGCTGGT GCAACGGCGA TGAATAACTC 360 TTTTTTGGAG GAGAGGGGCA TTGGACATGT CCCGTTCAAC GTTCCTCTCC CGAGTCAATT 25116 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTTGGAG GAGAGGGGCA TTGGACATGT CCCGTTCAAC GTTCCTCTCC CGAGTCAATT 420 AGAGTGGTTT AAATCTCACC TCCAGTCAAC CTATGGGTCA AAATATTCTA CAACTCTACG 25056 |||| ||||| |||||||||| ||||| |||| |||||||||| |||||||||| ||||| |||| AGAGCGGTTT AAATCTCACC TCCAGACAAC CTATGGGTCA AAATATTCTA CAACTGTACG 480 AAACTCTCTT GTAGTATTAG GAGAGTTTGG TGGAGTTGAT TATTGGAATG CTTTAGCGGG 24996 |||||||||| |||||||||| |||||||||| |||| ||||| |||||||||| ||||| |||| AAACTCTCTT GTAGTATTAG GAGAGTTTGG TGGAATTGAT TATTGGAATG CTTTANCGGG 540 AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGA 24948 |||||||||| |||||||||| |||||||||| |||||||||| |||||||| AAATAAACCT GAACCTGAGG TACGTACGTA CGTACCTTTT ATTATTGA 588 hqPGS_C12HBa0093P12.1-3-_SGN-E305486+ (25535 24948) ******************************************************************************** EST sequence 12 +strand 529 n (File: SGN-E304360+) 1 TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 61 CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 121 TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 181 TATCGGCCCA GGCATGGGGT CTACCTTATG GCGAAACCTT TTTTCATAAA CCTACTGGAC 241 GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 301 TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGCTG 361 CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 421 TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTGCAG TCAACCTATG 481 GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGA Predicted gene structure (within gDNA segment 26160 to 24422): Exon 1 25560 25032 ( 529 n); cDNA 1 529 ( 529 n); score: 0.994 MATCH C12HBa0093P12.1-3- SGN-E304360+ 0.994 529 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E304360+ (25560 25032) Alignment (genomic DNA sequence = upper lines): TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 25501 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TAATTATGGC TTCTCTTTTT CTATCGAAAT TTTCTTTCTT TCTTTTACTA ATAATAATAA 60 CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 25441 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTTTATTTTT TCACTCTTCT CATTGTGATG TCCTAACAAG ATGTCACATC ACATCACTTT 120 TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 25381 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTCAGTTCGG CGATTCTATC GCGGATGCTG GAAACGTGAT CCGCATACCT GGTGCCGTCA 180 TATCGGCCCA GGCATGGGGT CTACCTTATG GTGAAACCTT TTTTCATAAA CCTACTGGAC 25321 |||||||||| |||||||||| |||||||||| | |||||||| |||||||||| |||||||||| TATCGGCCCA GGCATGGGGT CTACCTTATG GCGAAACCTT TTTTCATAAA CCTACTGGAC 240 GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 25261 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| GTTTTTCTGA CGGTCGTATT ATCGCTGACT ATATCGCCAC GGCTCTCAGT CTCCCATTCC 300 TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGTTG 25201 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ||||||| || TCAATCCTTA CATGGACAAA TCAGGTGTTT CCTTTAGTCA AGGTGCTAAT TTCGCCGCTG 360 CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 25141 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CTGGTGCAAC GGCGATGAAT AACTCTTTTT TGGAGGAGAG GGGCATTGGA CATGTCCCGT 420 TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTCCAG TCAACCTATG 25081 |||||||||| |||||||||| |||||||||| |||||||||| |||||| ||| |||||||||| TCAACGTTCC TCTCCCGAGT CAATTAGAGT GGTTTAAATC TCACCTGCAG TCAACCTATG 480 GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGA 25032 |||||||||| |||||||||| |||||||||| |||||||||| ||||||||| GGTCAAAATA TTCTACAACT CTACGAAACT CTCTTGTAGT ATTAGGAGA 529 hqPGS_C12HBa0093P12.1-3-_SGN-E304360+ (25560 25032) ******************************************************************************** EST sequence 13 +strand 504 n (File: SGN-E306291+) 1 AATTATGGCT TCTCTTTTTC TATCGAAATT TTCTTTCTTT CTTTTACTAA TAATAATAAC 61 TTTATTTTTT CACTCTTCTC ATTGTGATGT CCTAACAAGA TGTCACATCA CATCACTTTT 121 TCAGTTCGGC GATTCTATCG CGGATGCTGG AAACGTGATC CGCATACCTG GTGCCGTCAT 181 ATCGGCCCAG GCATGGGGTC TACCTTATGG TGAAACCTTT TTTCATAAAC CTACTGGACG 241 TTTTTCTGAC GGTCGTATTA TCGCTGACTA TATCGCCACG GCTCTCAGTC TCCCATTCCT 301 CAATCCTTAC ATGGACAAAT CAGGTGTTTC CTTTAGTCAA GGTGCTAATT TCGCCGTTGC 361 TGGTGCAACG GCGATGAATA ACTCTTTTTT GGAGGAGAGG GGCATTGGAC ATGTCCCGTT 421 CAACGTTCCT CTCCCGAGTC AATTAGAGTG GTTTAAATCT CACCTCCAGT CAACCTATGG 481 GTCAAAATAT TCTACAACTC TACG Predicted gene structure (within gDNA segment 26159 to 24446): Exon 1 25559 25056 ( 504 n); cDNA 1 504 ( 504 n); score: 1.000 MATCH C12HBa0093P12.1-3- SGN-E306291+ 1.000 504 1.000 C PGS_C12HBa0093P12.1-3-_SGN-E306291+ (25559 25056) Alignment (genomic DNA sequence = upper lines): AATTATGGCT TCTCTTTTTC TATCGAAATT TTCTTTCTTT CTTTTACTAA TAATAATAAC 25500 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| AATTATGGCT TCTCTTTTTC TATCGAAATT TTCTTTCTTT CTTTTACTAA TAATAATAAC 60 TTTATTTTTT CACTCTTCTC ATTGTGATGT CCTAACAAGA TGTCACATCA CATCACTTTT 25440 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTATTTTTT CACTCTTCTC ATTGTGATGT CCTAACAAGA TGTCACATCA CATCACTTTT 120 TCAGTTCGGC GATTCTATCG CGGATGCTGG AAACGTGATC CGCATACCTG GTGCCGTCAT 25380 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TCAGTTCGGC GATTCTATCG CGGATGCTGG AAACGTGATC CGCATACCTG GTGCCGTCAT 180 ATCGGCCCAG GCATGGGGTC TACCTTATGG TGAAACCTTT TTTCATAAAC CTACTGGACG 25320 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| ATCGGCCCAG GCATGGGGTC TACCTTATGG TGAAACCTTT TTTCATAAAC CTACTGGACG 240 TTTTTCTGAC GGTCGTATTA TCGCTGACTA TATCGCCACG GCTCTCAGTC TCCCATTCCT 25260 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TTTTTCTGAC GGTCGTATTA TCGCTGACTA TATCGCCACG GCTCTCAGTC TCCCATTCCT 300 CAATCCTTAC ATGGACAAAT CAGGTGTTTC CTTTAGTCAA GGTGCTAATT TCGCCGTTGC 25200 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAATCCTTAC ATGGACAAAT CAGGTGTTTC CTTTAGTCAA GGTGCTAATT TCGCCGTTGC 360 TGGTGCAACG GCGATGAATA ACTCTTTTTT GGAGGAGAGG GGCATTGGAC ATGTCCCGTT 25140 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| TGGTGCAACG GCGATGAATA ACTCTTTTTT GGAGGAGAGG GGCATTGGAC ATGTCCCGTT 420 CAACGTTCCT CTCCCGAGTC AATTAGAGTG GTTTAAATCT CACCTCCAGT CAACCTATGG 25080 |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| |||||||||| CAACGTTCCT CTCCCGAGTC AATTAGAGTG GTTTAAATCT CACCTCCAGT CAACCTATGG 480 GTCAAAATAT TCTACAACTC TACG 25056 |||||||||| |||||||||| |||| GTCAAAATAT TCTACAACTC TACG 504 hqPGS_C12HBa0093P12.1-3-_SGN-E306291+ (25559 25056) Total number of EST alignments reported: 24 ________________________________________________________________________________ Predicted gene locations (6) in segment 1 to 27618: PGL 1 (+ strand): 6941 10408 AGS-1 (6941 7115,7599 7651,7869 7983,8077 8328,8995 10408) SCR (e 1.000 d 0.999 a 0.992,e 1.000 d 0.992 a 0.977,e 1.000 d 0.943 a 0.934,e 0.996 d 0.980 a 0.973,e 0.858) Exon 1 6941 7115 ( 175 n); score: 1.000 Intron 1 7116 7598 ( 483 n); Pd: 0.999 Pa: 0.992 Exon 2 7599 7651 ( 53 n); score: 1.000 Intron 2 7652 7868 ( 217 n); Pd: 0.992 Pa: 0.977 Exon 3 7869 7983 ( 115 n); score: 1.000 Intron 3 7984 8076 ( 93 n); Pd: 0.943 Pa: 0.934 Exon 4 8077 8328 ( 252 n); score: 0.996 Intron 4 8329 8994 ( 666 n); Pd: 0.980 Pa: 0.973 Exon 5 8995 10408 (1414 n); score: 0.858 PGS (6941 7115,7599 7651,7869 7983,8077 8328,8995 9040) SGN-E253402+ PGS (8998 9672) SGN-E332161+ PGS (9192 9786) SGN-E309637+ PGS (9484 10082) SGN-E309611+ PGS (9747 10408) SGN-E249331- 3-phase translation of AGS-1 (+strand): . . . . . . 6941 GCACCAATCGTGCTCATCTGGTTGATATCAATTATGATAATTGGGTTGTACAACACTATC A P I V L I W L I S I M I I G L Y N T I H Q S C S S G - Y Q L - - L G C T T L S T N R A H L V D I N Y D N W V V Q H Y . . . . . . 7001 ATTTGGAACCCCAAAATTGTGTCTGCTTTTTCGCCCTATTATATCATCAAGTTTTTTAGG I W N P K I V S A F S P Y Y I I K F F R F G T P K L C L L F R P I I S S S F L G H L E P Q N C V C F F A L L Y H Q V F - . . . . . . : 7061 GATACAGGAAAAGATGGTTGGATTTCTCTTGGAGGTATTCTCCTCTCAGTTGCAG : GTACT D T G K D G W I S L G G I L L S V A : G T I Q E K M V G F L L E V F S S Q L Q : V L G Y R K R W L D F S W R Y S P L S C R : Y . . . . . : . 7604 GAAGCTATGTATGCAGATCTTGGTCATTTCTCTGCCTTCTCCATGAGG : ATTACATTTGCA E A M Y A D L G H F S A F S M R : I T F A K L C M Q I L V I S L P S P - G : L H L H - S Y V C R S W S F L C L L H E : D Y I C . . . . . . 7881 TTTGTGGTGTATCCGTGCTTGGTGATACAGTACATGGGTCAAGCTGCTTTTCTGTCAAAA F V V Y P C L V I Q Y M G Q A A F L S K L W C I R A W - Y S T W V K L L F C Q K I C G V S V L G D T V H G S S C F S V K . . . . . : . 7941 AATCTAGATTCCATTCCAAATAGCTTCTATAGCTCAATACCTG : ATGGTGTATACTGGCCT N L D S I P N S F Y S S I P : D G V Y W P I - I P F Q I A S I A Q Y L : M V Y T G L K S R F H S K - L L - L N T - : W C I L A . . . . . . 8094 GTTTTTGTTATTGCAACCCTTGCAGCCATTGTAGGCAGCCAATCTATCATCACAGCCACA V F V I A T L A A I V G S Q S I I T A T F L L L Q P L Q P L - A A N L S S Q P H C F C Y C N P C S H C R Q P I Y H H S H . . . . . . 8154 TTCTCAATCGTCAAGCAATGTAATTCACTAGGTTGCTTCCCGCGGGTCAAGATTGTCCAC F S I V K Q C N S L G C F P R V K I V H S Q S S S N V I H - V A S R G S R L S T I L N R Q A M - F T R L L P A G Q D C P . . . . . . 8214 ACCTCAAAGCATAAAGGGCAGATCTATGTACCAGAAATAAATTGGATCCTGATGATTCTC T S K H K G Q I Y V P E I N W I L M I L P Q S I K G R S M Y Q K - I G S - - F S H L K A - R A D L C T R N K L D P D D S . . . . . . : 8274 ACTCTTGCTGTGGCTATCGGGTTCCAAGATACAACTTTGATTGGAAATGCATACG : GGCTA T L A V A I G F Q D T T L I G N A Y : G L L L L W L S G S K I Q L - L E M H T : G - H S C C G Y R V P R Y N F D W K C I R : A . . . . . . 9000 GCTTGCATGACAGTTATGTTTATCACAACATTCCTCATGACACTTGTTATAATCTTTGTG A C M T V M F I T T F L M T L V I I F V L A - Q L C L S Q H S S - H L L - S L C S L H D S Y V Y H N I P H D T C Y N L C . . . . . . 9060 TGGCAAAGAAGTTTAGTATTTGCTGCTGCTTTTCTCCTTTTCTTCTGGTTCATCGAAGGT W Q R S L V F A A A F L L F F W F I E G G K E V - Y L L L L F S F S S G S S K V V A K K F S I C C C F S P F L L V H R R . . . . . . 9120 CTCTACCTATCTTCCGCAGCCATTAAGGCTCCACAGGGAGGATGGGTATCCCTTTTGCTC L Y L S S A A I K A P Q G G W V S L L L S T Y L P Q P L R L H R E D G Y P F C S S L P I F R S H - G S T G R M G I P F A . . . . . . 9180 TCTTTTATCCTCTTAGCCATCATGCTTGTGTGGCACTATGGAACTTGCAAGAAGTACAAA S F I L L A I M L V W H Y G T C K K Y K L L S S - P S C L C G T M E L A R S T N L F Y P L S H H A C V A L W N L Q E V Q . . . . . . 9240 TATGACCTGCACAACAAAGTTCCATTGAAATGGATCCTTGGCTTGGGTCCAAGCCTTGGT Y D L H N K V P L K W I L G L G P S L G M T C T T K F H - N G S L A W V Q A L V I - P A Q Q S S I E M D P W L G S K P W . . . . . . 9300 ATTGTCCGCGTCCCAGGGATAGGGCTAATATACTCTGAACTGGTAACAGGAGTTCCACCT I V R V P G I G L I Y S E L V T G V P P L S A S Q G - G - Y T L N W - Q E F H L Y C P R P R D R A N I L - T G N R S S T . . . . . . 9360 ATCTTCTCTCACTTTGTCACAAATCTCCCTGCATTTCATAATGTAATGGTGTTTGTATGC I F S H F V T N L P A F H N V M V F V C S S L T L S Q I S L H F I M - W C L Y A Y L L S L C H K S P C I S - C N G V C M . . . . . . 9420 GTCAAATCTGTTCCTGTACCTCATGTCTCATCCGATGAGCGCTTCCTCATTGGTCGTGTT V K S V P V P H V S S D E R F L I G R V S N L F L Y L M S H P M S A S S L V V L R Q I C S C T S C L I R - A L P H W S C . . . . . . 9480 GGCCCAAGATCATATCGCATGTATCGTTGCATTGTTCGATATGGTTACAAGGACGCACAG G P R S Y R M Y R C I V R Y G Y K D A Q A Q D H I A C I V A L F D M V T R T H S W P K I I S H V S L H C S I W L Q G R T . . . . . . 9540 CAAGGTACTGGGAACTTTGAGGACCTTCTCATCCAAAGTCTAGCAGAGTTCATCCAAATG Q G T G N F E D L L I Q S L A E F I Q M K V L G T L R T F S S K V - Q S S S K W A R Y W E L - G P S H P K S S R V H P N . . . . . . 9600 GAAGCTGTGGAACCACAATTATCAAGCCCCGATAGTTCATCACTTGATGGTAGGATGGCA E A V E P Q L S S P D S S S L D G R M A K L W N H N Y Q A P I V H H L M V G W Q G S C G T T I I K P R - F I T - W - D G . . . . . . 9660 GTTATAAGCACAAATCTACAGTCACACTCACCATTTATCATAGATGATGATGATTTTGAA V I S T N L Q S H S P F I I D D D D F E L - A Q I Y S H T H H L S - M M M I L K S Y K H K S T V T L T I Y H R - - - F - . . . . . . 9720 ACATGTTCCACCATTCAAAGCAGCAAGTCACTGACACTTCAAAGTGTAAGATCTTTTTAT T C S T I Q S S K S L T L Q S V R S F Y H V P P F K A A S H - H F K V - D L F M N M F H H S K Q Q V T D T S K C K I F L . . . . . . 9780 GATGATGGGAACCATGAAAACAGAAAACGACGAATCAGGTTCAACTTGCCAGAGAACTCT D D G N H E N R K R R I R F N L P E N S M M G T M K T E N D E S G S T C Q R T L - - W E P - K Q K T T N Q V Q L A R E L . . . . . . 9840 GGCATGGATCCTGAAGTTAGGGATGAGCTTATAGATTTGGTTCAGGCAAAGGAGTCAGGG G M D P E V R D E L I D L V Q A K E S G A W I L K L G M S L - I W F R Q R S Q G W H G S - S - G - A Y R F G S G K G V R . . . . . . 9900 GTTGCATATATAATGGGACACTCATATGTCAAGGCACGTAGATTGTCCTCTTGCTGGAAG V A Y I M G H S Y V K A R R L S S C W K L H I - W D T H M S R H V D C P L A G R G C I Y N G T L I C Q G T - I V L L L E . . . . . . 9960 AAATTTGTCATTGACGTTGCATATTCATTTCTGCGTAAGAACTGCAGAGCTTCCGCTGTT K F V I D V A Y S F L R K N C R A S A V N L S L T L H I H F C V R T A E L P L L E I C H - R C I F I S A - E L Q S F R C . . . . . . 10020 GCACTTAACATTCCTCACATTAGTCTTATTGAAGTTGGCATGATATACTATGTCTAGAGA A L N I P H I S L I E V G M I Y Y V - R H L T F L T L V L L K L A - Y T M S R E C T - H S S H - S Y - S W H D I L C L E . . . . . . 10080 GAGGCTTGGAGCCAAGAGAACATTGAACGCCTTCATCGGAGTTACTTGCAGAATCTTTTC E A W S Q E N I E R L H R S Y L Q N L F R L G A K R T L N A F I G V T C R I F S R G L E P R E H - T P S S E L L A E S F . . . . . . 10140 ACAGGTAGATGAACTTTTTCTTGAATATTTTTGCCCCAAATACCAAGTCTTGCTCATCAC T G R - T F S - I F L P Q I P S L A H H Q V D E L F L E Y F C P K Y Q V L L I T H R - M N F F L N I F A P N T K S C S S . . . . . . 10200 TAATCTTTGTATTGTTAGTATATATATATTTGTTATATCATCTCTTTCACATATGACCTG - S L Y C - Y I Y I C Y I I S F T Y D L N L C I V S I Y I F V I S S L S H M T C L I F V L L V Y I Y L L Y H L F H I - P . . . . . . 10260 TATTTATTTGTGTATTTTATAGTTAGATAGAGACAGTAGTTATAATTTTAATAGGTGATT Y L F V Y F I V R - R Q - L - F - - V I I Y L C I L - L D R D S S Y N F N R - F V F I C V F Y S - I E T V V I I L I G D . . . . . . 10320 TAAGCTATGATTGAAATAGCTATATAGGCTTTTGTAATCTAACATTTTGATTTCTTTTTA - A M I E I A I - A F V I - H F D F F L K L - L K - L Y R L L - S N I L I S F - L S Y D - N S Y I G F C N L T F - F L F . . . 10380 AAAAAATTGTAAACTATTTATTAACATTA K K L - T I Y - H K N C K L F I N I K K I V N Y L L T L Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-3+_PGL-1_AGS-1_PPS_1 (6941 7115,7599 7651,7869 7983,8077 8328,8995 10076) (frame '1'; 1674 bp, 558 residues) 1 APIVLIWLIS IMIIGLYNTI IWNPKIVSAF SPYYIIKFFR DTGKDGWISL GGILLSVAGT 61 EAMYADLGHF SAFSMRITFA FVVYPCLVIQ YMGQAAFLSK NLDSIPNSFY SSIPDGVYWP 121 VFVIATLAAI VGSQSIITAT FSIVKQCNSL GCFPRVKIVH TSKHKGQIYV PEINWILMIL 181 TLAVAIGFQD TTLIGNAYGL ACMTVMFITT FLMTLVIIFV WQRSLVFAAA FLLFFWFIEG 241 LYLSSAAIKA PQGGWVSLLL SFILLAIMLV WHYGTCKKYK YDLHNKVPLK WILGLGPSLG 301 IVRVPGIGLI YSELVTGVPP IFSHFVTNLP AFHNVMVFVC VKSVPVPHVS SDERFLIGRV 361 GPRSYRMYRC IVRYGYKDAQ QGTGNFEDLL IQSLAEFIQM EAVEPQLSSP DSSSLDGRMA 421 VISTNLQSHS PFIIDDDDFE TCSTIQSSKS LTLQSVRSFY DDGNHENRKR RIRFNLPENS 481 GMDPEVRDEL IDLVQAKESG VAYIMGHSYV KARRLSSCWK KFVIDVAYSF LRKNCRASAV 541 ALNIPHISLI EVGMIYYV- >C12HBa0093P12.1-3+_PGL-1_AGS-1_PPS_2 (10063 10281) (frame '2'; 216 bp, 72 residues) 1 YTMSRERLGA KRTLNAFIGV TCRIFSQVDE LFLEYFCPKY QVLLITNLCI VSIYIFVISS 61 LSHMTCIYLC IL- PGL 2 (- strand): 15385 11174 AGS-1 (15385 15178,15098 14977,14257 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11174) SCR (e 1.000 d 0.952 a 0.999,e 1.000 d 1.000 a 0.972,e 1.000 d 0.991 a 0.987,e 1.000 d 0.993 a 0.995,e 1.000 d 0.993 a 0.928,e 1.000 d 0.988 a 0.398,e 1.000 d 0.998 a 0.999,e 1.000) Exon 1 15385 15178 ( 208 n); score: 1.000 Intron 1 15177 15099 ( 79 n); Pd: 0.952 Pa: 0.999 Exon 2 15098 14977 ( 122 n); score: 1.000 Intron 2 14976 14258 ( 719 n); Pd: 1.000 Pa: 0.972 Exon 3 14257 14162 ( 96 n); score: 1.000 Intron 3 14161 13886 ( 276 n); Pd: 0.991 Pa: 0.987 Exon 4 13885 13790 ( 96 n); score: 1.000 Intron 4 13789 12924 ( 866 n); Pd: 0.993 Pa: 0.995 Exon 5 12923 12845 ( 79 n); score: 1.000 Intron 5 12844 12150 ( 695 n); Pd: 0.993 Pa: 0.928 Exon 6 12149 12103 ( 47 n); score: 1.000 Intron 6 12102 11808 ( 295 n); Pd: 0.988 Pa: 0.398 Exon 7 11807 11748 ( 60 n); score: 1.000 Intron 7 11747 11535 ( 213 n); Pd: 0.998 Pa: 0.999 Exon 8 11534 11174 ( 361 n); score: 1.000 PGS (11456 11174) SGN-E226588- PGS (12873 12845,12149 12103,11807 11748,11534 11210) SGN-E356653+ PGS (14247 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11487) SGN-E540633- PGS (15209 15178,15098 14977,14257 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11497) SGN-E540635+ PGS (15354 15178,15098 14977,14257 14162,13885 13790,12923 12850) SGN-E288074+ PGS (15385 15178,15098 14977,14257 14162) SGN-E270528+ PGS (15351 15178,15098 14977,14257 14185) SGN-E289600+ 3-phase translation of AGS-1 (-strand): . . . . . . 15385 TGTTACCAAAAACTCAACTGGAAGAAGAAAGTCAAATTACAGTCTGAGAAGATAAAAATG C Y Q K L N W K K K V K L Q S E K I K M V T K N S T G R R K S N Y S L R R - K W L P K T Q L E E E S Q I T V - E D K N . . . . . . 15325 GCGGCTTCAGCTTTCGGACAGTGTAGTCTTCTTCCTCGTACAGTATCTTTGAATCCTCAG A A S A F G Q C S L L P R T V S L N P Q R L Q L S D S V V F F L V Q Y L - I L S G G F S F R T V - S S S S Y S I F E S S . . . . . . 15265 CAGTCTCATCGTCAGCTCTGCAGTTTGTCTTTCCATAGACAAACTGTAAATTCTTCACTT Q S H R Q L C S L S F H R Q T V N S S L S L I V S S A V C L S I D K L - I L H F A V S S S A L Q F V F P - T N C K F F T . . . : . . . 15205 CCTGCACTGTCATTCACTCAGTCTATAG : GTTTTGGGTCTGCAATTGAGAGACATTGTGTG P A L S F T Q S I : G F G S A I E R H C V L H C H S L S L - : V L G L Q L R D I V W S C T V I H S V Y R : F W V C N - E T L C . . . . . . 15066 GATCGAAACGGGTCGGATTTGTTTAAAACGGATGCTGTTCGTCAGTTGAATGGTTCAGTT D R N G S D L F K T D A V R Q L N G S V I E T G R I C L K R M L F V S - M V Q L G S K R V G F V - N G C C S S V E W F S . . . : . . . 15006 ATCTCTGCTAAGGGGCATCGGTTTGCTATT : GTGGTTGCACGTTTTAATGATCTGATCACC I S A K G H R F A I : V V A R F N D L I T S L L R G I G L L L : W L H V L M I - S P Y L C - G A S V C Y : C G C T F - - S D H . . . . . . 14227 AAGAAGCTTTTGGAGGGAGCTTTGGAGACTTTCAAGAATTACTCGGTTAGAGAGGAAGAT K K L L E G A L E T F K N Y S V R E E D R S F W R E L W R L S R I T R L E R K I Q E A F G G S F G D F Q E L L G - R G R . : . . . . . 14167 ATTGAT : GTTGTGTGGGTTCCTGGTTGTTTTGAAATCGGCGTGACTGCACAGCTTCTTGGA I D : V V W V P G C F E I G V T A Q L L G L M : L C G F L V V L K S A - L H S F L E Y - : C C V G S W L F - N R R D C T A S W . . . . . : . 13831 AAGTCACAGAAATATCACGCAATACTCTGCATTGGGGCTGTG : ATCAGAGGTGATACATCT K S Q K Y H A I L C I G A V : I R G D T S S H R N I T Q Y S A L G L - : S E V I H L K V T E I S R N T L H W G C : D Q R - Y I . . . . . . 12905 CACTACGATGCAGTCGTTAATGCTGCCACATCTGGAGTACTTTCAGCAGGTCTAAATTCA H Y D A V V N A A T S G V L S A G L N S T T M Q S L M L P H L E Y F Q Q V - I Q S L R C S R - C C H I W S T F S R S K F . : . . . . : . 12845 G : GTACTCCTTGCATATTTGGTGTTTTGACATGTGATACCTTGGAGCAG : GCTTTCAATCGC : G T P C I F G V L T C D T L E Q : A F N R : V L L A Y L V F - H V I P W S R : L S I A R : Y S L H I W C F D M - Y L G A : G F Q S . . . . . : . 11795 GTTGGTGGGAAGGCTGGGAATAAAGGTTCCGAAACTGCATTGACTGCT : ATTGAGATGGCA V G G K A G N K G S E T A L T A : I E M A L V G R L G I K V P K L H - L L : L R W H R W W E G W E - R F R N C I D C : Y - D G . . . . . . 11522 TCTTTGTTTGAGCACCACCTAAAGCCTTCAGAGTAGACAATCCTTCTTATCGCGACAAGG S L F E H H L K P S E - T I L L I A T R L C L S T T - S L Q S R Q S F L S R Q G I F V - A P P K A F R V D N P S Y R D K . . . . . . 11462 TTCTGGATTTTCACCTTTAAACAGAATCTCACTCGCATTACCATCCCAACTAGTTACTCC F W I F T F K Q N L T R I T I P T S Y S S G F S P L N R I S L A L P S Q L V T P V L D F H L - T E S H S H Y H P N - L L . . . . . . 11402 GACTAATAAGGTTAGACATCGAGGGGGAAAAAGAGTTCCCTCTACTTCTCGGTTTCCTTC D - - G - T S R G K K S S L Y F S V S F T N K V R H R G G K R V P S T S R F P S R L I R L D I E G E K E F P L L L G F L . . . . . . 11342 CTCGTCTACTCATTGATGCTGGATGTAAACATTCTTGTAAAAGCTGCACTTGTTTGAGAA L V Y S L M L D V N I L V K A A L V - E S S T H - C W M - T F L - K L H L F E K P R L L I D A G C K H S C K S C T C L R . . . . . . 11282 AATGTTGCACTTTGTTTCAAGTTTGAGTTTTGGATAATAGTATTCTTTCAAGTTTTGAGT N V A L C F K F E F W I I V F F Q V L S M L H F V S S L S F G - - Y S F K F - V K C C T L F Q V - V L D N S I L S S F E . . . . . 11222 TTTGGACTTCAAATTGGTTCTATGTGAATTGGAACTTCAACATTGATTA F G L Q I G S M - I G T S T L I L D F K L V L C E L E L Q H - L F W T S N W F Y V N W N F N I D Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-3-_PGL-2_AGS-1_PPS_1 (15385 15178,15098 14977,14257 14162,13885 13790,12923 12845,12149 12103,11807 11748,11534 11487) (frame '1'; 753 bp, 251 residues) 1 CYQKLNWKKK VKLQSEKIKM AASAFGQCSL LPRTVSLNPQ QSHRQLCSLS FHRQTVNSSL 61 PALSFTQSIG FGSAIERHCV DRNGSDLFKT DAVRQLNGSV ISAKGHRFAI VVARFNDLIT 121 KKLLEGALET FKNYSVREED IDVVWVPGCF EIGVTAQLLG KSQKYHAILC IGAVIRGDTS 181 HYDAVVNAAT SGVLSAGLNS GTPCIFGVLT CDTLEQAFNR VGGKAGNKGS ETALTAIEMA 241 SLFEHHLKPS E- PGL 3 (+ strand): 20641 20802 AGS-1 (20641 20802) SCR (e 0.784) Exon 1 20641 20802 ( 162 n); score: 0.784 PGS (20641 20802) SGN-E320920+ 3-phase translation of AGS-1 (+strand): . . . . . . 20641 ATCGCCACTGTCTCATGGACACCCATCACTGGCGTGTCACATAAATTTTGGAGGTATTCA I A T V S W T P I T G V S H K F W R Y S S P L S H G H P S L A C H I N F G G I Q R H C L M D T H H W R V T - I L E V F . . . . . . 20701 GAGGATCGTTTTGTAAGTTTGAGTGTTCAACTGACACAATTGAGATGAGTTGAGGTGCCT E D R F V S L S V Q L T Q L R - V E V P R I V L - V - V F N - H N - D E L R C L R G S F C K F E C S T D T I E M S - G A . . . . . 20761 AGATATGCATACTCAAAGTTGGAGTGTTTATTTGCCATATGA R Y A Y S K L E C L F A I - D M H T Q S W S V Y L P Y - I C I L K V G V F I C H M Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (-strand): . . . . . . 20802 TCATATGGCAAATAAACACTCCAACTTTGAGTATGCATATCTAGGCACCTCAACTCATCT S Y G K - T L Q L - V C I S R H L N S S H M A N K H S N F E Y A Y L G T S T H L I W Q I N T P T L S M H I - A P Q L I . . . . . . 20742 CAATTGTGTCAGTTGAACACTCAAACTTACAAAACGATCCTCTGAATACCTCCAAAATTT Q L C Q L N T Q T Y K T I L - I P P K F N C V S - T L K L T K R S S E Y L Q N L S I V S V E H S N L Q N D P L N T S K I . . . . . 20682 ATGTGACACGCCAGTGATGGGTGTCCATGAGACAGTGGCGAT M - H A S D G C P - D S G D C D T P V M G V H E T V A Y V T R Q - W V S M R Q W R Maximal non-overlapping open reading frames (>= 64 codons): none PGL 4 (- strand): 22196 21562 AGS-1 (22196 21562) SCR (e 1.000) Exon 1 22196 21562 ( 635 n); score: 1.000 PGS (22196 21562) SGN-E301519+ 3-phase translation of AGS-1 (-strand): . . . . . . 22196 TTTAATACACATTTTCAATGATGAAAGGGAGATATTTAAACTTTTAATACCGAAAGGGCA F N T H F Q - - K G D I - T F N T E R A L I H I F N D E R E I F K L L I P K G H - Y T F S M M K G R Y L N F - Y R K G . . . . . . 22136 TGGTCTAGCTATCAATGAAGTGAGTGAATTATGAGTGATCGTGATTCAAATCTCGGCAGA W S S Y Q - S E - I M S D R D S N L G R G L A I N E V S E L - V I V I Q I S A E M V - L S M K - V N Y E - S - F K S R Q . . . . . . 22076 AACAAAAAAAAAACACAATTTCTTCACGTGTGTTGACATTGGTTGATAAAGTTACTTGAT N K K K T Q F L H V C - H W L I K L L D T K K K H N F F T C V D I G - - S Y L I K Q K K N T I S S R V L T L V D K V T - . . . . . . 22016 ACTGTAGTAAGAAGTACTAGGCATCCGATGGAACATCGAGGTATGTGCAAGCTGATGACC T V V R S T R H P M E H R G M C K L M T L - - E V L G I R W N I E V C A S - - P Y C S K K Y - A S D G T S R Y V Q A D D . . . . . . 21956 CTAAACATATCATTAGTTATCAAAGACTTAAAATTGTAATCCAACCAAATATACCATATG L N I S L V I K D L K L - S N Q I Y H M - T Y H - L S K T - N C N P T K Y T I C P K H I I S Y Q R L K I V I Q P N I P Y . . . . . . 21896 CTTCTTATTGGGGATGTTTAAAAAAAAAAGACGTTCTACCTTCACGAGCTAGGGGTAAAG L L I G D V - K K K T F Y L H E L G V K F L L G M F K K K R R S T F T S - G - S A S Y W G C L K K K D V L P S R A R G K . . . . . . 21836 TGTACGTACACTCTATCCTCTCCTAACCCCACCTATGAGATTACATTGGGTACGTTGTTG C T Y T L S S P N P T Y E I T L G T L L V R T L Y P L L T P P M R L H W V R C C V Y V H S I L S - P H L - D Y I G Y V V . . . . . . 21776 TTGCTTCTTAGTGGGACACCTTAGGAATAAATAGCTTTTCTTCCTTTCAAACCTTTCACC L L L S G T P - E - I A F L P F K P F T C F L V G H L R N K - L F F L S N L S P V A S - W D T L G I N S F S S F Q T F H . . . . . . 21716 CAGTTAAAATACCAATCAATCAATTATACCACAATATTACAATATACAAGTATCTAACTT Q L K Y Q S I N Y T T I L Q Y T S I - L S - N T N Q S I I P Q Y Y N I Q V S N L P V K I P I N Q L Y H N I T I Y K Y L T . . . . . . 21656 GACTAGTATACAAGTGTGATAATAACAACTAGAACTTCATATAACCAACAACTCCTCCTA D - Y T S V I I T T R T S Y N Q Q L L L T S I Q V - - - Q L E L H I T N N S S Y - L V Y K C D N N N - N F I - P T T P P . . . . 21596 TGTGAGGTTCACATGGTTGGTCTCAAGTTCGAATG C E V H M V G L K F E V R F T W L V S S S N M - G S H G W S Q V R M Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-1 (+strand): . . . . . . 21562 CATTCGAACTTGAGACCAACCATGTGAACCTCACATAGGAGGAGTTGTTGGTTATATGAA H S N L R P T M - T S H R R S C W L Y E I R T - D Q P C E P H I G G V V G Y M K F E L E T N H V N L T - E E L L V I - . . . . . . 21622 GTTCTAGTTGTTATTATCACACTTGTATACTAGTCAAGTTAGATACTTGTATATTGTAAT V L V V I I T L V Y - S S - I L V Y C N F - L L L S H L Y T S Q V R Y L Y I V I S S S C Y Y H T C I L V K L D T C I L - . . . . . . 21682 ATTGTGGTATAATTGATTGATTGGTATTTTAACTGGGTGAAAGGTTTGAAAGGAAGAAAA I V V - L I D W Y F N W V K G L K G R K L W Y N - L I G I L T G - K V - K E E K Y C G I I D - L V F - L G E R F E R K K . . . . . . 21742 GCTATTTATTCCTAAGGTGTCCCACTAAGAAGCAACAACAACGTACCCAATGTAATCTCA A I Y S - G V P L R S N N N V P N V I S L F I P K V S H - E A T T T Y P M - S H S Y L F L R C P T K K Q Q Q R T Q C N L . . . . . . 21802 TAGGTGGGGTTAGGAGAGGATAGAGTGTACGTACACTTTACCCCTAGCTCGTGAAGGTAG - V G L G E D R V Y V H F T P S S - R - R W G - E R I E C T Y T L P L A R E G R I G G V R R G - S V R T L Y P - L V K V . . . . . . 21862 AACGTCTTTTTTTTTTAAACATCCCCAATAAGAAGCATATGGTATATTTGGTTGGATTAC N V F F F - T S P I R S I W Y I W L D Y T S F F F K H P Q - E A Y G I F G W I T E R L F F L N I P N K K H M V Y L V G L . . . . . . 21922 AATTTTAAGTCTTTGATAACTAATGATATGTTTAGGGTCATCAGCTTGCACATACCTCGA N F K S L I T N D M F R V I S L H I P R I L S L - - L M I C L G S S A C T Y L D Q F - V F D N - - Y V - G H Q L A H T S . . . . . . 21982 TGTTCCATCGGATGCCTAGTACTTCTTACTACAGTATCAAGTAACTTTATCAACCAATGT C S I G C L V L L T T V S S N F I N Q C V P S D A - Y F L L Q Y Q V T L S T N V M F H R M P S T S Y Y S I K - L Y Q P M . . . . . . 22042 CAACACACGTGAAGAAATTGTGTTTTTTTTTTGTTTCTGCCGAGATTTGAATCACGATCA Q H T - R N C V F F L F L P R F E S R S N T R E E I V F F F C F C R D L N H D H S T H V K K L C F F F V S A E I - I T I . . . . . . 22102 CTCATAATTCACTCACTTCATTGATAGCTAGACCATGCCCTTTCGGTATTAAAAGTTTAA L I I H S L H - - L D H A L S V L K V - S - F T H F I D S - T M P F R Y - K F K T H N S L T S L I A R P C P F G I K S L . . . . 22162 ATATCTCCCTTTCATCATTGAAAATGTGTATTAAA I S P F H H - K C V L Y L P F I I E N V Y - N I S L S S L K M C I K Maximal non-overlapping open reading frames (>= 64 codons): none PGL 5 (+ strand): 22995 23488 AGS-1 (22995 23359,23408 23434) SCR (e 0.856 d 0.000 a 0.985,e 0.704) Exon 1 22995 23359 ( 365 n); score: 0.856 Intron 1 23360 23407 ( 48 n); Pd: 0.000 Pa: 0.985 Exon 2 23408 23434 ( 27 n); score: 0.704 PGS (22995 23359,23408 23434) SGN-E542827+ 3-phase translation of AGS-1 (+strand): . . . . . . 22995 TTTTGACTTTTTTGGTTTCAATGACTAATTTATAATTATTATTTTGATAATCAAATTTAT F - L F W F Q - L I Y N Y Y F D N Q I Y F D F F G F N D - F I I I I L I I K F I L T F L V S M T N L - L L F - - S N L . . . . . . 23055 TTATGTTTCACTAATATTCTTGTAAAACTTGTTGTAGATGACCAAATTTTTTCTTCGAAT L C F T N I L V K L V V D D Q I F S S N Y V S L I F L - N L L - M T K F F L R I F M F H - Y S C K T C C R - P N F F F E . . . . . . 23115 ACAAAATTAAATTACAATACACAAAAAAAATAGTTTAATTTTTTTCTTTAAACTAAGGAA T K L N Y N T Q K K - F N F F L - T K E Q N - I T I H K K N S L I F F F K L R N Y K I K L Q Y T K K I V - F F S L N - G . . . . . . 23175 TGAAAGAAAAAAAACAAAATAAGAATAAGAAACTCAAATAATTATAATAAAAGAAGTTAA - K K K N K I R I R N S N N Y N K R S - E R K K T K - E - E T Q I I I I K E V K M K E K K Q N K N K K L K - L - - K K L . . . . . . 23235 AAAATAATTTATGTATCAAAAAAAATTAAAATATACCTTGAACTTTGATAGAAGAATCAT K I I Y V S K K I K I Y L E L - - K N H K - F M Y Q K K L K Y T L N F D R R I I K N N L C I K K N - N I P - T L I E E S . . . . . . 23295 ATATACCCCTAAATCATTTTTTTTAAAAAAATGAAGTAAAAAATATAAATTTAAAATTAA I Y P - I I F F K K M K - K I - I - N - Y T P K S F F L K K - S K K Y K F K I N Y I P L N H F F - K N E V K N I N L K L . : . . . 23355 TTTTT : GGGTATATGTGAGCCGATTGTATAACG F L : G I C E P I V - F : W V Y V S R L Y N I F : G Y M - A D C I T Maximal non-overlapping open reading frames (>= 64 codons): none AGS-2 (23083 23488) SCR (e 0.894) Exon 1 23083 23488 ( 406 n); score: 0.894 PGS (23083 23488) SGN-E243215+ PGS (23208 23488) SGN-E578389- 3-phase translation of AGS-2 (+strand): . . . . . . 23083 TTGTTGTAGATGACCAAATTTTTTCTTCGAATACAAAATTAAATTACAATACACAAAAAA L L - M T K F F L R I Q N - I T I H K K C C R - P N F F F E Y K I K L Q Y T K K V V D D Q I F S S N T K L N Y N T Q K . . . . . . 23143 AATAGTTTAATTTTTTTCTTTAAACTAAGGAATGAAAGAAAAAAAACAAAATAAGAATAA N S L I F F F K L R N E R K K T K - E - I V - F F S L N - G M K E K K Q N K N K K - F N F F L - T K E - K K K N K I R I . . . . . . 23203 GAAACTCAAATAATTATAATAAAAGAAGTTAAAAAATAATTTATGTATCAAAAAAAATTA E T Q I I I I K E V K K - F M Y Q K K L K L K - L - - K K L K N N L C I K K N - R N S N N Y N K R S - K I I Y V S K K I . . . . . . 23263 AAATATACCTTGAACTTTGATAGAAGAATCATATATACCCCTAAATCATTTTTTTTAAAA K Y T L N F D R R I I Y T P K S F F L K N I P - T L I E E S Y I P L N H F F - K K I Y L E L - - K N H I Y P - I I F F K . . . . . . 23323 AAATGAAGTAAAAAATATAAATTTAAAATTAATTTTTTAACATCCGTTAAATGAAGGGTA K - S K K Y K F K I N F L T S V K - R V N E V K N I N L K L I F - H P L N E G Y K M K - K I - I - N - F F N I R - M K G . . . . . . 23383 TATGTGAGCCATTTTGTAACGGCAGGGGTATATGTGAGCCGATTGTATAACGGTAAGGGC Y V S H F V T A G V Y V S R L Y N G K G M - A I L - R Q G Y M - A D C I T V R A I C E P F C N G R G I C E P I V - R - G . . . . . 23443 ATATATGAGCCACTTTTATAACGAGGGATATATTAGCTCCAAATGA I Y E P L L - R G I Y - L Q M Y M S H F Y N E G Y I S S K - H I - A T F I T R D I L A P N Maximal non-overlapping open reading frames (>= 64 codons): none 3-phase translation of AGS-2 (-strand): . . . . . . 23488 TCATTTGGAGCTAATATATCCCTCGTTATAAAAGTGGCTCATATATGCCCTTACCGTTAT S F G A N I S L V I K V A H I C P Y R Y H L E L I Y P S L - K W L I Y A L T V I I W S - Y I P R Y K S G S Y M P L P L . . . . . . 23428 ACAATCGGCTCACATATACCCCTGCCGTTACAAAATGGCTCACATATACCCTTCATTTAA T I G S H I P L P L Q N G S H I P F I - Q S A H I Y P C R Y K M A H I Y P S F N Y N R L T Y T P A V T K W L T Y T L H L . . . . . . 23368 CGGATGTTAAAAAATTAATTTTAAATTTATATTTTTTACTTCATTTTTTTAAAAAAAATG R M L K N - F - I Y I F Y F I F L K K M G C - K I N F K F I F F T S F F - K K - T D V K K L I L N L Y F L L H F F K K N . . . . . . 23308 ATTTAGGGGTATATATGATTCTTCTATCAAAGTTCAAGGTATATTTTAATTTTTTTTGAT I - G Y I - F F Y Q S S R Y I L I F F D F R G I Y D S S I K V Q G I F - F F L I D L G V Y M I L L S K F K V Y F N F F - . . . . . . 23248 ACATAAATTATTTTTTAACTTCTTTTATTATAATTATTTGAGTTTCTTATTCTTATTTTG T - I I F - L L L L - L F E F L I L I L H K L F F N F F Y Y N Y L S F L F L F C Y I N Y F L T S F I I I I - V S Y S Y F . . . . . . 23188 TTTTTTTTCTTTCATTCCTTAGTTTAAAGAAAAAAATTAAACTATTTTTTTTGTGTATTG F F F F H S L V - R K K L N Y F F C V L F F S F I P - F K E K N - T I F F V Y C V F F L S F L S L K K K I K L F F L C I . . . . . 23128 TAATTTAATTTTGTATTCGAAGAAAAAATTTGGTCATCTACAACAA - F N F V F E E K I W S S T T N L I L Y S K K K F G H L Q Q V I - F C I R R K N L V I Y N Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-3-_PGL-5_AGS-2_PPS_1 (23474 23250) (frame '0'; 222 bp, 74 residues) 1 YIPRYKSGSY MPLPLYNRLT YTPAVTKWLT YTLHLTDVKK LILNLYFLLH FFKKNDLGVY 61 MILLSKFKVY FNFF- PGL 6 (- strand): 25577 24267 AGS-1 (25577 24267) SCR (e 0.997) Exon 1 25577 24267 (1311 n); score: 0.997 PGS (25055 24267) SGN-E543825- PGS (25560 24834) SGN-E543826+ PGS (25577 24938) SGN-E305600+ PGS (25577 24943) SGN-E373690+ PGS (25535 24948) SGN-E305486+ PGS (25560 25032) SGN-E304360+ PGS (25559 25056) SGN-E306291+ 3-phase translation of AGS-1 (-strand): . . . . . . 25577 CAAAAAAAAATCATTCATAATTATGGCTTCTCTTTTTCTATCGAAATTTTCTTTCTTTCT Q K K I I H N Y G F S F S I E I F F L S K K K S F I I M A S L F L S K F S F F L K K N H S - L W L L F F Y R N F L S F . . . . . . 25517 TTTACTAATAATAATAACTTTATTTTTTCACTCTTCTCATTGTGATGTCCTAACAAGATG F T N N N N F I F S L F S L - C P N K M L L I I I T L F F H S S H C D V L T R C F Y - - - - L Y F F T L L I V M S - Q D . . . . . . 25457 TCACATCACATCACTTTTTCAGTTCGGCGATTCTATCGCGGATGCTGGAAACGTGATCCG S H H I T F S V R R F Y R G C W K R D P H I T S L F Q F G D S I A D A G N V I R V T S H H F F S S A I L S R M L E T - S . . . . . . 25397 CATACCTGGTGCCGTCATATCGGCCCAGGCATGGGGTCTACCTTATGGTGAAACCTTTTT H T W C R H I G P G M G S T L W - N L F I P G A V I S A Q A W G L P Y G E T F F A Y L V P S Y R P R H G V Y L M V K P F . . . . . . 25337 TCATAAACCTACTGGACGTTTTTCTGACGGTCGTATTATCGCTGACTATATCGCCACGGC S - T Y W T F F - R S Y Y R - L Y R H G H K P T G R F S D G R I I A D Y I A T A F I N L L D V F L T V V L S L T I S P R . . . . . . 25277 TCTCAGTCTCCCATTCCTCAATCCTTACATGGACAAATCAGGTGTTTCCTTTAGTCAAGG S Q S P I P Q S L H G Q I R C F L - S R L S L P F L N P Y M D K S G V S F S Q G L S V S H S S I L T W T N Q V F P L V K . . . . . . 25217 TGCTAATTTCGCCGTTGCTGGTGCAACGGCGATGAATAACTCTTTTTTGGAGGAGAGGGG C - F R R C W C N G D E - L F F G G E G A N F A V A G A T A M N N S F L E E R G V L I S P L L V Q R R - I T L F W R R G . . . . . . 25157 CATTGGACATGTCCCGTTCAACGTTCCTCTCCCGAGTCAATTAGAGTGGTTTAAATCTCA H W T C P V Q R S S P E S I R V V - I S I G H V P F N V P L P S Q L E W F K S H A L D M S R S T F L S R V N - S G L N L . . . . . . 25097 CCTCCAGTCAACCTATGGGTCAAAATATTCTACAACTCTACGAAACTCTCTTGTAGTATT P P V N L W V K I F Y N S T K L S C S I L Q S T Y G S K Y S T T L R N S L V V L T S S Q P M G Q N I L Q L Y E T L L - Y . . . . . . 25037 AGGAGAGTTTGGTGGAGTTGATTATTGGAATGCTTTAGCGGGAAATAAACCTGAACCTGA R R V W W S - L L E C F S G K - T - T - G E F G G V D Y W N A L A G N K P E P E - E S L V E L I I G M L - R E I N L N L . . . . . . 24977 GGTACGTACGTACGTACCTTTTATTATTGATGGCATTATAAGTGCCATCAAAGAGGTGAT G T Y V R T F Y Y - W H Y K C H Q R G D V R T Y V P F I I D G I I S A I K E V I R Y V R T Y L L L L M A L - V P S K R - . . . . . . 24917 CCAATTAGGATCAACTCGAATTTTGGTTCCAGGAGTTTTTCCTTTTGGGTGTCTATCATC P I R I N S N F G S R S F S F W V S I I Q L G S T R I L V P G V F P F G C L S S S N - D Q L E F W F Q E F F L L G V Y H . . . . . . 24857 ATATCTAACAAGATTCGCTGACACGAATCCAAATGCTTATGATCAATATGGTTGTTTGAA I S N K I R - H E S K C L - S I W L F E Y L T R F A D T N P N A Y D Q Y G C L K H I - Q D S L T R I Q M L M I N M V V - . . . . . . 24797 ATTTTATAATGATTTCGCTACGTATCATAATATCGAGCTAAAGAAGGCTCTAGAAAATCT I L - - F R Y V S - Y R A K E G S R K S F Y N D F A T Y H N I E L K K A L E N L N F I M I S L R I I I S S - R R L - K I . . . . . . 24737 ACGATGTGAGTTCCCACGTGTTAAAATTGTGTATGGGGATTACTATGGTGGTTTTAGGCT T M - V P T C - N C V W G L L W W F - A R C E F P R V K I V Y G D Y Y G G F R L Y D V S S H V L K L C M G I T M V V L G . . . . . . 24677 TGTTTTTCGATACGCGTCTTGGTTGGGATTTAATCCAAGTACATTGGTATCGGCATGTTG C F S I R V L V G I - S K Y I G I G M L V F R Y A S W L G F N P S T L V S A C C L F F D T R L G W D L I Q V H W Y R H V . . . . . . 24617 TGGGAGCGGAGGACGATACAATGCCGGGGGATGTAGTTCAGCTAGTACCAATGTATGTCC W E R R T I Q C R G M - F S - Y Q C M S G S G G R Y N A G G C S S A S T N V C P V G A E D D T M P G D V V Q L V P M Y V . . . . . . 24557 TAACCCGTCCCAATACGTTAATTGGGACGGGCTTCATCTGACAGATGAAGCGTATCATCG - P V P I R - L G R A S S D R - S V S S N P S Q Y V N W D G L H L T D E A Y H R L T R P N T L I G T G F I - Q M K R I I . . . . . . 24497 TATTTCTAACGTTGTTATCAACAATATGCTTCCAAAATTTGGGTGTTATGGACTTAGAAA Y F - R C Y Q Q Y A S K I W V L W T - K I S N V V I N N M L P K F G C Y G L R N V F L T L L S T I C F Q N L G V M D L E . . . . . . 24437 TTCGAGTGCATTATCTAGTTATTGAATTTTTAAGGTTTTTGTTTAATTTGTTTTGGTTAT F E C I I - L L N F - G F C L I C F G Y S S A L S S Y - I F K V F V - F V L V I I R V H Y L V I E F L R F L F N L F W L . . . . . . 24377 TTGGGTGTCTTTTTCATGATTTTCTTTGTACTTGAGTTCGTTGATTTTGATGAAATGTGT L G V F F M I F F V L E F V D F D E M C W V S F S - F S L Y L S S L I L M K C V F G C L F H D F L C T - V R - F - - N V . . . . . . 24317 TGTAGTTTGAAATGAATATGAATAATGAGAAAAGATAATTGTTGTGTTGTA C S L K - I - I M R K D N C C V V V V - N E Y E - - E K I I V V L L - F E M N M N N E K R - L L C C Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-3-_PGL-6_AGS-1_PPS_1 (25576 24413) (frame '2'; 1161 bp, 387 residues) 1 KKKSFIIMAS LFLSKFSFFL LLIIITLFFH SSHCDVLTRC HITSLFQFGD SIADAGNVIR 61 IPGAVISAQA WGLPYGETFF HKPTGRFSDG RIIADYIATA LSLPFLNPYM DKSGVSFSQG 121 ANFAVAGATA MNNSFLEERG IGHVPFNVPL PSQLEWFKSH LQSTYGSKYS TTLRNSLVVL 181 GEFGGVDYWN ALAGNKPEPE VRTYVPFIID GIISAIKEVI QLGSTRILVP GVFPFGCLSS 241 YLTRFADTNP NAYDQYGCLK FYNDFATYHN IELKKALENL RCEFPRVKIV YGDYYGGFRL 301 VFRYASWLGF NPSTLVSACC GSGGRYNAGG CSSASTNVCP NPSQYVNWDG LHLTDEAYHR 361 ISNVVINNML PKFGCYGLRN SSALSSY- 3-phase translation of AGS-1 (+strand): . . . . . . 24267 TACAACACAACAATTATCTTTTCTCATTATTCATATTCATTTCAAACTACAACACATTTC Y N T T I I F S H Y S Y S F Q T T T H F T T Q Q L S F L I I H I H F K L Q H I S Q H N N Y L F S L F I F I S N Y N T F . . . . . . 24327 ATCAAAATCAACGAACTCAAGTACAAAGAAAATCATGAAAAAGACACCCAAATAACCAAA I K I N E L K Y K E N H E K D T Q I T K S K S T N S S T K K I M K K T P K - P K H Q N Q R T Q V Q R K S - K R H P N N Q . . . . . . 24387 ACAAATTAAACAAAAACCTTAAAAATTCAATAACTAGATAATGCACTCGAATTTCTAAGT T N - T K T L K I Q - L D N A L E F L S Q I K Q K P - K F N N - I M H S N F - V N K L N K N L K N S I T R - C T R I S K . . . . . . 24447 CCATAACACCCAAATTTTGGAAGCATATTGTTGATAACAACGTTAGAAATACGATGATAC P - H P N F G S I L L I T T L E I R - Y H N T Q I L E A Y C - - Q R - K Y D D T S I T P K F W K H I V D N N V R N T M I . . . . . . 24507 GCTTCATCTGTCAGATGAAGCCCGTCCCAATTAACGTATTGGGACGGGTTAGGACATACA A S S V R - S P S Q L T Y W D G L G H T L H L S D E A R P N - R I G T G - D I H R F I C Q M K P V P I N V L G R V R T Y . . . . . . 24567 TTGGTACTAGCTGAACTACATCCCCCGGCATTGTATCGTCCTCCGCTCCCACAACATGCC L V L A E L H P P A L Y R P P L P Q H A W Y - L N Y I P R H C I V L R S H N M P I G T S - T T S P G I V S S S A P T T C . . . . . . 24627 GATACCAATGTACTTGGATTAAATCCCAACCAAGACGCGTATCGAAAAACAAGCCTAAAA D T N V L G L N P N Q D A Y R K T S L K I P M Y L D - I P T K T R I E K Q A - N R Y Q C T W I K S Q P R R V S K N K P K . . . . . . 24687 CCACCATAGTAATCCCCATACACAATTTTAACACGTGGGAACTCACATCGTAGATTTTCT P P - - S P Y T I L T R G N S H R R F S H H S N P H T Q F - H V G T H I V D F L T T I V I P I H N F N T W E L T S - I F . . . . . . 24747 AGAGCCTTCTTTAGCTCGATATTATGATACGTAGCGAAATCATTATAAAATTTCAAACAA R A F F S S I L - Y V A K S L - N F K Q E P S L A R Y Y D T - R N H Y K I S N N - S L L - L D I M I R S E I I I K F Q T . . . . . . 24807 CCATATTGATCATAAGCATTTGGATTCGTGTCAGCGAATCTTGTTAGATATGATGATAGA P Y - S - A F G F V S A N L V R Y D D R H I D H K H L D S C Q R I L L D M M I D T I L I I S I W I R V S E S C - I - - - . . . . . . 24867 CACCCAAAAGGAAAAACTCCTGGAACCAAAATTCGAGTTGATCCTAATTGGATCACCTCT H P K G K T P G T K I R V D P N W I T S T Q K E K L L E P K F E L I L I G S P L T P K R K N S W N Q N S S - S - L D H L . . . . . . 24927 TTGATGGCACTTATAATGCCATCAATAATAAAAGGTACGTACGTACGTACCTCAGGTTCA L M A L I M P S I I K G T Y V R T S G S - W H L - C H Q - - K V R T Y V P Q V Q F D G T Y N A I N N K R Y V R T Y L R F . . . . . . 24987 GGTTTATTTCCCGCTAAAGCATTCCAATAATCAACTCCACCAAACTCTCCTAATACTACA G L F P A K A F Q - S T P P N S P N T T V Y F P L K H S N N Q L H Q T L L I L Q R F I S R - S I P I I N S T K L S - Y Y . . . . . . 25047 AGAGAGTTTCGTAGAGTTGTAGAATATTTTGACCCATAGGTTGACTGGAGGTGAGATTTA R E F R R V V E Y F D P - V D W R - D L E S F V E L - N I L T H R L T G G E I - K R V S - S C R I F - P I G - L E V R F . . . . . . 25107 AACCACTCTAATTGACTCGGGAGAGGAACGTTGAACGGGACATGTCCAATGCCCCTCTCC N H S N - L G R G T L N G T C P M P L S T T L I D S G E E R - T G H V Q C P S P K P L - L T R E R N V E R D M S N A P L . . . . . . 25167 TCCAAAAAAGAGTTATTCATCGCCGTTGCACCAGCAACGGCGAAATTAGCACCTTGACTA S K K E L F I A V A P A T A K L A P - L P K K S Y S S P L H Q Q R R N - H L D - L Q K R V I H R R C T S N G E I S T L T . . . . . . 25227 AAGGAAACACCTGATTTGTCCATGTAAGGATTGAGGAATGGGAGACTGAGAGCCGTGGCG K E T P D L S M - G L R N G R L R A V A R K H L I C P C K D - G M G D - E P W R K G N T - F V H V R I E E W E T E S R G . . . . . . 25287 ATATAGTCAGCGATAATACGACCGTCAGAAAAACGTCCAGTAGGTTTATGAAAAAAGGTT I - S A I I R P S E K R P V G L - K K V Y S Q R - Y D R Q K N V Q - V Y E K R F D I V S D N T T V R K T S S R F M K K G . . . . . . 25347 TCACCATAAGGTAGACCCCATGCCTGGGCCGATATGACGGCACCAGGTATGCGGATCACG S P - G R P H A W A D M T A P G M R I T H H K V D P M P G P I - R H Q V C G S R F T I R - T P C L G R Y D G T R Y A D H . . . . . . 25407 TTTCCAGCATCCGCGATAGAATCGCCGAACTGAAAAAGTGATGTGATGTGACATCTTGTT F P A S A I E S P N - K S D V M - H L V F Q H P R - N R R T E K V M - C D I L L V S S I R D R I A E L K K - C D V T S C . . . . . . 25467 AGGACATCACAATGAGAAGAGTGAAAAAATAAAGTTATTATTATTAGTAAAAGAAAGAAA R T S Q - E E - K N K V I I I S K R K K G H H N E K S E K I K L L L L V K E R K - D I T M R R V K K - S Y Y Y - - K K E . . . . . . 25527 GAAAATTTCGATAGAAAAAGAGAAGCCATAATTATGAATGATTTTTTTTTG E N F D R K R E A I I M N D F F L K I S I E K E K P - L - M I F F R K F R - K K R S H N Y E - F F F Maximal non-overlapping open reading frames (>= 64 codons): >C12HBa0093P12.1-3+_PGL-6_AGS-1_PPS_1 (24822 25016) (frame '1'; 192 bp, 64 residues) 1 AFGFVSANLV RYDDRHPKGK TPGTKIRVDP NWITSLMALI MPSIIKGTYV RTSGSGLFPA 61 KAFQ- ... finished at: Thu Jul 27 14:06:08 2006