38
o programa InterPro, de forma a aumentar a confiabilidade de nossos resultados. O
InterPro é uma ferramenta que integra vários bancos de dados dentre eles o Pfam
1441 cgttttggtgtggaagaacgcctacaatgtggtttgacaaataag
R F G V E E R L Q C G L T N K
1486 gtctcgtacaaaactgatcatgaatggattctttcagttccagtt
V S Y K T D H E W I L S V P V
1531 ccaatggaagcggtaacgaatcaaaaagcattagacaattatgag
P M E A V T N Q K A L D N Y E
1576 aaacgacgtgctgacgctgagttaaatggtatacatttatcacct
K R R A D A E L N G I H L S P
1621 gaagaggttgtacgaccaattattcctatggaagcttgcctgtcg
E E V V R P I I P M E A C L S
1666 gcttgggctcaaacagaacgaattactgattttaaaacaccagca
A W A Q T E R I T D F K T P A
1711 agtcaacctccaggtgccaaaacattcgctttacgttctaatcgt
S Q P P G A K T F A L R S N R
1756 ttaacaaattttccaaattatttatgtatacaagttggacgattt
L T N F P N Y L C I Q V G R F
1801 actgttggtacagattggcttccgaaaaaattggatgtggacatc
T V G T D W L P K K L D V D I
1846 gagttgaaatcatctgttggaaatcatggtaatacagaatggttt
E L K S S V G N H G N T E W F
1891 attgatttaaataatttacgtgcaccaaatggtagtagaccgttg
I D L N N L R A P N G S R P L
1936 cctggtgaacaattgatgccaactggagatgaaataaaatctgaa
P G E Q L M P T G D E I K S E
1981 caaatgaaaattgatgatgatgtacagtcagatgtaacaattatc
Q M K I D D D V Q S D V T I I
2026 aatgatctgttagttatgggtttcactttagaagcagcacagaaa
N D L L V M G F T L E A A Q K
2071 gcttgtaaatttacacaaaatagtagtgtagaaaatgctaccaac
A C K F T Q N S S V E N A T N
2116 tggttaatggaacatttagatgatccagatttaaatgatccgttg
W L M E H L D D P D L N D P L
2161 ccgccgcctagtgattatcaacaacaaaataaacagaaacaatca
P P P S D Y Q Q Q N K Q K Q S
2206 gtggttggatcggtcacttctacagatattgacgaaaacgcagtt
V V G S V T S T D I D E N A V
2251 gaaatgatgttggctatgggattcagtagacaacagtcaatcgtc
E M M L A M G F S R Q Q S I V
2296 gcattacaacataaaaatggtaatttggaacaagctgctgactgg
A L Q H K N G N L E Q A A D W
2341 gcattgagtagtccagatgaattagaaactttattacttcagtct
A L S S P D E L E T L L L Q S
2386 gaaataaatatgactaataaatcttccgctgatgctgggaataat
E I N M T N K S S A D A G N N
2431 tcaagtcagacatcatcatcttcatcaggtccattgctatcagat
S S Q T S S S S S G P L L S D
2476 ggtagttcaaagtatgaattattcgcttttattagtcatatggga
G S S K Y E L F A F I S H M G
2521 aagtcgaccaatgatggtcattacgtagctcatattaaacgatca
K S T N D G H Y V A H I K R S
2566 tttctagctaaatgtattcctcatgaaccgccagtatatcatgtt
F L A K C I P H E P P V Y H V
2611 ggctcaccaatttgtggcggttccgatcaagaatggattatattc
G S P I C G G S D Q E W I I F
2656 aatgatgagaaagtagcaaaaagtgaatgccctccacatcgtcat
N D E K V A K S E C P P H R H
2701 gcttatttatatttctttcgaagattagatgctactgatcagtct
A Y L Y F F R R L D A T D Q S
2746 gactaa 2751
D *
1 atgtccgaatctatttttgaggacttatctgctcaagtcaggatt
M S E S I F E D L S A Q V R I
46 cctggtgacggggaaaaggtattcaaggatgaatgtccgttttcg
P G D G E K V F K D E C P F S
91 tttgagacacctgagacagaaaaaggaatatacatatgcatgcga
F E T P E T E K G I Y I C M R
136 cacttcgttgcgatcggctctaaaacacttaggctttattatgag
H F V A I G S K T L R L Y Y E
181 aaaactggctgtcgtgctttcctaagatacaagatcgaaaagcaa
K T G C R A F L R Y K I E K Q
226 ttcaaagataagggtactagtgctcaagaacgcccgacaaaactc
F K D K G T S A Q E R P T K L
271 gccgttggagtttctggaggttttgactttccccaagatatgtat
A V G V S G G F D F P Q D M Y
316 acagtcactgaacattggtcgctagtacgtcttccgcaaggcgac
T V T E H W S L V R L P Q G D
361 tcatttgatatcccgaatccagaacccggacaaggacaatctgat
S F D I P N P E P G Q G Q S D
406 atatctcacttagagattttggaactccctaaacgcctggctact
I S H L E I L E L P K R L A T
451 tcgatcgcattaattcaacgtgctgagtcagtattactagcagaa
S I A L I Q R A E S V L L A E
496 gaacgagctaactcacttaaagcttgggaagatgataatatttgt
E R A N S L K A W E D D N I C
541 tttatttcctcgtatgccatgaatttagcacaaataaataattct
F I S S Y A M N L A Q I N N S
586 gttcggattcctccgtctggatggaaatgtgccaaatgtgatttg
V R I P P S G W K C A K C D L
631 agagataatctgtggatgaatcttactgatggaaccatactttgt
R D N L W M N L T D G T I L C
676 ggccgaaaattttgggatgggtctggagggaataatcacgctctg
G R K F W D G S G G N N H A L
721 gagcattatgagaagaccaagtatccactagctgtcaaacttgga
E H Y E K T K Y P L A V K L G
766 actataactccgaagggtggagaagtgttttcatatcctgaagat
T I T P K G G E V F S Y P E D
811 tcaatggtaaccgacccaaaattagcagaacatcttgctcacttc
S M V T D P K L A E H L A H F
856 ggtattgatgtaatgcttatgcaaaaaaccgataaaacaatggcc
G I D V M L M Q K T D K T M A
901 gagttagaagtggatgcaaatgaaagacttggagaatggctaaca
E L E V D A N E R L G E W L T
946 ttgcaggaatccaaccatacacttgaagcacgttatggtccaggt
L Q E S N H T L E A R Y G P G
991 atgactggacttagaaatctcggtaatacttgttacatgaatgct
M T G L R N L G N T C Y M N A
1036 gtgttacaggttctgttttcaatttcacaatttcgttcgtactat
V L Q V L F S I S Q F R S Y Y
1081 gcctatcaattaccagtttggtgtgaggaagcattggatcaattc
A Y Q L P V W C E E A L D Q F
1126 acttctgaaaatcatccacttctaccagtagatcatattggtcta
T S E N H P L L P V D H I G L
1171 caattttcgaaattaggacatggtctttgttcaggcgctcattca
Q F S K L G H G L C S G A H S
1216 tggctagttccaaataattttagtcatactactacaggtggccct
W L V P N N F S H T T T G G P
1261 gtgcctattctcccgggtatacgtccaattttgtttcgtcgatta
V P I L P G I R P I L F R R L
1306 atgggtgctcataactccacatttgccacaagacagcaacaagat
M G A H N S T F A T R Q Q Q D
1351 gcttgcgaatttttaatatatctattagatttattagaacaaaaa
A C E F L I Y L L D L L E Q K
1396 gccaataaacaggaacacggaaaagtacatccaagtaatgtgatg
A N K Q E H G K V H P S N V M
– Seqüência da ORF predita pelo programa ORF finder de SmDUB5, janela+1 contendo 916 resíduos de
aminoácidos. A seqüência de nucleotídeos inicia no nucleotídeo 1 e termina no nucleotídeo 2751, ela está
representada em letra minúscula, a seqüência de aminoácidos está representada em letra maiúscula.