BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= FNP_0993
(200 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|148323539|gb|EDK88789.1| possible intracellular protease... 357 3e-97
gi|19705181|ref|NP_602676.1| 4-methyl-5(B-hydroxyethyl)-thi... 330 4e-89
gi|47568298|ref|ZP_00239000.1| 4-methyl-5(B-hydroxyethyl)-t... 134 3e-30
gi|71279213|ref|YP_267551.1| DJ-1/PfpI family protein [Colw... 131 2e-29
gi|126652384|ref|ZP_01724557.1| DJ-1/PfpI family protein [B... 131 3e-29
gi|90412401|ref|ZP_01220405.1| hypothetical intracellular p... 130 6e-29
gi|42781363|ref|NP_978610.1| DJ-1/PfpI family [Bacillus cer... 128 2e-28
gi|54302174|ref|YP_132167.1| hypothetical intracellular pro... 125 1e-27
gi|15896593|ref|NP_349942.1| Putative intracellular proteas... 112 2e-23
gi|145953864|ref|ZP_01802872.1| hypothetical protein CdifQ_... 108 1e-22
gi|126700977|ref|YP_001089874.1| putative protease [Clostri... 108 3e-22
gi|153938279|ref|YP_001390376.1| DJ-1/PfpI family protein [... 106 9e-22
gi|148379015|ref|YP_001253556.1| protease I [Clostridium bo... 106 1e-21
gi|29347781|ref|NP_811284.1| putative protease/amidase [Bac... 94 3e-18
gi|18311049|ref|NP_562983.1| 4-methyl-5(beta-hydroxyethyl)-... 87 8e-16
gi|110801657|ref|YP_699347.1| DJ-1 family protein [Clostrid... 85 2e-15
gi|110799272|ref|YP_696747.1| DJ-1 family protein [Clostrid... 84 3e-15
gi|66808601|ref|XP_638023.1| hypothetical protein DDBDRAFT_... 82 2e-14
gi|150019134|ref|YP_001311388.1| DJ-1 family protein [Clost... 73 1e-11
gi|34764182|ref|ZP_00145046.1| 4-methyl-5(B-hydroxyethyl)-t... 72 3e-11
gi|15894907|ref|NP_348256.1| Putative intracellular proteas... 71 3e-11
gi|153939756|ref|YP_001392047.1| DJ-1 family protein [Clost... 70 8e-11
gi|148380727|ref|YP_001255268.1| 4-methyl-5(b-hydroxyethyl)... 70 8e-11
gi|53715247|ref|YP_101239.1| putative ThiJ family intracell... 70 1e-10
gi|156546896|ref|XP_001599104.1| PREDICTED: similar to GH09... 69 1e-10
gi|153954760|ref|YP_001395525.1| hypothetical protein CKL_2... 69 1e-10
gi|15902757|ref|NP_358307.1| 4-methyl-5(b-hydroxyethyl)-thi... 67 7e-10
gi|149003439|ref|ZP_01828328.1| 4-methyl-5(b-hydroxyethyl)-... 67 7e-10
gi|15900697|ref|NP_345301.1| 4-methyl-5(b-hydroxyethyl)-thi... 67 8e-10
gi|148997116|ref|ZP_01824770.1| 4-methyl-5(b-hydroxyethyl)-... 67 8e-10
gi|67484652|ref|XP_657546.1| 4-methyl-5(B-hydroxyethyl)-thi... 66 1e-09
gi|66531474|ref|XP_624271.1| PREDICTED: similar to dj-1 CG1... 66 1e-09
gi|148985843|ref|ZP_01818937.1| 4-methyl-5(b-hydroxyethyl)-... 65 3e-09
gi|28211406|ref|NP_782350.1| 4-methyl-5(B-hydroxyethyl)-thi... 65 3e-09
gi|28571932|ref|NP_651825.3| dj-1beta CG1349-PA [Drosophila... 63 9e-09
gi|156719857|ref|ZP_02061463.1| DJ-1 family protein [Hydrog... 62 2e-08
gi|157104409|ref|XP_001648396.1| dj-1 protein (park7) [Aede... 62 2e-08
gi|29349330|ref|NP_812833.1| putative ThiJ family intracell... 61 3e-08
gi|126330567|ref|XP_001362447.1| PREDICTED: similar to CAP1... 61 3e-08
gi|153813466|ref|ZP_01966134.1| hypothetical protein RUMOBE... 61 4e-08
gi|146302026|ref|YP_001196617.1| intracellular protease, Pf... 61 4e-08
gi|62752059|ref|NP_001015851.1| MGC108042 protein [Xenopus ... 60 7e-08
gi|147900143|ref|NP_001083896.1| SP22 [Xenopus laevis] >gi|... 60 7e-08
gi|90422501|ref|YP_530871.1| Peptidase C56, PfpI [Rhodopseu... 60 7e-08
gi|89210012|ref|ZP_01188405.1| DJ-1 [Halothermothrix orenii... 60 1e-07
gi|16303786|gb|AAL16803.1|AF394958_1 SP22 [Xenopus laevis] 60 1e-07
gi|66267686|dbj|BAD98544.1| DJ-1 [Crocodylus niloticus] 59 2e-07
gi|66267682|dbj|BAD98542.1| DJ-1 [Alligator mississippiensis] 59 2e-07
gi|38234592|ref|NP_940359.1| Putative protease [Corynebacte... 59 2e-07
gi|116672030|ref|YP_832963.1| intracellular protease, PfpI ... 59 2e-07
gi|52079280|ref|YP_078071.1| putative intracellular proteas... 59 2e-07
gi|52784646|ref|YP_090475.1| hypothetical protein BLi00848 ... 59 2e-07
gi|145902805|gb|AAU22433.2| putative intracellular protease... 59 2e-07
gi|91978428|ref|YP_571087.1| Peptidase C56, PfpI [Rhodopseu... 59 2e-07
gi|121997998|ref|YP_001002785.1| DJ-1 family protein [Halor... 58 2e-07
gi|73748071|ref|YP_307310.1| DJ-1 family protein [Dehalococ... 58 3e-07
gi|147668899|ref|YP_001213717.1| DJ-1 family protein [Dehal... 58 3e-07
gi|126657553|ref|ZP_01728709.1| proteinase I [Cyanothece sp... 58 3e-07
gi|120435844|ref|YP_861530.1| peptidase, family C56 [Gramel... 58 4e-07
gi|45383015|ref|NP_989916.1| Parkinson disease (autosomal r... 58 4e-07
gi|125772586|ref|XP_001357594.1| GA12322-PA [Drosophila pse... 58 4e-07
gi|86751228|ref|YP_487724.1| Peptidase C56, PfpI [Rhodopseu... 57 5e-07
gi|74212240|dbj|BAE40278.1| unnamed protein product [Mus mu... 57 5e-07
gi|110601832|ref|ZP_01389998.1| Peptidase C56, PfpI [Geobac... 57 6e-07
gi|74317863|ref|YP_315603.1| putative protease [Thiobacillu... 57 6e-07
gi|150003061|ref|YP_001297805.1| putative ThiJ family intra... 57 7e-07
gi|148255686|ref|YP_001240271.1| putative intracellular pro... 57 7e-07
gi|39934376|ref|NP_946652.1| putative intracellular proteas... 57 7e-07
gi|62751849|ref|NP_001015572.1| Parkinson disease (autosoma... 57 8e-07
gi|153854555|ref|ZP_01995825.1| hypothetical protein DORLON... 57 8e-07
gi|55741460|ref|NP_065594.2| DJ-1 protein [Mus musculus] >g... 57 8e-07
gi|118580535|ref|YP_901785.1| metal dependent phosphohydrol... 57 9e-07
gi|121535032|ref|ZP_01666850.1| ThiJ/PfpI domain protein [T... 56 1e-06
gi|149689074|gb|ABR27864.1| DJ-1 [Triatoma infestans] 56 1e-06
gi|147775474|emb|CAN62882.1| hypothetical protein [Vitis vi... 56 1e-06
gi|152992160|ref|YP_001357881.1| 4-methyl-5(beta-hydroxyeth... 56 1e-06
gi|154175067|ref|YP_001407863.1| DJ-1 family protein [Campy... 56 1e-06
gi|152981191|ref|YP_001354772.1| transcriptional regulator,... 56 1e-06
gi|119469330|ref|ZP_01612269.1| proteinase [Alteromonadales... 56 2e-06
gi|145596702|ref|YP_001160999.1| intracellular protease, Pf... 55 2e-06
gi|147905238|ref|NP_001086295.1| MGC84701 protein [Xenopus ... 55 2e-06
gi|18404397|ref|NP_564626.1| DJ-1 family protein [Arabidops... 55 2e-06
gi|21536528|gb|AAM60860.1| 4-methyl-5(b-hydroxyethyl)-thiaz... 55 2e-06
gi|62319084|dbj|BAD94229.1| hypothetical protein [Arabidops... 55 2e-06
gi|39586901|emb|CAE62836.1| Hypothetical protein CBG07015 [... 55 2e-06
gi|156370244|ref|XP_001628381.1| predicted protein [Nematos... 55 2e-06
gi|110633837|ref|YP_674045.1| intracellular protease, PfpI ... 55 2e-06
gi|81865403|sp|Q7TQ35|PARK7_MESAU Protein DJ-1 (Parkinson d... 55 2e-06
gi|56201615|dbj|BAD73062.1| putative 4-methyl-5(B-hydroxyet... 55 2e-06
gi|115435298|ref|NP_001042407.1| Os01g0217800 [Oryza sativa... 55 2e-06
gi|117924316|ref|YP_864933.1| DJ-1 family protein [Magnetoc... 55 2e-06
gi|125524923|gb|EAY73037.1| hypothetical protein OsI_000884... 55 2e-06
gi|118474065|ref|YP_892402.1| 4-methyl-5(B-hydroxyethyl)-th... 55 2e-06
gi|154506032|ref|ZP_02042770.1| hypothetical protein RUMGNA... 55 3e-06
gi|24653499|ref|NP_610916.1| DJ-1alpha CG6646-PA [Drosophil... 55 3e-06
gi|57086915|ref|XP_536733.1| PREDICTED: similar to DJ-1 pro... 55 3e-06
gi|66267684|dbj|BAD98543.1| DJ-1 [Pseudemys nelsoni] 55 3e-06
gi|114707002|ref|ZP_01439901.1| proteinase [Fulvimarina pel... 55 3e-06
gi|74310993|ref|YP_309412.1| 4-methyl-5(beta-hydroxyethyl)-... 54 4e-06
gi|75512485|ref|ZP_00735024.1| COG0693: Putative intracellu... 54 4e-06
gi|75176898|ref|ZP_00697014.1| COG0693: Putative intracellu... 54 4e-06
gi|1100872|gb|AAA82704.1| ThiJ >gi|1773108|gb|AAB40180.1| 4... 54 4e-06
gi|89107294|ref|AP_001074.1| hypothetical protein [Escheric... 54 4e-06
gi|15800154|ref|NP_286166.1| 4-methyl-5(beta-hydroxyethyl)-... 54 4e-06
gi|82407737|pdb|2AB0|A Chain A, Crystal Structure Of E. Col... 54 4e-06
gi|26246430|ref|NP_752469.1| 4-methyl-5(B-hydroxyethyl)-thi... 54 4e-06
gi|38703865|ref|NP_308505.2| 4-methyl-5(beta-hydroxyethyl)-... 54 4e-06
gi|118403904|ref|NP_001072131.1| DJ-1 protein [Sus scrofa] ... 54 4e-06
gi|16924002|ref|NP_476484.1| DJ-1 protein [Rattus norvegicu... 54 5e-06
gi|83941938|ref|ZP_00954400.1| putative intracellular prote... 54 5e-06
gi|150389262|ref|YP_001319311.1| ThiJ/PfpI domain protein [... 54 5e-06
gi|149185899|ref|ZP_01864214.1| protease [Erythrobacter sp.... 54 5e-06
gi|54302922|ref|YP_132915.1| hypothetical protein PBPRB1243... 54 5e-06
gi|146340883|ref|YP_001205931.1| putative intracellular pro... 54 6e-06
gi|90418768|ref|ZP_01226679.1| putative intracellular prote... 54 6e-06
gi|83855414|ref|ZP_00948944.1| putative intracellular prote... 54 6e-06
gi|20807466|ref|NP_622637.1| putative intracellular proteas... 54 6e-06
gi|126354153|ref|ZP_01711164.1| intracellular protease, Pfp... 54 7e-06
gi|114690169|ref|XP_521268.2| PREDICTED: similar to DJ-1 is... 54 7e-06
gi|86130142|ref|ZP_01048742.1| proteinase [Cellulophaga sp.... 54 7e-06
gi|31543380|ref|NP_009193.2| DJ-1 protein [Homo sapiens] >g... 54 7e-06
gi|75761540|ref|ZP_00741499.1| Transcriptional regulator, A... 54 7e-06
gi|86143389|ref|ZP_01061791.1| proteinase [Flavobacterium s... 54 8e-06
gi|15669157|ref|NP_247962.1| intracellular protease (pfpI) ... 54 8e-06
gi|33358055|pdb|1PE0|A Chain A, Crystal Structure Of The K1... 54 8e-06
gi|89275119|gb|ABD66014.1| SP22 [Xenopus laevis] 53 8e-06
gi|149695427|ref|XP_001495448.1| PREDICTED: similar to DJ-1... 53 8e-06
gi|42543006|pdb|1J42|A Chain A, Crystal Structure Of Human ... 53 9e-06
gi|92115071|ref|YP_574999.1| Peptidase C56, PfpI [Chromohal... 53 9e-06
>gi|148323539|gb|EDK88789.1| possible intracellular protease/amidase [Fusobacterium nucleatum
subsp. polymorphum ATCC 10953]
Length = 200
Score = 357 bits (915), Expect = 3e-97, Method: Composition-based stats.
Identities = 200/200 (100%), Positives = 200/200 (100%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE
Sbjct: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN
Sbjct: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL
Sbjct: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
Query: 181 LEKLTSNENVNIIKDNMFLK 200
LEKLTSNENVNIIKDNMFLK
Sbjct: 181 LEKLTSNENVNIIKDNMFLK 200
>gi|19705181|ref|NP_602676.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
gi|19713122|gb|AAL93975.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
Length = 200
Score = 330 bits (845), Expect = 4e-89, Method: Composition-based stats.
Identities = 180/200 (90%), Positives = 193/200 (96%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE
Sbjct: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
KIITEDNVE+F++YD L+IPGGFGKANFFKD +N+IFKKLIKYF ENNK+IVAICSAVIN
Sbjct: 61 KIITEDNVENFYEYDALVIPGGFGKANFFKDNDNEIFKKLIKYFSENNKVIVAICSAVIN 120
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
LLE+TYIR KKVTTYLLDNKRYFNQLKNYN+IP+EEEIV DNNLFTCSGPGNALELSFR+
Sbjct: 121 LLETTYIRDKKVTTYLLDNKRYFNQLKNYNIIPVEEEIVIDNNLFTCSGPGNALELSFRV 180
Query: 181 LEKLTSNENVNIIKDNMFLK 200
LEKLTS ENV II++NMFLK
Sbjct: 181 LEKLTSKENVKIIQNNMFLK 200
>gi|47568298|ref|ZP_00239000.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Bacillus cereus G9241]
gi|47554991|gb|EAL13340.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Bacillus cereus G9241]
Length = 193
Score = 134 bits (337), Expect = 3e-30, Method: Composition-based stats.
Identities = 76/197 (38%), Positives = 112/197 (56%), Gaps = 6/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKI + L +G E E + FTDV GWN G +V T+ + + CTW + E
Sbjct: 1 MKKILLLLADGFEAVEASVFTDVLGWNKWEGDGS---TEVVTVGLRNKLTCTWNFTVIPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K + + +++F D L IPGGF +A F++D ++ F +I++F+ K I +IC A +
Sbjct: 58 KTVDDIQLDEF---DALAIPGGFEEAGFYRDAYSREFSHVIQHFYAKQKPIASICVASLT 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L +S + GKK TTY + QLKN+ I + IV+D N+ T S PG A +++F L
Sbjct: 115 LGKSGILTGKKATTYSHPTSKRKEQLKNFGAIIQNDLIVQDGNIITSSNPGTAFDVAFLL 174
Query: 181 LEKLTSNENVNIIKDNM 197
LEKLTS +N +KD M
Sbjct: 175 LEKLTSKKNAEHVKDLM 191
>gi|71279213|ref|YP_267551.1| DJ-1/PfpI family protein [Colwellia psychrerythraea 34H]
gi|71144953|gb|AAZ25426.1| DJ-1/PfpI family protein [Colwellia psychrerythraea 34H]
Length = 207
Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats.
Identities = 73/197 (37%), Positives = 113/197 (57%), Gaps = 6/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKI + L G E E++ FTDV GW ++G + I++ ++ I+ T+G ++
Sbjct: 1 MKKIMMLLANGVEPLEMSVFTDVMGWATILGDEA---IELTDVALHTEIETTFGLTIKPS 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K++ + D DYD + IPGGF + F+ D ++ F K IKYF E K I ++C + I
Sbjct: 58 KMLQDI---DLADYDAIAIPGGFEPSGFYVDALSEPFIKAIKYFNEQGKTIASVCVSSIA 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L + + GKK TTY + QL+ I I+ IV+D ++ T +GPG A+E++F L
Sbjct: 115 LGNAGILTGKKATTYHQVGGKRKQQLEESGAIFIDRPIVQDQHIITSTGPGTAIEVAFSL 174
Query: 181 LEKLTSNENVNIIKDNM 197
LE++TS ENV I+ M
Sbjct: 175 LEQVTSAENVAEIRRKM 191
>gi|126652384|ref|ZP_01724557.1| DJ-1/PfpI family protein [Bacillus sp. B14905]
gi|126590805|gb|EAZ84919.1| DJ-1/PfpI family protein [Bacillus sp. B14905]
Length = 194
Score = 131 bits (329), Expect = 3e-29, Method: Composition-based stats.
Identities = 71/197 (36%), Positives = 111/197 (56%), Gaps = 6/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKI + L G E E + FTDV GWN G +V T+ ++CTW ++ E
Sbjct: 1 MKKIMLLLANGFEAVEASVFTDVLGWNKWEGDGS---TEVVTVGLHTQLQCTWNFKVAPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K++ + ++ DF D L IPGGF +A+F++D ++ F+ ++++F E K I IC A +
Sbjct: 58 KLLHDIDLADF---DALAIPGGFEEADFYEDAFSEEFQAVVRHFHEQQKPIATICVASLI 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L S + ++ TTY + QL++Y I + IV+ +++ T S PG A +++FRL
Sbjct: 115 LGHSGILHNRQATTYNHPTSKRLAQLESYGAIIVNGRIVQTDHIITSSNPGTAFDVAFRL 174
Query: 181 LEKLTSNENVNIIKDNM 197
LE LTS N +KD M
Sbjct: 175 LETLTSTTNTARVKDLM 191
>gi|90412401|ref|ZP_01220405.1| hypothetical intracellular protease/amidase [Photobacterium
profundum 3TCK]
gi|90326663|gb|EAS43062.1| hypothetical intracellular protease/amidase [Photobacterium
profundum 3TCK]
Length = 199
Score = 130 bits (326), Expect = 6e-29, Method: Composition-based stats.
Identities = 70/197 (35%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ +FL +G E +E + FTD GW GL+ IK+ T+ + +KC W + E
Sbjct: 1 MKKVILFLCQGVEEYEASVFTDALGWTTTYGLEP---IKLVTVGLRSKVKCAWNFTIEPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
++E ++ F D L+IPGG +A F++D ++ LI+ F K+I ++C I
Sbjct: 58 CQLSEIDINHF---DALVIPGGMSRAGFYEDAYDERLLSLIRDFDSQGKLIASVCVGAIP 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
+ +S + G+ TTY L KR Q+K V I++ +V DNN+ T P A+ ++F +
Sbjct: 115 IAKSGVLNGRNGTTYHLSEKRQ-TQMKEMGVNIIQQPVVIDNNIITSRSPSAAMNVAFAV 173
Query: 181 LEKLTSNENVNIIKDNM 197
+EKLTS N+N IK+ M
Sbjct: 174 VEKLTSTANLNRIKEGM 190
>gi|42781363|ref|NP_978610.1| DJ-1/PfpI family [Bacillus cereus ATCC 10987]
gi|42737285|gb|AAS41218.1| DJ-1/PfpI family [Bacillus cereus ATCC 10987]
Length = 193
Score = 128 bits (321), Expect = 2e-28, Method: Composition-based stats.
Identities = 78/197 (39%), Positives = 111/197 (56%), Gaps = 6/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKI + L +G E E + FTDV GWN G +V T+ ++ + CTW + E
Sbjct: 1 MKKILLLLADGFEAVEASVFTDVLGWNKWEGDGS---TEVITVGLRDKLTCTWNFTIIPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K T DN++ ++D L IPGGF +A F++D +K F +I++F K I +IC A +
Sbjct: 58 K--TVDNIQ-LDEFDALAIPGGFEEAGFYRDAYSKEFLHVIQHFHVKQKPIASICVASLA 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L +S + GKK TTY QLKN+ + IV+D N+ T S PG A +++F L
Sbjct: 115 LGKSGILIGKKATTYSHPTSERKEQLKNFGAKVQNDLIVQDGNIITSSNPGTAFDVAFLL 174
Query: 181 LEKLTSNENVNIIKDNM 197
LEKLTS +N +KD M
Sbjct: 175 LEKLTSKQNAKHVKDLM 191
>gi|54302174|ref|YP_132167.1| hypothetical intracellular protease/amidase [Photobacterium
profundum SS9]
gi|46915595|emb|CAG22367.1| hypothetical intracellular protease/amidase [Photobacterium
profundum SS9]
Length = 198
Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats.
Identities = 68/197 (34%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ +FL +G E +E + FTD GW GL+ I++ T+ + +KC W + E
Sbjct: 1 MKKVILFLCQGVEEYEASVFTDALGWTTTYGLEP---IELVTVGLRSKVKCAWNFTIEPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
++E + DF D L IPGG +A F++D ++ LI+ F +K+I ++C +
Sbjct: 58 FQLSEIDSNDF---DALAIPGGMSRAGFYEDAYDERLLSLIRDFDSQDKLIASVCVGALP 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
+ +S + G+ TTY L KR Q+K V I++ +V D N+ T P A++++F +
Sbjct: 115 IAKSGVLNGRNGTTYHLSEKRQ-AQMKEMGVNIIQQPVVIDKNIITSRSPSAAMDVAFTV 173
Query: 181 LEKLTSNENVNIIKDNM 197
+EKLTS N+N IK+ M
Sbjct: 174 VEKLTSTANLNRIKEGM 190
>gi|15896593|ref|NP_349942.1| Putative intracellular protease/amidase (ThiJ family) [Clostridium
acetobutylicum ATCC 824]
gi|15026433|gb|AAK81282.1|AE007832_3 Putative intracellular protease/amidase (ThiJ family) [Clostridium
acetobutylicum ATCC 824]
Length = 195
Score = 112 bits (279), Expect = 2e-23, Method: Composition-based stats.
Identities = 74/198 (37%), Positives = 106/198 (53%), Gaps = 7/198 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKKI + L G E E + FTDV GWN + G V T + IKCTW + E
Sbjct: 1 MKKILLLLANGFEAVEASVFTDVLGWNMLEGDGS---TLVVTAGMHDKIKCTWNFTVLPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
I NV+DF + L+IPGGF +A FF D + F LI+ F KII ++C ++
Sbjct: 58 IKIKNVNVDDF---EALVIPGGFEEAGFFIDAYSNSFLDLIRTFNAKGKIIASVCVGALS 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEE-IVEDNNLFTCSGPGNALELSFR 179
+ +S ++G+ TTY L++++ +L + V +E + IV D N+ T P A ++F
Sbjct: 115 IGKSGILKGRTATTYNLNDRKRQYELSKFGVKILENQPIVIDKNVITSYNPSTAFNVAFT 174
Query: 180 LLEKLTSNENVNIIKDNM 197
LLE LTS EN +K M
Sbjct: 175 LLEMLTSTENCTKVKKLM 192
>gi|145953864|ref|ZP_01802872.1| hypothetical protein CdifQ_04003881 [Clostridium difficile
QCD-32g58]
Length = 194
Score = 108 bits (271), Expect = 1e-22, Method: Composition-based stats.
Identities = 72/199 (36%), Positives = 107/199 (53%), Gaps = 8/199 (4%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGW-NNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ +FL +G E E + F DV GW N G DI V T +K+ + T+ ++ +K
Sbjct: 2 KVLVFLAKGFETMEFSVFVDVMGWARNDYG----HDIDVVTCGFKKQVMSTFNIQVLVDK 57
Query: 62 IITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
I E V+D YD L IPGGF + F+ + + F LI+ F KII +IC A + +
Sbjct: 58 TIEEVCVDD---YDALAIPGGFEEFGFYDEAYDSSFLNLIREFNSKEKIIASICVAALPV 114
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
+S ++ +K TTY L N + QL ++V + E IV D N+ T P A ++F+LL
Sbjct: 115 GKSGVLKNRKATTYHLKNGKRQRQLSEFDVNVVNEPIVVDKNIITSYCPETAPHVAFKLL 174
Query: 182 EKLTSNENVNIIKDNMFLK 200
E LTS E ++ +K M K
Sbjct: 175 EMLTSREQMDEVKLAMGFK 193
>gi|126700977|ref|YP_001089874.1| putative protease [Clostridium difficile 630]
gi|115252414|emb|CAJ70256.1| putative protease [Clostridium difficile 630]
Length = 194
Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats.
Identities = 72/199 (36%), Positives = 107/199 (53%), Gaps = 8/199 (4%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGW-NNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ +FL +G E E + F DV GW N G DI V T +K+ + T+ ++ +K
Sbjct: 2 KVLVFLAKGFETMEFSVFVDVMGWARNDYG----HDIDVVTCGFKKQVMSTFNIQVLVDK 57
Query: 62 IITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
I E V+D YD L IPGGF + F+ + + F LI+ F KII +IC A + +
Sbjct: 58 TIEEVCVDD---YDALAIPGGFEEFGFYDEAYDSSFLNLIREFNSKEKIIASICVAALPV 114
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
+S ++ +K TTY L N + QL ++V + E IV D N+ T P A ++F+LL
Sbjct: 115 GKSGVLKNRKATTYHLKNGKRQRQLSEFDVNVVNEPIVVDKNIITSYCPETAPHVAFKLL 174
Query: 182 EKLTSNENVNIIKDNMFLK 200
E LTS E ++ +K M K
Sbjct: 175 EMLTSKEQMDEVKLVMGFK 193
>gi|153938279|ref|YP_001390376.1| DJ-1/PfpI family protein [Clostridium botulinum F str. Langeland]
gi|152934175|gb|ABS39673.1| DJ-1/PfpI family protein [Clostridium botulinum F str. Langeland]
Length = 194
Score = 106 bits (264), Expect = 9e-22, Method: Composition-based stats.
Identities = 69/198 (34%), Positives = 107/198 (54%), Gaps = 7/198 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ + L G E E + FTDV GWN + G I T+ +E +KCT+ + E
Sbjct: 1 MKKVLLLLANGFEAVEASVFTDVIGWNKLEGDGTTELI---TVGIREKLKCTFNFTVTPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
++E N+++F D L IPGGF +A F++D ++ F +I+ F + K I +IC +
Sbjct: 58 MHVSEVNIDEF---DALAIPGGFEEAGFYEDAYSEDFLNIIREFDKAEKTIASICVGALP 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEE-IVEDNNLFTCSGPGNALELSFR 179
+ +S + + TTY L N R NQL + + ++ +V D N+ T P A ++F+
Sbjct: 115 IGKSGVLVNRNATTYNLGNGRRQNQLSEFGANVLRDKPVVVDKNIITSYNPSTAFHVAFK 174
Query: 180 LLEKLTSNENVNIIKDNM 197
LLE LTS EN +K M
Sbjct: 175 LLELLTSKENCVNVKRLM 192
>gi|148379015|ref|YP_001253556.1| protease I [Clostridium botulinum A str. ATCC 3502]
gi|153932182|ref|YP_001383399.1| DJ-1/PfpI family protein [Clostridium botulinum A str. ATCC 19397]
gi|153937372|ref|YP_001386946.1| DJ-1/PfpI family protein [Clostridium botulinum A str. Hall]
gi|148288499|emb|CAL82578.1| putative protease I [Clostridium botulinum A str. ATCC 3502]
gi|152928226|gb|ABS33726.1| DJ-1/PfpI family protein [Clostridium botulinum A str. ATCC 19397]
gi|152933286|gb|ABS38785.1| DJ-1/PfpI family protein [Clostridium botulinum A str. Hall]
Length = 194
Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats.
Identities = 69/198 (34%), Positives = 106/198 (53%), Gaps = 7/198 (3%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ + L G E E + FTDV GWN + G I T+ +E +KCT+ + E
Sbjct: 1 MKKVLLLLANGFEAVEASVFTDVIGWNKLEGDGTTELI---TVGIREKLKCTFNFTVTPE 57
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
++E N+++F D L IPGGF +A F++D ++ F +I+ F + K I +IC +
Sbjct: 58 MHVSEVNIDEF---DALAIPGGFEEAGFYEDAYSEDFLNIIREFDKARKTIASICVGALP 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEE-IVEDNNLFTCSGPGNALELSFR 179
+ +S + + TTY L N R QL + + +E +V D N+ T P A ++F+
Sbjct: 115 IGKSGVLVNRNATTYNLGNGRRQKQLSEFGANVLRDEPVVVDKNIITSYNPSTAFHVAFK 174
Query: 180 LLEKLTSNENVNIIKDNM 197
LLE LTS EN +K M
Sbjct: 175 LLELLTSKENCVNVKRLM 192
>gi|29347781|ref|NP_811284.1| putative protease/amidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29339682|gb|AAO77478.1| putative protease/amidase [Bacteroides thetaiotaomicron VPI-5482]
Length = 192
Score = 94.4 bits (233), Expect = 3e-18, Method: Composition-based stats.
Identities = 61/196 (31%), Positives = 103/196 (52%), Gaps = 8/196 (4%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFR-DIKVETISYKESIKCTWGGELRAEK 61
K+ +FL +G E E + F DV GW +F D++V T + E + ++ + +K
Sbjct: 2 KLLVFLAKGFETIEFSGFIDVMGWAKT----DFGCDVEVVTGGFNEKVISSFNIPVLVDK 57
Query: 62 IITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
I E +V++ YD L IPGGF F+++ + LI+ F K I +C + +
Sbjct: 58 TIDEISVDE---YDALAIPGGFEVFGFYEEAYEEKLLNLIRQFDARKKWIATVCVGALPV 114
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
+S ++ +K TTY L L+++ I + E IV D+N+ T GP A ++ LL
Sbjct: 115 GKSGVLKDRKATTYHLGGAVKQKVLQSFGAIIVHEPIVVDDNIITSYGPQTASGVALLLL 174
Query: 182 EKLTSNENVNIIKDNM 197
EKLTS+ ++++K+ M
Sbjct: 175 EKLTSHREMSLVKEAM 190
>gi|18311049|ref|NP_562983.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
protein [Clostridium perfringens str. 13]
gi|18145731|dbj|BAB81773.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
protein [Clostridium perfringens str. 13]
Length = 193
Score = 86.7 bits (213), Expect = 8e-16, Method: Composition-based stats.
Identities = 62/199 (31%), Positives = 101/199 (50%), Gaps = 17/199 (8%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ +FL EG E E S DV V +++ ++ G + +
Sbjct: 3 MKKVLVFLAEGFETIEALSVVDVCNRAKVT-------CHACSLTENRTVNSAHGTMVLCD 55
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K+I+++++E YD +I+PGG A +D N+ + LIK + + NKI+ AIC+A I
Sbjct: 56 KLISDNDLET---YDAIILPGGMPGATNLRD--NERVQSLIKKYNKENKIVAAICAAPIA 110
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I GKKVT+Y + +L N N + E+ +V D N+ T GP AL +
Sbjct: 111 LAKAGVIEGKKVTSY----PGFKEELGNVNYVE-EDTVVVDGNIITSRGPATALVFGLEI 165
Query: 181 LEKLTSNENVNIIKDNMFL 199
L+KL + I++ M +
Sbjct: 166 LKKLGYEKEAEEIREGMLI 184
>gi|110801657|ref|YP_699347.1| DJ-1 family protein [Clostridium perfringens SM101]
gi|110682158|gb|ABG85528.1| DJ-1 family protein [Clostridium perfringens SM101]
Length = 191
Score = 85.1 bits (209), Expect = 2e-15, Method: Composition-based stats.
Identities = 61/199 (30%), Positives = 100/199 (50%), Gaps = 17/199 (8%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ +FL EG E E S DV V +++ ++ G + +
Sbjct: 1 MKKVLVFLAEGFETIEALSVVDVCNRAKVT-------CHACSLTENRTVNSAHGTMVLCD 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K+I+++++E YD +++PGG + +D N+ + LIK + E NKI+ AIC+A I
Sbjct: 54 KLISDNDLET---YDAIVLPGGMPGSTNLRD--NEKVQSLIKKYNEENKIVAAICAAPIA 108
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I GKKVT+Y + +L N N + E+ +V D N T GP AL +
Sbjct: 109 LAKAGVIEGKKVTSY----PGFKEELGNVNYVE-EDTVVVDGNTITSRGPATALVFGLEI 163
Query: 181 LEKLTSNENVNIIKDNMFL 199
L+KL + I++ M +
Sbjct: 164 LKKLGYEKEAEEIREGMLI 182
>gi|110799272|ref|YP_696747.1| DJ-1 family protein [Clostridium perfringens ATCC 13124]
gi|110673919|gb|ABG82906.1| DJ-1 family protein [Clostridium perfringens ATCC 13124]
Length = 191
Score = 84.3 bits (207), Expect = 3e-15, Method: Composition-based stats.
Identities = 60/199 (30%), Positives = 101/199 (50%), Gaps = 17/199 (8%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK+ +FL EG E E S DV V +++ ++ G + +
Sbjct: 1 MKKVLVFLAEGFETIEALSVVDVCNRAKVT-------CHACSLTENRTVNSAHGTMVLCD 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K+I+++++E YD +++PGG + +D N+ + LIK + + NKI+ AIC+A I
Sbjct: 54 KLISDNDLET---YDAIVLPGGMPGSTNLRD--NEKVQSLIKKYNKENKIVAAICAAPIA 108
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I GKKVT+Y + +L N N + E+ +V D N+ T GP AL +
Sbjct: 109 LAKAGVIEGKKVTSY----PGFKEELGNVNYVE-EDTVVVDGNIITSRGPATALVFGLEI 163
Query: 181 LEKLTSNENVNIIKDNMFL 199
L+KL + I++ M +
Sbjct: 164 LKKLGYEKEAEEIREGMLI 182
>gi|66808601|ref|XP_638023.1| hypothetical protein DDBDRAFT_0218762 [Dictyostelium discoideum
AX4]
gi|60466464|gb|EAL64519.1| hypothetical protein DDBDRAFT_0218762 [Dictyostelium discoideum
AX4]
Length = 205
Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats.
Identities = 62/203 (30%), Positives = 107/203 (52%), Gaps = 9/203 (4%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFR-DIKVETIS-YKESIKCTWGGELRA 59
KKI + L +G E+ E F DV GW E + DI+V T Y + + T+G +++
Sbjct: 3 KKILLLLCKGFEVMEFTPFVDVMGWAREDDNNEDKADIQVVTCGLYNKMVTSTFGVKVQV 62
Query: 60 EKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+ ++ E V+ ++D L IPGGF +F+++ ++ +LI+ F K I ++C A +
Sbjct: 63 DVLLGE-VVKSLDEFDALAIPGGFENYSFYEEAYSEDVSQLIRDFDSKGKHIASVCVAAL 121
Query: 120 NLLESTYIRGKKVTTY---LLDNKRYFNQLKNY--NVIPIEEEIVEDNNLFTCSGPGNAL 174
L +S ++G+ TTY L ++ QL+++ NVI ++ IV D N+ T P A
Sbjct: 122 ALGKSGILKGRNATTYRNSLREHSVRQQQLRDFGANVIA-DQSIVIDKNVITSYNPQTAP 180
Query: 175 ELSFRLLEKLTSNENVNIIKDNM 197
++F LL +L+ +K M
Sbjct: 181 YVAFELLSRLSDENKAKKVKTLM 203
>gi|150019134|ref|YP_001311388.1| DJ-1 family protein [Clostridium beijerinckii NCIMB 8052]
gi|149905599|gb|ABR36432.1| DJ-1 family protein [Clostridium beijerinckii NCIMB 8052]
Length = 183
Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats.
Identities = 56/201 (27%), Positives = 95/201 (47%), Gaps = 23/201 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKE-SIKCTWGGELRA 59
MKK+ + L EG E E + +D+ D+ + +S E +K + G ++A
Sbjct: 1 MKKVCVLLAEGFEEIEALTVSDII---------RRADVTCDLVSIAEKQVKSSHGVVVQA 51
Query: 60 EKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+K+ E +YD+++IPGG A +D I K +K ++ K+I AIC+ I
Sbjct: 52 DKLFDEK-----MEYDLVVIPGGIPGATNLRDDERVI--KFVKKQNKDGKLIGAICAGPI 104
Query: 120 NLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFR 179
L + G+ +T+Y Y ++L N + E+ +V D N+ T GP A+ +++
Sbjct: 105 VLGRAGITEGRNITSY----PGYEDELPNCEYL--EDAVVVDKNIVTSRGPATAMAFAYK 158
Query: 180 LLEKLTSNENVNIIKDNMFLK 200
LL+ L V I M K
Sbjct: 159 LLDILGYGNKVESISSGMLYK 179
>gi|34764182|ref|ZP_00145046.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27886050|gb|EAA23362.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 39
Score = 71.6 bits (174), Expect = 3e-11, Method: Composition-based stats.
Identities = 36/39 (92%), Positives = 39/39 (100%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIK 39
MKKIA+FLFEGAELFEIA+FTD+FGWNNVVGLKEFRDIK
Sbjct: 1 MKKIAVFLFEGAELFEIATFTDIFGWNNVVGLKEFRDIK 39
>gi|15894907|ref|NP_348256.1| Putative intracellular protease/amidase, ThiJ family [Clostridium
acetobutylicum ATCC 824]
gi|15024587|gb|AAK79596.1|AE007672_3 Putative intracellular protease/amidase, ThiJ family [Clostridium
acetobutylicum ATCC 824]
Length = 188
Score = 71.2 bits (173), Expect = 3e-11, Method: Composition-based stats.
Identities = 59/199 (29%), Positives = 97/199 (48%), Gaps = 21/199 (10%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKI 62
KI +FL EG E E + D+ +++ + ++ + + IK A+K
Sbjct: 2 KIVLFLAEGFEEVEALTVVDILRRADIIC--DMCSLEAKEVVGAHKIKVC------ADKT 53
Query: 63 ITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
I + D +YD L++PGG G N +N++ +K F + KI+ AIC+A I L
Sbjct: 54 IEDI---DIAEYDGLVLPGGMPGAENL---RNSEFVINAVKKFNKEKKIVAAICAAPIVL 107
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
++ + G+ T+Y Y +++ N N + E+ V+D N+ T GP A+ RL+
Sbjct: 108 GKAEVLEGRDATSY----PGYGDEMGNCNYL--EKITVKDGNILTSRGPATAIYFGLRLV 161
Query: 182 EKLTSNENVNIIKDNMFLK 200
E L E N +KD M LK
Sbjct: 162 EILKGKEVANGLKDGMMLK 180
>gi|153939756|ref|YP_001392047.1| DJ-1 family protein [Clostridium botulinum F str. Langeland]
gi|152935652|gb|ABS41150.1| DJ-1 family protein [Clostridium botulinum F str. Langeland]
Length = 183
Score = 70.1 bits (170), Expect = 8e-11, Method: Composition-based stats.
Identities = 59/198 (29%), Positives = 97/198 (48%), Gaps = 18/198 (9%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+ +F+ EG E E + DV N+ + +I+ + +K + +
Sbjct: 1 MTKVLVFIAEGFEEIEALTVVDVLRRANI-------RCDMCSITSNKEVKGAHNILVNVD 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K + E D Y+ L+IPGG A +D NNK+ L+K F + K+I AIC+ I
Sbjct: 54 KTLEEIKTND---YNSLVIPGGMPGAANLRD-NNKVIN-LVKEFNRDEKLIAAICAGPIV 108
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I+GK+VT+Y + LK I E+ +V+D N+ T GP A+ +F++
Sbjct: 109 LSKANIIKGKEVTSY----PGFEEDLK--ECIYKEDLVVQDGNIITSRGPSAAMYFAFKI 162
Query: 181 LEKLTSNENVNIIKDNMF 198
LE + I +D +F
Sbjct: 163 LENFKKDSAKEIKEDMLF 180
>gi|148380727|ref|YP_001255268.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Clostridium botulinum A str. ATCC 3502]
gi|153932262|ref|YP_001385011.1| DJ-1 family protein [Clostridium botulinum A str. ATCC 19397]
gi|153935560|ref|YP_001388481.1| DJ-1 family protein [Clostridium botulinum A str. Hall]
gi|148290211|emb|CAL84330.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Clostridium botulinum A str. ATCC 3502]
gi|152928306|gb|ABS33806.1| DJ-1 family protein [Clostridium botulinum A str. ATCC 19397]
gi|152931474|gb|ABS36973.1| DJ-1 family protein [Clostridium botulinum A str. Hall]
Length = 183
Score = 69.7 bits (169), Expect = 8e-11, Method: Composition-based stats.
Identities = 61/199 (30%), Positives = 99/199 (49%), Gaps = 19/199 (9%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+ +F+ EG E E + DV N+ + +I+ + +K + +
Sbjct: 1 MTKVLVFIAEGFEEIEALTVVDVLRRANI-------RCDMCSITSNKEVKGAHNILVNVD 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K + E D Y L+IPGG A +D NNK+ L+K F + K+I AIC+ I
Sbjct: 54 KTLEEIKTND---YSSLVIPGGMPGAANLRD-NNKVIN-LVKEFNRDEKLIAAICAGPIV 108
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I+GK+VT+Y + LK I E+ +V+D N+ T GP A+ +F++
Sbjct: 109 LSKANIIKGKEVTSY----PGFEEDLKEG--IYKEDLVVQDGNIITSRGPSAAMYFAFKI 162
Query: 181 LEKLTSNENVNIIKDNMFL 199
LE L ++ IK++M L
Sbjct: 163 LENL-KKDSAKGIKEDMLL 180
>gi|53715247|ref|YP_101239.1| putative ThiJ family intracellular protease [Bacteroides fragilis
YCH46]
gi|60683181|ref|YP_213325.1| putative thiamine biosynthesis related protein [Bacteroides
fragilis NCTC 9343]
gi|52218112|dbj|BAD50705.1| putative ThiJ family intracellular protease [Bacteroides fragilis
YCH46]
gi|60494615|emb|CAH09416.1| putative thiamine biosynthesis related protein [Bacteroides
fragilis NCTC 9343]
Length = 183
Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats.
Identities = 47/142 (33%), Positives = 76/142 (53%), Gaps = 12/142 (8%)
Query: 62 IITEDNVE--DFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
++ + N E DFFD ++L +PGG G A K + +KLI F E NK I AIC+A
Sbjct: 50 VLCDKNFENCDFFDAELLFLPGGMPGAATLDKHEG---LRKLILSFAEKNKPIAAICAAP 106
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
+ L + ++G++VT Y ++Y + N E +V D N+ T GPG A+E +
Sbjct: 107 MVLGKLGLLKGRRVTCYP-SFEQYLDGADCTN-----EPVVRDGNIITGMGPGAAMEFAL 160
Query: 179 RLLEKLTSNENVNIIKDNMFLK 200
+++ L E VN + + M ++
Sbjct: 161 TIVDTLLGKEKVNELVEAMCVR 182
>gi|156546896|ref|XP_001599104.1| PREDICTED: similar to GH09983p [Nasonia vitripennis]
Length = 188
Score = 69.3 bits (168), Expect = 1e-10, Method: Composition-based stats.
Identities = 64/204 (31%), Positives = 95/204 (46%), Gaps = 26/204 (12%)
Query: 2 KKIAI-FLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
KK A+ L EGAE E D+ V + V +I+ KE IKC+ R
Sbjct: 3 KKTAVCLLAEGAEEMEAIVTVDILRRAGV-------SVTVASITDKECIKCS-----RDV 50
Query: 61 KIITEDNVEDF--FDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
KI T+ + D YD +I+PGG G N +++K +K+I AIC+A
Sbjct: 51 KICTDAKIGDIEGQKYDAVILPGGVGWKNLAASAR---VGEILKAQESESKVIAAICAAP 107
Query: 119 INLLESTYI-RGKKVTTYLLDNKRYFNQL-KNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
N+L++ I +GKK+T+Y N L +Y+ I ++ +V D NL T GP A
Sbjct: 108 -NVLKAHGIAKGKKITSY----PSVKNDLTSDYSYID-DQIVVTDGNLITSKGPATAYAF 161
Query: 177 SFRLLEKLTSNENVNIIKDNMFLK 200
++EKL E + D + K
Sbjct: 162 GLAIVEKLVDKETAQKVADGLLYK 185
>gi|153954760|ref|YP_001395525.1| hypothetical protein CKL_2142 [Clostridium kluyveri DSM 555]
gi|146347618|gb|EDK34154.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
Length = 186
Score = 68.9 bits (167), Expect = 1e-10, Method: Composition-based stats.
Identities = 69/199 (34%), Positives = 106/199 (53%), Gaps = 18/199 (9%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MKK I L EG E E + DV N+ D ++ +IS KE +K G ++AE
Sbjct: 1 MKKAIILLAEGFEEIEALTCVDVLRRGNI-------DCRICSISGKEDVKGAHGVVVKAE 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
++ N ED +Y+ +I+PGG A +D N K+ + ++K F NKII AIC+A I
Sbjct: 54 ILLQNIN-ED--EYEAIILPGGMPGAVNLRD-NEKVIE-IVKKFDRENKIIAAICAAPIV 108
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ I +KVT+Y + +L+ YN EE +V++ NL T GP A + ++
Sbjct: 109 LKKAGIIYNRKVTSY----PGFEEELQAYNY--SEEIVVQERNLITSRGPATAPYFALKV 162
Query: 181 LEKLTSNENVNIIKDNMFL 199
LE L+ E V ++ +M L
Sbjct: 163 LENLSGTEGVENLRKDMLL 181
>gi|15902757|ref|NP_358307.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein [Streptococcus pneumoniae R6]
gi|116516953|ref|YP_816201.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae D39]
gi|148990417|ref|ZP_01821583.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP6-BS73]
gi|149021674|ref|ZP_01835705.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP23-BS72]
gi|15458304|gb|AAK99517.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein [Streptococcus pneumoniae R6]
gi|116077529|gb|ABJ55249.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae D39]
gi|147924322|gb|EDK75415.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP6-BS73]
gi|147930135|gb|EDK81121.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP23-BS72]
Length = 184
Score = 66.6 bits (161), Expect = 7e-10, Method: Composition-based stats.
Identities = 47/184 (25%), Positives = 90/184 (48%), Gaps = 21/184 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L +G E E + DV N I + + ++E + + ++RA+
Sbjct: 1 MVKVAVMLAQGFEEIEALTVVDVLRRAN---------ITCDMVGFEEQVTGSHAIQVRAD 51
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ D DYD++++PGG + +D N+ + ++ F + K + AIC+A I
Sbjct: 52 HVFDGD----LSDYDMIVLPGGMPGSAHLRD--NQTLIQELQSFEQEGKKLAAICAAPIA 105
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ ++ K+ T Y ++ + +Y ++E +V D L T GP AL ++ L
Sbjct: 106 LNQAEILKNKRYTCYDGVQEQILD--GHY----VKETVVVDGQLTTSRGPSTALAFAYEL 159
Query: 181 LEKL 184
+E+L
Sbjct: 160 VEQL 163
>gi|149003439|ref|ZP_01828328.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP14-BS69]
gi|149010563|ref|ZP_01831934.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP19-BS75]
gi|147758622|gb|EDK65620.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP14-BS69]
gi|147765044|gb|EDK71973.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP19-BS75]
Length = 184
Score = 66.6 bits (161), Expect = 7e-10, Method: Composition-based stats.
Identities = 47/184 (25%), Positives = 90/184 (48%), Gaps = 21/184 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L +G E E + DV N I + + ++E + + ++RA+
Sbjct: 1 MVKVAVMLAQGFEEIEALTVVDVLRRAN---------ITCDMVGFEEQVTGSHAIQVRAD 51
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ D DYD++++PGG + +D N+ + ++ F + K + AIC+A I
Sbjct: 52 HVFDGD----LSDYDMIVLPGGMPGSAHLRD--NQTLIQELQSFEQEGKKLAAICAAPIA 105
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ ++ K+ T Y ++ + +Y ++E +V D L T GP AL ++ L
Sbjct: 106 LNQAEILKNKRYTCYDGVQEQILD--GHY----VKETVVVDGQLTTSRGPSTALAFAYEL 159
Query: 181 LEKL 184
+E+L
Sbjct: 160 VEQL 163
>gi|15900697|ref|NP_345301.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae TIGR4]
gi|111658145|ref|ZP_01408842.1| hypothetical protein SpneT_02000670 [Streptococcus pneumoniae
TIGR4]
gi|14972281|gb|AAK74941.1| putative 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate
biosynthesis protein [Streptococcus pneumoniae TIGR4]
Length = 184
Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats.
Identities = 47/184 (25%), Positives = 90/184 (48%), Gaps = 21/184 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L +G E E + DV N I + + ++E + + ++RA+
Sbjct: 1 MVKVAVILAQGFEEIEALTVVDVLRRAN---------ITCDMVGFEEQVTGSHAIQVRAD 51
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ D DYD++++PGG + +D N+ + ++ F + K + AIC+A I
Sbjct: 52 HVFDGD----LSDYDMIVLPGGMPGSAHLRD--NQTLIQELQSFEQEGKKLAAICAAPIA 105
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ ++ K+ T Y ++ + +Y ++E +V D L T GP AL ++ L
Sbjct: 106 LNQAEILKNKRYTCYDGVQEQILD--GHY----VKETVVVDGQLTTSRGPSTALAFAYEL 159
Query: 181 LEKL 184
+E+L
Sbjct: 160 VEQL 163
>gi|148997116|ref|ZP_01824770.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP11-BS70]
gi|149007690|ref|ZP_01831307.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP18-BS74]
gi|147756816|gb|EDK63856.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP11-BS70]
gi|147760845|gb|EDK67816.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP18-BS74]
Length = 184
Score = 66.6 bits (161), Expect = 8e-10, Method: Composition-based stats.
Identities = 47/184 (25%), Positives = 90/184 (48%), Gaps = 21/184 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L +G E E + DV N I + + ++E + + ++RA+
Sbjct: 1 MVKVAVILAQGFEEIEALTVVDVLRRAN---------ITCDMVGFEEQVTGSHAIQVRAD 51
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ D DYD++++PGG + +D N+ + ++ F + K + AIC+A I
Sbjct: 52 HVFDGD----LSDYDMIVLPGGMPGSAHLRD--NQTLIQELQSFEQEGKKLAAICAAPIA 105
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ ++ K+ T Y ++ + +Y ++E +V D L T GP AL ++ L
Sbjct: 106 LNQAEILKNKRYTCYDGVQEQILD--GHY----VKETVVVDGQLTTSRGPSTALAFAYEL 159
Query: 181 LEKL 184
+E+L
Sbjct: 160 VEQL 163
>gi|67484652|ref|XP_657546.1| 4-methyl-5(B-hydroxyethyl)-thiazol monophosphate biosynthesis
enzyme [Entamoeba histolytica HM-1:IMSS]
Length = 184
Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats.
Identities = 49/198 (24%), Positives = 90/198 (45%), Gaps = 20/198 (10%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKI 62
K + + G+E E + D+ + + TI+ C+ G ++ A+K
Sbjct: 2 KALVVIANGSEELEAVTIIDILARAKI-------QVTTATINSNLETACSRGVKIMADKF 54
Query: 63 ITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLL 122
++E N + YDV+ IPGG A+ +++ + IK N+ + AIC++ +L
Sbjct: 55 LSECNEQ----YDVIAIPGGLPGADNLA--GSQLLIQKIKEQLAANRFVAAICASPAIVL 108
Query: 123 EST-YIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
E I G+K T Y + NQ + + +V DN+L T PG+A+E S ++
Sbjct: 109 EGNGIIEGRKCTAYPSFQPKLANQ------SAVHQRVVVDNHLITSQAPGSAIEFSLEII 162
Query: 182 EKLTSNENVNIIKDNMFL 199
+L E + ++ + L
Sbjct: 163 RQLKGEEAMREVEKPLVL 180
>gi|66531474|ref|XP_624271.1| PREDICTED: similar to dj-1 CG1349-PA [Apis mellifera]
Length = 186
Score = 65.9 bits (159), Expect = 1e-09, Method: Composition-based stats.
Identities = 56/206 (27%), Positives = 89/206 (43%), Gaps = 34/206 (16%)
Query: 2 KKIAIFLF-EGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
KK AI L +G+E E TD+ V D+ V ++ + C+ R
Sbjct: 3 KKTAILLIADGSEEMEAVITTDILRRAGV-------DVTVAGLTENPYVNCS-----RNV 50
Query: 61 KIITEDNVEDFFD--YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
KI + ++D + YDV+I+PGG + F KL++ E N+ I AIC+A
Sbjct: 51 KIHVDAKLQDVINQKYDVVILPGGLDGSKAFASSAE--VGKLLQRQQEENRFIAAICAAP 108
Query: 119 INLLESTYIRGKKVTTY------LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
L +GK++T+Y L+D +Y +E ++V D+NL T GP
Sbjct: 109 TALKAHGIAKGKQITSYPAMKDQLVDYYKY-----------LENKVVIDDNLITSRGPAT 157
Query: 173 ALELSFRLLEKLTSNENVNIIKDNMF 198
A + EKL + + + M
Sbjct: 158 AFAFGLAIAEKLIDKQTADNVAQAML 183
>gi|148985843|ref|ZP_01818937.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP3-BS71]
gi|148992474|ref|ZP_01822169.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP9-BS68]
gi|147921989|gb|EDK73113.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP3-BS71]
gi|147928791|gb|EDK79804.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Streptococcus pneumoniae SP9-BS68]
Length = 184
Score = 65.1 bits (157), Expect = 3e-09, Method: Composition-based stats.
Identities = 47/184 (25%), Positives = 92/184 (50%), Gaps = 21/184 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L +G E E + DV N I + + ++E + + ++RA+
Sbjct: 1 MVKVAVMLAQGFEEIEALTVVDVLRRAN---------ITCDMVGFEEQVTGSHAIQVRAD 51
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ + N+ D YD++++PGG + +D N+ + ++ F + K + AIC+A I
Sbjct: 52 HVF-DGNLSD---YDMIVLPGGMPGSAHLRD--NQTLIQELQSFEQEGKKLAAICAAPIA 105
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L ++ ++ K+ T Y ++ + +Y ++E +V D L T GP AL ++ L
Sbjct: 106 LNQAEILKNKRYTCYDGVQEQILD--GHY----VKETVVVDGQLTTSRGPSTALAFAYEL 159
Query: 181 LEKL 184
+E+L
Sbjct: 160 VEQL 163
>gi|28211406|ref|NP_782350.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Clostridium tetani E88]
gi|28203847|gb|AAO36287.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Clostridium tetani E88]
Length = 188
Score = 64.7 bits (156), Expect = 3e-09, Method: Composition-based stats.
Identities = 55/200 (27%), Positives = 103/200 (51%), Gaps = 22/200 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
+KKI + L +G E E + D+ VG+ D K +I+ ++ +K ++++
Sbjct: 7 VKKIVVMLAKGFEEIEALTVVDIL---RRVGV----DCKTCSITEEKMVKGAHNIYVKSD 59
Query: 61 KIITEDNVEDFFDYDV--LIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
++ +DF +Y +++PGG A +D NK +IK F + NK+I AIC+A
Sbjct: 60 TLL-----KDFKEYGFSGIVLPGGMPGATNLRD--NKEVIGIIKEFNDENKLIAAICAAP 112
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
I L E+ + K +T+Y + +LK N E+++V+ N+ T GP A++ +F
Sbjct: 113 IVLKEADIVENKNITSY----PGFEEELKGSNY--KEDKVVQHGNIITSRGPSTAIDFTF 166
Query: 179 RLLEKLTSNENVNIIKDNMF 198
++LE + + + +K +M
Sbjct: 167 KILENIIDEKELEELKKSML 186
>gi|28571932|ref|NP_651825.3| dj-1beta CG1349-PA [Drosophila melanogaster]
gi|16767998|gb|AAL28218.1| GH09983p [Drosophila melanogaster]
gi|18642508|dbj|BAB84672.1| DJ-1 beta [Drosophila melanogaster]
gi|28381503|gb|AAF57086.2| CG1349-PA [Drosophila melanogaster]
Length = 205
Score = 63.2 bits (152), Expect = 9e-09, Method: Composition-based stats.
Identities = 50/198 (25%), Positives = 92/198 (46%), Gaps = 16/198 (8%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K + L GAE E DV G+K + V ++ E++KC+ ++ +
Sbjct: 21 KSALVILAPGAEEMEFIIAADVL---RRAGIK----VTVAGLNGGEAVKCSRDVQILPDT 73
Query: 62 IITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
+ + + F DV+++PGG G +N + + + L++ +I AIC+A L
Sbjct: 74 SLAQVASDKF---DVVVLPGGLGGSNAMGESS--LVGDLLRSQESGGGLIAAICAAPTVL 128
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
+ GK +T+Y + N NY+ + ++ +V+D NL T GPG A E + ++
Sbjct: 129 AKHGVASGKSLTSYPSMKPQLVN---NYSYVD-DKTVVKDGNLITSRGPGTAYEFALKIA 184
Query: 182 EKLTSNENVNIIKDNMFL 199
E+L E V + + +
Sbjct: 185 EELAGKEKVQEVAKGLLV 202
>gi|156719857|ref|ZP_02061463.1| DJ-1 family protein [Hydrogenobaculum sp. Y04AAS1]
gi|156709878|gb|EDO50281.1| DJ-1 family protein [Hydrogenobaculum sp. Y04AAS1]
Length = 183
Score = 62.0 bits (149), Expect = 2e-08, Method: Composition-based stats.
Identities = 57/200 (28%), Positives = 87/200 (43%), Gaps = 20/200 (10%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M K+A+ L G E E + D+ V L +K + I ++K E
Sbjct: 1 MAKVAVLLAPGFEEVEAIAPIDILRRGGVEVL--IVGVKDKVIPSARNVKI--------E 50
Query: 61 KIITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+T D ++D + D++IIPGG G N K ++ K LI K + AIC+ +
Sbjct: 51 VDVTIDELKDVDNLDMIIIPGGMIGVENL---KKSEEVKNLINQMNAKKKYVSAICAGPL 107
Query: 120 NLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFR 179
L + + K +T++ + L EE +VED N+ + GP A+ FR
Sbjct: 108 VLKNAGVVENKHITSHPSVKLEFNEHLYK------EESVVEDENIISSRGPATAMVFGFR 161
Query: 180 LLEKLTSNENVNIIKDNMFL 199
LLEKLTS E + M
Sbjct: 162 LLEKLTSKEKAKEVAKAMLF 181
>gi|157104409|ref|XP_001648396.1| dj-1 protein (park7) [Aedes aegypti]
gi|108880380|gb|EAT44605.1| dj-1 protein (park7) [Aedes aegypti]
Length = 186
Score = 62.0 bits (149), Expect = 2e-08, Method: Composition-based stats.
Identities = 53/201 (26%), Positives = 86/201 (42%), Gaps = 25/201 (12%)
Query: 2 KKIAIFLFEGAELFEIASFTDVF---GWN-NVVGLKEFRDIKVETISYKESIKCTWGGEL 57
KK+ + L GAE E DV G N V GL + +++KC+ +
Sbjct: 3 KKLLMLLPHGAEEMEFVICVDVLRRCGVNVTVAGLTD------------KTVKCSRDVVI 50
Query: 58 RAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+A+ + E EDF D + +PGG G + +++K F K+I AIC+A
Sbjct: 51 KADTTLEEAANEDF---DAIALPGGLGGSKAMSGSTK--LGEVLKSFESKGKLITAICAA 105
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL + GK +T+Y + ++ ++ +V D NL T GPG A + +
Sbjct: 106 PTVLLTHSVALGKTLTSY----PSFKDEFAGKYTYVEDKTVVVDGNLVTSRGPGTAFDFA 161
Query: 178 FRLLEKLTSNENVNIIKDNMF 198
+L E L + + M
Sbjct: 162 LKLGEILVGLDKTKQVAKGML 182
>gi|29349330|ref|NP_812833.1| putative ThiJ family intracellular protease/amidase [Bacteroides
thetaiotaomicron VPI-5482]
gi|29341238|gb|AAO79027.1| putative ThiJ family intracellular protease/amidase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 183
Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats.
Identities = 44/139 (31%), Positives = 71/139 (51%), Gaps = 11/139 (7%)
Query: 63 ITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
I DN DFFD D+L++PGG G A K + +KL+ F K I AIC+A + L
Sbjct: 54 INFDNC-DFFDADLLLLPGGMPGAATLDKHEG---LRKLLLDFAAKGKPIAAICAAPMVL 109
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
+ ++G+K T Y + L+ + E +V D N+ T GPG A+E + ++
Sbjct: 110 GKLGLLKGRKATCY----PSFEQYLEGAECV--SEPVVRDGNIITGMGPGAAMEFALAIV 163
Query: 182 EKLTSNENVNIIKDNMFLK 200
+ L + V+ + + M +K
Sbjct: 164 DLLVGKDKVDELVEAMCVK 182
>gi|126330567|ref|XP_001362447.1| PREDICTED: similar to CAP1 protein [Monodelphis domestica]
Length = 189
Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats.
Identities = 55/203 (27%), Positives = 94/203 (46%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
KK + L +GAE E D+ G+K + + +S K+ ++C+ R
Sbjct: 4 KKALVILAKGAEEMETVIPVDLM---RRAGIK----VVLAGLSGKDPVQCS-----RDVF 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I ++++ED YDV+++PGG G N + + + K L+K +N +I A+C+
Sbjct: 52 ICPDESLEDAKKQGPYDVIVLPGGNLGAQNLCE---SPVVKTLLKEQEKNKGLIAAVCAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y E + +D N+ T GPG + E
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-SHYTYT--ESRVEKDGNILTSRGPGTSFEFG 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++ +L V+ +K + LK
Sbjct: 166 LAIIAELMGKSVVDQVKGPLVLK 188
>gi|153813466|ref|ZP_01966134.1| hypothetical protein RUMOBE_03886 [Ruminococcus obeum ATCC 29174]
gi|149830410|gb|EDM85502.1| hypothetical protein RUMOBE_03886 [Ruminococcus obeum ATCC 29174]
Length = 185
Score = 60.8 bits (146), Expect = 4e-08, Method: Composition-based stats.
Identities = 53/204 (25%), Positives = 90/204 (44%), Gaps = 34/204 (16%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKES--IKCTWGGELRA 59
KK+ IFL +G F D+ G VV L DI ++T+S KES I + G +
Sbjct: 3 KKVYIFLADG--------FEDIEGLT-VVDLMRRADIDIKTVSIKESKEITTSHGISMLT 53
Query: 60 EKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+ + E DF D D+L++PGG + + + + L+ F+ + AIC+A
Sbjct: 54 DLVFVE---TDFSDADMLVLPGGMPGTKYLNEYQS--LRDLLADFYRKGGKVAAICAAPT 108
Query: 120 NLLESTYIRGKKVTTY------LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNA 173
++ G+K T Y L +R E +V D N+ T G G A
Sbjct: 109 VFASLGFLEGRKATAYPSCMDGLAGAERSL------------ESVVVDGNVTTSRGLGTA 156
Query: 174 LELSFRLLEKLTSNENVNIIKDNM 197
++ + L+ +L + + I +++
Sbjct: 157 VDFALSLIGQLLGEKKADEIAESV 180
>gi|146302026|ref|YP_001196617.1| intracellular protease, PfpI family [Flavobacterium johnsoniae
UW101]
gi|146156444|gb|ABQ07298.1| intracellular protease, PfpI family [Flavobacterium johnsoniae
UW101]
Length = 182
Score = 60.8 bits (146), Expect = 4e-08, Method: Composition-based stats.
Identities = 53/187 (28%), Positives = 90/187 (48%), Gaps = 21/187 (11%)
Query: 2 KKIAIFLFEGAELFEIAS---FTDVFGWN-NVVGLKEFRDIKVETISYKESIKCTWGGEL 57
K IAI G E E+AS + + GWN ++V LK IK S+K+ W E
Sbjct: 3 KNIAILATNGFEESELASPKAYLEEQGWNADIVSLKS-GTIK----SWKDG---NWSKEY 54
Query: 58 RAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+ ++ + N D YD L++PGG + + + + ++ FFE+ K + AIC
Sbjct: 55 NVDVVLDQANEAD---YDALVLPGGVINPDLLRREETAV--NFVRSFFESKKPVAAICHG 109
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
L+++ + G+KVT++ N LKN + E+V DN L T P + +
Sbjct: 110 PQILVDADVLEGRKVTSFF----SVKNDLKNAGAQWEDSEVVVDNGLVTSRNPNDLPAFN 165
Query: 178 FRLLEKL 184
+++E++
Sbjct: 166 KKMVEEI 172
>gi|62752059|ref|NP_001015851.1| MGC108042 protein [Xenopus tropicalis]
gi|62859573|ref|NP_001017109.1| Parkinson disease (autosomal recessive, early onset) 7 [Xenopus
tropicalis]
gi|60422832|gb|AAH90355.1| MGC108042 protein [Xenopus tropicalis]
gi|89270947|emb|CAJ81253.1| Parkinson disease (autosomal recessive [Xenopus tropicalis]
Length = 189
Score = 60.1 bits (144), Expect = 7e-08, Method: Composition-based stats.
Identities = 53/200 (26%), Positives = 89/200 (44%), Gaps = 16/200 (8%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + + +S K+ + C+ L +
Sbjct: 4 KRALLILAKGAEEMETVIPADVM---RRAGIK----VTIAGLSGKDPVLCSRDVVLCPDT 56
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E + YDV+++PGG G N + + K+++K N +I AIC+
Sbjct: 57 SLEEARTQG--PYDVVVLPGGNLGAQNL---SESPVVKEVLKEQEAKNGLIAAICAGPTA 111
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
L GK +TT+ L + N +Y EE +V+D N T GPG + E + +
Sbjct: 112 LTVHGVGIGKTITTHPLAKDKIVNA-DHYKYS--EERVVKDGNFITSRGPGTSFEFALMI 168
Query: 181 LEKLTSNENVNIIKDNMFLK 200
+ L E + +K + LK
Sbjct: 169 VSTLVGKEVADQVKSPLLLK 188
>gi|147900143|ref|NP_001083896.1| SP22 [Xenopus laevis]
gi|46329781|gb|AAH68860.1| Park7 protein [Xenopus laevis]
Length = 189
Score = 60.1 bits (144), Expect = 7e-08, Method: Composition-based stats.
Identities = 56/202 (27%), Positives = 91/202 (45%), Gaps = 20/202 (9%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TDV G+K + V +S K+ ++C+ L +
Sbjct: 4 KRALVILAKGAEEMETVIPTDVM---RRAGIK----VTVAGLSGKDPVQCSRDVMLCPDT 56
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E + YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 57 SLEEARTQG--PYDVVVLPGGNLGAQNL---SESPVVKEVLKEQEAKKGLIAAICAGPTA 111
Query: 121 LLESTYIRGKKVTTYLLDNKRYFN--QLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L GK +TT+ L + N Q K Y+ EE +V+D N T GPG + E +
Sbjct: 112 LTVHGVGIGKTITTHPLAKDKIVNPDQYK-YS----EERVVKDENFITSRGPGTSFEFAL 166
Query: 179 RLLEKLTSNENVNIIKDNMFLK 200
++ L E +K + LK
Sbjct: 167 EIVCTLLGKEVAEQVKTPLLLK 188
>gi|90422501|ref|YP_530871.1| Peptidase C56, PfpI [Rhodopseudomonas palustris BisB18]
gi|90104515|gb|ABD86552.1| Peptidase C56, PfpI [Rhodopseudomonas palustris BisB18]
Length = 187
Score = 60.1 bits (144), Expect = 7e-08, Method: Composition-based stats.
Identities = 34/132 (25%), Positives = 66/132 (50%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K ++ +D YD L++PGG + + + KLIK F++ K++
Sbjct: 55 WGRPVKVDKALSAAKADD---YDALVLPGGQINPDLLRVNAEAL--KLIKAFYDGGKVVA 109
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+E+ +GKK+T+Y N + + +V D + T PG+
Sbjct: 110 AVCHAPWLLIETGIAKGKKMTSYHAIKTDVINAGAQWE----DSSVVTDQGVITSRNPGD 165
Query: 173 ALELSFRLLEKL 184
+ S +++E++
Sbjct: 166 LEDFSNKIIEEI 177
>gi|89210012|ref|ZP_01188405.1| DJ-1 [Halothermothrix orenii H 168]
gi|89160352|gb|EAR80007.1| DJ-1 [Halothermothrix orenii H 168]
Length = 181
Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats.
Identities = 60/199 (30%), Positives = 94/199 (47%), Gaps = 20/199 (10%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKI 62
KI I L EG E E + DV I+V T S ES + +++
Sbjct: 2 KILIPLAEGFEEIEAITSIDVL---------RRAGIEVITSSLTESTEVMGSHDVKVTAD 52
Query: 63 ITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINL 121
T D V + D +++PGG G AN K++ KLIK + + +I AIC+A I L
Sbjct: 53 TTLDKVS-VDNLDGILLPGGMPGSANL---KDDIRIIKLIKRLNKKSGLIAAICAAPIVL 108
Query: 122 LESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLL 181
++ I+ K+ T+Y +K + NY E +V D N+ T GPG A+E + ++
Sbjct: 109 EKAGVIKEKRATSYPGFDKEM--KTCNYQ----ENRVVVDGNIITGRGPGVAMEFALTVV 162
Query: 182 EKLTSNENVNIIKDNMFLK 200
LTS + V + + M ++
Sbjct: 163 NYLTSEDMVKELSEKMMVE 181
>gi|16303786|gb|AAL16803.1|AF394958_1 SP22 [Xenopus laevis]
Length = 189
Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats.
Identities = 56/202 (27%), Positives = 91/202 (45%), Gaps = 20/202 (9%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TDV G+K + V +S K+ ++C+ L +
Sbjct: 4 KRALVILAKGAEEMETVIPTDVM---RRAGIK----VTVAGLSGKDPVQCSRDVMLCPDT 56
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E + YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 57 SLEEARTQG--PYDVVVLPGGNLGAQNL---SESPVVKEVLKEQEAKKGLIAAICAGPTA 111
Query: 121 LLESTYIRGKKVTTYLLDNKRYFN--QLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L GK +TT+ L + N Q K Y+ EE +V+D N T GPG + E +
Sbjct: 112 LTVHGVGIGKTITTHPLAKDKIVNPDQYK-YS----EERVVKDENFITSRGPGTSFEFAL 166
Query: 179 RLLEKLTSNENVNIIKDNMFLK 200
++ L E +K + LK
Sbjct: 167 EIVCTLLGKEVAEQVKTPLVLK 188
>gi|66267686|dbj|BAD98544.1| DJ-1 [Crocodylus niloticus]
Length = 189
Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats.
Identities = 55/203 (27%), Positives = 91/203 (44%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TD+ G+K + V ++ KE ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPTDLM---RRAGIK----VTVAGLTGKEPVQCS-----RDVF 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+ + ++ED YDV+++PGG G N + K ++K +I AIC+
Sbjct: 52 VCPDTSLEDARKEGPYDVVVLPGGNLGAQNL---SESSAVKDILKDQEMRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N ++Y E + +D N+ T GPG + E
Sbjct: 109 PTALLAHGIGFGSKVTTHPLAKDKMMNG-EHYKYS--ENRVEKDGNILTSRGPGTSFEFG 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E + +K + LK
Sbjct: 166 LAIIETLMGKEVSDQVKSPLILK 188
>gi|66267682|dbj|BAD98542.1| DJ-1 [Alligator mississippiensis]
Length = 189
Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats.
Identities = 55/203 (27%), Positives = 91/203 (44%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TD+ G+K + V ++ KE ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPTDLM---RRAGIK----VTVAGLTGKEPVQCS-----RDVF 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+ + ++ED YDV+++PGG G N + K ++K +I AIC+
Sbjct: 52 VCPDTSLEDARKEGPYDVVVLPGGNLGAQNL---SESSAVKDILKDQEMRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N ++Y E + +D N+ T GPG + E
Sbjct: 109 PTALLAHGIGFGSKVTTHPLAKDKMMNG-EHYKYS--ENRVEKDGNILTSRGPGTSFEFG 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E + +K + LK
Sbjct: 166 LAIIETLMGKEVSDQVKSPLILK 188
>gi|38234592|ref|NP_940359.1| Putative protease [Corynebacterium diphtheriae NCTC 13129]
gi|38200855|emb|CAE50560.1| Putative protease [Corynebacterium diphtheriae]
Length = 178
Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats.
Identities = 40/119 (33%), Positives = 63/119 (52%), Gaps = 9/119 (7%)
Query: 67 NVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTY 126
NVE+F D L++ GG A+ + N + + FFE K + AIC A L++S
Sbjct: 69 NVEEF---DALVLAGGTLNADAMRI--NPEARSITVQFFEAEKPVAAICHAPWLLIDSKK 123
Query: 127 IRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLT 185
+ GKK+T+Y + L+N I ++EE+VED NL T PG+ + ++ KL+
Sbjct: 124 VEGKKLTSY----TSVKSDLENAGAIWVDEEVVEDGNLITSRNPGDLEAFNKAIIAKLS 178
>gi|116672030|ref|YP_832963.1| intracellular protease, PfpI family [Arthrobacter sp. FB24]
gi|116612139|gb|ABK04863.1| intracellular protease, PfpI family [Arthrobacter sp. FB24]
Length = 188
Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats.
Identities = 52/190 (27%), Positives = 83/190 (43%), Gaps = 17/190 (8%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGE-LRAE 60
KK+A L +G E E+ S WN V + + T GE +
Sbjct: 9 KKVAFLLTDGVEQVELTS-----PWNAVKEAGGEPTLVAPKAGKLQGYDGTEKGETFDVD 63
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFK-DKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+ E N DF L+IPGG A+ + DK+ + F + FFE +K + +IC
Sbjct: 64 ITVAEANASDF---HALVIPGGVVNADHLRVDKDAQAFAR---SFFEQHKPVASICHGPW 117
Query: 120 NLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFR 179
L+++ +RG+K+T+Y LKN + E+V D T PG+ + +
Sbjct: 118 LLIDAGVVRGRKLTSY----HTLQTDLKNAGADWSDAEVVVDQGFVTSRHPGDLNAFNDK 173
Query: 180 LLEKLTSNEN 189
LLE++ E+
Sbjct: 174 LLEEIEEGEH 183
>gi|52079280|ref|YP_078071.1| putative intracellular protease [Bacillus licheniformis ATCC 14580]
Length = 211
Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 23/202 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETI--SYKESIKCTWGGELR 58
MKK +FL +G E E + DV R +VET+ S S G ++
Sbjct: 29 MKKAYVFLIDGFEEIEAIATIDVL-----------RRAEVETVTVSLDPSRSVKGGHDIV 77
Query: 59 AEKIITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
E + D+ D+ + D+LI+PGG G + ++ K++ K + AIC+A
Sbjct: 78 VEADVMFDDA-DWQEADMLILPGGNVGSKKMLE---HQALHKMLTEAANAGKYVAAICAA 133
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
+ L ++ + GKK T Y L +V E +V D N+ T GP + +
Sbjct: 134 TMTLGKTGLVSGKKATCY----PGVEEHLTGADVTA-HENVVVDGNIITSRGPATTIPFA 188
Query: 178 FRLLEKLTSNENVNIIKDNMFL 199
+L E L E + M +
Sbjct: 189 LKLAELLNGKEKAGAVAKGMLV 210
>gi|52784646|ref|YP_090475.1| hypothetical protein BLi00848 [Bacillus licheniformis ATCC 14580]
gi|52347148|gb|AAU39782.1| putative protein [Bacillus licheniformis DSM 13]
Length = 257
Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 23/202 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETI--SYKESIKCTWGGELR 58
MKK +FL +G E E + DV R +VET+ S S G ++
Sbjct: 75 MKKAYVFLIDGFEEIEAIATIDVL-----------RRAEVETVTVSLDPSRSVKGGHDIV 123
Query: 59 AEKIITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
E + D+ D+ + D+LI+PGG G + ++ K++ K + AIC+A
Sbjct: 124 VEADVMFDDA-DWQEADMLILPGGNVGSKKMLE---HQALHKMLTEAANAGKYVAAICAA 179
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
+ L ++ + GKK T Y L +V E +V D N+ T GP + +
Sbjct: 180 TMTLGKTGLVSGKKATCY----PGVEEHLTGADVTA-HENVVVDGNIITSRGPATTIPFA 234
Query: 178 FRLLEKLTSNENVNIIKDNMFL 199
+L E L E + M +
Sbjct: 235 LKLAELLNGKEKAGAVAKGMLV 256
>gi|145902805|gb|AAU22433.2| putative intracellular protease [Bacillus licheniformis ATCC 14580]
Length = 183
Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 23/202 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETI--SYKESIKCTWGGELR 58
MKK +FL +G E E + DV R +VET+ S S G ++
Sbjct: 1 MKKAYVFLIDGFEEIEAIATIDVL-----------RRAEVETVTVSLDPSRSVKGGHDIV 49
Query: 59 AEKIITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
E + D+ D+ + D+LI+PGG G + ++ K++ K + AIC+A
Sbjct: 50 VEADVMFDDA-DWQEADMLILPGGNVGSKKMLE---HQALHKMLTEAANAGKYVAAICAA 105
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
+ L ++ + GKK T Y L +V E +V D N+ T GP + +
Sbjct: 106 TMTLGKTGLVSGKKATCY----PGVEEHLTGADVTA-HENVVVDGNIITSRGPATTIPFA 160
Query: 178 FRLLEKLTSNENVNIIKDNMFL 199
+L E L E + M +
Sbjct: 161 LKLAELLNGKEKAGAVAKGMLV 182
>gi|91978428|ref|YP_571087.1| Peptidase C56, PfpI [Rhodopseudomonas palustris BisB5]
gi|91684884|gb|ABE41186.1| Peptidase C56, PfpI [Rhodopseudomonas palustris BisB5]
Length = 187
Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/132 (25%), Positives = 67/132 (50%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K ++ +D YD +++PGG + + + + KLIK FF+ K +
Sbjct: 55 WGRPVKVDKALSAVKADD---YDAIVLPGGQINPDLLRVNADAL--KLIKSFFDAGKTVA 109
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+E+ +G+K+T+Y + N + + +V DN + T PG+
Sbjct: 110 AVCHAPWLLIEAGIAKGRKMTSYNSIKQDVINAGAKWE----DSAVVTDNGVITSRNPGD 165
Query: 173 ALELSFRLLEKL 184
S +++E++
Sbjct: 166 LEAFSDKIIEEI 177
>gi|121997998|ref|YP_001002785.1| DJ-1 family protein [Halorhodospira halophila SL1]
gi|121589403|gb|ABM61983.1| DJ-1 family protein [Halorhodospira halophila SL1]
Length = 188
Score = 58.2 bits (139), Expect = 2e-07, Method: Composition-based stats.
Identities = 32/115 (27%), Positives = 62/115 (53%), Gaps = 9/115 (7%)
Query: 74 YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVT 133
+D++++PGG G A + + + ++++ E I AIC+A L E ++G++ T
Sbjct: 65 FDLIVLPGGLGGAE--RLEGDARIARMLQAQNERGGWIAAICAAPRVLAEVGVLQGRRAT 122
Query: 134 TYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNE 188
+ QL+ + + P + +V D+NL T GPG A++ + RL+E + +E
Sbjct: 123 AFP-------TQLERHGIEPEDSAVVIDDNLITSRGPGTAMDFALRLIEVVYGDE 170
>gi|73748071|ref|YP_307310.1| DJ-1 family protein [Dehalococcoides sp. CBDB1]
gi|73659787|emb|CAI82394.1| DJ-1 family protein [Dehalococcoides sp. CBDB1]
Length = 180
Score = 58.2 bits (139), Expect = 3e-07, Method: Composition-based stats.
Identities = 50/202 (24%), Positives = 91/202 (45%), Gaps = 25/202 (12%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M + A+ L EG E E + TD+ D++V+ + K + G R
Sbjct: 1 MSRFAVLLAEGFEEIEFCTITDIL---------RRADLEVKIVGLKNDLT----GGSRGI 47
Query: 61 KIITEDNVEDF--FDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+I+ + +++D DY+VL++PGG G N KD+ +LI+ NK + AIC+
Sbjct: 48 RIMPDMHIDDLKTTDYEVLVLPGGNPGFINMGKDQR---VLELIRTAHAENKYLAAICAG 104
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
L + I GK+V Y + LKN + ++ + L T P A++ +
Sbjct: 105 PAVLSRAGVIDGKEVAIY----PGVKHLLKNCTACDLRVKV--EGRLITGRSPQAAMDFA 158
Query: 178 FRLLEKLTSNENVNIIKDNMFL 199
L++ ++ +++D M +
Sbjct: 159 LTLMDMFAKPQSAKVVRDEMLV 180
>gi|147668899|ref|YP_001213717.1| DJ-1 family protein [Dehalococcoides sp. BAV1]
gi|146269847|gb|ABQ16839.1| DJ-1 family protein [Dehalococcoides sp. BAV1]
Length = 180
Score = 58.2 bits (139), Expect = 3e-07, Method: Composition-based stats.
Identities = 50/202 (24%), Positives = 91/202 (45%), Gaps = 25/202 (12%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M + A+ L EG E E + TD+ D++V+ + K + G R
Sbjct: 1 MSRFAVLLAEGFEEIEFCTITDIL---------RRADLEVKIVGLKNDLT----GGSRGI 47
Query: 61 KIITEDNVEDF--FDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+I+ + +++D DY+VL++PGG G N KD+ +LI+ NK + AIC+
Sbjct: 48 RIMPDIHIDDLKTTDYEVLVLPGGNPGFINMGKDQR---VLELIRTAHAENKYLAAICAG 104
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
L + I GK+V Y + LKN + ++ + L T P A++ +
Sbjct: 105 PAVLSRAGVIDGKEVAIY----PGVKHLLKNCTACDLRVKV--EGRLITGRSPQAAMDFA 158
Query: 178 FRLLEKLTSNENVNIIKDNMFL 199
L++ ++ +++D M +
Sbjct: 159 LTLMDMFAKPQSAKVVRDEMLV 180
>gi|126657553|ref|ZP_01728709.1| proteinase I [Cyanothece sp. CCY0110]
gi|126621257|gb|EAZ91970.1| proteinase I [Cyanothece sp. CCY0110]
Length = 184
Score = 58.2 bits (139), Expect = 3e-07, Method: Composition-based stats.
Identities = 49/173 (28%), Positives = 81/173 (46%), Gaps = 21/173 (12%)
Query: 2 KKIAIFLFEGAELFEIA----SFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGEL 57
KKIAI + +G E E+ +F D +++ K D V ++ + E
Sbjct: 8 KKIAILVTDGFEQVEMTKPRQAFDDAGATTHLISPK---DKTVRGWNHYDK-----ADEF 59
Query: 58 RAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+ + + N +D YD L++PGG AN + + N + IK FF +K + AIC
Sbjct: 60 NVDVALNQANPDD---YDALLLPGGV--ANPDQLRTNPSVVEFIKAFFTADKPVAAICHG 114
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGP 170
L+E+ ++G+K+T++ LKN ++EE+V D NL T P
Sbjct: 115 PWTLVEAEAVQGRKITSW----PSLKTDLKNAGANWVDEEVVIDGNLVTSRNP 163
>gi|120435844|ref|YP_861530.1| peptidase, family C56 [Gramella forsetii KT0803]
gi|117577994|emb|CAL66463.1| peptidase, family C56 [Gramella forsetii KT0803]
Length = 182
Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats.
Identities = 47/193 (24%), Positives = 91/193 (47%), Gaps = 23/193 (11%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESI-----KCTWGGE 56
K+IAI G E E+AS + E KVE +S ++ K WG E
Sbjct: 3 KRIAILATHGFEESELASPKEAM---------EKEGFKVEIVSLEKGKIKSWDKDNWGKE 53
Query: 57 LRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICS 116
+K + E + +D Y+ L++PGG + + + + + ++ FF+ +K + AIC
Sbjct: 54 YNVDKTLDEVSAKD---YNALVLPGGVINPDKLRREESALI--FVRDFFKQSKPVAAICH 108
Query: 117 AVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
A L+ + + G+ +T++ K L+N + ++EE+V D L T P +
Sbjct: 109 AAWTLISADVVEGRTMTSFNSIKK----DLENAGALWVDEEVVVDEALVTSRNPDDLPAF 164
Query: 177 SFRLLEKLTSNEN 189
+ +++E++ ++
Sbjct: 165 NAKVIEEIKEGKH 177
>gi|45383015|ref|NP_989916.1| Parkinson disease (autosomal recessive, early onset) 7 [Gallus
gallus]
gi|82106351|sp|Q8UW59|PARK7_CHICK Protein DJ-1 (Parkinson disease protein 7 homolog)
gi|17974316|dbj|BAB79527.1| DJ-1 [Gallus gallus]
Length = 189
Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats.
Identities = 55/203 (27%), Positives = 87/203 (42%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TDV G+K + V ++ KE ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPTDVM---RRAGIK----VTVAGLTGKEPVQCS-----RDVL 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K ++K +I AIC+
Sbjct: 52 ICPDASLEDARKEGPYDVIVLPGGNLGAQNL---SESAAVKDILKDQESRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KV T+ L + N + E + +D N+ T GPG + E
Sbjct: 109 PTALLAHGIGFGSKVITHPLAKDKMMN---GAHYCYSESRVEKDGNILTSRGPGTSFEFG 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E +K + LK
Sbjct: 166 LAIVEALMGKEVAEQVKAPLILK 188
>gi|125772586|ref|XP_001357594.1| GA12322-PA [Drosophila pseudoobscura]
gi|54637326|gb|EAL26728.1| GA12322-PA [Drosophila pseudoobscura]
Length = 187
Score = 57.8 bits (138), Expect = 4e-07, Method: Composition-based stats.
Identities = 54/206 (26%), Positives = 89/206 (43%), Gaps = 29/206 (14%)
Query: 1 MKKIA-IFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRA 59
M K A I L GAE E DV G+K + V + E +KC+ +
Sbjct: 1 MSKTALIILAPGAEEMEFVIAADVL---RRAGIK----VTVAGLKDSEPVKCSRDVVIVP 53
Query: 60 EKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+ + + + F DV+++PGG G +N D + L++ +I AIC+A
Sbjct: 54 DTSLAKAACDKF---DVVVLPGGLGGSNAMGD--SAAVGDLLRAQESAGGLIAAICAAPT 108
Query: 120 NLLESTYIRGKKVTTY------LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNA 173
L + GK +T+Y L+D Y + ++ +V+D NL T GPG A
Sbjct: 109 VLAKHGIAAGKSLTSYPSMKEQLVDKYCYVD----------DKSVVKDGNLITSRGPGTA 158
Query: 174 LELSFRLLEKLTSNENVNIIKDNMFL 199
+ + ++ E+L E V + + L
Sbjct: 159 YDFALKIAEELAGLEKVKEVAKGLLL 184
>gi|86751228|ref|YP_487724.1| Peptidase C56, PfpI [Rhodopseudomonas palustris HaA2]
gi|86574256|gb|ABD08813.1| Peptidase C56, PfpI [Rhodopseudomonas palustris HaA2]
Length = 187
Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats.
Identities = 33/132 (25%), Positives = 66/132 (50%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K + +D YD +++PGG + + + + KLIK FF+ K +
Sbjct: 55 WGRPVKVDKALGSAKADD---YDAIVLPGGQINPDLLRVNADAL--KLIKSFFDAGKTVA 109
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+++ +G+K+T+Y + N + + +V DN + T PG+
Sbjct: 110 AVCHAPWLLIDTGIAKGRKMTSYNSIKQDVINAGAKWE----DSAVVTDNGVITSRNPGD 165
Query: 173 ALELSFRLLEKL 184
S +++E++
Sbjct: 166 LEAFSAKIIEEI 177
>gi|74212240|dbj|BAE40278.1| unnamed protein product [Mus musculus]
Length = 189
Score = 57.4 bits (137), Expect = 5e-07, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVM 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 52 ICPDTSLEDAKTQGPYDVVVLPGGNLGAQNL---SESPMVKEILKEQESRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEVGFGCKVTTHTLAKDKMMNG-SHYSY--SESRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L + N +K + LK
Sbjct: 166 LAIVEALVGKDMANQVKAPLVLK 188
>gi|110601832|ref|ZP_01389998.1| Peptidase C56, PfpI [Geobacter sp. FRC-32]
gi|110547449|gb|EAT60709.1| Peptidase C56, PfpI [Geobacter sp. FRC-32]
Length = 166
Score = 57.0 bits (136), Expect = 6e-07, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Query: 73 DYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKV 132
DY +L++PGG K+ + ++ ++FFE NK + AIC L+ + +RG+K
Sbjct: 61 DYTILVLPGGKAPETVRKEAKAQ---EIARFFFEQNKPVAAICHGPQTLISAGLLRGRKA 117
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKL 184
T Y ++LK + E+V D NL T PG+ +++KL
Sbjct: 118 TCY----NTVVDELKEAGARYEDTEVVVDGNLVTSREPGDLPAFMREMMKKL 165
>gi|74317863|ref|YP_315603.1| putative protease [Thiobacillus denitrificans ATCC 25259]
gi|74057358|gb|AAZ97798.1| putative protease [Thiobacillus denitrificans ATCC 25259]
Length = 181
Score = 57.0 bits (136), Expect = 6e-07, Method: Composition-based stats.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 12/113 (10%)
Query: 74 YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVT 133
YD++++PGG A KD + L+K K AIC+A + L E+ +RGK+ T
Sbjct: 63 YDMVVLPGGMPGAAHLKDDVRVV--DLLKKMASAGKYTAAICAAPMVLAEAGLLRGKQAT 120
Query: 134 TY--LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKL 184
+Y LD +V E +V+D + T GPG A++ + +L+E L
Sbjct: 121 SYPGFLDGVP--------DVTLRAEAVVQDGTVLTSRGPGTAMDFALQLVETL 165
>gi|150003061|ref|YP_001297805.1| putative ThiJ family intracellular protease [Bacteroides vulgatus
ATCC 8482]
gi|149931485|gb|ABR38183.1| putative ThiJ family intracellular protease [Bacteroides vulgatus
ATCC 8482]
Length = 183
Score = 57.0 bits (136), Expect = 7e-07, Method: Composition-based stats.
Identities = 57/200 (28%), Positives = 92/200 (46%), Gaps = 20/200 (10%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MK I +FL EG E E + DV GL +K +++ ++ G + A+
Sbjct: 1 MKTIYVFLAEGFEEVEALTPVDVL---RRAGLP----VKTVSVTGVLTVNGAHGVPVVAD 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVI 119
+ E V++ D +++++PGG G N ++ KLI F E + + AIC+A +
Sbjct: 54 MVFEE--VKEG-DAEMIVLPGGLPGATNL---DAHEGLGKLIMTFAEAGRPLSAICAAPL 107
Query: 120 NLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFR 179
+ ++GKKVT Y K + + Y +E+ D N T GPG A+ SF
Sbjct: 108 VYGKRGLLKGKKVTCYPGFEK--YLEGAEYTAALVEK----DGNFITGKGPGAAMAFSFA 161
Query: 180 LLEKLTSNENVNIIKDNMFL 199
+ EK E V +K M +
Sbjct: 162 IAEKYVGAEKVTELKQGMMI 181
>gi|148255686|ref|YP_001240271.1| putative intracellular proteinase [Bradyrhizobium sp. BTAi1]
gi|146407859|gb|ABQ36365.1| putative intracellular proteinase [Bradyrhizobium sp. BTAi1]
Length = 186
Score = 56.6 bits (135), Expect = 7e-07, Method: Composition-based stats.
Identities = 34/138 (24%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K + + + D YD +++PGG + + + + K IK FE KI+
Sbjct: 53 WGRPVKVDKTLDQASASD---YDAIVLPGGQINPDLLRLEPKAL--KFIKDIFEAKKIVA 107
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+E+ +G+K+T+Y N ++ +E +V D + T PG+
Sbjct: 108 AVCHAPWLLIETGIAKGRKMTSYKSIKTDVVNAGADWQ----DEAVVVDQGVITSRNPGD 163
Query: 173 ALELSFRLLEKLTSNENV 190
S +++E++ ++
Sbjct: 164 LEAFSAKIIEEVKEGRHL 181
>gi|39934376|ref|NP_946652.1| putative intracellular protease, PfpI family [Rhodopseudomonas
palustris CGA009]
gi|39648225|emb|CAE26744.1| putative intracellular protease, PfpI family [Rhodopseudomonas
palustris CGA009]
Length = 187
Score = 56.6 bits (135), Expect = 7e-07, Method: Composition-based stats.
Identities = 32/132 (24%), Positives = 67/132 (50%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K+++ +D YD +++PGG + + + + KLIK F+ K +
Sbjct: 55 WGHLVKVDKLLSAVKADD---YDAIVLPGGQINPDLLRVNQDAL--KLIKSLFDAGKTVA 109
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+++ +G+K+T+Y + N + + +V DN + T PG+
Sbjct: 110 AVCHAPWLLIDTGIAKGRKMTSYNSIKQDVINAGAKWE----DSAVVTDNGVITSRNPGD 165
Query: 173 ALELSFRLLEKL 184
S +++E++
Sbjct: 166 LEAFSAKIIEEM 177
>gi|62751849|ref|NP_001015572.1| Parkinson disease (autosomal recessive, early onset) 7 [Bos taurus]
gi|75040204|sp|Q5E946|PARK7_BOVIN Protein DJ-1 (Parkinson disease protein 7 homolog)
gi|59858513|gb|AAX09091.1| DJ-1 protein [Bos taurus]
Length = 189
Score = 56.6 bits (135), Expect = 8e-07, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K + +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQEKRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-SHYSY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
+++E L E + +K + LK
Sbjct: 166 LKIVEVLVGKEVADQVKAPLVLK 188
>gi|153854555|ref|ZP_01995825.1| hypothetical protein DORLON_01820 [Dorea longicatena DSM 13814]
gi|149752864|gb|EDM62795.1| hypothetical protein DORLON_01820 [Dorea longicatena DSM 13814]
Length = 181
Score = 56.6 bits (135), Expect = 8e-07, Method: Composition-based stats.
Identities = 54/202 (26%), Positives = 94/202 (46%), Gaps = 28/202 (13%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKES--IKCTWGGELR 58
MKK+ + L +G E EI T VV L I V+T+S + + G ++
Sbjct: 1 MKKVCVLLADGFE--EIEGLT-------VVDLLRRAKIYVDTVSIMDDYIVHGAHGINVQ 51
Query: 59 AEKIITEDNVEDFFDYDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
E + E DF ++D++++PGG G N K + + ++K + + + + AIC+A
Sbjct: 52 TEDLFDE---VDFEEFDMVVLPGGMPGTLNL---KEHDGVRYVVKQYAKEGRFVGAICAA 105
Query: 118 VINLLESTYIRGKKVTTY--LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALE 175
L + G++ T Y + D NVI E +V D+N+ T G G A++
Sbjct: 106 PTILKSLGLLEGRRATCYPGVEDEME--------NVILTETAVVVDDNIITSQGVGTAID 157
Query: 176 LSFRLLEKLTSNENVNIIKDNM 197
+ +L+E L E I +++
Sbjct: 158 FALKLIEVLDGEEKAKEIAESI 179
>gi|55741460|ref|NP_065594.2| DJ-1 protein [Mus musculus]
gi|56404944|sp|Q99LX0|PARK7_MOUSE Protein DJ-1 (Parkinson disease protein 7 homolog)
gi|12805429|gb|AAH02187.1| Parkinson disease (autosomal recessive, early onset) 7 [Mus
musculus]
gi|54792586|dbj|BAA29063.2| DJ-1 [Mus musculus]
gi|74150475|dbj|BAE32271.1| unnamed protein product [Mus musculus]
gi|74226952|dbj|BAE27118.1| unnamed protein product [Mus musculus]
gi|123246552|emb|CAM19230.1| Parkinson disease (autosomal recessive, early onset) 7 [Mus
musculus]
gi|148682949|gb|EDL14896.1| Parkinson disease (autosomal recessive, early onset) 7, isoform
CRA_a [Mus musculus]
gi|148682950|gb|EDL14897.1| Parkinson disease (autosomal recessive, early onset) 7, isoform
CRA_a [Mus musculus]
Length = 189
Score = 56.6 bits (135), Expect = 8e-07, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 92/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVM 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 52 ICPDTSLEDAKTQGPYDVVVLPGGNLGAQNL---SESPMVKEILKEQESRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEVGFGCKVTTHPLAKDKMMNG-SHYSY--SESRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L + N +K + LK
Sbjct: 166 LAIVEALVGKDMANQVKAPLVLK 188
>gi|118580535|ref|YP_901785.1| metal dependent phosphohydrolase [Pelobacter propionicus DSM 2379]
gi|118503245|gb|ABK99727.1| metal dependent phosphohydrolase [Pelobacter propionicus DSM 2379]
Length = 388
Score = 56.6 bits (135), Expect = 9e-07, Method: Composition-based stats.
Identities = 38/128 (29%), Positives = 63/128 (49%), Gaps = 10/128 (7%)
Query: 74 YDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKV 132
+D++I+PGG G AN D +L+ F ++NK+I AIC+A L E+ IRGK+V
Sbjct: 63 FDMVILPGGQPGAANLSADVR---VIRLLNDFSKDNKLIGAICAATTVLSEAGLIRGKRV 119
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNENVNI 192
T Y Y ++L + +V D + T GPG A+ + ++ + +
Sbjct: 120 TAY----PDYRDRLPGAQY--EDSAVVIDGKIITSQGPGTAMAFALAIVSRFAGKHTADE 173
Query: 193 IKDNMFLK 200
I M ++
Sbjct: 174 IAGKMLVQ 181
>gi|121535032|ref|ZP_01666850.1| ThiJ/PfpI domain protein [Thermosinus carboxydivorans Nor1]
gi|121306445|gb|EAX47369.1| ThiJ/PfpI domain protein [Thermosinus carboxydivorans Nor1]
Length = 193
Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 44/189 (23%), Positives = 84/189 (44%), Gaps = 15/189 (7%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M I I +F E + +V + N + R + ++ E+I
Sbjct: 1 MVTIGILIFPQVEELDFVGPFEVLSYPN-----KLRSESTKVLTVAETINPVQA--FNGL 53
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
K+I + + + D++++PGG G+ K ++ + I + + I ++C+
Sbjct: 54 KVIPDIDFANCPPLDIIVVPGGKGR---MKAMHDPAIRDFILQQAKTARYITSVCTGAFI 110
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVI-PIEEEIVEDNNLFTCSGPGNALELSFR 179
L E+ + GK+ TTY +L Y I P++ ++V+D ++ T +G + LEL F
Sbjct: 111 LAEAGILDGKRATTYF----AALPELAGYPAIHPVKSKVVQDGSVITAAGVSSGLELGFY 166
Query: 180 LLEKLTSNE 188
LL+ L E
Sbjct: 167 LLKLLFGRE 175
>gi|149689074|gb|ABR27864.1| DJ-1 [Triatoma infestans]
Length = 194
Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 51/202 (25%), Positives = 86/202 (42%), Gaps = 34/202 (16%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K + + EG+E E DV V ++ + + E KC+ R
Sbjct: 4 KSALVLVAEGSEEMECIISVDVLRRGGV-------NVTLAGLKGNEPTKCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
++ + ++E+ YD +++PGG + F D + L+K + KI+ AIC+A
Sbjct: 52 VVPDKSMEEAIKCGPYDAIVLPGGLQGSKSFADCST--LGNLLKEQEKCGKIVAAICAAP 109
Query: 119 INLLESTYIRGKKVTTY------LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
L GK+VT Y L+D+ +Y E+++V D NL T GPG
Sbjct: 110 TALKAHGIGLGKRVTCYPGLEKELVDSYKY-----------SEDKVVIDGNLITSRGPGT 158
Query: 173 ALELSFRLLEKLTSNENVNIIK 194
A + L+E+L + +K
Sbjct: 159 AFDFGLALVEQLVGTDTSCSVK 180
>gi|147775474|emb|CAN62882.1| hypothetical protein [Vitis vinifera]
Length = 427
Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 72 FDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGK 130
YD++++PGG G A F + L+K E+NK AIC++ +LE ++GK
Sbjct: 280 LSYDLIVLPGGLGGAQAFASSEKLV--NLLKNQRESNKPYGAICASPALVLEPHGLLKGK 337
Query: 131 KVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKL 184
K T + + +Q + IE ++ D NL T GPG ++E + ++EK
Sbjct: 338 KATAFPALCSKLSDQSE------IENRVLVDGNLITSRGPGTSMEFALAIIEKF 385
>gi|152992160|ref|YP_001357881.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
protein [Sulfurovum sp. NBC37-1]
gi|151424021|dbj|BAF71524.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
protein [Sulfurovum sp. NBC37-1]
Length = 186
Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats.
Identities = 50/204 (24%), Positives = 101/204 (49%), Gaps = 26/204 (12%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
M + I L +G E E + DV + E R +E + + G ++A+
Sbjct: 1 MASVLIPLAKGFEELEAVALIDVMRRGGI----EVRVAYLEDEMQSDLVLGANGITVKAD 56
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
I ++ + D D+D++++PGG+G + +N ++ ++L++ F + KI+ A+C+A
Sbjct: 57 TSI-KNVISD--DFDMMVLPGGWG-GTYALAENTRV-QELLREF-KAKKIVGAMCAAPFA 110
Query: 121 LLESTYIRGKKVTTY-----LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALE 175
L ++ + G++ T Y +D+ Y +E++VED N+ T GPG A+
Sbjct: 111 LKQAG-VLGERYTAYPGAVEEIDHPGYV----------ADEKVVEDGNVMTSQGPGTAVC 159
Query: 176 LSFRLLEKLTSNENVNIIKDNMFL 199
++++L E++ +K+ M L
Sbjct: 160 FGLAIVKRLVGEESMQAVKEGMLL 183
>gi|154175067|ref|YP_001407863.1| DJ-1 family protein [Campylobacter curvus 525.92]
gi|112803406|gb|EAU00750.1| DJ-1 family protein [Campylobacter curvus 525.92]
Length = 185
Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats.
Identities = 45/202 (22%), Positives = 90/202 (44%), Gaps = 25/202 (12%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNV----VGLKEFRDIKVETISYKESIKCTWGGE 56
MK++A+ L G E E S D+ ++ VGL + +S K + + E
Sbjct: 1 MKRVAVILANGFEEIEALSVVDILRRADIDALCVGLDRALVVGAHGVSVKVDLLLS---E 57
Query: 57 LRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICS 116
LR + D +++PGG A D +K ++++ F +N K+I AIC+
Sbjct: 58 LRE------------IELDAIVLPGGLPGAQNLAD--SKELGEILRRFDDNGKLICAICA 103
Query: 117 AVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
A + L ++ ++G + N + N ++ ++ D+N+ T GP A+E
Sbjct: 104 APMALAKAGVLKGAFTCYPGFET----NVRSDKNGYISDKNVICDHNIITSRGPATAMEF 159
Query: 177 SFRLLEKLTSNENVNIIKDNMF 198
+ ++++L + ++D +
Sbjct: 160 ALEIVKELNGTSSYESVRDGLL 181
>gi|152981191|ref|YP_001354772.1| transcriptional regulator, AraC family [Janthinobacterium sp.
Marseille]
gi|151281268|gb|ABR89678.1| transcriptional regulator, AraC family [Janthinobacterium sp.
Marseille]
Length = 328
Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats.
Identities = 53/191 (27%), Positives = 90/191 (47%), Gaps = 22/191 (11%)
Query: 1 MKKIAIFL--FEGAELFEIASFTDVFGWNNVV--GLKEFRDIKVETISYKESIKCTWGGE 56
M+KI I L F ++ +IA+ D F V+ G E+ + + T + I+ + G
Sbjct: 1 MRKITIGLVVFPRFQMLDIAAPGDAFAEVKVLSNGECEYEILTIATT--RGPIQSSSGLT 58
Query: 57 LRAEKIITEDNVEDFFD----YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
+ ++ I FD +D LI+PGG G + +D + E + I
Sbjct: 59 IMPDRTI--------FDPCPHFDTLIVPGGLGVFDILEDTT---LTDWLAAQGEGCRRIG 107
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC+ V L + I GK VTT+ +D R + + V P + V+D++L+T +G
Sbjct: 108 AICNGVFALGAAGMINGKTVTTHWMDAARLASMFRKATVEP-DRIYVKDDSLYTTAGVTA 166
Query: 173 ALELSFRLLEK 183
++LS L+E+
Sbjct: 167 GIDLSLALIEE 177
>gi|119469330|ref|ZP_01612269.1| proteinase [Alteromonadales bacterium TW-7]
gi|119447194|gb|EAW28463.1| proteinase [Alteromonadales bacterium TW-7]
Length = 192
Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats.
Identities = 56/186 (30%), Positives = 84/186 (45%), Gaps = 17/186 (9%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
KKIAI +G E E+ S D N E IK I+ K WG ++ +K
Sbjct: 13 KKIAILATDGFEQSELFSPRDAL--LNAGAEIEIVSIKEGQITGWNEDK--WGEKVSVDK 68
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFF--ENNKIIVAICSAV 118
++T N D YD L++PGG F + +DK+ K F I FF E NK + AIC A
Sbjct: 69 LVTNTNSAD---YDALMLPGGLFNPDSLRQDKHAKAF---IDGFFGAEKNKPVAAICHAP 122
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L E +R + +T++ N N+ + EE+ D L T P + +
Sbjct: 123 WLLAEINKLRDRTITSFPSIKSDLMNAGANW----VNEEVCVDRGLVTSRSPEDLDAFNA 178
Query: 179 RLLEKL 184
+ +E++
Sbjct: 179 KFIEEV 184
>gi|145596702|ref|YP_001160999.1| intracellular protease, PfpI family [Salinispora tropica CNB-440]
gi|145306039|gb|ABP56621.1| intracellular protease, PfpI family [Salinispora tropica CNB-440]
Length = 187
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 29/101 (28%), Positives = 50/101 (49%), Gaps = 6/101 (5%)
Query: 70 DFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRG 129
D DYD L++PGG +F + N + + ++ F E+ K + AIC L+E+ +RG
Sbjct: 69 DPVDYDALVLPGGVANPDFLRADANVV--RFVRTFVESGKPVAAICHGPWTLVEANVVRG 126
Query: 130 KKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGP 170
+ +T++ L N +++E+ DN L T P
Sbjct: 127 RTLTSW----PSLRTDLVNAGATWVDQEVFVDNGLITSRRP 163
>gi|147905238|ref|NP_001086295.1| MGC84701 protein [Xenopus laevis]
gi|49522782|gb|AAH74440.1| MGC84701 protein [Xenopus laevis]
Length = 189
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 90/202 (44%), Gaps = 20/202 (9%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + + ++ K+ ++C+ L +
Sbjct: 4 KRALVILAKGAEETETVIPADVM---RRAGIK----VTIAGLNGKDPVQCSRDVMLCPDT 56
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E + YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 57 SLEEARTQG--PYDVVVLPGGNLGAQNL---SESPVVKEVLKEQEAKKGLIAAICAGPTA 111
Query: 121 LLESTYIRGKKVTTYLLDNKRYFN--QLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L GK +TT+ L + N Q K Y+ EE +V+D N T GPG + E +
Sbjct: 112 LTVHGVGIGKSITTHPLAKDKIVNPDQYK-YS----EERVVKDENFITSRGPGTSFEFAL 166
Query: 179 RLLEKLTSNENVNIIKDNMFLK 200
++ L E +K + LK
Sbjct: 167 EIVCTLLGKEVAEQVKTPLLLK 188
>gi|18404397|ref|NP_564626.1| DJ-1 family protein [Arabidopsis thaliana]
gi|7769869|gb|AAF69547.1|AC008007_22 F12M16.18 [Arabidopsis thaliana]
gi|15810459|gb|AAL07117.1| unknown protein [Arabidopsis thaliana]
gi|20259561|gb|AAM14123.1| unknown protein [Arabidopsis thaliana]
Length = 438
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 9/116 (7%)
Query: 74 YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGKKV 132
YD++++PGG G A F + ++K E+NK AIC++ + E ++GKK
Sbjct: 320 YDLIVLPGGLGGAEAFASSEKLV--NMLKKQAESNKPYGAICASPALVFEPHGLLKGKKA 377
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNE 188
T + + +Q IE ++ D NL T GPG +LE + ++EK E
Sbjct: 378 TAFPAMCSKLTDQSH------IEHRVLVDGNLITSRGPGTSLEFALAIVEKFYGRE 427
>gi|21536528|gb|AAM60860.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein, putative [Arabidopsis thaliana]
Length = 438
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 9/116 (7%)
Query: 74 YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGKKV 132
YD++++PGG G A F + ++K E+NK AIC++ + E ++GKK
Sbjct: 320 YDLIVLPGGLGGAEAFASSEKLV--NMLKKQAESNKPYGAICASPALVFEPHGLLKGKKA 377
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNE 188
T + + +Q IE ++ D NL T GPG +LE + ++EK E
Sbjct: 378 TAFPAMCSKLTDQSH------IEHRVLVDGNLITSRGPGTSLEFALAIVEKFYGRE 427
>gi|62319084|dbj|BAD94229.1| hypothetical protein [Arabidopsis thaliana]
Length = 147
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 9/116 (7%)
Query: 74 YDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGKKV 132
YD++++PGG G A F + ++K E+NK AIC++ + E ++GKK
Sbjct: 29 YDLIVLPGGLGGAEAFASSEKLV--NMLKKQAESNKPYGAICASPALVFEPHGLLKGKKA 86
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNE 188
T + + +Q IE ++ D NL T GPG +LE + ++EK E
Sbjct: 87 TAFPAMCSKLTDQSH------IEHRVLVDGNLITSRGPGTSLEFALAIVEKFYGRE 136
>gi|39586901|emb|CAE62836.1| Hypothetical protein CBG07015 [Caenorhabditis briggsae]
Length = 192
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 86/202 (42%), Gaps = 13/202 (6%)
Query: 1 MKKIAIFLF--EGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELR 58
M K A+ L EGAE E+ DV ++ + + K + +KC G E+
Sbjct: 1 MSKSALILLAPEGAEESEVIIPGDVLTRGDIQVVYASLEGKCPKTGMMKPVKCAKGAEIM 60
Query: 59 AEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
D+V+D YD++IIPGG G + K N L+K F++ +I AIC+
Sbjct: 61 PSAAF--DDVKDK-KYDIVIIPGGPGSS---KLAENSCVGSLLKDQFKSGGLIGAICAGP 114
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
LL + + Y + +K K E+ +V + T GPG A E +
Sbjct: 115 TVLLSHGIMVDEVTGHYTVKDKLVDGGYKFS-----EDRVVVSGKVITSQGPGTAFEFAL 169
Query: 179 RLLEKLTSNENVNIIKDNMFLK 200
+++E + E +K + K
Sbjct: 170 KIVEIMQGAEKAESLKKPLCFK 191
>gi|156370244|ref|XP_001628381.1| predicted protein [Nematostella vectensis]
gi|156215356|gb|EDO36318.1| predicted protein [Nematostella vectensis]
Length = 192
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 53/201 (26%), Positives = 86/201 (42%), Gaps = 25/201 (12%)
Query: 6 IFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKIITE 65
+ L EGAE E DV V + V ++ + + C+ R ++ +
Sbjct: 11 VILAEGAEEMEAVITADVLRRGKV-------NTVVAGLTGPDPVVCS-----RQVQVKPD 58
Query: 66 DNVEDFFD---YDVLIIPGGF-GKANFFK-DKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ED YD +I+PGG G N K D+ +I ++ +E +I+ AIC+
Sbjct: 59 MGLEDALKKVPYDAVILPGGLTGAQNLAKSDQVGQILREQ----YEAGRIVAAICAGPTA 114
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
LL GK+VT+Y + + Y E+ +V D NL T GPG A E L
Sbjct: 115 LLAHGVGGGKRVTSYPSFKDKMTGK---YGYTYSEDRVVRDGNLITSRGPGTAFEFGIEL 171
Query: 181 LEKLTSNEN-VNIIKDNMFLK 200
+ + ++ + + M LK
Sbjct: 172 VRAIRGDDGAADGLASQMLLK 192
>gi|110633837|ref|YP_674045.1| intracellular protease, PfpI family [Mesorhizobium sp. BNC1]
gi|110284821|gb|ABG62880.1| intracellular protease, PfpI family [Mesorhizobium sp. BNC1]
Length = 186
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/132 (26%), Positives = 65/132 (49%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K + E ED YD L++PGG + + + + IK F+ ++K+I
Sbjct: 54 WGRPVKVDKTLDEARPED---YDALVLPGGQINPDLLRVEKAAL--DFIKSFWNDSKVIG 108
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+E+ ++G++VT+Y N + + ++V D L T PG+
Sbjct: 109 AVCHAPWLLVETGILKGRRVTSYHSIKTDVINAGGKWE----DSQVVTDQGLVTSRNPGD 164
Query: 173 ALELSFRLLEKL 184
+L E++
Sbjct: 165 LDAFCDKLAEEI 176
>gi|81865403|sp|Q7TQ35|PARK7_MESAU Protein DJ-1 (Parkinson disease protein 7 homolog)
(Contraception-associated protein 1)
gi|32452351|emb|CAD24072.2| CAP1 protein [Mesocricetus auratus]
Length = 189
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 52/203 (25%), Positives = 92/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E D+ G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDIM---RRAGIK----VTVAGLAGKDPVQCS-----RDVM 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 52 ICPDTSLEDAKKQGPYDVVVLPGGNLGAQNL---SESPVVKEILKEQESRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPGAKDKMMNG-SHYSY--SESRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L+ E + +K + LK
Sbjct: 166 LAIVEALSGKEAADQVKAPLVLK 188
>gi|56201615|dbj|BAD73062.1| putative 4-methyl-5(B-hydroxyethyl)-thiazol monophosphate
biosynthesis enzyme [Oryza sativa (japonica
cultivar-group)]
Length = 426
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 37/144 (25%), Positives = 68/144 (47%), Gaps = 12/144 (8%)
Query: 61 KIITEDNVEDFFD--YDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
K++ + V D +D++ +PGG G AN ++ K+ +K++K E + AIC+
Sbjct: 86 KLVADGRVADLEGEAFDLIALPGGMPGSANL---RDCKVLEKMVKKQAEQGGLYAAICAT 142
Query: 118 -VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
+ L ++G K T Y F + +IP+ +V D N T GP A+E
Sbjct: 143 PAVTLAHWGLLKGLKATCY-----PSFMEKFTAEIIPVNSRVVVDRNAVTSQGPATAIEY 197
Query: 177 SFRLLEKLTSNENVNIIKDNMFLK 200
+ L+E+L E + ++++
Sbjct: 198 ALALVEQLYGKEKSEEVAGPLYVR 221
Score = 53.9 bits (128), Expect = 6e-06, Method: Composition-based stats.
Identities = 40/134 (29%), Positives = 69/134 (51%), Gaps = 20/134 (14%)
Query: 73 DYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGKK 131
++D++++PGG A K + K+ L+K E+NK AIC++ +LE ++GKK
Sbjct: 306 EFDLIVMPGGLPGAQ--KLSSTKVLVDLLKKQAESNKPYGAICASPAYVLEPHGLLKGKK 363
Query: 132 VTTY-----LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTS 186
T++ LL ++ + +V D NL T PG+A E + ++EKL
Sbjct: 364 ATSFPPMAHLLTDQS-----------ACDSRVVVDGNLITSKAPGSATEFALAIVEKLFG 412
Query: 187 NEN-VNIIKDNMFL 199
E V+I K+ +F+
Sbjct: 413 REKAVSIAKELIFM 426
>gi|115435298|ref|NP_001042407.1| Os01g0217800 [Oryza sativa (japonica cultivar-group)]
gi|113531938|dbj|BAF04321.1| Os01g0217800 [Oryza sativa (japonica cultivar-group)]
Length = 513
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 37/144 (25%), Positives = 68/144 (47%), Gaps = 12/144 (8%)
Query: 61 KIITEDNVEDFFD--YDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
K++ + V D +D++ +PGG G AN ++ K+ +K++K E + AIC+
Sbjct: 173 KLVADGRVADLEGEAFDLIALPGGMPGSANL---RDCKVLEKMVKKQAEQGGLYAAICAT 229
Query: 118 -VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
+ L ++G K T Y F + +IP+ +V D N T GP A+E
Sbjct: 230 PAVTLAHWGLLKGLKATCY-----PSFMEKFTAEIIPVNSRVVVDRNAVTSQGPATAIEY 284
Query: 177 SFRLLEKLTSNENVNIIKDNMFLK 200
+ L+E+L E + ++++
Sbjct: 285 ALALVEQLYGKEKSEEVAGPLYVR 308
Score = 53.9 bits (128), Expect = 6e-06, Method: Composition-based stats.
Identities = 40/134 (29%), Positives = 69/134 (51%), Gaps = 20/134 (14%)
Query: 73 DYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLES-TYIRGKK 131
++D++++PGG A K + K+ L+K E+NK AIC++ +LE ++GKK
Sbjct: 393 EFDLIVMPGGLPGAQ--KLSSTKVLVDLLKKQAESNKPYGAICASPAYVLEPHGLLKGKK 450
Query: 132 VTTY-----LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTS 186
T++ LL ++ + +V D NL T PG+A E + ++EKL
Sbjct: 451 ATSFPPMAHLLTDQS-----------ACDSRVVVDGNLITSKAPGSATEFALAIVEKLFG 499
Query: 187 NEN-VNIIKDNMFL 199
E V+I K+ +F+
Sbjct: 500 REKAVSIAKELIFM 513
>gi|117924316|ref|YP_864933.1| DJ-1 family protein [Magnetococcus sp. MC-1]
gi|117608072|gb|ABK43527.1| DJ-1 family protein [Magnetococcus sp. MC-1]
Length = 183
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 47/166 (28%), Positives = 73/166 (43%), Gaps = 22/166 (13%)
Query: 31 GLKEFRDIKVETISYKESIKCTWGGEL-------RAEKIITEDNVEDFFD--YDVLIIPG 81
G +E I + I + I CT G + R I+ + +E D +D++ +PG
Sbjct: 12 GSEEMEAITIVNILRRAQIDCTLAGTVEGPIRCSRGSVIVPDTTLEAVMDMPFDLIALPG 71
Query: 82 GF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTY--LLD 138
G G + +D L++ K I AIC+A L + + GKK T Y LLD
Sbjct: 72 GQPGTTHLDEDPR---MHTLLQRMHAEGKFITAICAAPTILAHAGLLTGKKATCYPTLLD 128
Query: 139 NKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKL 184
L + I +V D N+ T +GPG A++ + L+E L
Sbjct: 129 T------LHGAETVAIHG-VVCDGNIITSTGPGTAMDFALTLVETL 167
>gi|125524923|gb|EAY73037.1| hypothetical protein OsI_000884 [Oryza sativa (indica
cultivar-group)]
Length = 427
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 37/144 (25%), Positives = 68/144 (47%), Gaps = 12/144 (8%)
Query: 61 KIITEDNVEDFFD--YDVLIIPGGF-GKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
K++ + V D +D++ +PGG G AN ++ K+ +K++K E + AIC+
Sbjct: 78 KLVADGRVADLEGEAFDLIALPGGMPGSANL---RDCKVLEKMVKKQAEQGGLYAAICAT 134
Query: 118 -VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
+ L ++G K T Y F + +IP+ +V D N T GP A+E
Sbjct: 135 PAVTLAHWGLLKGLKATCY-----PSFMEKFTAEIIPVNSRVVVDRNAVTSQGPATAIEY 189
Query: 177 SFRLLEKLTSNENVNIIKDNMFLK 200
+ L+E+L E + ++++
Sbjct: 190 ALALVEQLYGKEKSEEVAGPLYVR 213
>gi|118474065|ref|YP_892402.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Campylobacter fetus subsp. fetus 82-40]
gi|118413291|gb|ABK81711.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Campylobacter fetus subsp. fetus 82-40]
Length = 179
Score = 55.1 bits (131), Expect = 2e-06, Method: Composition-based stats.
Identities = 55/201 (27%), Positives = 93/201 (46%), Gaps = 27/201 (13%)
Query: 3 KIAIFLFEGAELFEIASFTDVF---GWNNV-VGLKEFRDIKVETISYKESIKCTWGGELR 58
K+A+ L +G E E + DV G + V VGL I IS ++
Sbjct: 2 KVAVMLVDGFEEIEATTIIDVLRRAGIDAVFVGLNSDTAIGAHNIS------------MK 49
Query: 59 AEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
A+ + N ++F D++++PGG A + K+ K+ +K++K F E +K I AIC+A
Sbjct: 50 ADTAFDDINFDNF---DMIVLPGGLPGAEYLA-KSEKL-QKVLKDFDEKDKFIGAICAAP 104
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L ++ + G T Y F ++ ++ +V D N+ T GP A+E +
Sbjct: 105 W-ALSTSNVLGDSYTCY-----PGFEKVVAKGGYVSDKNVVIDGNIITSKGPATAMEFAL 158
Query: 179 RLLEKLTSNENVNIIKDNMFL 199
L++ L NE +KD +
Sbjct: 159 ELVKVLQGNEKYIEVKDGLLF 179
>gi|154506032|ref|ZP_02042770.1| hypothetical protein RUMGNA_03574 [Ruminococcus gnavus ATCC 29149]
gi|153793531|gb|EDN75951.1| hypothetical protein RUMGNA_03574 [Ruminococcus gnavus ATCC 29149]
Length = 205
Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats.
Identities = 47/193 (24%), Positives = 85/193 (44%), Gaps = 18/193 (9%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE 60
MK+IA+FL EG E E + TD+ V + +++ ++++ + G + A+
Sbjct: 24 MKQIAVFLAEGFEEIEGLTVTDLLRRAGVT-------VTNVSVTGEKTVHGSHGIGVEAD 76
Query: 61 KIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E +F D+L++PGG K+ + L+K F+ + + AIC+A
Sbjct: 77 ALFEE---MEFEGMDMLVLPGGMPGTKHLKEHRD--LCALLKEFYAKERYLAAICAAPTV 131
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
E ++ G+K Y + N EE + D ++ T G G A+ + +L
Sbjct: 132 FGELGFLEGRKACCYPGMESGLSHAETN------EEPVNVDGHMITSRGLGTAIPFALKL 185
Query: 181 LEKLTSNENVNII 193
+E L E I
Sbjct: 186 IELLCGKEKAEEI 198
>gi|24653499|ref|NP_610916.1| DJ-1alpha CG6646-PA [Drosophila melanogaster]
gi|21627206|gb|AAF58316.2| CG6646-PA [Drosophila melanogaster]
Length = 217
Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats.
Identities = 48/190 (25%), Positives = 84/190 (44%), Gaps = 21/190 (11%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K I L GAE E DV ++ + V + E +KC+ R+
Sbjct: 31 KNALIILAPGAEEMEFTISADVLRRGKIL-------VTVAGLHDCEPVKCS-----RSVV 78
Query: 62 IITEDNVEDFF---DYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAV 118
I+ + ++E+ DYDV+++PGG N+ +++ +I AIC+A
Sbjct: 79 IVPDTSLEEAVTRGDYDVVVLPGGLAGNKALM--NSSAVGDVLRCQESKGGLIAAICAAP 136
Query: 119 INLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSF 178
L + +GK +T++ D K QLK ++ +V+D N+ T GPG + +
Sbjct: 137 TALAKHGIGKGKSITSHP-DMK---PQLKELYCYIDDKTVVQDGNIITSRGPGTTFDFAL 192
Query: 179 RLLEKLTSNE 188
++ E+L E
Sbjct: 193 KITEQLVGAE 202
>gi|57086915|ref|XP_536733.1| PREDICTED: similar to DJ-1 protein isoform 1 [Canis familiaris]
gi|73956704|ref|XP_858995.1| PREDICTED: similar to DJ-1 protein isoform 5 [Canis familiaris]
gi|73956706|ref|XP_859031.1| PREDICTED: similar to DJ-1 protein isoform 6 [Canis familiaris]
Length = 189
Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 93/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVI 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+I+PGG G N + + K+++K +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVILPGGNLGAQNLCE---SAAVKEILKEQENRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-SHYSY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L+ + + +K + LK
Sbjct: 166 LAIVEALSGKDVADQVKAPLVLK 188
>gi|66267684|dbj|BAD98543.1| DJ-1 [Pseudemys nelsoni]
Length = 189
Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats.
Identities = 55/204 (26%), Positives = 93/204 (45%), Gaps = 24/204 (11%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E TDV G+K + + ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPTDVM---RRAGIK----VTIAGLTGKDPVQCS-----RDVF 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNK-IIVAICS 116
I + ++ED YDV+++PGG G N ++ + L+ EN K +I AIC+
Sbjct: 52 ICPDASLEDARKEGPYDVVVLPGGNLGAQNL--SESPAVKDILVDQ--ENRKGLIAAICA 107
Query: 117 AVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALEL 176
L+ G+KVTT+ L + +K + E + +D N T GPG + E
Sbjct: 108 GPTALMAHGIGFGRKVTTHPLAKDK---MMKGEHYKYSESRVEKDGNFLTSRGPGTSFEF 164
Query: 177 SFRLLEKLTSNENVNIIKDNMFLK 200
++E L E + +K + LK
Sbjct: 165 GLAIVEILMGKEVADQVKAPLILK 188
>gi|114707002|ref|ZP_01439901.1| proteinase [Fulvimarina pelagi HTCC2506]
gi|114537552|gb|EAU40677.1| proteinase [Fulvimarina pelagi HTCC2506]
Length = 258
Score = 54.7 bits (130), Expect = 3e-06, Method: Composition-based stats.
Identities = 49/187 (26%), Positives = 82/187 (43%), Gaps = 23/187 (12%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESI-----KCTWGGEL 57
KIAI +G FE T+ G G V IS K + W E+
Sbjct: 79 KIAILATDG---FEEVELTEPLGKLQAAG------ADVHVISNKSGTIRGWDQDHWNREI 129
Query: 58 RAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
+ +K ++E V D YD L++PGG + + + ++ FF + K + AIC A
Sbjct: 130 KVDKQLSEIRVTD---YDALVLPGGQINPDVLRADPKVV--SFVREFFNSKKPLAAICHA 184
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
L+E+ +RG+ VT+Y N N+ +++E+V L T PG+
Sbjct: 185 PWLLIEADVVRGRDVTSYKSIRTDIVNAGGNW----LDQEVVCHEALITSRNPGDLPAFI 240
Query: 178 FRLLEKL 184
+++E++
Sbjct: 241 DKIIEEV 247
>gi|74310993|ref|YP_309412.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Shigella sonnei Ss046]
gi|73854470|gb|AAZ87177.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Shigella sonnei Ss046]
Length = 198
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 32 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 86
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 87 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 137
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 138 AEQWQDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKTHEVASQLVM 189
>gi|75512485|ref|ZP_00735024.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
53638]
Length = 196
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 30 IKVTTASVASDGNLAITCSRGVKLLADTPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 84
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 85 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 135
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 136 AEQWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 187
>gi|75176898|ref|ZP_00697014.1| COG0693: Putative intracellular protease/amidase [Shigella boydii
BS512]
gi|75189491|ref|ZP_00702758.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
E24377A]
gi|75194643|ref|ZP_00704713.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
HS]
gi|75209980|ref|ZP_00710169.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
B171]
gi|75230149|ref|ZP_00716653.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
B7A]
gi|75237194|ref|ZP_00721241.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
E110019]
gi|75238751|ref|ZP_00722739.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
F11]
gi|75256796|ref|ZP_00728401.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
E22]
gi|83585657|ref|ZP_00924299.1| COG0693: Putative intracellular protease/amidase [Escherichia coli
101-1]
gi|157065626|gb|ABV04881.1| protein ThiJ [Escherichia coli HS]
gi|157080669|gb|ABV20377.1| protein ThiJ [Escherichia coli E24377A]
Length = 196
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 30 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 84
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 85 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 135
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 136 AEQWQDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 187
>gi|1100872|gb|AAA82704.1| ThiJ
gi|1773108|gb|AAB40180.1| 4-methyl-5(b-hydroxyethyl)-thiazole monophosphate biosynthesis
protein [Escherichia coli]
Length = 198
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 32 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 86
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 87 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 137
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 138 AEQWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 189
>gi|89107294|ref|AP_001074.1| hypothetical protein [Escherichia coli W3110]
gi|90111131|ref|NP_414958.4| conserved protein [Escherichia coli K12]
gi|124528865|ref|ZP_01700050.1| DJ-1 family protein [Escherichia coli B]
gi|6686342|sp|Q46948|THIJ_ECOLI Protein thiJ
gi|85674564|dbj|BAE76204.1| conserved hypothetical protein [Escherichia coli W3110]
gi|87081736|gb|AAC73527.2| conserved protein [Escherichia coli K12]
gi|124500202|gb|EAY47678.1| DJ-1 family protein [Escherichia coli B]
Length = 196
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 30 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 84
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 85 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 135
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 136 AEQWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 187
>gi|15800154|ref|NP_286166.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli O157:H7 EDL933]
gi|12513280|gb|AAG54774.1|AE005221_11 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli O157:H7 EDL933]
gi|13359935|dbj|BAB33901.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli O157:H7 str. Sakai]
Length = 198
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 32 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 86
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 87 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 137
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 138 AEQWQDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 189
>gi|82407737|pdb|2AB0|A Chain A, Crystal Structure Of E. Coli Protein Yajl (Thij)
gi|82407738|pdb|2AB0|B Chain B, Crystal Structure Of E. Coli Protein Yajl (Thij)
Length = 205
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 30 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 84
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 85 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 135
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 136 AEQWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 187
>gi|26246430|ref|NP_752469.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Escherichia coli CFT073]
gi|91209493|ref|YP_539479.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Escherichia coli UTI89]
gi|117622684|ref|YP_851597.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli APEC O1]
gi|26106828|gb|AAN79013.1|AE016756_196 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Escherichia coli CFT073]
gi|91071067|gb|ABE05948.1| 4-methyl-5(B-hydroxyethyl)-thiazole monophosphate biosynthesis
enzyme [Escherichia coli UTI89]
gi|115511808|gb|ABI99882.1| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli APEC O1]
Length = 198
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 32 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 86
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 87 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 137
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 138 AEQWQDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 189
>gi|38703865|ref|NP_308505.2| 4-methyl-5(beta-hydroxyethyl)-thiazole monophosphate synthesis
[Escherichia coli O157:H7 str. Sakai]
Length = 196
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 82/172 (47%), Gaps = 24/172 (13%)
Query: 38 IKVETISYKE----SIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKN 93
IKV T S +I C+ G +L A+ + E V D +YDV+++PGG A F+D
Sbjct: 30 IKVTTASVASDGNLAITCSRGVKLLADAPLVE--VADG-EYDVIVLPGGIKGAECFRD-- 84
Query: 94 NKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIP 153
+ + + +K F + +I+ AIC+A +L I + + N F LK+ IP
Sbjct: 85 STLLVETVKQFHRSGRIVAAICAAPATVLVPHDI-------FPIGNMTGFPTLKDK--IP 135
Query: 154 IEE----EIVEDN--NLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFL 199
E+ +V D L T GPG A++ ++++ L E + + + +
Sbjct: 136 AEQWQDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQLVM 187
>gi|118403904|ref|NP_001072131.1| DJ-1 protein [Sus scrofa]
gi|67038668|gb|AAY63803.1| DJ-1 protein [Sus scrofa]
Length = 189
Score = 54.3 bits (129), Expect = 4e-06, Method: Composition-based stats.
Identities = 53/203 (26%), Positives = 91/203 (44%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K ++K + +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKDILKEQEKRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-SHYSY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E + +K + L+
Sbjct: 166 LAIVEALAGKEVADQVKAPLVLR 188
>gi|16924002|ref|NP_476484.1| DJ-1 protein [Rattus norvegicus]
gi|56404680|sp|O88767|PARK7_RAT Protein DJ-1 (Parkinson disease protein 7 homolog)
(Contraception-associated protein 1) (Protein CAP1)
(Fertility protein SP22)
gi|5478755|gb|AAD43956.1|AF157511_1 fertility protein SP22 [Rattus norvegicus]
gi|5478757|gb|AAD43957.1|AF157512_1 fertility protein SP22 [Rattus norvegicus]
gi|3250916|emb|CAA07434.1| CAP1 [Rattus norvegicus]
gi|149024696|gb|EDL81193.1| rCG30883, isoform CRA_a [Rattus norvegicus]
gi|149024697|gb|EDL81194.1| rCG30883, isoform CRA_a [Rattus norvegicus]
gi|149024698|gb|EDL81195.1| rCG30883, isoform CRA_a [Rattus norvegicus]
Length = 189
Score = 54.3 bits (129), Expect = 5e-06, Method: Composition-based stats.
Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 16/200 (8%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E D+ G+K + V ++ K+ ++C+ + +
Sbjct: 4 KRALVILAKGAEEMETVIPVDIM---RRAGIK----VTVAGLAGKDPVQCSRDVVICPDT 56
Query: 62 IITEDNVEDFFDYDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVIN 120
+ E + YDV+++PGG G N + + K+++K +I AIC+
Sbjct: 57 SLEEAKTQG--PYDVVVLPGGNLGAQNL---SESALVKEILKEQENRKGLIAAICAGPTA 111
Query: 121 LLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRL 180
LL G KVT++ L + N +Y+ E + +D + T GPG + E + +
Sbjct: 112 LLAHEVGFGCKVTSHPLAKDKMMNG-SHYSY--SESRVEKDGLILTSRGPGTSFEFALAI 168
Query: 181 LEKLTSNENVNIIKDNMFLK 200
+E L+ + N +K + LK
Sbjct: 169 VEALSGKDMANQVKAPLVLK 188
>gi|83941938|ref|ZP_00954400.1| putative intracellular protease, PfpI family protein [Sulfitobacter
sp. EE-36]
gi|83847758|gb|EAP85633.1| putative intracellular protease, PfpI family protein [Sulfitobacter
sp. EE-36]
Length = 186
Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats.
Identities = 37/137 (27%), Positives = 66/137 (48%), Gaps = 9/137 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG + A+K +++ V+D YD +++PGG + + NK LIK F + K +
Sbjct: 54 WGNIVAADKALSDVTVDD---YDAIVLPGGQINPDLLR--ANKDAVSLIKSFADAGKTVA 108
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L+E+ I+G+ T+Y +KN + E+V D + T P +
Sbjct: 109 AICHAPWLLIEAGIIKGRAATSY----ASIATDVKNAGAHYEDSEVVVDQGIITSRSPED 164
Query: 173 ALELSFRLLEKLTSNEN 189
+++E++ E+
Sbjct: 165 LDAFIAKIVEEVEEGEH 181
>gi|150389262|ref|YP_001319311.1| ThiJ/PfpI domain protein [Alkaliphilus metalliredigens QYMF]
gi|149949124|gb|ABR47652.1| ThiJ/PfpI domain protein [Alkaliphilus metalliredigens QYMF]
Length = 198
Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats.
Identities = 46/188 (24%), Positives = 91/188 (48%), Gaps = 13/188 (6%)
Query: 3 KIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKI 62
K+ I +F+ E+ + A +VF + + V TIS K ++ G LR +
Sbjct: 7 KVGILIFDDVEVLDFAGPFEVFSVTTIA--NQMNPFHVSTISEKGNMITARNG-LRVQP- 62
Query: 63 ITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLL 122
+ + ED D+LIIPGG G ++ +N + I E +++ ++C+ + L
Sbjct: 63 --DYSFEDMPQLDILIIPGGLGARE--REIHNDTLIRWISNQIEKVELMTSVCTGALLLA 118
Query: 123 ESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEE--EIVEDNNLFTCSGPGNALELSFRL 180
++ +RGKK TT+ +R + + I ++ + V++ N+ T G + +SF +
Sbjct: 119 KAGLLRGKKATTHWASLERL---QREFPEIYVQHGVKFVDEGNIVTSGGISAGINMSFHI 175
Query: 181 LEKLTSNE 188
+++L +E
Sbjct: 176 VKRLLGSE 183
>gi|149185899|ref|ZP_01864214.1| protease [Erythrobacter sp. SD-21]
gi|148830460|gb|EDL48896.1| protease [Erythrobacter sp. SD-21]
Length = 185
Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats.
Identities = 42/135 (31%), Positives = 60/135 (44%), Gaps = 8/135 (5%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K T D V D YD L++PGG + + I +++ F K I
Sbjct: 50 WGDSVKVDK--TVDEVSDCSGYDALLLPGGQMNPDILRMNERAI--AIVREFNMAGKPIA 105
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L E+ I+ K VT + LKN +++E V D NL T P +
Sbjct: 106 AICHAPWLLAEADLIKDKTVTAW----PSIRTDLKNAGANVVDKEAVVDGNLITSRNPDD 161
Query: 173 ALELSFRLLEKLTSN 187
S L+E L N
Sbjct: 162 IPAFSKALIEMLGEN 176
>gi|54302922|ref|YP_132915.1| hypothetical protein PBPRB1243 [Photobacterium profundum SS9]
gi|46916350|emb|CAG23115.1| hypothetical protein [Photobacterium profundum SS9]
Length = 216
Score = 53.9 bits (128), Expect = 5e-06, Method: Composition-based stats.
Identities = 48/182 (26%), Positives = 83/182 (45%), Gaps = 23/182 (12%)
Query: 14 LFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE---KIITEDNVED 70
LFE DVFG + GL + Y+ + GG + + K++T+ + +D
Sbjct: 22 LFEQFELLDVFGPLEMFGLLPDK--------YQLKLVSEQGGAISSSQGIKVLTDYSFQD 73
Query: 71 FFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGK 130
F D+LIIPGG G N + NN + N + I ++C+ + L + +
Sbjct: 74 IFLTDILIIPGGEGIKN---EVNNSNLLAWLNKTAPNIQYICSVCTGAVILASAGLLEDC 130
Query: 131 KVTTYLLDNKRYFNQLKNY----NVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTS 186
K TT NK++++ + Y + P+ V+D +FT SG +++S L+ + S
Sbjct: 131 KATT----NKKHYHWVTRYGKDIDWQPV-ARWVQDGTVFTSSGTAAGIDMSLALIAEQYS 185
Query: 187 NE 188
E
Sbjct: 186 EE 187
>gi|146340883|ref|YP_001205931.1| putative intracellular proteinase [Bradyrhizobium sp. ORS278]
gi|146193689|emb|CAL77706.1| putative intracellular proteinase [Bradyrhizobium sp. ORS278]
Length = 186
Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats.
Identities = 33/138 (23%), Positives = 66/138 (47%), Gaps = 9/138 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG ++ +K + + D YD +++PGG + + + + + IK F KI+
Sbjct: 53 WGRPVKVDKTLDQAQASD---YDAIVLPGGQINPDLLRLEPKAL--QFIKDIFNAKKIVA 107
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
A+C A L+E+ +G+K+T+Y K + N + E+V D + T PG+
Sbjct: 108 AVCHAPWLLIETGIAKGRKMTSY----KSIKTDVANAGAQWQDAEVVVDQGVITSRNPGD 163
Query: 173 ALELSFRLLEKLTSNENV 190
S +++E++ ++
Sbjct: 164 LEAFSAKIIEEVKEGRHL 181
>gi|90418768|ref|ZP_01226679.1| putative intracellular protease/amidase [Aurantimonas sp. SI85-9A1]
gi|90336848|gb|EAS50553.1| putative intracellular protease/amidase [Aurantimonas sp. SI85-9A1]
Length = 187
Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats.
Identities = 34/132 (25%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
W E+ +K ++E V D YD L++PGG + + + ++ FF + K +
Sbjct: 54 WDKEITVDKTLSEVRVTD---YDALVLPGGQINPDVLRADPKVV--SFVREFFNSKKPLA 108
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L+E+ +RG+ +T+Y +KN +++E+V L T PG+
Sbjct: 109 AICHAPWLLIEADVVRGRNITSY----NSIKTDVKNAGGNWLDQEVVVHEALITSRNPGD 164
Query: 173 ALELSFRLLEKL 184
+++E++
Sbjct: 165 IPAFVAKIIEEV 176
>gi|83855414|ref|ZP_00948944.1| putative intracellular protease, PfpI family protein [Sulfitobacter
sp. NAS-14.1]
gi|83843257|gb|EAP82424.1| putative intracellular protease, PfpI family protein [Sulfitobacter
sp. NAS-14.1]
Length = 186
Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats.
Identities = 36/137 (26%), Positives = 66/137 (48%), Gaps = 9/137 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG + A++ +++ V+D YD +++PGG + + NK LIK F + K +
Sbjct: 54 WGNTVAADQALSDVTVDD---YDAIVLPGGQINPDLLR--ANKDAVSLIKSFADAGKTVA 108
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L+E+ I+G+ T+Y +KN + E+V D + T P +
Sbjct: 109 AICHAPWLLIEAGIIKGRAATSY----ASIATDVKNAGAHYEDSEVVVDQGIITSRSPED 164
Query: 173 ALELSFRLLEKLTSNEN 189
+++E++ E+
Sbjct: 165 LDAFIAKIVEEVEEGEH 181
>gi|20807466|ref|NP_622637.1| putative intracellular protease/amidase [Thermoanaerobacter
tengcongensis MB4]
gi|20515992|gb|AAM24241.1| putative intracellular protease/amidase [Thermoanaerobacter
tengcongensis MB4]
Length = 168
Score = 53.5 bits (127), Expect = 6e-06, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Query: 73 DYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKV 132
DYD ++IPGG+ + + ++ F +K + KII AIC + S ++GK+V
Sbjct: 63 DYDAVVIPGGYSPDHMRRCQDTVNF---VKEMCQQQKIIAAICHGPWMMASSCDLKGKRV 119
Query: 133 TTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKL 184
T++ N Y ++EE+V D NL T P + + ++EKL
Sbjct: 120 TSFFSIKDDLINAGAQY----VDEEVVIDGNLITSRTPNDLVAFVKAIIEKL 167
>gi|126354153|ref|ZP_01711164.1| intracellular protease, PfpI family [Caldivirga maquilingensis
IC-167]
gi|126312807|gb|EAZ65261.1| intracellular protease, PfpI family [Caldivirga maquilingensis
IC-167]
Length = 191
Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats.
Identities = 41/151 (27%), Positives = 73/151 (48%), Gaps = 22/151 (14%)
Query: 25 GWNNVVGLKEFRDIKV---------ETISYKESIKCTWGGELRAEKIITEDNVEDFFDYD 75
GW+ V +D++ ET S K W K ++E E+ YD
Sbjct: 30 GWDVDVAAPSRKDLRTVVHDFEPGWETYSEKPGYLFKW-----VTKTLSEVKPEE---YD 81
Query: 76 VLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTY 135
L+IPGG + + ++ K+++++FFE K + AIC A L + ++G+++T+Y
Sbjct: 82 GLVIPGG-RMPEYVRVVASEDVKRIVRHFFETKKPVAAICHAPQILAAAGVVKGRRMTSY 140
Query: 136 LLDNKRYFNQLKNYNVIPIEEEIVEDNNLFT 166
+ +++N I ++EE+V D NL T
Sbjct: 141 IAVRP----EVENNGGIWVDEEVVVDGNLVT 167
>gi|114690169|ref|XP_521268.2| PREDICTED: similar to DJ-1 isoform 2 [Pan troglodytes]
Length = 189
Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats.
Identities = 53/203 (26%), Positives = 89/203 (43%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + + ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTIAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQENRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-GHYTY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E +K + LK
Sbjct: 166 LAIVEALNGKEVAAQVKSPLVLK 188
>gi|86130142|ref|ZP_01048742.1| proteinase [Cellulophaga sp. MED134]
gi|85818817|gb|EAQ39976.1| proteinase [Dokdonia donghaensis MED134]
Length = 182
Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats.
Identities = 37/137 (27%), Positives = 69/137 (50%), Gaps = 9/137 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
W GE T DNV DY+ L++PGG + + ++ + I+ FF+ +K +
Sbjct: 50 WSGEYDVTD--TVDNVSAK-DYNALMLPGGVINPDKLRRNDDALI--FIRDFFKQSKPVA 104
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L+E+ + G+ +T++ LKN + +++E+V D L T PG+
Sbjct: 105 AICHAPQLLIEADVVNGRTMTSF----NSIKTDLKNAGALWVDKEVVVDEALVTSRNPGD 160
Query: 173 ALELSFRLLEKLTSNEN 189
+ +L+E++ ++
Sbjct: 161 LEAFNAKLIEEIKEGKH 177
>gi|31543380|ref|NP_009193.2| DJ-1 protein [Homo sapiens]
gi|56404943|sp|Q99497|PARK7_HUMAN Protein DJ-1 (Oncogene DJ1) (Parkinson disease protein 7)
gi|34810587|pdb|1UCF|A Chain A, The Crystal Structure Of Dj-1, A Protein Related To Male
Fertility And Parkinson's Disease
gi|34810588|pdb|1UCF|B Chain B, The Crystal Structure Of Dj-1, A Protein Related To Male
Fertility And Parkinson's Disease
gi|34810650|pdb|1P5F|A Chain A, Crystal Structure Of Human Dj-1
gi|37927769|pdb|1Q2U|A Chain A, Crystal Structure Of Dj-1RS AND IMPLICATION ON FAMILIAL
Parkinson's Disease
gi|39654550|pdb|1PS4|A Chain A, Crystal Structure Of Dj-1
gi|134105362|pdb|2OR3|A Chain A, Pre-Oxidation Complex Of Human Dj-1
gi|134105363|pdb|2OR3|B Chain B, Pre-Oxidation Complex Of Human Dj-1
gi|2460318|gb|AAC12806.1| RNA-binding protein regulatory subunit [Homo sapiens]
gi|5731801|emb|CAB52550.1| Parkinson disease (autosomal recessive, early onset) 7 [Homo
sapiens]
gi|14198257|gb|AAH08188.1| Parkinson disease (autosomal recessive, early onset) 7 [Homo
sapiens]
gi|30038760|dbj|BAA09603.2| DJ-1 protein [Homo sapiens]
gi|119591997|gb|EAW71591.1| Parkinson disease (autosomal recessive, early onset) 7 [Homo
sapiens]
Length = 189
Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 89/203 (43%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQENRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLAKDKMMNG-GHYTY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E +K + LK
Sbjct: 166 LAIVEALNGKEVAAQVKAPLVLK 188
>gi|75761540|ref|ZP_00741499.1| Transcriptional regulator, AraC family [Bacillus thuringiensis
serovar israelensis ATCC 35646]
gi|74490970|gb|EAO54227.1| Transcriptional regulator, AraC family [Bacillus thuringiensis
serovar israelensis ATCC 35646]
Length = 198
Score = 53.5 bits (127), Expect = 7e-06, Method: Composition-based stats.
Identities = 44/181 (24%), Positives = 91/181 (50%), Gaps = 10/181 (5%)
Query: 4 IAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEKII 63
+ IFLF E+ + A +VF +V + E + V T+S + G K+
Sbjct: 7 VGIFLFNEVEVLDFAGPFEVF---SVTEVNEEKPFTVYTVSENGEMITARNGL----KVQ 59
Query: 64 TEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLE 123
+ ++E+ D+LIIPGG G + + N+I K I+ + K++ ++C+ + L +
Sbjct: 60 PDYSIENLPPVDILIIPGGLGARKY--EIKNEIVIKWIRQQMKEVKLMTSVCTGALLLAK 117
Query: 124 STYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEK 183
+ + G K TT+ +++ N+ +N VI + V++ ++ T +G + ++F +++
Sbjct: 118 AGLLEGLKATTHWASIEKFKNEFQNVEVIE-NVKFVDEGHIITSAGISAGINMAFHIVKN 176
Query: 184 L 184
L
Sbjct: 177 L 177
>gi|86143389|ref|ZP_01061791.1| proteinase [Flavobacterium sp. MED217]
gi|85830294|gb|EAQ48754.1| proteinase [Leeuwenhoekiella blandensis MED217]
Length = 181
Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats.
Identities = 44/194 (22%), Positives = 89/194 (45%), Gaps = 23/194 (11%)
Query: 1 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESI-----KCTWGG 55
MKK+AI G E E+ S + + +V+ +S K + WG
Sbjct: 1 MKKVAILATNGFEESELTSPLEAM---------KKEGFQVDIVSEKSGTIKAWAETDWGK 51
Query: 56 ELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAIC 115
+ +K + E + +D Y+ L++PGG + + N + I+ FF+ +K + AIC
Sbjct: 52 DYNVDKTLDEVSAKD---YNALVLPGGVINPDQLRRNENALV--FIRDFFKQHKPVAAIC 106
Query: 116 SAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALE 175
A L+ + + G+ +T++ K L+N + +++E+V D L T P +
Sbjct: 107 HAPQVLISADVVEGRTLTSFSSIKK----DLENAGALWVDKEVVVDEALVTSRNPNDLPA 162
Query: 176 LSFRLLEKLTSNEN 189
+ +++E++ ++
Sbjct: 163 FNAKVIEEINEGKH 176
>gi|15669157|ref|NP_247962.1| intracellular protease (pfpI) [Methanocaldococcus jannaschii DSM
2661]
gi|3024948|sp|Q58377|Y967_METJA Uncharacterized protein MJ0967
gi|1499805|gb|AAB98972.1| intracellular protease (pfpI) [Methanocaldococcus jannaschii DSM
2661]
Length = 205
Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats.
Identities = 47/171 (27%), Positives = 75/171 (43%), Gaps = 16/171 (9%)
Query: 29 VVGLKEFRDIKV-------ETISYKESIKCTWGGE---LRAEKIITEDNVEDFF--DYDV 76
V+ K+FRD ++ E+ K + T GE + KI E + D DY
Sbjct: 37 VIAPKDFRDEELFEPMAVFESNGLKVDVVSTTKGECVGMLGNKITVEKTIYDVNPDDYVA 96
Query: 77 LIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYL 136
++I GG G + NN +L+K F+ NK++ AIC + + L + ++GKK T Y
Sbjct: 97 IVIVGGIGSKEYLW--NNTKLIELVKEFYNKNKVVSAICLSPVVLARAGILKGKKATVY- 153
Query: 137 LDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSN 187
+LK I + +V D N+ T P A +L+ + N
Sbjct: 154 -PAPEAIEELKKAGAIYEDRGVVVDGNVITAKSPDYARLFGLEVLKAIEKN 203
>gi|33358055|pdb|1PE0|A Chain A, Crystal Structure Of The K130r Mutant Of Human Dj-1
gi|33358056|pdb|1PE0|B Chain B, Crystal Structure Of The K130r Mutant Of Human Dj-1
Length = 197
Score = 53.5 bits (127), Expect = 8e-06, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 89/203 (43%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQENRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPLARDKMMNG-GHYTY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E +K + LK
Sbjct: 166 LAIVEALNGKEVAAQVKAPLVLK 188
>gi|89275119|gb|ABD66014.1| SP22 [Xenopus laevis]
Length = 163
Score = 53.1 bits (126), Expect = 8e-06, Method: Composition-based stats.
Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 17/173 (9%)
Query: 31 GLKEFRDIKVETISYKESIKCTWGGELRAEKIITEDNVEDFFDYDVLIIPGG-FGKANFF 89
GLK + V ++ K+ ++C+ L + + E + YDV+++PGG G N
Sbjct: 4 GLK----VTVAGLNGKDPVQCSRDVMLCPDTSLEEARTQG--PYDVVVLPGGNLGAQNL- 56
Query: 90 KDKNNKIFKKLIKYFFENNKIIVAICSAVINLLESTYIRGKKVTTYLLDNKRYFN--QLK 147
+ + K+++K +I AIC+ L GK +TT+ L + N Q K
Sbjct: 57 --SESPVVKEVLKEQEAKKGLIAAICAGPTALTVHGVGIGKSITTHPLAKDKIVNPDQYK 114
Query: 148 NYNVIPIEEEIVEDNNLFTCSGPGNALELSFRLLEKLTSNENVNIIKDNMFLK 200
Y+ EE +V+D N T GPG + E + ++ L E +K + LK
Sbjct: 115 -YS----EERVVKDENFITSRGPGTSFEFALEIVCTLLGKEVAEQVKSPLLLK 162
>gi|149695427|ref|XP_001495448.1| PREDICTED: similar to DJ-1 protein [Equus caballus]
Length = 189
Score = 53.1 bits (126), Expect = 8e-06, Method: Composition-based stats.
Identities = 52/203 (25%), Positives = 92/203 (45%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + + ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTIAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K + +I AIC+
Sbjct: 52 ICPDASLEDAKKQGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQEKRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ + N +Y+ E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGFGSKVTTHPQAKDKIMNG-SHYSY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L+ E + +K + LK
Sbjct: 166 LAIVEALSGKEVADQVKAPLVLK 188
>gi|42543006|pdb|1J42|A Chain A, Crystal Structure Of Human Dj-1
gi|16751471|dbj|BAB71782.1| DJ-1 [Homo sapiens]
Length = 189
Score = 53.1 bits (126), Expect = 9e-06, Method: Composition-based stats.
Identities = 54/203 (26%), Positives = 89/203 (43%), Gaps = 22/203 (10%)
Query: 2 KKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAEK 61
K+ + L +GAE E DV G+K + V ++ K+ ++C+ R
Sbjct: 4 KRALVILAKGAEEMETVIPVDVM---RRAGIK----VTVAGLAGKDPVQCS-----RDVV 51
Query: 62 IITEDNVEDFFD---YDVLIIPGG-FGKANFFKDKNNKIFKKLIKYFFENNKIIVAICSA 117
I + ++ED YDV+++PGG G N + K+++K +I AIC+
Sbjct: 52 ICPDASLEDAKKEGPYDVVVLPGGNLGAQNL---SESAAVKEILKEQENRKGLIAAICAG 108
Query: 118 VINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGNALELS 177
LL G KVTT+ L + N +Y E + +D + T GPG + E +
Sbjct: 109 PTALLAHEIGCGSKVTTHPLAKDKMMNG-GHYTY--SENRVEKDGLILTSRGPGTSFEFA 165
Query: 178 FRLLEKLTSNENVNIIKDNMFLK 200
++E L E +K + LK
Sbjct: 166 LAIVEALNGKEVAAQVKAPLVLK 188
>gi|92115071|ref|YP_574999.1| Peptidase C56, PfpI [Chromohalobacter salexigens DSM 3043]
gi|91798161|gb|ABE60300.1| Peptidase C56, PfpI [Chromohalobacter salexigens DSM 3043]
Length = 204
Score = 53.1 bits (126), Expect = 9e-06, Method: Composition-based stats.
Identities = 35/132 (26%), Positives = 64/132 (48%), Gaps = 9/132 (6%)
Query: 53 WGGELRAEKIITEDNVEDFFDYDVLIIPGGFGKANFFKDKNNKIFKKLIKYFFENNKIIV 112
WG A+K +++ D DY L++PGG + + + + ++ FFE K +
Sbjct: 72 WGDTYEADKALSD---VDSTDYHALVLPGGLFNPDELRLNDQAL--DFVRGFFEAGKPVA 126
Query: 113 AICSAVINLLESTYIRGKKVTTYLLDNKRYFNQLKNYNVIPIEEEIVEDNNLFTCSGPGN 172
AIC A L+ + + G+++T+ LKN ++E++V DN L T P +
Sbjct: 127 AICHAPWILINAGVVEGRRMTSV----ASVAEDLKNAGAEWVDEKVVVDNGLVTSRTPKD 182
Query: 173 ALELSFRLLEKL 184
+ +L+E+L
Sbjct: 183 LDAFNDKLIEEL 194
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.321 0.141 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 748,963,966
Number of Sequences: 5470121
Number of extensions: 33010424
Number of successful extensions: 83535
Number of sequences better than 1.0e-05: 182
Number of HSP's better than 0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 167
Number of HSP's that attempted gapping in prelim test: 83282
Number of HSP's gapped (non-prelim): 192
length of query: 200
length of database: 1,894,087,724
effective HSP length: 126
effective length of query: 74
effective length of database: 1,204,852,478
effective search space: 89159083372
effective search space used: 89159083372
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 126 (53.1 bits)