BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PG0810
(163 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540669|ref|NP_905148.1| G/U mismatch-specific DNA glyc... 332 4e-90
gi|91217499|ref|ZP_01254458.1| G/U mismatch-specific DNA gl... 164 2e-39
gi|126650554|ref|ZP_01722777.1| hypothetical protein BB1490... 132 8e-30
gi|74317075|ref|YP_314815.1| hypothetical protein Tbd_1057 ... 128 1e-28
gi|146302058|ref|YP_001196649.1| G:T/U mismatch-specific DN... 127 2e-28
gi|88803537|ref|ZP_01119062.1| hypothetical protein PI23P_0... 125 1e-27
gi|150384749|ref|ZP_01923421.1| Uracil-DNA glycosylase supe... 123 3e-27
gi|17545710|ref|NP_519112.1| hypothetical protein RSc0991 [... 120 3e-26
gi|88602308|ref|YP_502486.1| hypothetical protein Mhun_1017... 120 3e-26
gi|153888898|ref|ZP_02010035.1| Uracil-DNA glycosylase supe... 119 4e-26
gi|150389112|ref|YP_001319161.1| hypothetical protein Amet_... 119 5e-26
gi|114797714|ref|YP_761300.1| hypothetical protein HNE_2611... 119 8e-26
gi|67479591|ref|XP_655177.1| conserved hypothetical protein... 118 1e-25
gi|73542027|ref|YP_296547.1| Uracil-DNA glycosylase superfa... 117 2e-25
gi|121530110|ref|ZP_01662720.1| uracil-DNA glycosylase supe... 117 2e-25
gi|113868612|ref|YP_727101.1| G:T/U mismatch-specific DNA g... 117 2e-25
gi|118032494|ref|ZP_01503943.1| conserved hypothetical prot... 117 3e-25
gi|115422711|emb|CAJ49238.1| conserved hypothetical protein... 116 3e-25
gi|113932522|ref|ZP_01418424.1| Uracil-DNA glycosylase supe... 116 5e-25
gi|83745749|ref|ZP_00942806.1| hypothetical cytosolic prote... 115 6e-25
gi|91782505|ref|YP_557711.1| hypothetical protein Bxe_A3324... 115 7e-25
gi|56552478|ref|YP_163317.1| G:T/U mismatch-specific DNA gl... 115 8e-25
gi|20089353|ref|NP_615428.1| hypothetical protein MA0462 [M... 114 2e-24
gi|71909478|ref|YP_287065.1| hypothetical protein Daro_3867... 114 2e-24
gi|21227753|ref|NP_633675.1| hypothetical protein MM_1651 [... 113 4e-24
gi|156864151|gb|EDO57582.1| hypothetical protein CLOL250_01... 113 4e-24
gi|119899892|ref|YP_935105.1| hypothetical protein azo3603 ... 113 5e-24
gi|118696705|ref|ZP_01554786.1| G-T/U mismatch-specific DNA... 112 6e-24
gi|152981569|ref|YP_001352380.1| G:T/U mismatch-specific DN... 112 6e-24
gi|82491806|gb|ABB77797.1| Mug G:T/U mismatch-specific DNA ... 112 7e-24
gi|106886756|ref|ZP_01354087.1| conserved hypothetical prot... 112 7e-24
gi|152990863|ref|YP_001356585.1| hypothetical protein NIS_1... 111 1e-23
gi|118719341|ref|ZP_01571872.1| uracil-DNA glycosylase supe... 111 1e-23
gi|124894388|gb|EAY68268.1| hypothetical protein BDAG_00976... 111 1e-23
gi|73668280|ref|YP_304295.1| hypothetical protein Mbar_A073... 111 1e-23
gi|118039985|ref|ZP_01511383.1| conserved hypothetical prot... 110 2e-23
gi|94311441|ref|YP_584651.1| Uracil-DNA glycosylase superfa... 110 2e-23
gi|84362465|ref|ZP_00987086.1| COG3663: G:T/U mismatch-spec... 110 2e-23
gi|23098520|ref|NP_691986.1| hypothetical protein OB1065 [O... 110 2e-23
gi|16126728|ref|NP_421292.1| hypothetical protein CC_2489 [... 110 2e-23
gi|91776836|ref|YP_546592.1| T/U mismatch-specific DNA glyc... 110 2e-23
gi|115352432|ref|YP_774271.1| G-T/U mismatch-specific DNA g... 110 2e-23
gi|153811093|ref|ZP_01963761.1| hypothetical protein RUMOBE... 110 3e-23
gi|114776245|ref|ZP_01451290.1| hypothetical protein SPV1_0... 110 3e-23
gi|94496341|ref|ZP_01302918.1| Uracil-DNA glycosylase super... 110 4e-23
gi|87199194|ref|YP_496451.1| Uracil-DNA glycosylase superfa... 109 4e-23
gi|118707225|ref|ZP_01559811.1| G-T/U mismatch-specific DNA... 109 4e-23
gi|134296531|ref|YP_001120266.1| G-T/U mismatch-specific DN... 108 7e-23
gi|107023282|ref|YP_621609.1| G-T/U mismatch-specific DNA g... 108 8e-23
gi|53718725|ref|YP_107711.1| hypothetical protein BPSL1089 ... 108 1e-22
gi|67643813|ref|ZP_00442556.1| COG3663: G:T/U mismatch-spec... 108 1e-22
gi|78067153|ref|YP_369922.1| G-T/U mismatch-specific DNA gl... 107 2e-22
gi|84355380|ref|ZP_00980265.1| COG3663: G:T/U mismatch-spec... 107 2e-22
gi|148556860|ref|YP_001264442.1| Uracil-DNA glycosylase sup... 107 2e-22
gi|124872348|gb|EAY64064.1| conserved hypothetical protein ... 107 3e-22
gi|82533860|ref|ZP_00892913.1| hypothetical protein Bpse110... 107 3e-22
gi|152992204|ref|YP_001357925.1| hypothetical protein SUN_0... 106 5e-22
gi|118052267|ref|ZP_01520814.1| conserved hypothetical prot... 105 7e-22
gi|21244388|ref|NP_643970.1| hypothetical protein XAC3663 [... 105 8e-22
gi|121610938|ref|YP_998745.1| hypothetical protein Veis_402... 104 1e-21
gi|33596028|ref|NP_883671.1| hypothetical protein BPP1370 [... 104 2e-21
gi|78049337|ref|YP_365512.1| hypothetical protein XCV3781 [... 104 2e-21
gi|88797134|ref|ZP_01112724.1| hypothetical protein MED297_... 103 4e-21
gi|30249529|ref|NP_841599.1| hypothetical protein NE1560 [N... 103 4e-21
gi|134094007|ref|YP_001099082.1| hypothetical protein HEAR0... 103 4e-21
gi|58580341|ref|YP_199357.1| hypothetical protein XOO0718 [... 102 5e-21
gi|118728844|ref|ZP_01577361.1| conserved hypothetical prot... 102 6e-21
gi|78485657|ref|YP_391582.1| hypothetical protein Tcr_1313 ... 102 8e-21
gi|32265840|ref|NP_859872.1| hypothetical protein HH0341 [H... 100 2e-20
gi|119878933|ref|ZP_01645796.1| conserved hypothetical prot... 100 3e-20
gi|83720475|ref|YP_441511.1| hypothetical protein BTH_I0957... 99 7e-20
gi|120609759|ref|YP_969437.1| hypothetical protein Aave_106... 99 8e-20
gi|91789726|ref|YP_550678.1| hypothetical protein Bpro_3884... 97 3e-19
gi|56477748|ref|YP_159337.1| T/U mismatch-specific DNA glyc... 97 3e-19
gi|154151743|ref|YP_001405361.1| hypothetical protein Mboo_... 97 3e-19
gi|34497666|ref|NP_901881.1| hypothetical protein CV_2211 [... 97 3e-19
gi|121595712|ref|YP_987608.1| hypothetical protein Ajs_3415... 97 4e-19
gi|157101474|gb|EAT97723.2| Mug G:T/U mismatch-specific DNA... 96 5e-19
gi|86135463|ref|ZP_01054044.1| hypothetical protein MED152_... 96 8e-19
gi|153160315|ref|ZP_01925940.1| conserved hypothetical prot... 96 8e-19
gi|86152465|ref|ZP_01070670.1| T/U mismatch-specific DNA gl... 96 9e-19
gi|89901985|ref|YP_524456.1| hypothetical protein Rfer_3216... 95 1e-18
gi|149193946|ref|ZP_01871044.1| hypothetical protein CMTB2_... 95 1e-18
gi|150017648|ref|YP_001309902.1| Mug G:T/U mismatch-specifi... 95 2e-18
gi|113938045|ref|ZP_01423916.1| conserved hypothetical prot... 95 2e-18
gi|121613018|ref|YP_001000930.1| hypothetical protein CJJ81... 95 2e-18
gi|153951695|ref|YP_001397656.1| hypothetical protein JJD26... 94 2e-18
gi|153852709|ref|ZP_01994146.1| hypothetical protein DORLON... 94 2e-18
gi|15792578|ref|NP_282401.1| hypothetical protein Cj1254 [C... 94 2e-18
gi|33601416|ref|NP_888976.1| hypothetical protein BB2436 [B... 94 3e-18
gi|86150913|ref|ZP_01069129.1| conserved hypothetical prote... 93 5e-18
gi|118462625|ref|YP_880656.1| T/U mismatch-specific DNA gly... 92 8e-18
gi|57238128|ref|YP_179378.1| hypothetical protein CJE1391 [... 92 1e-17
gi|118475705|ref|YP_892419.1| Mug G:T/U mismatch-specific D... 91 2e-17
gi|120405405|ref|YP_955234.1| hypothetical protein Mvan_445... 91 2e-17
gi|21230015|ref|NP_635932.1| hypothetical protein XCC0540 [... 91 2e-17
gi|66769991|ref|YP_244753.1| hypothetical protein XC_3693 [... 91 3e-17
gi|108800914|ref|YP_641111.1| hypothetical protein Mmcs_395... 91 3e-17
gi|41408606|ref|NP_961442.1| hypothetical protein MAP2508 [... 90 4e-17
gi|126436539|ref|YP_001072230.1| hypothetical protein Mjls_... 90 4e-17
gi|121606132|ref|YP_983461.1| hypothetical protein Pnap_324... 90 4e-17
gi|118473922|ref|YP_889275.1| T/U mismatch-specific DNA gly... 89 9e-17
gi|154175180|ref|YP_001407814.1| hypothetical protein CCV52... 87 3e-16
gi|118586810|ref|ZP_01544245.1| hypothetical protein OENOO_... 87 4e-16
gi|57168156|ref|ZP_00367295.1| conserved hypothetical prote... 86 8e-16
gi|91772987|ref|YP_565679.1| hypothetical protein Mbur_0989... 86 8e-16
gi|116490596|ref|YP_810140.1| G:T/U mismatch-specific DNA g... 85 2e-15
gi|119025890|ref|YP_909735.1| hypothetical protein BAD_0872... 83 5e-15
gi|57241603|ref|ZP_00369548.1| G/U mismatch-specific DNA gl... 81 2e-14
gi|145222830|ref|YP_001133508.1| hypothetical protein Mflv_... 75 1e-12
gi|121530107|ref|ZP_01662717.1| uracil-DNA glycosylase supe... 74 2e-12
gi|153878422|ref|ZP_02004581.1| conserved hypothetical prot... 63 7e-09
gi|139438549|ref|ZP_01772065.1| Hypothetical protein COLAER... 59 1e-07
>gi|34540669|ref|NP_905148.1| G/U mismatch-specific DNA glycosylase, putative [Porphyromonas
gingivalis W83]
gi|34396983|gb|AAQ66047.1| G/U mismatch-specific DNA glycosylase, putative [Porphyromonas
gingivalis W83]
Length = 163
Score = 332 bits (851), Expect = 4e-90, Method: Composition-based stats.
Identities = 163/163 (100%), Positives = 163/163 (100%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY
Sbjct: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE
Sbjct: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFSL
Sbjct: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
>gi|91217499|ref|ZP_01254458.1| G/U mismatch-specific DNA glycosylase, putative [Psychroflexus
torquis ATCC 700755]
gi|91184384|gb|EAS70768.1| G/U mismatch-specific DNA glycosylase, putative [Psychroflexus
torquis ATCC 700755]
Length = 254
Score = 164 bits (415), Expect = 2e-39, Method: Composition-based stats.
Identities = 74/161 (45%), Positives = 104/161 (64%)
Query: 2 ASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYA 61
++ K SF PI + ILILG++PGD+S+ G+YY HPRNRFW +++ + LP Y+
Sbjct: 93 STYSKASFDPISNSDTAILILGTMPGDKSLELGEYYGHPRNRFWKIISTITNNELPLTYS 152
Query: 62 ERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEA 121
++ E+LL IG+WD+AH A RKGS D I DEEPND+ I R+ L I FNG K+E
Sbjct: 153 DKKELLLKFKIGIWDIAHKANRKGSLDSAIEDEEPNDLDDFIARHKNLKVIGFNGTKSET 212
Query: 122 MFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
+F K+F ++ + +PSTSPAN G D + K W +IF+
Sbjct: 213 LFDKYFDRKSGLKYVYLPSTSPANTGIDFDNICKLWRQIFT 253
>gi|126650554|ref|ZP_01722777.1| hypothetical protein BB14905_08188 [Bacillus sp. B14905]
gi|126592710|gb|EAZ86709.1| hypothetical protein BB14905_08188 [Bacillus sp. B14905]
Length = 173
Score = 132 bits (331), Expect = 8e-30, Method: Composition-based stats.
Identities = 69/161 (42%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
++S K P+ D ++LI+GS+PG +S+ + QYY +PRN FWP++ +LL +PDDY
Sbjct: 4 VSSEVKNVLLPVVDSSTKVLIVGSMPGKQSLEKQQYYGNPRNHFWPIIGELLQIEVPDDY 63
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
+ER +L IGLWD + RKGS D I +E+PND ++L + P + + FNG KA
Sbjct: 64 SERMALLRKNAIGLWDTIEACERKGSLDAAIRNEKPNDFQTLFQSYPNIELVLFNGAKAF 123
Query: 121 AMFRKHFP-ETLAIRCL-LMPSTSPANAG--KTLDLLVKDW 157
+F+KH E LA R MPSTSP K+ D V+DW
Sbjct: 124 DVFKKHIGLELLAGRAYKKMPSTSPIPGKNIKSFDEKVEDW 164
>gi|74317075|ref|YP_314815.1| hypothetical protein Tbd_1057 [Thiobacillus denitrificans ATCC
25259]
gi|74056570|gb|AAZ97010.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
25259]
Length = 171
Score = 128 bits (322), Expect = 1e-28, Method: Composition-based stats.
Identities = 70/165 (42%), Positives = 92/165 (55%), Gaps = 11/165 (6%)
Query: 5 HKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERT 64
H QSFPPI D H +LILGSLPG S+R QYYA P+N FW +M +L G YAER
Sbjct: 3 HAQSFPPIADTHARVLILGSLPGQTSLRMQQYYAQPQNAFWKIMGRLFGAGADVPYAERY 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAM 122
L+ + + LWDV SA R GS D I PND + + +P +H + FNG KA +
Sbjct: 63 RKLVRSGVALWDVCASAERPGSLDSAIVTSSVVPNDFAAFLASHPHIHLVCFNGAKAAGL 122
Query: 123 FRK-------HFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+R+ P +A R L PSTSPA+AG + + W+ +
Sbjct: 123 YRRLVLPDLPEAPRLVAYRTL--PSTSPAHAGMRFEEKLAHWSVV 165
>gi|146302058|ref|YP_001196649.1| G:T/U mismatch-specific DNA glycosylase-like protein
[Flavobacterium johnsoniae UW101]
gi|146156476|gb|ABQ07330.1| G:T/U mismatch-specific DNA glycosylase-like protein
[Flavobacterium johnsoniae UW101]
Length = 160
Score = 127 bits (320), Expect = 2e-28, Method: Composition-based stats.
Identities = 62/156 (39%), Positives = 87/156 (55%)
Query: 8 SFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEIL 67
SF PI ++LILG++PG +S+ QYY H +N FW M + + DY + +L
Sbjct: 5 SFSPISSPDSKVLILGTMPGTKSLELNQYYGHSQNNFWKFMFTIFKEEFSTDYETKKALL 64
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF 127
L I LWDV R GS D I +E ND + ++++P +HTIAFNG+KA A F+K+
Sbjct: 65 LKNKIALWDVLQYCDRVGSLDSAIKNEIANDFETFLKQHPNIHTIAFNGQKAAAFFKKYV 124
Query: 128 PETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
+ +PSTSPANA K + +W I SL
Sbjct: 125 HLKKDYTLITLPSTSPANASKAFQSKLDEWKVIASL 160
>gi|88803537|ref|ZP_01119062.1| hypothetical protein PI23P_00955 [Polaribacter irgensii 23-P]
gi|88780549|gb|EAR11729.1| hypothetical protein PI23P_00955 [Polaribacter irgensii 23-P]
Length = 159
Score = 125 bits (313), Expect = 1e-27, Method: Composition-based stats.
Identities = 62/149 (41%), Positives = 88/149 (59%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F PI + +I ILG++PG +S+R+ QYYA+ RN FW +M L + ++Y ER + L
Sbjct: 6 FDPILPANPKIFILGTMPGKQSLRKQQYYANNRNSFWKIMYALTEEVFSENYTERIDTLK 65
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
HI +WD+ RK S D I +EEPN I +IE P + I FNG+KAE ++ K+F
Sbjct: 66 RNHIAIWDICQFGERKTSLDSDIKNEEPNPINEIIEMFPSVKQIVFNGKKAEKLYSKYFS 125
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
E IR + STSPANA + + +W
Sbjct: 126 EIAHIRYDTVLSTSPANARYSFREKLNNW 154
>gi|150384749|ref|ZP_01923421.1| Uracil-DNA glycosylase superfamily [Victivallis vadensis ATCC
BAA-548]
gi|150258845|gb|EDM96100.1| Uracil-DNA glycosylase superfamily [Victivallis vadensis ATCC
BAA-548]
Length = 169
Score = 123 bits (309), Expect = 3e-27, Method: Composition-based stats.
Identities = 65/144 (45%), Positives = 82/144 (56%), Gaps = 3/144 (2%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
K SFPP E+L LGSLPGDES+R+ QYYAHPRN FW +M +L G Y ER
Sbjct: 5 KHSFPPAAAPDAELLFLGSLPGDESLRQAQYYAHPRNAFWRIMGELFGFDHLLPYGERLA 64
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
L + LWDV S +R GS D I + PNDI L+ R P + I NG + + ++
Sbjct: 65 ELRRNRVALWDVVGSGVRPGSLDTAIREHRPNDIPGLLARCPGIRAIGCNGSTSHKLLKR 124
Query: 126 HFPETL---AIRCLLMPSTSPANA 146
H+PE + L +PSTSPA A
Sbjct: 125 HYPELFGRTGLTILALPSTSPAAA 148
>gi|17545710|ref|NP_519112.1| hypothetical protein RSc0991 [Ralstonia solanacearum GMI1000]
gi|17428004|emb|CAD14693.1| probable cog3663, g:t/u mismatch-specific dna glycosylase protein
[Ralstonia solanacearum]
Length = 166
Score = 120 bits (300), Expect = 3e-26, Method: Composition-based stats.
Identities = 65/157 (41%), Positives = 90/157 (57%), Gaps = 3/157 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
++ PI D + +L+LGS PG+ S+ QYYAHPRN+FWPLM LL PLP YA R
Sbjct: 9 QRGLAPILDAEVRVLVLGSFPGEASLAAQQYYAHPRNQFWPLMGALLDLPLPAMPYAARV 68
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL+ HIG+WDV + +R+GS D I + ND L R PRL + FNG A F
Sbjct: 69 RALLARHIGVWDVLGACVREGSLDAAIRHPQANDFDLLHRRAPRLRRVGFNGGTA-GRFA 127
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
K F +++PS+SPA+A ++ + + W +
Sbjct: 128 KDFARA-GFETVVLPSSSPAHAARSFEQKLALWRSLL 163
>gi|88602308|ref|YP_502486.1| hypothetical protein Mhun_1017 [Methanospirillum hungatei JF-1]
gi|88187770|gb|ABD40767.1| conserved hypothetical protein [Methanospirillum hungatei JF-1]
Length = 159
Score = 120 bits (300), Expect = 3e-26, Method: Composition-based stats.
Identities = 65/157 (41%), Positives = 93/157 (59%), Gaps = 2/157 (1%)
Query: 4 VHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAER 63
+HK P+ +LILGS P SI+ G+YYAHPRN FW +M +L P Y R
Sbjct: 1 MHKCGLSPLLSSDCRVLILGSYPSVLSIKAGEYYAHPRNMFWKIMEIILNIPHNQTYTTR 60
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
+LL+ IGLWDV + R+ SAD I D PN++++LI + + I NGR AE F
Sbjct: 61 ISLLLNHGIGLWDVYAACSREKSADAHIKDPVPNELKTLILNHQDIRIIFLNGRAAEKGF 120
Query: 124 RKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
RK FP+ ++++ + +PS+SPA+A L+ +K W I
Sbjct: 121 RKFFPD-ISLQTIYLPSSSPAHAVH-LEEKIKRWREI 155
>gi|153888898|ref|ZP_02010035.1| Uracil-DNA glycosylase superfamily [Ralstonia pickettii 12D]
gi|151574646|gb|EDN39065.1| Uracil-DNA glycosylase superfamily [Ralstonia pickettii 12D]
Length = 165
Score = 119 bits (299), Expect = 4e-26, Method: Composition-based stats.
Identities = 65/144 (45%), Positives = 89/144 (61%), Gaps = 2/144 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D H +LILGSLPG+ S+ + QYYAH +N+FW L +++G+ LP +YA R
Sbjct: 4 KRCFPPVVDAHTRVLILGSLPGEASLAQSQYYAHKQNQFWRLAGEVIGQDLPGMEYAARL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
++LL+ IGLWDV A R+GS D +I D ND+ LI P L IAFNG A +
Sbjct: 64 QVLLAHRIGLWDVVAEAKREGSLDSKIRDHAGNDLAGLIASLPELALIAFNGGTASRIGV 123
Query: 125 KHFPETLAIRCLL-MPSTSPANAG 147
K + +L +PS+SPA A
Sbjct: 124 KALGDIGDRHTILKLPSSSPAYAA 147
>gi|150389112|ref|YP_001319161.1| hypothetical protein Amet_1297 [Alkaliphilus metalliredigens QYMF]
gi|149948974|gb|ABR47502.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF]
Length = 164
Score = 119 bits (299), Expect = 5e-26, Method: Composition-based stats.
Identities = 55/143 (38%), Positives = 88/143 (61%), Gaps = 1/143 (0%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+SF PI D + ILILGS+P +S++ G+YY +PRN+FW ++ ++LG+P +DY ++ +
Sbjct: 5 ESFSPIIDANSHILILGSIPSVQSLQEGKYYGNPRNQFWKIIYEVLGQPFQEDYGKKVQF 64
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
+ I +WDV S R+GS D I +E PND + ++ PR+ + FNG KA FR+
Sbjct: 65 IKEHGIAIWDVIESCHREGSLDSNIKEERPNDFNTFLKSYPRIKYVLFNGTKAHDTFRRK 124
Query: 127 FPETLAIRCL-LMPSTSPANAGK 148
+ + +PS+SPA+ K
Sbjct: 125 VGFSFEDKIFHKLPSSSPAHTMK 147
>gi|114797714|ref|YP_761300.1| hypothetical protein HNE_2611 [Hyphomonas neptunium ATCC 15444]
gi|114737888|gb|ABI76013.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444]
Length = 162
Score = 119 bits (297), Expect = 8e-26, Method: Composition-based stats.
Identities = 64/153 (41%), Positives = 82/153 (53%), Gaps = 2/153 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+ FPPI EILILGS+PG S+ GQYYAHPRN FWP+M L G Y +R +
Sbjct: 4 RGFPPIAAPDAEILILGSMPGIASLEAGQYYAHPRNAFWPIMCALFGACPDLSYPDRVRV 63
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
L + +WDV R+GS D I E PND + +P + I NG KA A FRKH
Sbjct: 64 LARRKVAVWDVLQLCQREGSLDAAIRGEVPNDFATFFAAHPNIRHIGLNGGKAAASFRKH 123
Query: 127 FP--ETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
+ + +++PSTSPA A L+ W
Sbjct: 124 AAQHQPASAEAVVLPSTSPAYASLNLEAKCALW 156
>gi|67479591|ref|XP_655177.1| conserved hypothetical protein [Entamoeba histolytica HM-1:IMSS]
Length = 164
Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats.
Identities = 61/153 (39%), Positives = 87/153 (56%), Gaps = 1/153 (0%)
Query: 8 SFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP-LPDDYAERTEI 66
SF PI ILILGS+PG ES+++ QYY HP+N FW +++ L +P Y R E+
Sbjct: 7 SFDPIIPSKPLILILGSMPGTESLKKQQYYGHPQNCFWSIISSLKSMTNVPPRYEHRIEL 66
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
+ S I LWDV RKGS D I + +PN I L+ +P + T+ FNG+ ++ K
Sbjct: 67 IKSCQIALWDVCCQCERKGSLDSDIKEVKPNKINKLLIEHPTIKTVLFNGQGPSKLYHKF 126
Query: 127 FPETLAIRCLLMPSTSPANAGKTLDLLVKDWNR 159
F E I+ ++MP TSPANA ++ W +
Sbjct: 127 FKELPNIKYIIMPGTSPANARTKKQTKIEIWKK 159
>gi|73542027|ref|YP_296547.1| Uracil-DNA glycosylase superfamily [Ralstonia eutropha JMP134]
gi|72119440|gb|AAZ61703.1| Uracil-DNA glycosylase superfamily [Ralstonia eutropha JMP134]
Length = 167
Score = 117 bits (294), Expect = 2e-25, Method: Composition-based stats.
Identities = 66/145 (45%), Positives = 88/145 (60%), Gaps = 8/145 (5%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D +LILGSLPG+ S+ +GQYYAH +N+FW L+ +G+ L DY R
Sbjct: 3 KRCFPPVVDAETRVLILGSLPGEASLAQGQYYAHKQNQFWRLVGDTIGEALVQMDYPLRL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM-- 122
+ LL+ HIGLWDV A R+GS D I D ND+ LI+ P L IAFNG AE +
Sbjct: 63 QTLLAHHIGLWDVVAEAKREGSLDSNIRDHAGNDLIGLIDSLPSLAVIAFNGGTAERIGL 122
Query: 123 --FRKHFPETLAIRCLLMPSTSPAN 145
R+H +R +PS+SPA+
Sbjct: 123 KALREHGDRHHIVR---LPSSSPAH 144
>gi|121530110|ref|ZP_01662720.1| uracil-DNA glycosylase superfamily [Ralstonia pickettii 12J]
gi|121302925|gb|EAX43893.1| uracil-DNA glycosylase superfamily [Ralstonia pickettii 12J]
Length = 167
Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats.
Identities = 63/142 (44%), Positives = 89/142 (62%), Gaps = 2/142 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D H +LILGSLPG+ S+ + QYYAH +N+FW L+ +++G+ L +Y R
Sbjct: 4 KRCFPPVVDAHTRVLILGSLPGEASLAQSQYYAHKQNQFWRLVGEVIGQDLAGMEYGARL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
+ LL+ +GLWDV A R+GS D +I D ND+ SLI P L IAFNG AE +
Sbjct: 64 QALLAHRLGLWDVVAEARREGSLDSKIRDHAGNDLSSLIASLPNLAVIAFNGGTAERIGL 123
Query: 125 KHF-PETLAIRCLLMPSTSPAN 145
K + R + +PS+SPA+
Sbjct: 124 KALGSHGGSYRIVRLPSSSPAH 145
>gi|113868612|ref|YP_727101.1| G:T/U mismatch-specific DNA glycosylase [Ralstonia eutropha H16]
gi|113527388|emb|CAJ93733.1| G:T/U mismatch-specific DNA glycosylase [Ralstonia eutropha H16]
Length = 170
Score = 117 bits (293), Expect = 2e-25, Method: Composition-based stats.
Identities = 67/142 (47%), Positives = 89/142 (62%), Gaps = 2/142 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D ILILGSLPG+ S+ + QYYAH +N+FW L ++LG+ L +YA R
Sbjct: 4 KRCFPPVVDAGTRILILGSLPGEASLAQCQYYAHRQNQFWRLTGEVLGEDLVRMEYAARL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
+ LL+ H+GLWDV A R+GS D I D ND+ SLIE PRL IAFNG A +
Sbjct: 64 QTLLAHHVGLWDVVAQAKREGSLDSNIRDHAGNDLASLIESLPRLAVIAFNGGTAARIGL 123
Query: 125 KHFPE-TLAIRCLLMPSTSPAN 145
K + R + +PS+SPA+
Sbjct: 124 KALGQHGDHHRIIRLPSSSPAH 145
>gi|118032494|ref|ZP_01503943.1| conserved hypothetical protein [Burkholderia phymatum STM815]
gi|117981792|gb|EAU96181.1| conserved hypothetical protein [Burkholderia phymatum STM815]
Length = 157
Score = 117 bits (292), Expect = 3e-25, Method: Composition-based stats.
Identities = 64/160 (40%), Positives = 91/160 (56%), Gaps = 9/160 (5%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERTE 65
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ ++G+PLP+ DY R E
Sbjct: 3 RGFPPVVAESTHTLILGSFPGEASLTATQYYAHPRNQFWRLLGAVIGEPLPELDYPTRLE 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+L +G+WDV + R+GS D I + PND S E P L + FNG+ A
Sbjct: 63 RVLRHGLGVWDVLAACTREGSLDAAIRNATPNDFASFREYAPLLKKVCFNGKTA-----G 117
Query: 126 HFPETLA---IRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
F +A + L++PS+SPANA + D ++ W I +
Sbjct: 118 RFAPIIAAAGYQTLVLPSSSPANAILSFDQKLRLWRDILT 157
>gi|115422711|emb|CAJ49238.1| conserved hypothetical protein [Bordetella avium 197N]
Length = 168
Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats.
Identities = 60/158 (37%), Positives = 85/158 (53%), Gaps = 2/158 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q P+ ILILG++PG S+ RGQYY +PRN FW + A L G DY R +
Sbjct: 11 QGLAPVARPDARILILGTMPGVVSLARGQYYGNPRNAFWGIAAALFGVDAAADYDARLQA 70
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L++A + LWDV R+GS D I PND+ +P+L +A NG KA ++R
Sbjct: 71 LMAARVALWDVLADCERRGSLDSAIVASSVAPNDLPGFFAAHPQLEVVALNGAKAAQLYR 130
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
+H +R L +PSTSPA+A + + + W + S
Sbjct: 131 RHVVHPAGLRLLALPSTSPAHAAQRFEDKLTAWRPLIS 168
>gi|113932522|ref|ZP_01418424.1| Uracil-DNA glycosylase superfamily [Caulobacter sp. K31]
gi|113732743|gb|EAU13811.1| Uracil-DNA glycosylase superfamily [Caulobacter sp. K31]
Length = 166
Score = 116 bits (290), Expect = 5e-25, Method: Composition-based stats.
Identities = 66/153 (43%), Positives = 88/153 (57%), Gaps = 3/153 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPL-PDDYAERT 64
K FPP+ D + +LILGSLPG+ S+ QYY +PRN FW LM +L PL P Y +R
Sbjct: 4 KSGFPPVTDANTRVLILGSLPGEASLAVQQYYGNPRNAFWRLMESVLDAPLVPLAYEDRL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LLS +GLWDV A R GS D I D ND+ +LIE P L +AFNG A +
Sbjct: 64 AALLSRGVGLWDVIAQAERPGSLDAAIRDPAANDLLALIETLPALRLVAFNGGAAAKLGS 123
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
+ + +A L +PS+SPA+A ++ + W
Sbjct: 124 RLIGDRVA--ALALPSSSPAHAARSFAQKAEIW 154
>gi|83745749|ref|ZP_00942806.1| hypothetical cytosolic protein [Ralstonia solanacearum UW551]
gi|83727439|gb|EAP74560.1| hypothetical cytosolic protein [Ralstonia solanacearum UW551]
Length = 186
Score = 115 bits (289), Expect = 6e-25, Method: Composition-based stats.
Identities = 64/157 (40%), Positives = 88/157 (56%), Gaps = 3/157 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
++ PI D + +L+LGS PG+ S+ QYYAHPRN+FWPLM LL PLP YA R
Sbjct: 30 QRGLAPILDTEVRVLVLGSFPGEASLAAQQYYAHPRNQFWPLMGALLDLPLPAMSYAARV 89
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL HIG+WDV + +R+GS D I + ND L R P L + FNG A F
Sbjct: 90 RALLDRHIGVWDVLGACVREGSLDSAIRHPQANDFDLLHRRAPGLRRVGFNGGTA-GRFA 148
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
K F +++PS+SPA+A ++ + + W +
Sbjct: 149 KDFARA-GFETVVLPSSSPAHAARSFEQKLVLWRALL 184
>gi|91782505|ref|YP_557711.1| hypothetical protein Bxe_A3324 [Burkholderia xenovorans LB400]
gi|91686459|gb|ABE29659.1| Conserved hypothetical protein [Burkholderia xenovorans LB400]
Length = 157
Score = 115 bits (289), Expect = 7e-25, Method: Composition-based stats.
Identities = 64/160 (40%), Positives = 89/160 (55%), Gaps = 9/160 (5%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERTE 65
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +L +PL D Y ER
Sbjct: 3 RGFPPVVAADTHTLILGSFPGEASLAATQYYAHPRNQFWRLLGTVLDEPLADLGYEERLR 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+L+ IG+WDV + R+GS D I + PND SL E P+L + FNG+ A
Sbjct: 63 RVLAHGIGVWDVLAACTREGSLDAAIRNASPNDFASLREHAPKLAKVCFNGKTA-----G 117
Query: 126 HFPETLAIR---CLLMPSTSPANAGKTLDLLVKDWNRIFS 162
F + L++PS+SPANA + D ++ W I +
Sbjct: 118 RFAPVIGAAGYVTLVLPSSSPANAMLSFDQKLRLWRDILT 157
>gi|56552478|ref|YP_163317.1| G:T/U mismatch-specific DNA glycosylase [Zymomonas mobilis subsp.
mobilis ZM4]
gi|6580777|gb|AAF18283.1| unknown; orf1 [Zymomonas mobilis]
gi|56544052|gb|AAV90206.1| G:T/U mismatch-specific DNA glycosylase [Zymomonas mobilis subsp.
mobilis ZM4]
Length = 163
Score = 115 bits (288), Expect = 8e-25, Method: Composition-based stats.
Identities = 63/142 (44%), Positives = 89/142 (62%), Gaps = 2/142 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD-YAERTE 65
+ F P+ + + +LILGSLPG S+ + QYY HPRN+FW LM+ ++G+ + Y ER E
Sbjct: 3 RCFAPVTNINSRLLILGSLPGVASLEKAQYYGHPRNQFWRLMSDIIGEDIESAAYPERLE 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
LL HIGLWDV +A R+GS D I + PN + LI + P L +AFNG+KA + K
Sbjct: 63 GLLRHHIGLWDVIGTAKRQGSLDSNIKEVSPNPLGDLIGKLPNLKALAFNGQKAAQLGIK 122
Query: 126 HFPETLA-IRCLLMPSTSPANA 146
+ A + ++PSTSPA+A
Sbjct: 123 ELQKIGAKLPYYILPSTSPAHA 144
>gi|20089353|ref|NP_615428.1| hypothetical protein MA0462 [Methanosarcina acetivorans C2A]
gi|19914245|gb|AAM03908.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 158
Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats.
Identities = 67/144 (46%), Positives = 86/144 (59%), Gaps = 9/144 (6%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FP + D + EILILGSLP DESIR+ QYY +P N FW L+ +G+ L D Y ++
Sbjct: 3 KRGFPAVLDENTEILILGSLPSDESIRKQQYYGNPGNDFWRLVGHAIGENLQDMAYEKKL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA---EA 121
+ L IGLWDV + R+GS D +I DEE ND L E P+L I FNGRKA E
Sbjct: 63 KTLKHNRIGLWDVFKAGSREGSQDSKIGDEEINDFSGLKEMVPKLRLICFNGRKAGEYEP 122
Query: 122 MFRKHFPETLAIRCLLMPSTSPAN 145
+ R ET ++PS+S AN
Sbjct: 123 LLRGMGYET-----KVLPSSSGAN 141
>gi|71909478|ref|YP_287065.1| hypothetical protein Daro_3867 [Dechloromonas aromatica RCB]
gi|71849099|gb|AAZ48595.1| conserved hypothetical protein [Dechloromonas aromatica RCB]
Length = 176
Score = 114 bits (285), Expect = 2e-24, Method: Composition-based stats.
Identities = 65/159 (40%), Positives = 89/159 (55%), Gaps = 5/159 (3%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q F P+ D ILILGS+PG+ S+R GQYYA+ RN FW ++ L+G YA+R E
Sbjct: 8 QGFSPVVDARARILILGSMPGEASLRAGQYYANERNAFWRIIGDLIGAGPSLPYAQRLES 67
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L +A I LWDV + R GS D I ND + +P + ++ FNG AEA FR
Sbjct: 68 LQAAGIALWDVIATCERYGSLDSDIVRGSVCVNDFPAFFAAHPGIESVYFNGATAEASFR 127
Query: 125 KHFPETLA---IRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+H L +R L +PSTSPA+A ++ + W+ I
Sbjct: 128 RHVLPGLGGKELRLLRLPSTSPAHAARSYLEKLAAWSAI 166
>gi|21227753|ref|NP_633675.1| hypothetical protein MM_1651 [Methanosarcina mazei Go1]
gi|20906154|gb|AAM31347.1| conserved protein [Methanosarcina mazei Go1]
Length = 158
Score = 113 bits (282), Expect = 4e-24, Method: Composition-based stats.
Identities = 58/115 (50%), Positives = 70/115 (60%), Gaps = 1/115 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D + EILILGSLPGDES+RR QYY HP N FW L+ +G+ L D DY R
Sbjct: 3 KRGFPPVFDNNTEILILGSLPGDESLRRRQYYGHPGNDFWRLVGNAIGENLQDMDYETRL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA 119
L IGLWDV + RKGS D I +EE N L E P L + FNG+ +
Sbjct: 63 TTLKRHRIGLWDVFKAGKRKGSQDSNIREEEVNQFSELKEMAPELKKVFFNGKAS 117
>gi|156864151|gb|EDO57582.1| hypothetical protein CLOL250_01672 [Clostridium sp. L2-50]
Length = 166
Score = 113 bits (282), Expect = 4e-24, Method: Composition-based stats.
Identities = 56/152 (36%), Positives = 93/152 (61%), Gaps = 1/152 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
+ F P+ + ++L+LGSLP +S +G YY HPRNRFW ++A +LG P P E+
Sbjct: 8 EHEFAPVYNEESQVLVLGSLPSVKSREQGFYYGHPRNRFWKVVAAVLGVPEPLTIEEKKR 67
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+LL+ H+ ++DV + GS+D I + P DI+S++ P + + NG+ A +++K
Sbjct: 68 MLLAGHVAVYDVIRACEIIGSSDSSIRNVVPADIKSIVAHTP-IRAVFTNGKTAGRLYKK 126
Query: 126 HFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
+ E +A+ + +PSTSPANA +L+ L + W
Sbjct: 127 YQAEHIALPMIELPSTSPANAAFSLERLEEIW 158
>gi|119899892|ref|YP_935105.1| hypothetical protein azo3603 [Azoarcus sp. BH72]
gi|119672305|emb|CAL96219.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length = 186
Score = 113 bits (282), Expect = 5e-24, Method: Composition-based stats.
Identities = 67/163 (41%), Positives = 86/163 (52%), Gaps = 9/163 (5%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+SF P+ +L+LGS+PG S+ QYYAHPRN FW +M L G YAER +
Sbjct: 14 RSFAPVWRSDARVLVLGSMPGVASLTAAQYYAHPRNAFWTIMGALFGAGPALPYAERLQR 73
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL A + LWDV S R GS D I + PND+ LI P L + FNG AE FR
Sbjct: 74 LLDAGVALWDVIASCRRPGSLDSAIAPDSVTPNDLPGLITACPALEHLFFNGTAAETAFR 133
Query: 125 KHFPETLAIRCL-------LMPSTSPANAGKTLDLLVKDWNRI 160
+HF +R L +PSTSPA+A + L + W +
Sbjct: 134 RHFARGDRLRVLRPTLKLQRLPSTSPAHAARPLAEKLSAWEAV 176
>gi|118696705|ref|ZP_01554786.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
ambifaria MC40-6]
gi|118645535|gb|EAV52367.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
ambifaria MC40-6]
Length = 158
Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats.
Identities = 63/159 (39%), Positives = 90/159 (56%), Gaps = 10/159 (6%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
Q FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P + Y R
Sbjct: 3 QGFPPVVGPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPGLHELAYDARL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 63 ERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFDSLREHAPLLKRVCFNGKTA----- 117
Query: 125 KHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + ++ W I
Sbjct: 118 GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLEQWRHI 156
>gi|152981569|ref|YP_001352380.1| G:T/U mismatch-specific DNA glycosylase [Janthinobacterium sp.
Marseille]
gi|151281646|gb|ABR90056.1| G:T/U mismatch-specific DNA glycosylase [Janthinobacterium sp.
Marseille]
Length = 165
Score = 112 bits (281), Expect = 6e-24, Method: Composition-based stats.
Identities = 69/159 (43%), Positives = 93/159 (58%), Gaps = 11/159 (6%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERTEIL 67
FPP+ D H EILILGS PG+ S+ QYYAHPRN+FW L L+ + L Y ER +
Sbjct: 13 FPPVIDQHTEILILGSFPGEASLSAQQYYAHPRNQFWKLTGALINEDLVTLPYTERL-LR 71
Query: 68 LSAH-IGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGR---KAEAMF 123
LSAH IGLWDV + R+GS D I + ND L ++ P+L + FNG+ K EA+F
Sbjct: 72 LSAHRIGLWDVIAACAREGSLDSAIRQAQHNDFALLKQQCPQLRRVCFNGKTSGKQEALF 131
Query: 124 RKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
+ ET +++PS+SPA A T + + W I +
Sbjct: 132 AQAGFET-----IILPSSSPAYALLTFEQKLAAWRGIMA 165
>gi|82491806|gb|ABB77797.1| Mug G:T/U mismatch-specific DNA glycosylase [Burkholderia
cenocepacia]
Length = 159
Score = 112 bits (280), Expect = 7e-24, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
Q FPP+ H +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 QGFPPVVAPHTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPTLHELP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 61 RLARVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFASLREHAPLLKRVCFNGKTA--- 117
Query: 123 FRKHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + + W I
Sbjct: 118 --GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLAQWRAI 156
>gi|106886756|ref|ZP_01354087.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
gi|106765716|gb|EAT22464.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 173
Score = 112 bits (280), Expect = 7e-24, Method: Composition-based stats.
Identities = 57/154 (37%), Positives = 90/154 (58%), Gaps = 1/154 (0%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+F P+ D + +ILILGS P +S + YY HP NRFW +++ +L K +P D + E
Sbjct: 9 HTFEPVFDKNSKILILGSFPSVKSREQEFYYGHPMNRFWRMLSSVLKKEIPSDSLRKREF 68
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
LL HI LWDV S GS+D I + P D+ ++ +P H I NG+ A+ ++ ++
Sbjct: 69 LLENHIALWDVIDSCDIIGSSDSSIKNVVPTDLSRILAISPISH-IYVNGKTAKKLYDRY 127
Query: 127 FPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
++ + +PSTSPANA +L+ L+++W I
Sbjct: 128 SESVTGLKAICLPSTSPANAMFSLEKLIEEWKVI 161
>gi|152990863|ref|YP_001356585.1| hypothetical protein NIS_1119 [Nitratiruptor sp. SB155-2]
gi|151422724|dbj|BAF70228.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2]
Length = 182
Score = 111 bits (278), Expect = 1e-23, Method: Composition-based stats.
Identities = 57/153 (37%), Positives = 81/153 (52%), Gaps = 1/153 (0%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F P D ++LILGS P S+ G YY HP+N+FW ++A + + +PD E+ E L
Sbjct: 7 FEPFIDKESKVLILGSFPSFASLEIGFYYGHPQNQFWKIVADVFKQKVPDTLEEKKEFLK 66
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
HI LWD+ IR+ S D + E NDI L++ P + I F GRK++ +F K F
Sbjct: 67 RHHIALWDMVKGCIRENSLDSSLSSIEVNDIEKLLKTYPNIEVIFFTGRKSQQLFEKQFS 126
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
R L PS SPA + + V+ W +I
Sbjct: 127 HVNLPRYYL-PSPSPAYRAMSFEEKVQQWKKIL 158
>gi|118719341|ref|ZP_01571872.1| uracil-DNA glycosylase superfamily [Burkholderia multivorans ATCC
17616]
gi|118657647|gb|EAV64396.1| uracil-DNA glycosylase superfamily [Burkholderia multivorans ATCC
17616]
Length = 163
Score = 111 bits (278), Expect = 1e-23, Method: Composition-based stats.
Identities = 67/162 (41%), Positives = 91/162 (56%), Gaps = 2/162 (1%)
Query: 4 VHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAE 62
++K+ FPP+ D + LILGSLPG+ S+ QYY HP+N+FW L+ ++ + L Y
Sbjct: 1 MNKRCFPPVVDANTRSLILGSLPGERSLALRQYYGHPQNKFWSLVGDVIERDLVGMSYPA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R + LL IGLWDV A R GS D +I D ND+ +LIE P L TIAFNG A +
Sbjct: 61 RLDTLLEHRIGLWDVVAQASRIGSLDSRIRDHASNDLIALIETLPNLTTIAFNGGTAAKI 120
Query: 123 FRKHFPETLA-IRCLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
E A R L +PS+SPA A + + W ++ L
Sbjct: 121 GMTALGEEGARYRLLRLPSSSPAYASISYAEKLAAWRQLRPL 162
>gi|124894388|gb|EAY68268.1| hypothetical protein BDAG_00976 [Burkholderia dolosa AUO158]
Length = 185
Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats.
Identities = 63/160 (39%), Positives = 88/160 (55%), Gaps = 10/160 (6%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
Q FPP+ +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P + Y R
Sbjct: 30 QGFPPVVAPDTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPRLHELAYDARL 89
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 90 ERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFTSLREHAPLLKKVCFNGKTA----- 144
Query: 125 KHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
F + + L++PS+SPANA + + + W I
Sbjct: 145 GRFADVIGRAGYDTLVLPSSSPANAMLSFEQKLAQWRAIL 184
>gi|73668280|ref|YP_304295.1| hypothetical protein Mbar_A0737 [Methanosarcina barkeri str.
Fusaro]
gi|72395442|gb|AAZ69715.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 158
Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats.
Identities = 56/115 (48%), Positives = 72/115 (62%), Gaps = 1/115 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
KQ FPP+ D + EILILGSLPGD SIR+ QYY HP N FW L+ ++G+ L +Y R
Sbjct: 3 KQGFPPVIDENTEILILGSLPGDVSIRKHQYYGHPGNDFWRLLGSIIGEDLQSINYQNRL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA 119
E L IGLWDV + R+G+ D +I DEE N L + P L + FNG+K+
Sbjct: 63 EALKRNKIGLWDVFKAGKREGNEDTKIKDEEINQFSILKDMAPNLKLVLFNGKKS 117
>gi|118039985|ref|ZP_01511383.1| conserved hypothetical protein [Burkholderia phytofirmans PsJN]
gi|117989792|gb|EAV04092.1| conserved hypothetical protein [Burkholderia phytofirmans PsJN]
Length = 158
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 62/160 (38%), Positives = 88/160 (55%), Gaps = 9/160 (5%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERTE 65
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +L +PL + Y ER
Sbjct: 4 RGFPPVAAADTHTLILGSFPGEASLAATQYYAHPRNQFWRLLGAVLDEPLAELAYDERLR 63
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+L+ IG+WDV + R+GS D I + PND SL P+L + FNG+ A
Sbjct: 64 RVLAHGIGVWDVLAACTREGSLDAAIRNASPNDFASLRAHAPKLAKVCFNGKTA-----G 118
Query: 126 HFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
F + L++PS+SPANA + D ++ W I +
Sbjct: 119 RFAPVIGAAGYATLVLPSSSPANAMLSFDQKLRLWRDILT 158
>gi|94311441|ref|YP_584651.1| Uracil-DNA glycosylase superfamily [Ralstonia metallidurans CH34]
gi|93355293|gb|ABF09382.1| Uracil-DNA glycosylase superfamily [Ralstonia metallidurans CH34]
Length = 164
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 62/141 (43%), Positives = 84/141 (59%), Gaps = 2/141 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ D H +L+LGSLPG+ S+ + +YYAH +NRFW LM+ +LG L +Y R
Sbjct: 4 KRCFPPVVDIHTRVLVLGSLPGEVSLAQSRYYAHKQNRFWHLMSDVLGVDLVGLEYEARL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
+ LL +GLWDV A R+GS D I D + ND+ LI P L IAFNG A +
Sbjct: 64 QALLDHRVGLWDVVAEARREGSLDSSIRDHQGNDLIGLIATLPTLKAIAFNGGTAAKIGE 123
Query: 125 KHFPETLAIRCLL-MPSTSPA 144
K + +L +PS+SPA
Sbjct: 124 KTLADHGDRHAILRLPSSSPA 144
>gi|84362465|ref|ZP_00987086.1| COG3663: G:T/U mismatch-specific DNA glycosylase [Burkholderia
dolosa AUO158]
Length = 158
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 63/160 (39%), Positives = 88/160 (55%), Gaps = 10/160 (6%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
Q FPP+ +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P + Y R
Sbjct: 3 QGFPPVVAPDTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPRLHELAYDARL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 63 ERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFTSLREHAPLLKKVCFNGKTA----- 117
Query: 125 KHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
F + + L++PS+SPANA + + + W I
Sbjct: 118 GRFADVIGRAGYDTLVLPSSSPANAMLSFEQKLAQWRAIL 157
>gi|23098520|ref|NP_691986.1| hypothetical protein OB1065 [Oceanobacillus iheyensis HTE831]
gi|22776746|dbj|BAC13021.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
Length = 167
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 62/158 (39%), Positives = 89/158 (56%), Gaps = 4/158 (2%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
QSF P+ +LI+GS+PG +S+ QYY +PRN FW +M LL + D+Y ++ +
Sbjct: 5 QSFAPVIPDDPRVLIVGSMPGVKSLAEQQYYGNPRNHFWSIMFTLLEEKRVDNYQDKLSM 64
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
L I LWD S R+GS D+ I D EPNDI L++ P + I NG KA +FRK+
Sbjct: 65 LKRHGIALWDTIGSCYREGSLDVNIQDAEPNDIIGLVQDYPTIKLIGCNGTKAYDIFRKN 124
Query: 127 FP-ETLA-IRCLLMPSTS--PANAGKTLDLLVKDWNRI 160
F E+ + + +PSTS P K L+ ++ W I
Sbjct: 125 FSLESFPNVDVVKLPSTSPIPGKYNKNLEEKIEAWKII 162
>gi|16126728|ref|NP_421292.1| hypothetical protein CC_2489 [Caulobacter crescentus CB15]
gi|13424044|gb|AAK24460.1| conserved hypothetical protein [Caulobacter crescentus CB15]
Length = 191
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 63/146 (43%), Positives = 85/146 (58%), Gaps = 3/146 (2%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPL-PDDYAERT 64
K+ FPP+ D +LILGSLPGD S+ GQYY HPRN FW LM ++ PL Y +R
Sbjct: 5 KRGFPPVVDAKTCVLILGSLPGDASLAAGQYYGHPRNAFWRLMEGVIDAPLVVRAYEDRL 64
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL+ +GLWDV +A R GS D I + ND+ +LI+ P L +AFNG A +
Sbjct: 65 VTLLAHGVGLWDVIETARRPGSLDAAIREPAANDLMALIDTLPALRLVAFNGGTAGRLGG 124
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTL 150
+ ++ L +PS+SPA A +T
Sbjct: 125 RLIGTRVST--LALPSSSPAYAARTF 148
>gi|91776836|ref|YP_546592.1| T/U mismatch-specific DNA glycosylase [Methylobacillus flagellatus
KT]
gi|91710823|gb|ABE50751.1| T/U mismatch-specific DNA glycosylase [Methylobacillus flagellatus
KT]
Length = 174
Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats.
Identities = 64/162 (39%), Positives = 82/162 (50%), Gaps = 6/162 (3%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
SFPP +LILGS+PG S+ YYAHP+N FWP + L P Y ER +
Sbjct: 9 HSFPPAITPGCRVLILGSMPGKRSLEMQAYYAHPQNLFWPFITHLFNIPPESGYQERLRL 68
Query: 67 LLSAHIGLWDVAHSAIRKGS--ADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L HIG+WDV +R S DI+ ND +L + P + I FNG KAE FR
Sbjct: 69 LNQQHIGIWDVLKECVRTSSLDGDIEETSIVANDFSALFMQFPHIRHIFFNGSKAEQAFR 128
Query: 125 KHFPETLAIRCLL----MPSTSPANAGKTLDLLVKDWNRIFS 162
+H + I L +PSTSPANA + + W +I S
Sbjct: 129 RHVLKRQIIPAHLHYHKLPSTSPANASIKKEHKLAQWQQIIS 170
>gi|115352432|ref|YP_774271.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
cepacia AMMD]
gi|115282420|gb|ABI87937.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
cepacia AMMD]
Length = 158
Score = 110 bits (275), Expect = 2e-23, Method: Composition-based stats.
Identities = 62/159 (38%), Positives = 89/159 (55%), Gaps = 10/159 (6%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
Q F P+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P + Y R
Sbjct: 3 QGFAPVVGPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPGLHELVYDARL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 63 ERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFASLREHAPLLKKVCFNGKTA----- 117
Query: 125 KHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + ++ W I
Sbjct: 118 GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLEQWRHI 156
>gi|153811093|ref|ZP_01963761.1| hypothetical protein RUMOBE_01484 [Ruminococcus obeum ATCC 29174]
gi|149832981|gb|EDM88064.1| hypothetical protein RUMOBE_01484 [Ruminococcus obeum ATCC 29174]
Length = 162
Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats.
Identities = 59/155 (38%), Positives = 88/155 (56%), Gaps = 1/155 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
K + PI D + ILILGS P +S + +Y H +NRFW ++A +LG +P E+ E
Sbjct: 3 KHTIAPIYDKNSRILILGSFPSVKSREQQFFYGHKQNRFWKVLAGILGYEVPQTIEEKKE 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+LL+ HI +WDV S +GS+D I + PND+ ++ + I NG KA ++ K
Sbjct: 63 MLLANHIAVWDVIQSCEIEGSSDASIHEVIPNDLSEILA-TADIRAIYTNGGKAFELYDK 121
Query: 126 HFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ I+ +PSTSPANAG +L+ L + W I
Sbjct: 122 YIYPVNGIKAQKLPSTSPANAGYSLERLKEAWMEI 156
>gi|114776245|ref|ZP_01451290.1| hypothetical protein SPV1_00957 [Mariprofundus ferrooxydans PV-1]
gi|114553075|gb|EAU55473.1| hypothetical protein SPV1_00957 [Mariprofundus ferrooxydans PV-1]
Length = 168
Score = 110 bits (275), Expect = 3e-23, Method: Composition-based stats.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 7/165 (4%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
++ FP + ++LILGS+PG +S+ QYYAHPRN FWP+MA L DY +R
Sbjct: 3 EEGFPCSANPDAKVLILGSMPGRKSLDEQQYYAHPRNAFWPIMASLFEFSADIDYEKRLG 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSAD--IQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
+L S I LWDVA IR GS D I++ ND + + +P++ + FNGRKA ++
Sbjct: 63 MLRSNGIALWDVARQCIRPGSLDAAIRLPSVITNDFTAFFDTHPQIRAVCFNGRKAADLY 122
Query: 124 RKHFPETLAIR-----CLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
++ LA+ +PSTSP +A + D + W I +
Sbjct: 123 QRRVIPELALNQTQLPLYPLPSTSPTHAAISFDSKLAQWRTILDI 167
>gi|94496341|ref|ZP_01302918.1| Uracil-DNA glycosylase superfamily protein [Sphingomonas sp. SKA58]
gi|94424087|gb|EAT09111.1| Uracil-DNA glycosylase superfamily protein [Sphingomonas sp. SKA58]
Length = 164
Score = 110 bits (274), Expect = 4e-23, Method: Composition-based stats.
Identities = 63/153 (41%), Positives = 88/153 (57%), Gaps = 1/153 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPL-PDDYAERT 64
K +FPP D +L+LGSLPGD SIR+G+YYAH N FW L+ ++G+ L YA R
Sbjct: 4 KFAFPPSVDAATRVLVLGSLPGDASIRQGEYYAHRGNAFWHLIGDVVGENLRAQPYAMRL 63
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
+ L + H+GLWDV SA R+GS D +I + ++ +E P L IAFNG+ A R
Sbjct: 64 KRLRAHHVGLWDVMESAQREGSLDSRIRQADLRNLADFVEHLPDLRAIAFNGKTAALHGR 123
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
+ + I L +PSTS A A + + + W
Sbjct: 124 RQLTDRPGITLLDLPSTSGAYASLSREAKRERW 156
>gi|87199194|ref|YP_496451.1| Uracil-DNA glycosylase superfamily [Novosphingobium aromaticivorans
DSM 12444]
gi|87134875|gb|ABD25617.1| Uracil-DNA glycosylase superfamily [Novosphingobium aromaticivorans
DSM 12444]
Length = 168
Score = 109 bits (273), Expect = 4e-23, Method: Composition-based stats.
Identities = 61/160 (38%), Positives = 89/160 (55%), Gaps = 1/160 (0%)
Query: 2 ASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLP-DDY 60
A K SF P+ D +LILGSLPG+ S+ +YYA+PRN+FW L+ +++G L Y
Sbjct: 4 AEGRKSSFAPVVDERTRVLILGSLPGEASLAAARYYANPRNQFWRLVGEVIGSDLAVRGY 63
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
ER +L IGLWD SA R+GS D I + E N + L+ P L +AFNG K+
Sbjct: 64 DERLALLREHGIGLWDAIGSARRRGSLDTAIREVEANPLGDLVTHLPALRCVAFNGAKSA 123
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
A+ + + + +PS+SPA+A T + W+ +
Sbjct: 124 AIGARELAGAGQVELVRLPSSSPAHAALTYAEKLVHWSNL 163
>gi|118707225|ref|ZP_01559811.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
cenocepacia MC0-3]
gi|118654888|gb|EAV61667.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
cenocepacia MC0-3]
Length = 158
Score = 109 bits (273), Expect = 4e-23, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 90/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
Q FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 QGFPPVVAPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPSLHELP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R E +LS IG+WDV + R+GS D I + +PND SL E P L + FNG+ A
Sbjct: 61 RLERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFDSLREHAPLLKRVCFNGKTA--- 117
Query: 123 FRKHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + + W I
Sbjct: 118 --GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLVLWRGI 156
>gi|134296531|ref|YP_001120266.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
vietnamiensis G4]
gi|134139688|gb|ABO55431.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
vietnamiensis G4]
Length = 158
Score = 108 bits (271), Expect = 7e-23, Method: Composition-based stats.
Identities = 62/156 (39%), Positives = 86/156 (55%), Gaps = 4/156 (2%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
Q FPP+ +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P + Y R
Sbjct: 3 QGFPPVVARDTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPGLHELAYDARL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS +G+WDV + R+GS D I + +PND SL P L + FNG+ A F
Sbjct: 63 ERVLSHGVGIWDVLDACHRQGSLDSAIRNAKPNDFASLRAHAPLLRKVCFNGKTA-GRFA 121
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
E L++PS+SPA A + + + W RI
Sbjct: 122 DVIGEA-GYETLVLPSSSPAYAMLSFEQKLVQWRRI 156
>gi|107023282|ref|YP_621609.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
cenocepacia AU 1054]
gi|116690365|ref|YP_835988.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
cenocepacia HI2424]
gi|105893471|gb|ABF76636.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia
cenocepacia AU 1054]
gi|116648454|gb|ABK09095.1| G-T/U mismatch-specific DNA glycosylase-like protein [Burkholderia
cenocepacia HI2424]
Length = 158
Score = 108 bits (271), Expect = 8e-23, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
Q FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 QGFPPVVAPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPSLHELP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R E +LS IG+WDV + R+GS D I + +PND SL P L + FNG+ A
Sbjct: 61 RLERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFDSLRAHAPLLKRVCFNGKTA--- 117
Query: 123 FRKHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F E + L++PS+SPANA + + + W I
Sbjct: 118 --GRFAEVIGQAGYETLVLPSSSPANAMLSFEQKLVLWRGI 156
>gi|53718725|ref|YP_107711.1| hypothetical protein BPSL1089 [Burkholderia pseudomallei K96243]
gi|52209139|emb|CAH35083.1| conserved hypothetical protein [Burkholderia pseudomallei K96243]
Length = 159
Score = 108 bits (269), Expect = 1e-22, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 4 RGFPPVASPGTHTLILGSFPGEASLAAAQYYAHPRNQFWRLLGAVLGEPQLHALP--YDA 61
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA--- 119
R +L+ G+WDV + R+GS D I +PND SL E P+L + FNG+ A
Sbjct: 62 RLARVLAHGFGIWDVLAACHREGSLDAAIRHAQPNDFASLREVAPKLTKVCFNGKTAGRF 121
Query: 120 EAMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
E R +T L++PS+SPANA + + + W RI
Sbjct: 122 EPTIRAAGFDT-----LVLPSSSPANAMLSFEQKLVFWQRI 157
>gi|67643813|ref|ZP_00442556.1| COG3663: G:T/U mismatch-specific DNA glycosylase [Burkholderia
mallei GB8 horse 4]
gi|76811184|ref|YP_332730.1| hypothetical protein BURPS1710b_1320 [Burkholderia pseudomallei
1710b]
gi|83621549|ref|ZP_00931957.1| COG3663: G:T/U mismatch-specific DNA glycosylase [Burkholderia
mallei FMH]
gi|83626404|ref|ZP_00936628.1| COG3663: G:T/U mismatch-specific DNA glycosylase [Burkholderia
mallei JHU]
gi|90294040|ref|ZP_01213637.1| hypothetical protein Bpse17_02001345 [Burkholderia pseudomallei
1710a]
gi|99910552|ref|ZP_01317543.1| hypothetical protein Bpse1_03003037 [Burkholderia pseudomallei
1655]
gi|100063392|ref|ZP_01324923.1| hypothetical protein BpseP_03001149 [Burkholderia pseudomallei
Pasteur]
gi|100236388|ref|ZP_01337005.1| hypothetical protein Bpse4_03000554 [Burkholderia pseudomallei
406e]
gi|100267388|ref|ZP_01341621.1| hypothetical protein Bmal2_03000440 [Burkholderia mallei
2002721280]
gi|100917862|ref|ZP_01346339.1| hypothetical protein Bmal10_03001044 [Burkholderia mallei 10399]
gi|121599540|ref|YP_992347.1| hypothetical protein BMASAVP1_A1007 [Burkholderia mallei SAVP1]
gi|124386336|ref|YP_001026847.1| hypothetical protein BMA10299_A0857 [Burkholderia mallei NCTC
10229]
gi|126440188|ref|YP_001058209.1| hypothetical protein BURPS668_1160 [Burkholderia pseudomallei 668]
gi|126448605|ref|YP_001079866.1| hypothetical protein BMA10247_0291 [Burkholderia mallei NCTC 10247]
gi|134281094|ref|ZP_01767803.1| conserved hypothetical protein [Burkholderia pseudomallei 305]
gi|76580637|gb|ABA50112.1| conserved hypothetical protein [Burkholderia pseudomallei 1710b]
gi|121228350|gb|ABM50868.1| conserved hypothetical protein [Burkholderia mallei SAVP1]
gi|124294356|gb|ABN03625.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229]
gi|126219681|gb|ABN83187.1| conserved hypothetical protein [Burkholderia pseudomallei 668]
gi|126241475|gb|ABO04568.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247]
gi|134247400|gb|EBA47485.1| conserved hypothetical protein [Burkholderia pseudomallei 305]
gi|147746469|gb|EDK53546.1| conserved hypothetical protein [Burkholderia mallei FMH]
gi|147751449|gb|EDK58516.1| conserved hypothetical protein [Burkholderia mallei JHU]
gi|148029208|gb|EDK87113.1| conserved hypothetical protein [Burkholderia mallei 2002721280]
Length = 158
Score = 108 bits (269), Expect = 1e-22, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 RGFPPVASPGTHTLILGSFPGEASLAAAQYYAHPRNQFWRLLGAVLGEPQLHALP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA--- 119
R +L+ G+WDV + R+GS D I +PND SL E P+L + FNG+ A
Sbjct: 61 RLARVLAHGFGIWDVLAACHREGSLDAAIRHAQPNDFASLREVAPKLTKVCFNGKTAGRF 120
Query: 120 EAMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
E R +T L++PS+SPANA + + + W RI
Sbjct: 121 EPTIRAAGFDT-----LVLPSSSPANAMLSFEQKLVFWQRI 156
>gi|78067153|ref|YP_369922.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia sp. 383]
gi|77967898|gb|ABB09278.1| G-T/U mismatch-specific DNA glycosylase-like [Burkholderia sp. 383]
Length = 158
Score = 107 bits (268), Expect = 2e-22, Method: Composition-based stats.
Identities = 62/161 (38%), Positives = 88/161 (54%), Gaps = 10/161 (6%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDD--YAERT 64
+ FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+ + Y R
Sbjct: 3 RGFPPVVAANTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGTVLGEHGLHELAYDARL 62
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E +LS IG+WDV + R+GS D I +PND +L E P L + FNG+ A
Sbjct: 63 ERVLSHGIGIWDVLAACHREGSLDSAIRHAKPNDFDALREHAPLLKRVCFNGKTA----- 117
Query: 125 KHFPETLAIR---CLLMPSTSPANAGKTLDLLVKDWNRIFS 162
F E + L++PS+SPANA T + + W + S
Sbjct: 118 GRFAEVIGAAGYDTLVLPSSSPANAMLTFEQKLAQWRGLLS 158
>gi|84355380|ref|ZP_00980265.1| COG3663: G:T/U mismatch-specific DNA glycosylase [Burkholderia
cenocepacia PC184]
Length = 158
Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats.
Identities = 64/161 (39%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
Q FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 QGFPPVVAPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPSLHELP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R E +LS IG+WDV + R+GS D I + +PND SL P L + FNG+ A
Sbjct: 61 RLERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFDSLRAHAPLLKRVCFNGKTA--- 117
Query: 123 FRKHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + + W I
Sbjct: 118 --GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLVLWRGI 156
>gi|148556860|ref|YP_001264442.1| Uracil-DNA glycosylase superfamily [Sphingomonas wittichii RW1]
gi|148502050|gb|ABQ70304.1| Uracil-DNA glycosylase superfamily [Sphingomonas wittichii RW1]
Length = 190
Score = 107 bits (267), Expect = 2e-22, Method: Composition-based stats.
Identities = 58/141 (41%), Positives = 81/141 (57%), Gaps = 2/141 (1%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP--LPDDYAER 63
K SFPP+ +L+LGSLPG+ S+R +YYAHP+N+FW L ++G DY R
Sbjct: 22 KASFPPVVSADTRLLLLGSLPGEASLRAARYYAHPQNQFWRLTGAVIGDEGLALLDYPAR 81
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
+ L +A +GLWDV A+R GS D I D PND+ +L+ P L I FNG A +
Sbjct: 82 LDRLRAAGVGLWDVVADALRDGSLDGAIRDHRPNDLAALVATLPALRAIGFNGGTAARLG 141
Query: 124 RKHFPETLAIRCLLMPSTSPA 144
R+ + + +PS+SPA
Sbjct: 142 RRMIGAPAGLALVDLPSSSPA 162
>gi|124872348|gb|EAY64064.1| conserved hypothetical protein [Burkholderia cenocepacia PC184]
Length = 185
Score = 107 bits (266), Expect = 3e-22, Method: Composition-based stats.
Identities = 64/161 (39%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
Q FPP+ + +ILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 30 QGFPPVVAPNTHTMILGSFPGEASLDAAQYYAHPRNQFWRLLGAVLGEPSLHELP--YDA 87
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAM 122
R E +LS IG+WDV + R+GS D I + +PND SL P L + FNG+ A
Sbjct: 88 RLERVLSHGIGIWDVLDACHRQGSLDSAIRNAKPNDFDSLRAHAPLLKRVCFNGKTA--- 144
Query: 123 FRKHFPETL---AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F + + L++PS+SPANA + + + W I
Sbjct: 145 --GRFADVIGQAGYETLVLPSSSPANAMLSFEQKLVLWRGI 183
>gi|82533860|ref|ZP_00892913.1| hypothetical protein Bpse110_02004878 [Burkholderia pseudomallei
1106b]
gi|100122456|ref|ZP_01328262.1| hypothetical protein BpseS_03003710 [Burkholderia pseudomallei S13]
gi|126453694|ref|YP_001065448.1| hypothetical protein BURPS1106A_1169 [Burkholderia pseudomallei
1106a]
gi|126227336|gb|ABN90876.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a]
Length = 158
Score = 107 bits (266), Expect = 3e-22, Method: Composition-based stats.
Identities = 65/161 (40%), Positives = 89/161 (55%), Gaps = 14/161 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAE 62
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +LG+P LP Y
Sbjct: 3 RGFPPVASPGTHTLILGSFPGEASLAAAQYYAHPRNQFWRLLGAVLGEPQLHALP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA--- 119
R +L+ G+WDV + R+GS D I +PND SL E P+L + FNG+ A
Sbjct: 61 RLARVLAHGFGIWDVLAACHREGSLDAAIRHAQPNDFASLREVAPKLTKVCFNGKTAGRF 120
Query: 120 EAMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
E R +T L++PS+SPANA + + + W RI
Sbjct: 121 EPPIRAAGFDT-----LVLPSSSPANAMLSFEQKLVFWQRI 156
>gi|152992204|ref|YP_001357925.1| hypothetical protein SUN_0609 [Sulfurovum sp. NBC37-1]
gi|151424065|dbj|BAF71568.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 175
Score = 106 bits (264), Expect = 5e-22, Method: Composition-based stats.
Identities = 58/143 (40%), Positives = 82/143 (57%), Gaps = 2/143 (1%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F PI + LILGS P +S YYAHPRN+FW ++ KL G P+ ++ ++ +L
Sbjct: 10 FKPIVFNDTKTLILGSFPSIKSFENNFYYAHPRNQFWKILEKLTGYPV-NNRDQKIWLLK 68
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
+GLWD+ S R+ S D + DE NDI +L+ P + IAF G+KA+A+F HF
Sbjct: 69 ECKLGLWDMIGSCQRENSLDSSLEDEVVNDIPALLSAYPSIDKIAFTGKKAQALFETHF- 127
Query: 129 ETLAIRCLLMPSTSPANAGKTLD 151
L I + +PS SPA A T +
Sbjct: 128 SYLEIERVYLPSPSPAYAAMTFE 150
>gi|118052267|ref|ZP_01520814.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
gi|118000395|gb|EAV14552.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
Length = 178
Score = 105 bits (263), Expect = 7e-22, Method: Composition-based stats.
Identities = 63/154 (40%), Positives = 85/154 (55%), Gaps = 5/154 (3%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK-PLPD--DYAER 63
Q + D + L+LGS PG +S++ QYYAHP+N+FWPL+ L + P P DYA R
Sbjct: 17 QGLAAVADARTQALVLGSFPGLKSLQLAQYYAHPQNQFWPLLQALWPRHPQPSRADYAGR 76
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
E LL +GLWDV S R+GS D I + ND L +R P L NG ++
Sbjct: 77 CEWLLERGLGLWDVYASCEREGSLDSAIRCAQVNDFAGLRQRCPDLQLALHNGGESYKHA 136
Query: 124 RKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
R+ E L + L +PSTSPANA + D ++ W
Sbjct: 137 RQ--TEALGLTALKLPSTSPANASWSFDKKLQAW 168
>gi|21244388|ref|NP_643970.1| hypothetical protein XAC3663 [Xanthomonas axonopodis pv. citri str.
306]
gi|21110047|gb|AAM38506.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 173
Score = 105 bits (262), Expect = 8e-22, Method: Composition-based stats.
Identities = 64/146 (43%), Positives = 80/146 (54%), Gaps = 7/146 (4%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
+L+LGS+PG S+ +YYAHPRNRFWPLM LLG YA R LL +GLWDV
Sbjct: 20 VLLLGSMPGVASLEAARYYAHPRNRFWPLMHALLGIDTAASYALRLHALLDHRVGLWDVI 79
Query: 79 HSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF-PETLAIRC 135
R+GS D I N + +L+ P+L +A NG A +R+H P A RC
Sbjct: 80 GQCERRGSLDTSIVAASIVVNPLPALLATLPQLRMVACNGTAAAQAWRRHVQPLLSAQRC 139
Query: 136 LL----MPSTSPANAGKTLDLLVKDW 157
L +PSTSPANA +L L W
Sbjct: 140 ALPVVALPSTSPANAAWSLPRLAAAW 165
>gi|121610938|ref|YP_998745.1| hypothetical protein Veis_4022 [Verminephrobacter eiseniae EF01-2]
gi|121555578|gb|ABM59727.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
Length = 196
Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats.
Identities = 64/169 (37%), Positives = 92/169 (54%), Gaps = 9/169 (5%)
Query: 3 SVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK-PLPDDYA 61
S Q P + +LILGS PG S++ GQYYAHPRN FWP++ L + PLP+ Y
Sbjct: 23 SARWQGLPAVVSAQTVVLILGSFPGAASLQAGQYYAHPRNHFWPILQALWPQHPLPNAYP 82
Query: 62 ERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNG----R 117
R + LL+ +GLWDV S R+GS D +I + + ND L P+L +A NG +
Sbjct: 83 ARCDWLLARGLGLWDVHASCEREGSLDSRIRNPQTNDFARLHGCCPQLLAVAHNGSASFK 142
Query: 118 KAEA----MFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
A+A + R T+ + + +PSTSPANA + + + W + S
Sbjct: 143 YADAVRACLARPLSDATVQMAAIRLPSTSPANASWSFERKLTAWAALMS 191
>gi|33596028|ref|NP_883671.1| hypothetical protein BPP1370 [Bordetella parapertussis 12822]
gi|33573031|emb|CAE36672.1| conserved hypothetical protein [Bordetella parapertussis]
Length = 177
Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats.
Identities = 56/158 (35%), Positives = 81/158 (51%), Gaps = 6/158 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F + +L+LGS+PG S+R QYY HPRN FWP+ A+L G YA R + L
Sbjct: 7 FAALARADARVLVLGSMPGVASLRAQQYYGHPRNAFWPIAAELFGLDAAQPYAHRVQALQ 66
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDE--EPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
A + LWDV + R GS D I + PND ++ +P + + NG A ++R+H
Sbjct: 67 RAGVALWDVLQACERPGSLDAAIVADSVRPNDFQAFFASHPGIACVGLNGAAAATLYRRH 126
Query: 127 FPETL----AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
L +R + +PSTSPA+A ++ W I
Sbjct: 127 VLPRLHGLAHLRYVALPSTSPAHAAMDFAAKLRAWRAI 164
>gi|78049337|ref|YP_365512.1| hypothetical protein XCV3781 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78037767|emb|CAJ25512.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 186
Score = 104 bits (259), Expect = 2e-21, Method: Composition-based stats.
Identities = 65/158 (41%), Positives = 82/158 (51%), Gaps = 7/158 (4%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q P +L+LGS+PG S+ +YYAHPRNRFWPLM LLG YA R +
Sbjct: 21 QGLPAHVRSDCRVLLLGSMPGVASLEAARYYAHPRNRFWPLMHALLGIDAAASYALRLQA 80
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL +GLWDV R+GS D I N + +L+ P+L +A NG A +R
Sbjct: 81 LLDHRVGLWDVIGQCERRGSLDTSIVAASIVVNPLPALLATLPQLRMVACNGAAAAQAWR 140
Query: 125 KHF-PETLAIRCLL----MPSTSPANAGKTLDLLVKDW 157
+H P A C L +PSTSPANA +L L W
Sbjct: 141 RHVQPRLSAQLCALPVVALPSTSPANAAWSLPRLAAAW 178
>gi|88797134|ref|ZP_01112724.1| hypothetical protein MED297_19912 [Reinekea sp. MED297]
gi|88780003|gb|EAR11188.1| hypothetical protein MED297_19912 [Reinekea sp. MED297]
Length = 171
Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats.
Identities = 58/145 (40%), Positives = 78/145 (53%), Gaps = 5/145 (3%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
+L+LGS+PG S+ QYYAHPRN FWP+MA G Y +R LL+ + LWDV
Sbjct: 17 VLVLGSMPGVASLTAVQYYAHPRNAFWPIMADFFGFDRDAPYDDRLAALLAKRVALWDVL 76
Query: 79 HSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFPETLA---I 133
S +R GS D I + PN I S + P L T+ NG +A FR++F L+ I
Sbjct: 77 QSCVRPGSLDSAIERDSMVPNPIDSWMATQPSLQTVLLNGGRAATEFRRYFKAILSNPNI 136
Query: 134 RCLLMPSTSPANAGKTLDLLVKDWN 158
+ +PSTSPA A + W+
Sbjct: 137 EVVTLPSTSPAYAAMRFEEKRDRWH 161
>gi|30249529|ref|NP_841599.1| hypothetical protein NE1560 [Nitrosomonas europaea ATCC 19718]
gi|30138892|emb|CAD85471.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718]
Length = 167
Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats.
Identities = 60/163 (36%), Positives = 83/163 (50%), Gaps = 6/163 (3%)
Query: 4 VHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAER 63
+H +SFPPI +LILG++PG S+R QYYAHP+N FW + A++LG Y R
Sbjct: 1 MHCRSFPPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILGFDATSAYPLR 60
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEA 121
L + LWDV S R+ S D I PND P + + FNG KA A
Sbjct: 61 VSSLKDHGVALWDVLQSCTRESSLDADIVAHTIVPNDFGRFFTACPDIRRVCFNGAKAAA 120
Query: 122 MFRKHFPETL----AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
++ +H L + + +PSTSPANA ++ W+ I
Sbjct: 121 LYARHVKPFLQDAPTVEYVQLPSTSPANAAIPRADKLRAWSVI 163
>gi|134094007|ref|YP_001099082.1| hypothetical protein HEAR0761 [Herminiimonas arsenicoxydans]
gi|133737910|emb|CAL60955.1| conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 169
Score = 103 bits (256), Expect = 4e-21, Method: Composition-based stats.
Identities = 66/156 (42%), Positives = 85/156 (54%), Gaps = 9/156 (5%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERTEIL 67
F P+ D EILILGS PG S+ QYYAHPRN+FW L++ L+ + L YA+R L
Sbjct: 17 FAPVIDRRTEILILGSFPGLASLTAQQYYAHPRNQFWRLLSALINEDLVALPYADRLGTL 76
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGR---KAEAMFR 124
LS IG+WDV R+GS D I ND L + P L IAFNG+ K A+F
Sbjct: 77 LSHRIGVWDVIDVCTRQGSLDSAIRAAIHNDYAQLRQLAPALTRIAFNGKTAGKQAALFA 136
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ ET ++PS+SPA A T + + W I
Sbjct: 137 EAGFET-----FVLPSSSPAYAAMTFEQKLAVWRGI 167
>gi|58580341|ref|YP_199357.1| hypothetical protein XOO0718 [Xanthomonas oryzae pv. oryzae
KACC10331]
gi|84622311|ref|YP_449683.1| hypothetical protein XOO_0654 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58424935|gb|AAW73972.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
KACC10331]
gi|84366251|dbj|BAE67409.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 186
Score = 102 bits (255), Expect = 5e-21, Method: Composition-based stats.
Identities = 62/161 (38%), Positives = 83/161 (51%), Gaps = 7/161 (4%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q P +L+LGS+PG S+ +YYAHPRNRFWPLM LLG YA R +
Sbjct: 21 QGLPAHIRSDCRVLLLGSMPGVASLEAARYYAHPRNRFWPLMHALLGLDAAASYASRLQA 80
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL +G+WDV R+GS D I N + +L+ P+L +A NG A +R
Sbjct: 81 LLDHRVGVWDVIGQCERRGSLDTSIVAASIVVNPLPALLVTLPQLRLVACNGAAAAQAWR 140
Query: 125 KHFPETL-----AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+H L A+ + +PSTSPANA +L L W +
Sbjct: 141 RHVQPLLSAKLRALPVVALPSTSPANAAWSLPRLATAWQPV 181
>gi|118728844|ref|ZP_01577361.1| conserved hypothetical protein [Delftia acidovorans SPH-1]
gi|118670289|gb|EAV76882.1| conserved hypothetical protein [Delftia acidovorans SPH-1]
Length = 181
Score = 102 bits (255), Expect = 6e-21, Method: Composition-based stats.
Identities = 64/154 (41%), Positives = 83/154 (53%), Gaps = 5/154 (3%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLL-GKPLP--DDYAER 63
Q P+ D L+LGS PG S++ QYYAHP+N+FWPL+ L G P P DDYA R
Sbjct: 20 QGLAPVADARTRALVLGSFPGLRSLQLQQYYAHPQNQFWPLLQALWPGHPQPGRDDYAGR 79
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
E LL+ +GLWDV S R+GS D I E ND +L P+L NG E+
Sbjct: 80 CEWLLARGLGLWDVYASCEREGSLDSAIRAAEVNDFVALRTHCPQLALALHNG--GESYK 137
Query: 124 RKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
+ L + + +PSTSPANA + V+ W
Sbjct: 138 HAGRTQALGLAAVKLPSTSPANASWSFARKVQAW 171
>gi|78485657|ref|YP_391582.1| hypothetical protein Tcr_1313 [Thiomicrospira crunogena XCL-2]
gi|78363943|gb|ABB41908.1| Uracil DNA glycosylase superfamily protein [Thiomicrospira
crunogena XCL-2]
Length = 168
Score = 102 bits (254), Expect = 8e-21, Method: Composition-based stats.
Identities = 58/151 (38%), Positives = 79/151 (52%), Gaps = 9/151 (5%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+ F PI ++LILG+ P S+ + YYAHPRN FWP++A +P+ D + +
Sbjct: 5 EGFAPIVPTQPKVLILGTAPSVVSLEKSFYYAHPRNAFWPILADYFNQPV-DSVEAKKAL 63
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
L I LWDV + R+GS D I E ND +L + P L + FNG+ A+ +F K
Sbjct: 64 CLENDILLWDVLSACERQGSLDSAIKQPEANDFEALFQTYPDLKAVLFNGKAAQNLFEKQ 123
Query: 127 ------FPETLAIRCLLMPSTSPANAGKTLD 151
PE L L PSTSPANA T +
Sbjct: 124 VIKKQSLPEDLMFYSL--PSTSPANARLTFE 152
>gi|32265840|ref|NP_859872.1| hypothetical protein HH0341 [Helicobacter hepaticus ATCC 51449]
gi|32261889|gb|AAP76938.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449]
Length = 188
Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats.
Identities = 59/161 (36%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPL---PDDYAERTE 65
F P+ D + +LILGS P S YYAH NRFW ++A L + D + +
Sbjct: 27 FAPVFDTYSRVLILGSFPSVISRDEQFYYAHSHNRFWHILATLFTPEIDIRAQDRDAKKQ 86
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
LL+ HI LWD+ S+D + + +PNDI S+I + +++ I NGRKA +F +
Sbjct: 87 FLLTHHIALWDIVRECDICNSSDSTLSNAKPNDI-SVILSSAKINAIFCNGRKAYELFIR 145
Query: 126 HFP----ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
F ET I+ L+PS+SPANA +L+ LV W + +
Sbjct: 146 FFGSQMCETQEIKVFLLPSSSPANARYSLEKLVDSWRVVLN 186
>gi|119878933|ref|ZP_01645796.1| conserved hypothetical protein [Stenotrophomonas maltophilia
R551-3]
gi|119818604|gb|EAX21286.1| conserved hypothetical protein [Stenotrophomonas maltophilia
R551-3]
Length = 186
Score = 100 bits (249), Expect = 3e-20, Method: Composition-based stats.
Identities = 62/146 (42%), Positives = 83/146 (56%), Gaps = 4/146 (2%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
+L+LGS+PG S++ +YYAHPRNRFWPL+A + G Y +R + L +A +GLWDV
Sbjct: 35 VLLLGSMPGVASLQEQRYYAHPRNRFWPLLADICGFDAGLAYPQRLQALAAAKVGLWDVI 94
Query: 79 HSAIRKGS--ADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH-FPETLAIRC 135
R+GS ADI E PN + +LI P L I NG + F++ P A
Sbjct: 95 GQCERRGSLDADIVRGTEVPNALPALIASLPELELIGCNGGASLQAFQRWVLPRLEAPPT 154
Query: 136 LL-MPSTSPANAGKTLDLLVKDWNRI 160
LL +PSTSPANA +L L W +
Sbjct: 155 LLALPSTSPANAAWSLPRLRLAWQPV 180
>gi|83720475|ref|YP_441511.1| hypothetical protein BTH_I0957 [Burkholderia thailandensis E264]
gi|83654300|gb|ABC38363.1| conserved hypothetical protein [Burkholderia thailandensis E264]
Length = 158
Score = 99.4 bits (246), Expect = 7e-20, Method: Composition-based stats.
Identities = 62/163 (38%), Positives = 87/163 (53%), Gaps = 14/163 (8%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK----PLPDDYAE 62
+ FPP+ LILGS PG+ S+ QYYAHPRN+FW L+ +L + LP Y
Sbjct: 3 RGFPPVVSPDTHTLILGSFPGEASLAAAQYYAHPRNQFWRLLGVVLDEAQLHALP--YDA 60
Query: 63 RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA--- 119
R +L+ G+WDV + R+GS D I +PND +L E P+L + FNG+ A
Sbjct: 61 RLARVLAHGFGIWDVLAACHREGSLDSAIRHAKPNDFDALREVAPKLTKVCFNGKTAGRF 120
Query: 120 EAMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIFS 162
E R +T L++PS+SPANA + + W RI +
Sbjct: 121 EPTIRAAGFDT-----LVLPSSSPANATLSFEQKRACWQRIVA 158
>gi|120609759|ref|YP_969437.1| hypothetical protein Aave_1066 [Acidovorax avenae subsp. citrulli
AAC00-1]
gi|120588223|gb|ABM31663.1| conserved hypothetical protein [Acidovorax avenae subsp. citrulli
AAC00-1]
Length = 196
Score = 99.0 bits (245), Expect = 8e-20, Method: Composition-based stats.
Identities = 63/152 (41%), Positives = 82/152 (53%), Gaps = 11/152 (7%)
Query: 11 PIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK-PLP--DDYAERTEIL 67
P+ +L+LGS PG S+R GQYY HP+N FWP++ L + PLP D YAER L
Sbjct: 28 PVVSERTVVLVLGSFPGVASLRAGQYYGHPQNHFWPILQALWPRHPLPGRDRYAERCAWL 87
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK-- 125
L +GLWDV + R+GS D I + ND +L R PRL IA NG ++ R
Sbjct: 88 LDRGLGLWDVYAACEREGSLDTSIRNAAVNDFAALRSRCPRLAAIAHNGGESFKHARAVR 147
Query: 126 ---HFPETL---AIRCLLMPSTSPANAGKTLD 151
P L + L +PSTSPANA + +
Sbjct: 148 TALQLPAGLQDGPVPSLRLPSTSPANASWSFE 179
>gi|91789726|ref|YP_550678.1| hypothetical protein Bpro_3884 [Polaromonas sp. JS666]
gi|91698951|gb|ABE45780.1| conserved hypothetical protein [Polaromonas sp. JS666]
Length = 173
Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats.
Identities = 55/157 (35%), Positives = 81/157 (51%), Gaps = 6/157 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLL-GKPL---PDDYAERT 64
PP+ G +LILGS PG S+ QYY HP+N FW ++ + P+ D Y R+
Sbjct: 13 LPPLVCGQTRLLILGSFPGAASLAAQQYYGHPQNHFWKILQAIWPDGPIGTGADGYQIRS 72
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
E LLS +G+WDV + R GS D I + ND ++ + P+L +A NG E+
Sbjct: 73 EWLLSKRLGVWDVYAACERDGSLDSNIRNAVVNDFAAVRQLCPKLRAVAHNG--GESFKH 130
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
+ L + +PSTSPANA + + + W +F
Sbjct: 131 SKYTVLLGLPVYRLPSTSPANASWSFERKLSSWREVF 167
>gi|56477748|ref|YP_159337.1| T/U mismatch-specific DNA glycosylase [Azoarcus sp. EbN1]
gi|56313791|emb|CAI08436.1| T/U mismatch-specific DNA glycosylase [Azoarcus sp. EbN1]
Length = 166
Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats.
Identities = 59/146 (40%), Positives = 77/146 (52%), Gaps = 6/146 (4%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+ FPP+ LILGS+PG S+ +YYAH RN FWP++ ++LG YAERT+
Sbjct: 5 EGFPPVAAPGAHTLILGSMPGAASLAALEYYAHRRNLFWPILGEVLGIDPALSYAERTQR 64
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDE--EPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L A LWDV + R GS D I ++ E ND +P + + NG AE FR
Sbjct: 65 LADAGYALWDVLGACRRHGSLDSNIENDSIEVNDFAIFFATHPSIDRVFCNGATAEQCFR 124
Query: 125 KHFPETLAIRCLL----MPSTSPANA 146
+H L R +PSTSPANA
Sbjct: 125 RHVLPALEGRFEFEVRRLPSTSPANA 150
>gi|154151743|ref|YP_001405361.1| hypothetical protein Mboo_2204 [Candidatus Methanoregula boonei
6A8]
gi|154000295|gb|ABS56718.1| conserved hypothetical protein [Candidatus Methanoregula boonei
6A8]
Length = 162
Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats.
Identities = 56/152 (36%), Positives = 76/152 (50%), Gaps = 3/152 (1%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
PPI EILILGS P S+ + +YY +P+N+FW ++ L G Y RT +L
Sbjct: 8 LPPIVGRDPEILILGSFPSLISLGKSEYYGNPKNQFWRIIEILFGIDHALSYPLRTHLLT 67
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
I LWD R+GS D ICD NDI + +P + IA NG A F+ P
Sbjct: 68 EHRIALWDALAFCRREGSMDSAICDPVANDITGFVAAHPTIRCIALNGSTAGRYFKSMNP 127
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ + ++PSTSPA A + +W RI
Sbjct: 128 D---LNGHILPSTSPAYASMSFAEKTLEWARI 156
>gi|34497666|ref|NP_901881.1| hypothetical protein CV_2211 [Chromobacterium violaceum ATCC 12472]
gi|34103521|gb|AAQ59883.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
12472]
Length = 167
Score = 97.1 bits (240), Expect = 3e-19, Method: Composition-based stats.
Identities = 63/141 (44%), Positives = 81/141 (57%), Gaps = 3/141 (2%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-DYAERT 64
K+ FPP+ DG +LILGSLPGD S+ QYYAHPRN+FW L+ +LG+ L Y R
Sbjct: 7 KRCFPPVADGDTRLLILGSLPGDASLAASQYYAHPRNQFWRLLGDVLGRDLAALPYDARL 66
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
+L +A +GLWDV A RKGS D + D ND+ P L +AFNG A
Sbjct: 67 AVLRAAGVGLWDVVAQARRKGSLDADLRDIRHNDLADFAASLPSLRAVAFNGAAAAKAEA 126
Query: 125 KHFPETLAIRCLLMPSTSPAN 145
+ LA+ L PS+SPA+
Sbjct: 127 QLAGLGLAVARL--PSSSPAH 145
>gi|121595712|ref|YP_987608.1| hypothetical protein Ajs_3415 [Acidovorax sp. JS42]
gi|120607792|gb|ABM43532.1| conserved hypothetical protein [Acidovorax sp. JS42]
Length = 207
Score = 96.7 bits (239), Expect = 4e-19, Method: Composition-based stats.
Identities = 62/160 (38%), Positives = 86/160 (53%), Gaps = 14/160 (8%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK-PLP--DDYAERTE 65
PP+ +L+LGS PG S+ QYY HP+N FW ++A L + PLP D YA R +
Sbjct: 41 LPPMVGSGTRVLVLGSFPGVASLAAQQYYGHPQNHFWRILAALWPQHPLPGRDQYAARCD 100
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR- 124
LL+ +GLWDV + R+GS D I + + ND L R P L IA NG ++ FR
Sbjct: 101 WLLARGLGLWDVYAACEREGSLDTSIRNAQVNDFAGLHARAPGLAAIAHNGGES---FRH 157
Query: 125 -KHFPETLAIRC------LLMPSTSPANAGKTLDLLVKDW 157
K +L ++ L +PSTSPANA + + + W
Sbjct: 158 AKAVRASLGLKADAPMPSLRLPSTSPANASWSFEKKLHAW 197
>gi|157101474|gb|EAT97723.2| Mug G:T/U mismatch-specific DNA glycosylase [Campylobacter concisus
13826]
Length = 168
Score = 96.3 bits (238), Expect = 5e-19, Method: Composition-based stats.
Identities = 57/164 (34%), Positives = 93/164 (56%), Gaps = 7/164 (4%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ +H F PI D + +ILILGS P S + G YY +P+NRFW ++A +L PLP
Sbjct: 1 MSQMH--PFKPIFDKNSKILILGSFPSVISRKFGFYYTNPQNRFWRVLAGILNAPLPAST 58
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ + LLS HI ++D A S +GS+D ++ P +++ L + ++ + NG KA
Sbjct: 59 DEKIKFLLSHHIAIYDAAISCEIEGSSDAKMSKIVPVNLKPLF-KEAKIKQVYANGGKAY 117
Query: 121 AMFRKHFPETL----AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ +K+ + + +PSTSPANA +L+ L+ +W +
Sbjct: 118 EICKKYLENEIIKATKNEVIKLPSTSPANAKFSLERLMCEWQVV 161
>gi|86135463|ref|ZP_01054044.1| hypothetical protein MED152_12184 [Tenacibaculum sp. MED152]
gi|85819636|gb|EAQ40793.1| hypothetical protein MED152_12184 [Polaribacter dokdonensis MED152]
Length = 159
Score = 95.9 bits (237), Expect = 8e-19, Method: Composition-based stats.
Identities = 49/154 (31%), Positives = 83/154 (53%)
Query: 4 VHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAER 63
+ + S P+ + I ILG+ PG+ +R+ +YYA+ N FW ++ +L + +YAER
Sbjct: 1 MERVSLDPVLPHNPTIFILGTKPGNLELRKQEYYANNGNSFWKIIYELTEEVFSKNYAER 60
Query: 64 TEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMF 123
++L HI +WD+ S R + +E PN + ++E P + I FNG+KA ++
Sbjct: 61 IDLLKRNHIAVWDIIESGERVKPGAQNVKNEIPNPVNEIVEMYPSVKQIIFNGQKAHDLY 120
Query: 124 RKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDW 157
KHF + I ++ STSPAN + + +W
Sbjct: 121 LKHFSKIEGIIYEVVLSTSPANTRFSFREKLNNW 154
>gi|153160315|ref|ZP_01925940.1| conserved hypothetical protein [Campylobacter concisus 13826]
Length = 165
Score = 95.5 bits (236), Expect = 8e-19, Method: Composition-based stats.
Identities = 55/156 (35%), Positives = 89/156 (57%), Gaps = 5/156 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F PI D + +ILILGS P S + G YY +P+NRFW ++A +L PLP E+ + LL
Sbjct: 4 FKPIFDKNSKILILGSFPSVISRKFGFYYTNPQNRFWRVLAGILNAPLPASTDEKIKFLL 63
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
S HI ++D A S +GS+D ++ P +++ L + ++ + NG KA + +K+
Sbjct: 64 SHHIAIYDAAISCEIEGSSDAKMSKIVPVNLKPLF-KEAKIKQVYANGGKAYEICKKYLE 122
Query: 129 ETL----AIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ + +PSTSPANA +L+ L+ +W +
Sbjct: 123 NEIIKATKNEVIKLPSTSPANAKFSLERLMCEWQVV 158
>gi|86152465|ref|ZP_01070670.1| T/U mismatch-specific DNA glycosylase [Campylobacter jejuni subsp.
jejuni HB93-13]
gi|85843350|gb|EAQ60560.1| T/U mismatch-specific DNA glycosylase [Campylobacter jejuni subsp.
jejuni HB93-13]
Length = 160
Score = 95.5 bits (236), Expect = 9e-19, Method: Composition-based stats.
Identities = 60/160 (37%), Positives = 81/160 (50%), Gaps = 5/160 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFEPFFDKDSKILILGSFPSIKSRQDGFYYQHPRNRFWPILETLFNAKL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ L HI LWDV S K S D I + ND+ SLI ++ I G+ A
Sbjct: 60 TEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDL-SLILSQTKIQAIFTTGQSAY 118
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P AI +PSTSPAN + + L++++ I
Sbjct: 119 RFFVKFHPHLEAIA---LPSTSPANLNFSFEQLLQNYEII 155
>gi|89901985|ref|YP_524456.1| hypothetical protein Rfer_3216 [Rhodoferax ferrireducens T118]
gi|89346722|gb|ABD70925.1| conserved hypothetical protein [Rhodoferax ferrireducens DSM 15236]
Length = 180
Score = 95.1 bits (235), Expect = 1e-18, Method: Composition-based stats.
Identities = 55/147 (37%), Positives = 78/147 (53%), Gaps = 6/147 (4%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP----LPDDYAERT 64
PP+ + +L+LGS P S+ R QYYAHP+N+FW ++ + Y ER+
Sbjct: 20 LPPLVSENTRLLVLGSFPSITSLARQQYYAHPQNQFWRILQAIFPSSPRGISAASYQERS 79
Query: 65 EILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
LL+ +G+WDV S R+GS D I + ND L R P+L IA NG E+
Sbjct: 80 AWLLAQGLGIWDVYASCEREGSLDTAIRQPQVNDFSGLRLRCPQLQLIAHNG--GESFRH 137
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLD 151
+ TL + L +PSTSPANA + +
Sbjct: 138 AKYTATLGLPVLKLPSTSPANASWSFE 164
>gi|149193946|ref|ZP_01871044.1| hypothetical protein CMTB2_02638 [Caminibacter mediatlanticus TB-2]
gi|149135899|gb|EDM24377.1| hypothetical protein CMTB2_02638 [Caminibacter mediatlanticus TB-2]
Length = 171
Score = 94.7 bits (234), Expect = 1e-18, Method: Composition-based stats.
Identities = 53/167 (31%), Positives = 81/167 (48%), Gaps = 6/167 (3%)
Query: 3 SVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAE 62
S+ F PI EILILG+ P +S YY+HP+N+FW +++ + + P E
Sbjct: 4 SILTHPFKPIVFDTSEILILGTFPSIKSFENNFYYSHPQNQFWKILSTIFNEKTPKTIEE 63
Query: 63 RTEILLSAHIGLWDVAHSAIR--KGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
+ L I LWD R K S D + +P DI +L+++ P + +A R AE
Sbjct: 64 KINFLKKHKIALWDTICECKRKEKNSKDSNLEIIKPCDIENLLKQYPNIKKVAVTSRTAE 123
Query: 121 AMFRKHFPET----LAIRCLLMPSTSPANAGKTLDLLVKDWNRIFSL 163
+ +KHF + L + + +PS SP A K VK+W + L
Sbjct: 124 KILKKHFKNSQFSILNSQLIYLPSPSPLYASKNFQTKVKEWKELLQL 170
>gi|150017648|ref|YP_001309902.1| Mug G:T/U mismatch-specific DNA glycosylase [Clostridium
beijerinckii NCIMB 8052]
gi|149904113|gb|ABR34946.1| Mug G:T/U mismatch-specific DNA glycosylase [Clostridium
beijerinckii NCIMB 8052]
Length = 160
Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats.
Identities = 57/155 (36%), Positives = 82/155 (52%), Gaps = 1/155 (0%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
KQ PI D ILILG+ P +S YYA+ RNRFW ++A + + ++ +
Sbjct: 5 KQMLEPIYDEKSRILILGTFPSVQSREAQFYYANARNRFWKVIAGIFDCNEANTTFDKKD 64
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
+LL+ I LWDV S GS+D I + P D+ S++ N + I NG KA ++ K
Sbjct: 65 MLLNNGIALWDVIESCEITGSSDSSIKNVVPADL-SIVLNNASIEKIYANGDKAFKLYMK 123
Query: 126 HFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ + + +PSTSPANA TLD L+ W I
Sbjct: 124 YTYPIIKRDIIKLPSTSPANAKFTLDTLIDYWKCI 158
>gi|113938045|ref|ZP_01423916.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC
23779]
gi|113900292|gb|EAU19283.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC
23779]
Length = 168
Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats.
Identities = 54/146 (36%), Positives = 73/146 (50%), Gaps = 6/146 (4%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTE 65
K + P I LILG+LPGD+S+ YY HPRN+FW +M + R
Sbjct: 3 KFALPAIVHSTSRCLILGTLPGDQSLAAQFYYLHPRNQFWRIMQAVFADEPGLTSVTRAA 62
Query: 66 ILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRK 125
L + LWDV H+A R GS D I + PND ++ NP + + NG KA+ +F +
Sbjct: 63 FLARHQLALWDVLHAAERVGSLDTAITNPIPNDFQTFFTANPAIQKVGLNGMKAQQLFMR 122
Query: 126 ----HFPETLAIRCLL--MPSTSPAN 145
+ L R L +PSTSPAN
Sbjct: 123 LVWPNLTSELQSRLELVGLPSTSPAN 148
>gi|121613018|ref|YP_001000930.1| hypothetical protein CJJ81176_1270 [Campylobacter jejuni subsp.
jejuni 81-176]
gi|87249748|gb|EAQ72707.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
81-176]
Length = 160
Score = 94.7 bits (234), Expect = 2e-18, Method: Composition-based stats.
Identities = 60/160 (37%), Positives = 82/160 (51%), Gaps = 5/160 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFEPFFDKDSKILILGSFPSIKSRQDGFYYQHPRNRFWPILETLFDVKL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
AE+ L HI LWDV S K S D I + ND+ +LI ++ I G+ A
Sbjct: 60 AEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDL-NLILSQTKIQAIFTTGQSAY 118
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P AI +PSTSPAN + + L++++ I
Sbjct: 119 RFFVKFHPRLEAIA---LPSTSPANLNFSFEQLLQNYEII 155
>gi|153951695|ref|YP_001397656.1| hypothetical protein JJD26997_0471 [Campylobacter jejuni subsp.
doylei 269.97]
gi|152939141|gb|ABS43882.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei
269.97]
Length = 160
Score = 94.4 bits (233), Expect = 2e-18, Method: Composition-based stats.
Identities = 60/162 (37%), Positives = 83/162 (51%), Gaps = 9/162 (5%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFKPFFDKDSKILILGSFPSVKSRQYGFYYQHPRNRFWPILETLFNVKL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ L HI LWDV S K S D I + ND+ +LI ++ I G+ A
Sbjct: 60 TEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDL-NLILSQAKIQAIFTTGQSAY 118
Query: 121 AMFRKHFP--ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P ET+A +PSTSPAN + + L++++ I
Sbjct: 119 KFFVKFHPRLETIA-----LPSTSPANLNFSFEQLLQNYEII 155
>gi|153852709|ref|ZP_01994146.1| hypothetical protein DORLON_00128 [Dorea longicatena DSM 13814]
gi|149754351|gb|EDM64282.1| hypothetical protein DORLON_00128 [Dorea longicatena DSM 13814]
Length = 191
Score = 94.0 bits (232), Expect = 2e-18, Method: Composition-based stats.
Identities = 52/155 (33%), Positives = 86/155 (55%), Gaps = 3/155 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
+F PI + H +IL+LG++P +S G YY HPRNRFW +++ + G+ P+ +
Sbjct: 35 HNFEPIYNEHSKILMLGTMPSPKSREVGFYYGHPRNRFWKVVSDVCGEAYPETREGKIAF 94
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
L I +WDV KG+ D I + +PND+ ++E + I G KA +++K
Sbjct: 95 ALRNGIAVWDVLAGCDIKGADDSSIRNPQPNDLSRILE-CADIKAIFTTGTKAFQLYKKF 153
Query: 127 -FPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+P+T I + +PSTSPAN + + L + ++ I
Sbjct: 154 CYPKT-GIMAIGLPSTSPANCRTSYEQLFEAYSEI 187
>gi|15792578|ref|NP_282401.1| hypothetical protein Cj1254 [Campylobacter jejuni subsp. jejuni
NCTC 11168]
gi|86150575|ref|ZP_01068798.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
CF93-6]
gi|88596277|ref|ZP_01099514.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
84-25]
gi|85838926|gb|EAQ56192.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
CF93-6]
gi|88191118|gb|EAQ95090.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
84-25]
gi|112360572|emb|CAL35369.1| hypothetical protein Cj1254 [Campylobacter jejuni subsp. jejuni
NCTC 11168]
Length = 160
Score = 94.0 bits (232), Expect = 2e-18, Method: Composition-based stats.
Identities = 59/160 (36%), Positives = 81/160 (50%), Gaps = 5/160 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFEPFFDKDSKILILGSFPSIKSRQDGFYYQHPRNRFWPILETLFNAKL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ L HI LWDV S K S D I + ND+ +LI ++ I G+ A
Sbjct: 60 TEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDL-NLILSQTKIQAIFTTGQSAY 118
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P AI +PSTSPAN + + L++++ I
Sbjct: 119 RFFVKFHPRLEAIA---LPSTSPANLNFSFEQLLQNYEII 155
>gi|33601416|ref|NP_888976.1| hypothetical protein BB2436 [Bordetella bronchiseptica RB50]
gi|33575852|emb|CAE32930.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
Length = 155
Score = 94.0 bits (232), Expect = 3e-18, Method: Composition-based stats.
Identities = 51/142 (35%), Positives = 73/142 (51%), Gaps = 6/142 (4%)
Query: 25 LPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVAHSAIRK 84
+PG S+R QYY HPRN FWP+ A+L G YA R + L A + LWDV + R
Sbjct: 1 MPGVASLRAQQYYGHPRNAFWPIAAELFGLDAAQPYAHRVQALQRAGVALWDVLQACERP 60
Query: 85 GSADIQICDE--EPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFPETL----AIRCLLM 138
GS D I + PND ++ +P + + NG A ++R+H L +R + +
Sbjct: 61 GSLDAAIVADSVRPNDFQAFFAAHPGIACVGLNGAAAATLYRRHVLPRLHGLAHLRYVAL 120
Query: 139 PSTSPANAGKTLDLLVKDWNRI 160
PSTSPA+A ++ W I
Sbjct: 121 PSTSPAHAAMDFAAKLRAWRAI 142
>gi|86150913|ref|ZP_01069129.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
260.94]
gi|85842083|gb|EAQ59329.1| conserved hypothetical protein [Campylobacter jejuni subsp. jejuni
260.94]
Length = 160
Score = 93.2 bits (230), Expect = 5e-18, Method: Composition-based stats.
Identities = 59/160 (36%), Positives = 81/160 (50%), Gaps = 5/160 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFEPFFDKDSKILILGSFPSIKSRQDGFYYQHPRNRFWPILETLFDVKL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ L HI LWDV S K S D I + ND+ +LI ++ I G+ A
Sbjct: 60 TEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDL-NLILSQTKIQAIFTTGQSAY 118
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P AI +PSTSPAN + + L++++ I
Sbjct: 119 RFFVKFHPRLEAIA---LPSTSPANLNFSFEQLLQNYEII 155
>gi|118462625|ref|YP_880656.1| T/U mismatch-specific DNA glycosylase [Mycobacterium avium 104]
gi|118163912|gb|ABK64809.1| T/U mismatch-specific DNA glycosylase [Mycobacterium avium 104]
Length = 165
Score = 92.4 bits (228), Expect = 8e-18, Method: Composition-based stats.
Identities = 56/150 (37%), Positives = 75/150 (50%), Gaps = 8/150 (5%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M++ + FPP+ D LILGS P +S+ GQYYA+PRN FW + +L G Y
Sbjct: 1 MSTPLLRGFPPVVDERARTLILGSFPSAQSLLTGQYYANPRNAFWSITGELFGFDAAAPY 60
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEP-----NDIRSLIERNPRLHTIAFN 115
R L I LWDV H+ R GSAD I EP ND +P + + FN
Sbjct: 61 PRRLAQLRRHRIALWDVLHACRRAGSADSAI---EPNSLVVNDFGEFFAEHPGITRVYFN 117
Query: 116 GRKAEAMFRKHFPETLAIRCLLMPSTSPAN 145
G KA ++R+ + +PSTSPA+
Sbjct: 118 GAKAAELYRRLATAPDHVCFQRLPSTSPAH 147
>gi|57238128|ref|YP_179378.1| hypothetical protein CJE1391 [Campylobacter jejuni RM1221]
gi|148926250|ref|ZP_01809935.1| hypothetical protein Cj8486_1298 [Campylobacter jejuni subsp.
jejuni CG8486]
gi|57166932|gb|AAW35711.1| conserved hypothetical protein [Campylobacter jejuni RM1221]
gi|145845421|gb|EDK22514.1| hypothetical protein Cj8486_1298 [Campylobacter jejuni subsp.
jejuni CG8486]
Length = 160
Score = 91.7 bits (226), Expect = 1e-17, Method: Composition-based stats.
Identities = 55/160 (34%), Positives = 82/160 (51%), Gaps = 5/160 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+ F P D +ILILGS P +S + G YY HPRNRFWP++ L L ++
Sbjct: 1 MSEFLTHPFEPFFDKDSKILILGSFPSVKSRQDGFYYQHPRNRFWPILETLFDVRL-ENI 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAE 120
E+ L HI LWDV S K S D I + ND+ ++ + ++ I G+ A
Sbjct: 60 TEQQAFLRKKHIALWDVLQSCKIKNSDDKTISYAKANDLSPILSQT-KIQAIFTTGQSAY 118
Query: 121 AMFRKHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
F K P ++ + +PSTSPAN + + L++++ I
Sbjct: 119 KFFIKFHPR---LKAIALPSTSPANLNFSFEQLLQNYEII 155
>gi|118475705|ref|YP_892419.1| Mug G:T/U mismatch-specific DNA glycosylase [Campylobacter fetus
subsp. fetus 82-40]
gi|118414931|gb|ABK83351.1| Mug G:T/U mismatch-specific DNA glycosylase [Campylobacter fetus
subsp. fetus 82-40]
Length = 157
Score = 91.3 bits (225), Expect = 2e-17, Method: Composition-based stats.
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 5/153 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F P+ D +ILILGS P +S G YYAH +NRFW ++ L G + ++ + LL
Sbjct: 6 FKPVFDSRSKILILGSFPSVKSREMGFYYAHSQNRFWKVLIALFGSEHLRNVQDKIDFLL 65
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
+ + D A+S KGS+D I +P ++ LI N + + NG + A+ +K +
Sbjct: 66 KHQVAIHDAAYSCSVKGSSDNSISSVKPANL-DLIFENADIKAVFANGNTSYAICKKFY- 123
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
+ +PSTSPANA +L+ LV +W I
Sbjct: 124 ---NFEVIKLPSTSPANAKFSLEKLVDEWKIIL 153
>gi|120405405|ref|YP_955234.1| hypothetical protein Mvan_4453 [Mycobacterium vanbaalenii PYR-1]
gi|119958223|gb|ABM15228.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length = 167
Score = 90.9 bits (224), Expect = 2e-17, Method: Composition-based stats.
Identities = 53/156 (33%), Positives = 78/156 (50%), Gaps = 3/156 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
QSFPP+ ILILG++PG S++ +YYAH RN FW + +L G Y R +
Sbjct: 7 QSFPPLAAPGARILILGNMPGVASLQAQRYYAHTRNAFWGITGELFGFDPAAPYDHRVAV 66
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L +A + +WDV R GS D + + PND S +P + + FNG AE +
Sbjct: 67 LTAAGVAVWDVLRHCRRAGSLDSAVEPDSMVPNDFASFFADHPAITHVYFNGAAAEKNYG 126
Query: 125 KHFPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ + +PSTSPA+ + D + W +I
Sbjct: 127 RLVTAPATVSYTRLPSTSPAHT-MSFDRKLAAWRQI 161
>gi|21230015|ref|NP_635932.1| hypothetical protein XCC0540 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|21111534|gb|AAM39856.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 174
Score = 90.9 bits (224), Expect = 2e-17, Method: Composition-based stats.
Identities = 55/149 (36%), Positives = 75/149 (50%), Gaps = 7/149 (4%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
+L+LGS+PG S+ YYAHPRNRFWP+M +LLG Y R + L +GLWDV
Sbjct: 21 VLVLGSMPGSASLHAHAYYAHPRNRFWPVMQQLLGIDADAPYDARLQQLAERGVGLWDVI 80
Query: 79 HSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFPETLA---- 132
R+GS D I N + + P+L + NG A +R+H L+
Sbjct: 81 GECARRGSLDAAIVPGSIVVNPLPERLATLPQLRLVVCNGSAAAQAWRRHVQPALSPPLT 140
Query: 133 -IRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ +PSTSPANA +L L W +
Sbjct: 141 RLPVQAVPSTSPANAAWSLPRLCAAWQPV 169
>gi|66769991|ref|YP_244753.1| hypothetical protein XC_3693 [Xanthomonas campestris pv. campestris
str. 8004]
gi|66575323|gb|AAY50733.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 160
Score = 90.5 bits (223), Expect = 3e-17, Method: Composition-based stats.
Identities = 55/149 (36%), Positives = 75/149 (50%), Gaps = 7/149 (4%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
+L+LGS+PG S+ YYAHPRNRFWP+M +LLG Y R + L +GLWDV
Sbjct: 7 VLVLGSMPGSASLHAHAYYAHPRNRFWPVMQQLLGIDADAPYDARLQQLAERGVGLWDVI 66
Query: 79 HSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFPETLA---- 132
R+GS D I N + + P+L + NG A +R+H L+
Sbjct: 67 GECARRGSLDAAIVPGSIVVNPLPERLATLPQLRLVVCNGSAAAQAWRRHVQPALSPPLT 126
Query: 133 -IRCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ +PSTSPANA +L L W +
Sbjct: 127 RLPVQAVPSTSPANAAWSLPRLCAAWQPV 155
>gi|108800914|ref|YP_641111.1| hypothetical protein Mmcs_3950 [Mycobacterium sp. MCS]
gi|119870054|ref|YP_940006.1| hypothetical protein Mkms_4024 [Mycobacterium sp. KMS]
gi|108771333|gb|ABG10055.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696143|gb|ABL93216.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length = 165
Score = 90.5 bits (223), Expect = 3e-17, Method: Composition-based stats.
Identities = 51/140 (36%), Positives = 69/140 (49%), Gaps = 2/140 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q F PI G L+LG+ P ++ + QYY +PRN FW + L G D YA RT
Sbjct: 8 QGFAPIVGGAPRTLVLGNAPSVLALAKHQYYGNPRNAFWRIAGALYGFDADDPYAVRTAA 67
Query: 67 LLSAHIGLWDVAHSAIRKGSAD--IQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L++ + +WDV R GS D +Q ND S +P + + FNG AE FR
Sbjct: 68 LIAHGVAVWDVLRECRRTGSLDSAVQPDSMVANDFGSFFGEHPTIRRVLFNGAAAERNFR 127
Query: 125 KHFPETLAIRCLLMPSTSPA 144
+ P + +PSTSPA
Sbjct: 128 RLVPGAFEVEFSRLPSTSPA 147
>gi|41408606|ref|NP_961442.1| hypothetical protein MAP2508 [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|41396964|gb|AAS04825.1| hypothetical protein MAP_2508 [Mycobacterium avium subsp.
paratuberculosis K-10]
Length = 165
Score = 90.1 bits (222), Expect = 4e-17, Method: Composition-based stats.
Identities = 55/150 (36%), Positives = 75/150 (50%), Gaps = 8/150 (5%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M++ + FPP+ D LILGS P +S+ GQYYA+PRN FW + +L G Y
Sbjct: 1 MSTPLLRGFPPVVDERARTLILGSFPSAQSLLTGQYYANPRNAFWSITGELFGFDAAAPY 60
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDI-----RSLIERNPRLHTIAFN 115
R L I LWDV H+ R GSAD I EPN + +P + + FN
Sbjct: 61 PRRLAQLRRHRIALWDVLHACRRAGSADSAI---EPNSLVVNGFGEFFAEHPGITRVYFN 117
Query: 116 GRKAEAMFRKHFPETLAIRCLLMPSTSPAN 145
G KA ++R+ + +PSTSPA+
Sbjct: 118 GAKAAELYRRLAMAPDHVCFQRLPSTSPAH 147
>gi|126436539|ref|YP_001072230.1| hypothetical protein Mjls_3964 [Mycobacterium sp. JLS]
gi|126236339|gb|ABN99739.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length = 165
Score = 90.1 bits (222), Expect = 4e-17, Method: Composition-based stats.
Identities = 51/140 (36%), Positives = 69/140 (49%), Gaps = 2/140 (1%)
Query: 7 QSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEI 66
Q F PI G L+LG+ P ++ + QYY +PRN FW + L G D YA RT
Sbjct: 8 QGFAPIVGGAPRTLVLGNAPSVLALAKHQYYGNPRNAFWRIAGALYGFDADDPYAVRTAA 67
Query: 67 LLSAHIGLWDVAHSAIRKGSAD--IQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFR 124
L++ + +WDV R GS D +Q ND S +P + + FNG AE FR
Sbjct: 68 LIAHGVAVWDVLRECRRAGSLDSAVQPDSVVANDFGSFFGEHPTIRRVLFNGAAAERNFR 127
Query: 125 KHFPETLAIRCLLMPSTSPA 144
+ P + +PSTSPA
Sbjct: 128 RLVPGAFEVEFSRLPSTSPA 147
>gi|121606132|ref|YP_983461.1| hypothetical protein Pnap_3242 [Polaromonas naphthalenivorans CJ2]
gi|120595101|gb|ABM38540.1| conserved hypothetical protein [Polaromonas naphthalenivorans CJ2]
Length = 165
Score = 90.1 bits (222), Expect = 4e-17, Method: Composition-based stats.
Identities = 55/155 (35%), Positives = 78/155 (50%), Gaps = 6/155 (3%)
Query: 11 PIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGK-PLP---DDYAERTEI 66
P+ + +LILGS PG S+ QYY HP+N+FW ++ + +P D Y R+E
Sbjct: 7 PLVSSNTRLLILGSFPGAASLAAQQYYGHPQNQFWRILQAIWPSCAMPTGADSYKIRSEW 66
Query: 67 LLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKH 126
LL+ +G+WDV + R+GS D I ND L P L IA NG E+
Sbjct: 67 LLARGLGVWDVYAACEREGSLDSSIRKPVVNDFAQLRRWCPELQAIAHNG--GESFKHAR 124
Query: 127 FPETLAIRCLLMPSTSPANAGKTLDLLVKDWNRIF 161
E L + +PSTSPANA + + + W +F
Sbjct: 125 HTELLGLPVYRLPSTSPANASWSFERKLAAWREVF 159
>gi|118473922|ref|YP_889275.1| T/U mismatch-specific DNA glycosylase [Mycobacterium smegmatis str.
MC2 155]
gi|118175209|gb|ABK76105.1| T/U mismatch-specific DNA glycosylase [Mycobacterium smegmatis str.
MC2 155]
Length = 162
Score = 89.0 bits (219), Expect = 9e-17, Method: Composition-based stats.
Identities = 57/155 (36%), Positives = 78/155 (50%), Gaps = 4/155 (2%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDY 60
M+SV PPI +LILG++P S+ +YY +PRN FW +M L G Y
Sbjct: 1 MSSV-LHGLPPIVGEAPRVLILGNMPSVMSLASSEYYGNPRNAFWRIMGSLYGFAPDAAY 59
Query: 61 AERTEILLSAHIGLWDVAHSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRK 118
+ER + S I +WDV S R GS D + + PND +P L + FNG
Sbjct: 60 SERVSAVTSHGIAVWDVLRSCRRVGSLDSAVERDSMVPNDFAGFFAAHPSLERVVFNGAA 119
Query: 119 AEAMFRKHFPETLAIRCLLMPSTSPANAGKTLDLL 153
AEA +R+ E + + + MPSTSPA + D L
Sbjct: 120 AEANYRRLVGE-VPLPYVRMPSTSPAQTMRFEDKL 153
>gi|154175180|ref|YP_001407814.1| hypothetical protein CCV52592_0946 [Campylobacter curvus 525.92]
gi|112802668|gb|EAU00012.1| protein of unknown function [Campylobacter curvus 525.92]
Length = 171
Score = 87.4 bits (215), Expect = 3e-16, Method: Composition-based stats.
Identities = 55/157 (35%), Positives = 84/157 (53%), Gaps = 5/157 (3%)
Query: 8 SFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEIL 67
+F P+ D + +ILILGS P S + G YYA+P+NRFW ++A++ PLP E+ L
Sbjct: 11 NFRPVFDENSKILILGSFPSVVSRKLGFYYANPQNRFWRVLAQIFDSPLPSSKDEKINFL 70
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF 127
LS I ++D A S KGS+D + + P ++ S I + + NG KA + +K
Sbjct: 71 LSHKIAIYDAAISCEIKGSSDADMREVTPVNL-SAIFSVAGISQVFANGAKAYEISKKFL 129
Query: 128 PETLAI----RCLLMPSTSPANAGKTLDLLVKDWNRI 160
+ + + +PSTS ANA +L LV W +
Sbjct: 130 EADIEKATKNKIIQLPSTSSANAKFSLPELVSRWQVV 166
>gi|118586810|ref|ZP_01544245.1| hypothetical protein OENOO_55006 [Oenococcus oeni ATCC BAA-1163]
gi|118432730|gb|EAV39461.1| hypothetical protein OENOO_55006 [Oenococcus oeni ATCC BAA-1163]
Length = 188
Score = 86.7 bits (213), Expect = 4e-16, Method: Composition-based stats.
Identities = 52/154 (33%), Positives = 81/154 (52%), Gaps = 3/154 (1%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAE-RTEIL 67
+PPI D + ++L+LGSLP +S+ G YY P+N FW +AK LG P+ E R
Sbjct: 30 WPPIYDENSKVLLLGSLPSPKSLASGFYYGSPQNIFWSTLAKTLGVDEPEKSKEARLSFA 89
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF 127
L + +W A G+ D I + PND ++ +N ++ TI G+KA +F K
Sbjct: 90 LKHRVAIWITLKEADIIGAQDSTIKNPVPNDFSDIL-KNSQIRTIFTEGKKATELFNKLA 148
Query: 128 PETLAIRCLLMPSTSPAN-AGKTLDLLVKDWNRI 160
+ + +PSTSPAN A + + ++ WN +
Sbjct: 149 APKIGQESVYLPSTSPANRATQAKPIFMERWNLV 182
>gi|57168156|ref|ZP_00367295.1| conserved hypothetical protein [Campylobacter coli RM2228]
gi|57020530|gb|EAL57199.1| conserved hypothetical protein [Campylobacter coli RM2228]
Length = 162
Score = 85.9 bits (211), Expect = 8e-16, Method: Composition-based stats.
Identities = 52/152 (34%), Positives = 78/152 (51%), Gaps = 5/152 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F P D +ILILGS P +S G YY HPRNRFWP++ L L ++ ++ L
Sbjct: 11 FKPFFDKDSKILILGSFPSVKSREDGFYYQHPRNRFWPILETLFDTRL-ENITKQQAFLK 69
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
+I +WDV S K S D I + ND+ +LI ++ I G+ A F K P
Sbjct: 70 EKNIAIWDVLQSCKIKNSDDKTISYAKANDL-NLILSQAQIQAIFTTGKSAYRFFTKFHP 128
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDWNRI 160
++ + +PS+SPAN + + L++ + I
Sbjct: 129 H---LKAIALPSSSPANLNFSFEQLLQSYKII 157
>gi|91772987|ref|YP_565679.1| hypothetical protein Mbur_0989 [Methanococcoides burtonii DSM 6242]
gi|91712002|gb|ABE51929.1| conserved hypothetical protein [Methanococcoides burtonii DSM 6242]
Length = 160
Score = 85.5 bits (210), Expect = 8e-16, Method: Composition-based stats.
Identities = 53/148 (35%), Positives = 74/148 (50%), Gaps = 5/148 (3%)
Query: 1 MASVHKQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPD-D 59
M S+ + I + EILI G+ P ++S + QYYA RN+FW LM+ +LG L +
Sbjct: 1 MVSLRSKGLDQIVNEETEILITGTFPSEKSRNQKQYYADSRNQFWNLMSFVLGIDLKKIE 60
Query: 60 YAERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKA 119
Y R IL IGLWD S GS+D IC+ + ND L + + I NG K
Sbjct: 61 YNNRVSILKKHKIGLWDTIESCEIGGSSDKNICNPQYNDFSHLTQ----IKKIVCNGGKV 116
Query: 120 EAMFRKHFPETLAIRCLLMPSTSPANAG 147
E KH ++ + + S+S AN G
Sbjct: 117 EETCNKHCNVLDGVQVVRVLSSSSANNG 144
>gi|116490596|ref|YP_810140.1| G:T/U mismatch-specific DNA glycosylase [Oenococcus oeni PSU-1]
gi|116091321|gb|ABJ56475.1| G:T/U mismatch-specific DNA glycosylase [Oenococcus oeni PSU-1]
Length = 194
Score = 84.7 bits (208), Expect = 2e-15, Method: Composition-based stats.
Identities = 49/138 (35%), Positives = 73/138 (52%), Gaps = 2/138 (1%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAE-RTEIL 67
+PPI D + ++L+LGSLP +S+ G YY P+N FW +AK LG P+ E R
Sbjct: 25 WPPIYDENSKVLLLGSLPSPKSLASGFYYGSPQNIFWSTLAKTLGVDEPEKSKEARLSFA 84
Query: 68 LSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF 127
L + +W A G+ D I + PND ++ +N ++ TI G+KA +F K
Sbjct: 85 LKHRVAIWITLKEADIIGAQDSTIKNPVPNDFSDIL-KNSQIRTIFTEGKKATELFNKLA 143
Query: 128 PETLAIRCLLMPSTSPAN 145
+ + +PSTSPAN
Sbjct: 144 APKIGQESVYLPSTSPAN 161
>gi|119025890|ref|YP_909735.1| hypothetical protein BAD_0872 [Bifidobacterium adolescentis ATCC
15703]
gi|118765474|dbj|BAF39653.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703]
Length = 179
Score = 83.2 bits (204), Expect = 5e-15, Method: Composition-based stats.
Identities = 58/166 (34%), Positives = 79/166 (47%), Gaps = 15/166 (9%)
Query: 6 KQSFPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKP-------LPD 58
+ F P+ D +L+LGS+P +S YY HPRNRFWP+MA + PD
Sbjct: 4 EHGFGPVWDAGSRVLVLGSVPSPKSREMRFYYGHPRNRFWPVMAAVFHDDGCLRECRDPD 63
Query: 59 DYAE-RTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGR 117
R + L H+ LWDV S G++D I + PND+ +I R+ H G
Sbjct: 64 QIVGLRRDFALRHHMALWDVIESCDIDGASDASIRNAVPNDLSPIIMRSKIAHVFT-TGA 122
Query: 118 KAEAMFRKHFPETLAIRCL------LMPSTSPANAGKTLDLLVKDW 157
KA ++RK L L +PSTSPANA L LV+ +
Sbjct: 123 KAAQLYRKLCVPLLRAHGLDGMPMTQLPSTSPANAKADLPGLVEAY 168
>gi|57241603|ref|ZP_00369548.1| G/U mismatch-specific DNA glycosylase, putative [Campylobacter lari
RM2100]
gi|57017496|gb|EAL54273.1| G/U mismatch-specific DNA glycosylase, putative [Campylobacter lari
RM2100]
Length = 157
Score = 80.9 bits (198), Expect = 2e-14, Method: Composition-based stats.
Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 5/150 (3%)
Query: 9 FPPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILL 68
F P D + ++L+LGS P +S YY H +NRFW + L L ++T L
Sbjct: 8 FEPFFDENSKVLVLGSFPSIKSREENFYYQHSKNRFWRIFEILFECELKTVQEQKT-FLK 66
Query: 69 SAHIGLWDVAHSAIRKGSADIQICDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFP 128
+ HI LWDV S K S D I + NDI ++++ + I G+ A F K +P
Sbjct: 67 ANHIALWDVIASCKIKNSDDKTISYAKANDINIILDK-ANIEKICVLGKVASKYFTKFYP 125
Query: 129 ETLAIRCLLMPSTSPANAGKTLDLLVKDWN 158
E +PS+SPAN +L+ LV+ +N
Sbjct: 126 EQ---EFFELPSSSPANMNYSLENLVEKYN 152
>gi|145222830|ref|YP_001133508.1| hypothetical protein Mflv_2242 [Mycobacterium gilvum PYR-GCK]
gi|145215316|gb|ABP44720.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length = 155
Score = 75.1 bits (183), Expect = 1e-12, Method: Composition-based stats.
Identities = 52/146 (35%), Positives = 71/146 (48%), Gaps = 7/146 (4%)
Query: 19 ILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLSAHIGLWDVA 78
ILILG++PG S+R +YY + RN FW + A L Y ER L A + +WDV
Sbjct: 7 ILILGNMPGVASLRAQRYYWYDRNAFWRITAALFDFDPASAYDERVAALTGAGVAVWDVL 66
Query: 79 HSAIRKGSADIQICDEE--PNDIRSLIERNPRLHTIAFNGRKAEAMFRKHF--PETLAIR 134
R GS D + + PND + E + L + FNG AE + + P+TL
Sbjct: 67 KHCRRVGSLDSAVEPDSMVPNDFVAFFETHRTLTHVYFNGAAAERNYARLVGGPDTLGYT 126
Query: 135 CLLMPSTSPANAGKTLDLLVKDWNRI 160
L PSTSPA + D + W +I
Sbjct: 127 RL--PSTSPAQT-MSFDAKLDAWRQI 149
>gi|121530107|ref|ZP_01662717.1| uracil-DNA glycosylase superfamily [Ralstonia pickettii 12J]
gi|121302922|gb|EAX43890.1| uracil-DNA glycosylase superfamily [Ralstonia pickettii 12J]
Length = 137
Score = 74.3 bits (181), Expect = 2e-12, Method: Composition-based stats.
Identities = 45/108 (41%), Positives = 62/108 (57%), Gaps = 2/108 (1%)
Query: 42 NRFWPLMAKLLGKPLPD-DYAERTEILLSAHIGLWDVAHSAIRKGSADIQICDEEPNDIR 100
N+FW L +++G+ LP +YA R ++LL+ IGLWDV A R+GS D +I D ND+
Sbjct: 12 NQFWRLAGEVIGQDLPGMEYAARLQVLLAHRIGLWDVVAEAKREGSLDSKIRDHAGNDLA 71
Query: 101 SLIERNPRLHTIAFNGRKAEAMFRKHFPETLAIRCLL-MPSTSPANAG 147
LI P L IAFNG A + K + +L +PS+SPA A
Sbjct: 72 GLIASLPELALIAFNGGTASRIGVKALGDIGDRHTILKLPSSSPAYAA 119
>gi|153878422|ref|ZP_02004581.1| conserved hypothetical protein [Beggiatoa sp. PS]
gi|152065114|gb|EDN65419.1| conserved hypothetical protein [Beggiatoa sp. PS]
Length = 131
Score = 62.8 bits (151), Expect = 7e-09, Method: Composition-based stats.
Identities = 41/120 (34%), Positives = 60/120 (50%), Gaps = 7/120 (5%)
Query: 48 MAKLLGKPLPDDYAERTEILLSAHIGLWDVAHSAIRKGSAD--IQICDEEPNDIRSLIER 105
M +L G DY ER +L I +WDV + IR+GS D IQ ND +S
Sbjct: 1 MERLFGIDSSLDYDERVILLQKKGISVWDVLQTCIRQGSLDSAIQESTMIANDFQSFYSN 60
Query: 106 NPRLHTIAFNGRKAEAMFRKHFPETLAIRCL-----LMPSTSPANAGKTLDLLVKDWNRI 160
+ + ++ FNG KAE +++KH +L+ + L +PSTSPA A + + W I
Sbjct: 61 HSNIKSVFFNGAKAEMVYKKHVIPSLSEKWLKLEYFRLPSTSPALASLNREEKLLKWQII 120
>gi|139438549|ref|ZP_01772065.1| Hypothetical protein COLAER_01063 [Collinsella aerofaciens ATCC
25986]
gi|133776088|gb|EBA39908.1| Hypothetical protein COLAER_01063 [Collinsella aerofaciens ATCC
25986]
Length = 134
Score = 58.5 bits (140), Expect = 1e-07, Method: Composition-based stats.
Identities = 37/126 (29%), Positives = 60/126 (47%), Gaps = 17/126 (13%)
Query: 48 MAKLLGKPLPDD----------------YAERTEILLSAHIGLWDVAHSAIRKGSADIQI 91
MA LG P+P + A + +LL+ + LWDV S KGS+D I
Sbjct: 1 MAACLGAPVPPNEGDSLASGEPATLDASIAAKRVMLLNGGVALWDVIESCDIKGSSDASI 60
Query: 92 CDEEPNDIRSLIERNPRLHTIAFNGRKAEAMFRKHFPETLAIRCLLMPSTSPANAGKTLD 151
+ P + + P + + NG A +++++ E + +++PSTSPANA L+
Sbjct: 61 KNVVPAHVERITGAAP-IGQVVCNGGTAGRLYKRYLQERTGLSAIVLPSTSPANAAWRLE 119
Query: 152 LLVKDW 157
LV+ W
Sbjct: 120 RLVERW 125
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.322 0.139 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 642,541,534
Number of Sequences: 5470121
Number of extensions: 25871661
Number of successful extensions: 49147
Number of sequences better than 1.0e-05: 113
Number of HSP's better than 0.0 without gapping: 94
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 48890
Number of HSP's gapped (non-prelim): 113
length of query: 163
length of database: 1,894,087,724
effective HSP length: 122
effective length of query: 41
effective length of database: 1,226,732,962
effective search space: 50296051442
effective search space used: 50296051442
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 124 (52.4 bits)