BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SMu1431
(352 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|24379958|ref|NP_721913.1| hypothetical protein SMU.1574c... 671 0.0
gi|156864181|gb|EDO57612.1| hypothetical protein CLOL250_01... 377 e-103
gi|150004920|ref|YP_001299664.1| hypothetical protein BVU_2... 375 e-102
gi|67938412|ref|ZP_00530938.1| conserved hypothetical prote... 231 7e-59
gi|68550445|ref|ZP_00589894.1| conserved hypothetical prote... 222 3e-56
gi|83859753|ref|ZP_00953273.1| hypothetical protein OA2633_... 210 1e-52
gi|84702455|ref|ZP_01017030.1| hypothetical protein PB2503_... 203 1e-50
gi|149377018|ref|ZP_01894769.1| hypothetical protein MDG893... 194 9e-48
gi|154252790|ref|YP_001413614.1| hypothetical protein Plav_... 191 7e-47
gi|75677300|ref|YP_319721.1| hypothetical protein Nwi_3122 ... 189 3e-46
gi|156864460|gb|EDO57891.1| hypothetical protein CLOL250_01... 188 4e-46
gi|75675594|ref|YP_318015.1| hypothetical protein Nwi_1402 ... 164 8e-39
gi|33602014|ref|NP_889574.1| hypothetical protein BB3038 [B... 152 3e-35
gi|33592772|ref|NP_880416.1| hypothetical protein BP1699 [B... 151 7e-35
gi|33597611|ref|NP_885254.1| hypothetical protein BPP3075 [... 150 1e-34
gi|115423073|emb|CAJ49604.1| conserved hypothetical protein... 148 6e-34
gi|77920059|ref|YP_357874.1| hypothetical protein Pcar_2465... 144 7e-33
gi|114775543|ref|ZP_01451111.1| hypothetical protein SPV1_0... 142 3e-32
gi|153891169|ref|ZP_02012204.1| conserved hypothetical prot... 120 1e-25
gi|91221717|ref|ZP_01257405.1| hypothetical protein P700755... 118 5e-25
gi|121528230|ref|ZP_01660844.1| conserved hypothetical prot... 116 2e-24
gi|153888605|ref|ZP_02009746.1| conserved hypothetical prot... 112 3e-23
gi|83746293|ref|ZP_00943346.1| Hypothetical protein RRSL_03... 110 1e-22
gi|126661496|ref|ZP_01732548.1| hypothetical protein CY0110... 105 3e-21
gi|116669547|ref|YP_830480.1| hypothetical protein Arth_098... 105 6e-21
gi|119026383|ref|YP_910228.1| hypothetical protein BAD_1365... 103 2e-20
gi|17546785|ref|NP_520187.1| hypothetical protein RSc2066 [... 100 2e-19
gi|152993883|ref|YP_001359604.1| hypothetical protein SUN_2... 98 6e-19
gi|51598270|ref|YP_072458.1| hypothetical protein BG0007 [B... 78 8e-13
gi|111114829|ref|YP_709447.1| hypothetical protein BAPKO_00... 77 1e-12
gi|15594353|ref|NP_212141.1| hypothetical protein BB0007 [B... 74 1e-11
gi|114704592|ref|ZP_01437500.1| hypothetical protein FP2506... 72 4e-11
>gi|24379958|ref|NP_721913.1| hypothetical protein SMU.1574c [Streptococcus mutans UA159]
gi|24377941|gb|AAN59219.1|AE014988_8 conserved hypothetical protein [Streptococcus mutans UA159]
Length = 352
Score = 671 bits (1731), Expect = 0.0, Method: Composition-based stats.
Identities = 352/352 (100%), Positives = 352/352 (100%)
Query: 1 MIGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWF 60
MIGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWF
Sbjct: 1 MIGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWF 60
Query: 61 RRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVN 120
RRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVN
Sbjct: 61 RRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVN 120
Query: 121 YTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGG 180
YTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGG
Sbjct: 121 YTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGG 180
Query: 181 YSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQL 240
YSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQL
Sbjct: 181 YSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQL 240
Query: 241 ERKHYEYKNEALKQSKQKSEEEQEKIDLIDALESSGNFKNTHNIIRKLSKYENWQDDEII 300
ERKHYEYKNEALKQSKQKSEEEQEKIDLIDALESSGNFKNTHNIIRKLSKYENWQDDEII
Sbjct: 241 ERKHYEYKNEALKQSKQKSEEEQEKIDLIDALESSGNFKNTHNIIRKLSKYENWQDDEII 300
Query: 301 DLLSIGFHNSQVRYILGDQDIKVFYKKILEKAPSIDENAAKVAAIIEASEEE 352
DLLSIGFHNSQVRYILGDQDIKVFYKKILEKAPSIDENAAKVAAIIEASEEE
Sbjct: 301 DLLSIGFHNSQVRYILGDQDIKVFYKKILEKAPSIDENAAKVAAIIEASEEE 352
>gi|156864181|gb|EDO57612.1| hypothetical protein CLOL250_01702 [Clostridium sp. L2-50]
Length = 376
Score = 377 bits (969), Expect = e-103, Method: Composition-based stats.
Identities = 202/345 (58%), Positives = 260/345 (75%), Gaps = 6/345 (1%)
Query: 2 IGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
+ DFWKESN LA+ NDMD+NLAYMY M+ K+RG++LFTKE L + G KV LFPGV WF
Sbjct: 35 VADFWKESNKLASDNDMDQNLAYMYMMRDKSRGKVLFTKETLRQDGGKVRLFPGVSTWFD 94
Query: 62 RIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNY 121
RI +YG + VI+EHYIISSGLKEMIEGT +AK+FK+IYA+SFY++D G AVWPAQVVNY
Sbjct: 95 RINEYGKSKGVIVEHYIISSGLKEMIEGTEVAKEFKKIYASSFYYNDAGEAVWPAQVVNY 154
Query: 122 TNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGY 181
TNKTQFLFRI KGVL+VND+ VN F P++ RVPF NMIYIGDSDTDIPCMKLVN +GG+
Sbjct: 155 TNKTQFLFRIEKGVLDVNDQEVNSFFEPNQYRVPFRNMIYIGDSDTDIPCMKLVNINGGH 214
Query: 182 SIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQLE 241
SIGV++ ++ K K +V++M+ +NRI YF PADY E L++LVK IIDRT+ NE LE
Sbjct: 215 SIGVYD----SDSKDKSKVFRMLDENRIKYFAPADYEEDSTLERLVKKIIDRTISNEILE 270
Query: 242 RKHYEYKNEALKQSKQKSEEEQEKIDLIDALESSGNFKNTHNIIRKLSKYENWQDDEIID 301
H++ +E + ++K +SEEE++K +LID LE S +F NTH +I +LSK ++W +
Sbjct: 271 EIHFDCVSEKISETKGQSEEERKKEELIDKLEDSESFANTHTVIGQLSKIKDWSVKQKNK 330
Query: 302 LLSIGFHNSQVRYILGDQDIKVFYKKILEKAPSIDENAAKVAAII 346
L I N+QV YIL D+D+K FY I + +E+A K+ II
Sbjct: 331 LYKIALENTQVMYILKDKDVKKFYSMICKD--DNNEDAVKIKEII 373
>gi|150004920|ref|YP_001299664.1| hypothetical protein BVU_2383 [Bacteroides vulgatus ATCC 8482]
gi|149933344|gb|ABR40042.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 368
Score = 375 bits (962), Expect = e-102, Method: Composition-based stats.
Identities = 194/328 (59%), Positives = 250/328 (76%), Gaps = 5/328 (1%)
Query: 2 IGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
+ FWKESNGLA NDMD+NLAYM+TM +KA G+++F K+ L +YG+KV LFPGV+ WF+
Sbjct: 44 VESFWKESNGLAEENDMDQNLAYMFTMIQKAHGKVIFNKKALMDYGAKVQLFPGVETWFK 103
Query: 62 RIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNY 121
RIR YG +R VI+EHYIISSGLKEMIEGT +A +F++IYA+SFY+D DGVA WPAQV+NY
Sbjct: 104 RIRDYGMERGVIVEHYIISSGLKEMIEGTKVANEFEKIYASSFYYDKDGVAQWPAQVINY 163
Query: 122 TNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGY 181
T+KTQFLFRI KG L+VND VND F P++IR+PF NM+YIGDSDTDIPCMKL+NS+ G+
Sbjct: 164 TSKTQFLFRIEKGTLDVNDSGVNDYFKPEDIRIPFRNMVYIGDSDTDIPCMKLINSYSGH 223
Query: 182 SIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQLE 241
SIGV+NPK ++ K++VYKM+ D RI Y+TPADY+EG ELD+LVK IID T NE+L
Sbjct: 224 SIGVYNPKTKD----KRKVYKMMEDKRIKYYTPADYTEGSELDKLVKTIIDTTASNEKLM 279
Query: 242 RKHYEYKNEALKQSKQ-KSEEEQEKIDLIDALESSGNFKNTHNIIRKLSKYENWQDDEII 300
HY K E + + Q ++E++EK LI LE+S +FK TH+II +L K +NW +E
Sbjct: 280 AVHYINKQEQVSHNGQIDNKEDKEKEKLIMDLENSNSFKQTHSIISELKKIKNWTLEEKK 339
Query: 301 DLLSIGFHNSQVRYILGDQDIKVFYKKI 328
L I N Q+ YI+ D D+ FY +
Sbjct: 340 QLKIIAEKNKQISYIMKDGDVASFYSSL 367
>gi|67938412|ref|ZP_00530938.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
gi|67915389|gb|EAM64711.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
Length = 277
Score = 231 bits (588), Expect = 7e-59, Method: Composition-based stats.
Identities = 114/231 (49%), Positives = 159/231 (68%), Gaps = 5/231 (2%)
Query: 5 FWKESNGLATANDMDKNLAYMYTMKKKAR-GQLLFTKEKLAEYGSKVGLFPGVKDWFRRI 63
FW E A + D+ LAYM+ M KKA ++ K +YG ++ LFPGV +WF+RI
Sbjct: 39 FWDEVFSHAREQNADQILAYMHLMLKKAESAEVQVRKSDFHKYGEQIKLFPGVAEWFQRI 98
Query: 64 RQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTN 123
+YG ++ + +EHYI+SSGL+EM+EGTSIAK+F+ IYA+ F +D GVA WPA +NYT
Sbjct: 99 NEYGREKNIRVEHYIVSSGLREMVEGTSIAKEFQAIYASGFMYDHHGVACWPALAINYTT 158
Query: 124 KTQFLFRISKGVLNVNDEAVNDSFAPDEIR-VPFHNMIYIGDSDTDIPCMKLVNSHGGYS 182
KTQ+LFRI+KG L+V D +V + + P E R VPF NMI+IGD +TDIPCM+LV + GG+S
Sbjct: 159 KTQYLFRINKGSLDVYDNSVINRYVPHEERPVPFENMIFIGDGETDIPCMRLVKNQGGHS 218
Query: 183 IGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDR 233
I V+N ++ AKK +++ D R+ PADY EG+ +D VK +ID+
Sbjct: 219 ISVYNSRKNG---AKKTAEQLLNDQRVTLIAPADYREGKTIDLAVKAMIDK 266
>gi|68550445|ref|ZP_00589894.1| conserved hypothetical protein [Pelodictyon phaeoclathratiforme
BU-1]
gi|68242696|gb|EAN24908.1| conserved hypothetical protein [Pelodictyon phaeoclathratiforme
BU-1]
Length = 277
Score = 222 bits (566), Expect = 3e-56, Method: Composition-based stats.
Identities = 113/231 (48%), Positives = 155/231 (67%), Gaps = 5/231 (2%)
Query: 5 FWKESNGLATANDMDKNLAYMYTMKKKAR-GQLLFTKEKLAEYGSKVGLFPGVKDWFRRI 63
FW E + A + D+ LAYMY M KKA ++ K +YG ++ LF GV +WF+RI
Sbjct: 39 FWDEVSCYAREQNADQILAYMYLMLKKAESAEVQVRKSDFHKYGEQIQLFLGVAEWFQRI 98
Query: 64 RQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTN 123
+YG + + +EHYI+SSGL+EM+EGTSI K+FK IYA+ F +D GVA WPA +NYT
Sbjct: 99 NEYGRAKNIRVEHYIVSSGLREMVEGTSIVKEFKAIYASGFMYDHHGVARWPALAINYTT 158
Query: 124 KTQFLFRISKGVLNVNDEAVNDSFA-PDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYS 182
KTQ+LFRI+KG L V D +V + + P+E VPF NMI+IGD +TDIPCM+LV GG+S
Sbjct: 159 KTQYLFRINKGSLEVYDNSVINRYVLPEERPVPFENMIFIGDGETDIPCMRLVKEQGGHS 218
Query: 183 IGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDR 233
I V+N N+ AKK +++ D R+ PADY EG+ +D VK ++D+
Sbjct: 219 ISVYN---SNKNGAKKAAEQLLFDKRVTLIAPADYREGKIIDLAVKAMVDK 266
>gi|83859753|ref|ZP_00953273.1| hypothetical protein OA2633_07129 [Oceanicaulis alexandrii
HTCC2633]
gi|83852112|gb|EAP89966.1| hypothetical protein OA2633_07129 [Oceanicaulis alexandrii
HTCC2633]
Length = 278
Score = 210 bits (534), Expect = 1e-52, Method: Composition-based stats.
Identities = 111/243 (45%), Positives = 151/243 (62%), Gaps = 6/243 (2%)
Query: 2 IGDFWKESNGLATANDMDKNLAYMYTMKKKA-RGQLLFTKEKLAEYGSKVGLFPGVKD-W 59
IG FW NG A D L YM+ M K A R + F +E + +G V FPGV + W
Sbjct: 35 IGAFWGRVNGEAARLGADNILIYMHEMVKAAQREDVRFRREDIERHGQSVSFFPGVAEGW 94
Query: 60 FRRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVV 119
F+R+R YG R V ++HYIISSGLKEMI +++ K+F I+A+ F ++ D V VW A V
Sbjct: 95 FQRLRDYGEARGVRVQHYIISSGLKEMIAASAVGKEFDAIFASEFKYNADDVPVWAASAV 154
Query: 120 NYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIR-VPFHNMIYIGDSDTDIPCMKLVNSH 178
NYTNKTQFLFRI+KG L+++D A +++ P++ R VPF NMIY+GD DTD+PCM+ V
Sbjct: 155 NYTNKTQFLFRINKGALDLSDHAEVNAYVPEDDRPVPFRNMIYVGDGDTDVPCMRTVKEQ 214
Query: 179 GGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNE 238
GG SI V + K ++ K+ D R+ + ADYS+G LD VK ID+ E
Sbjct: 215 GGVSIAV---HPAGDAKGAEKTAKLKADRRVHFTADADYSDGAALDVYVKAAIDKMAAVE 271
Query: 239 QLE 241
+L+
Sbjct: 272 RLK 274
>gi|84702455|ref|ZP_01017030.1| hypothetical protein PB2503_05817 [Parvularcula bermudensis
HTCC2503]
gi|84691701|gb|EAQ17541.1| hypothetical protein PB2503_05817 [Parvularcula bermudensis
HTCC2503]
Length = 278
Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats.
Identities = 104/234 (44%), Positives = 155/234 (66%), Gaps = 6/234 (2%)
Query: 4 DFWKESNGLATANDMDKNLAYMYTMKKKA-RGQLLFTKEKLAEYGSKVGLFPGVKD--WF 60
+FWKE L +D D+ L YM M ++A R L T++KL +G LF G+ D WF
Sbjct: 37 EFWKEVKRLTREHDADEILVYMQLMLREAQRKGLRVTRDKLKSHGGSSELFDGLADHSWF 96
Query: 61 RRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVN 120
RI + ++R + +EHYI+SSG +EMIEG+ IA DFK I+A+ + ++++G A WP+ +N
Sbjct: 97 ERINAFASERGLQVEHYIVSSGTQEMIEGSPIAGDFKGIFASRYIYNENGEAEWPSLAIN 156
Query: 121 YTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIR-VPFHNMIYIGDSDTDIPCMKLVNSHG 179
YT KTQFLFRI+KG+ +V D ++F P+ R +PF MI+IGD DTDIP MK+ + +G
Sbjct: 157 YTTKTQFLFRINKGIDSVWDNDAINAFMPEAERPIPFSRMIFIGDGDTDIPAMKMTSHYG 216
Query: 180 GYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDR 233
G SI ++PK + +A ++++++I D R+ + PADYSE LD L+K I+ R
Sbjct: 217 GQSIAAYDPKR--DSRALEKIHRLISDGRVNFVAPADYSENAHLDILLKGILGR 268
>gi|149377018|ref|ZP_01894769.1| hypothetical protein MDG893_08681 [Marinobacter algicola DG893]
gi|149358676|gb|EDM47147.1| hypothetical protein MDG893_08681 [Marinobacter algicola DG893]
Length = 276
Score = 194 bits (492), Expect = 9e-48, Method: Composition-based stats.
Identities = 99/233 (42%), Positives = 144/233 (61%), Gaps = 5/233 (2%)
Query: 4 DFWKESNGLATANDMDKNLAYMYTMKKKARG---QLLFTKEKLAEYGSKVGLFPGVKDWF 60
DFW E N D D+ L Y+ + K+AR Q EKL YG + LFPGV DWF
Sbjct: 34 DFWPEVNRKNRERDGDEILTYLGELAKRARDEGKQDELKPEKLQAYGKSIPLFPGVLDWF 93
Query: 61 RRIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDD-DGVAVWPAQVV 119
I ++ +D+ + + HYI+SSGL+EMI GT +AK FK+I+ +++D+ G A WPA +
Sbjct: 94 DAINRFASDQGIALSHYIVSSGLEEMIRGTPVAKHFKKIFGCRYHYDEATGHAKWPAVAI 153
Query: 120 NYTNKTQFLFRISKGVLNVNDE-AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSH 178
+YT KTQ+LFRI+KG+ N D +N+ P + VPF +MIY GD DTDIP MK+V +
Sbjct: 154 DYTTKTQYLFRINKGIENSWDNVTINEYIEPGDRPVPFDHMIYFGDGDTDIPAMKMVKAQ 213
Query: 179 GGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLII 231
GG S+ VF+ + + K ++++ K+I + R Y DY+ G +LD V+ I+
Sbjct: 214 GGCSLAVFDGDKWGQGKTQEKIEKLISEERANYVVQGDYTSGSQLDVTVRGIL 266
>gi|154252790|ref|YP_001413614.1| hypothetical protein Plav_2345 [Parvibaculum lavamentivorans DS-1]
gi|154156740|gb|ABS63957.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1]
Length = 282
Score = 191 bits (485), Expect = 7e-47, Method: Composition-based stats.
Identities = 97/231 (41%), Positives = 135/231 (58%), Gaps = 5/231 (2%)
Query: 4 DFWKESNGLATANDMDKNLAYMYTM-KKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFRR 62
DFW E L + D+ L YM M +K A + ++ G + LF GV+DWF R
Sbjct: 38 DFWAEVKRLTKQHQADEVLVYMNLMLRKAAAAGVPVRRDDFKARGKAIQLFEGVEDWFDR 97
Query: 63 IRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYT 122
I YG + V + HY++SSG E+ GT IA F ++YA+ F FD +GVA WPA VNYT
Sbjct: 98 ITGYGKAQGVRVTHYLVSSGNAEIFAGTPIASRFAQVYASKFMFDQNGVAAWPALAVNYT 157
Query: 123 NKTQFLFRISKGVLNVNDEAVNDSFAPDEIR-VPFHNMIYIGDSDTDIPCMKLVNSHGGY 181
KTQ+LFRI+KG +++D + + F R VPF NM++IGD TDIPC +LV GG
Sbjct: 158 TKTQYLFRINKGAFDLSDNSKVNQFVEKRDRPVPFENMVFIGDGSTDIPCFRLVKEQGGL 217
Query: 182 SIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIID 232
S+ VF P + A+ + I+D R+ PA Y++G ELD ++K I+
Sbjct: 218 SVAVFKPHTKG---ARGKADNYIKDGRVHCAVPAIYTDGSELDHVIKASIN 265
>gi|75677300|ref|YP_319721.1| hypothetical protein Nwi_3122 [Nitrobacter winogradskyi Nb-255]
gi|74422170|gb|ABA06369.1| hypothetical protein Nwi_3122 [Nitrobacter winogradskyi Nb-255]
Length = 284
Score = 189 bits (479), Expect = 3e-46, Method: Composition-based stats.
Identities = 94/226 (41%), Positives = 134/226 (59%), Gaps = 5/226 (2%)
Query: 5 FWKESNGLATANDMDKNLAYMYTM-KKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFRRI 63
FW E L + D+ L YM M +K A + ++ G + LF GV+DWF RI
Sbjct: 41 FWAEVKRLTKEHQADEVLVYMNLMLRKAAAANVPVRRDDFKARGKAIQLFEGVEDWFDRI 100
Query: 64 RQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTN 123
YG + V +EHY++SSG E+ GT I F ++YA+ F FD +GVA WPA VNYT
Sbjct: 101 TGYGKAQGVRVEHYLVSSGNAEIFAGTPIVSKFAQVYASKFMFDQNGVAAWPALAVNYTT 160
Query: 124 KTQFLFRISKGVLNVNDEAVNDSFAPDEIR-VPFHNMIYIGDSDTDIPCMKLVNSHGGYS 182
KTQ+LFRI+KG +++D + + F R VPF N+++IGD TDIPC +LV GG S
Sbjct: 161 KTQYLFRINKGAFDLSDNSKVNQFVEKRDRPVPFENIVFIGDGSTDIPCFRLVKEQGGLS 220
Query: 183 IGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVK 228
+ VF P + A+ + I+D+R+ PA Y++G ELD+++K
Sbjct: 221 VAVFKPHTKG---ARGKADSYIKDDRVHCVAPAIYTDGSELDRIIK 263
>gi|156864460|gb|EDO57891.1| hypothetical protein CLOL250_01431 [Clostridium sp. L2-50]
Length = 162
Score = 188 bits (478), Expect = 4e-46, Method: Composition-based stats.
Identities = 91/128 (71%), Positives = 108/128 (84%)
Query: 2 IGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
+ DFWKESN LA+ NDMD+NLAYMY M+ K+RG++LFTKE L + G KV LFPGV WF
Sbjct: 35 VADFWKESNKLASDNDMDQNLAYMYMMRDKSRGKVLFTKETLRQDGGKVRLFPGVSTWFD 94
Query: 62 RIRQYGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNY 121
RI +YG + VI+EHYIISSGLKEMIEGT +AK+FK+IYA+SFY++D G AVWPAQVVNY
Sbjct: 95 RINEYGKSKGVIVEHYIISSGLKEMIEGTEVAKEFKKIYASSFYYNDAGEAVWPAQVVNY 154
Query: 122 TNKTQFLF 129
TNKTQFLF
Sbjct: 155 TNKTQFLF 162
>gi|75675594|ref|YP_318015.1| hypothetical protein Nwi_1402 [Nitrobacter winogradskyi Nb-255]
gi|74420464|gb|ABA04663.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255]
Length = 307
Score = 164 bits (415), Expect = 8e-39, Method: Composition-based stats.
Identities = 90/254 (35%), Positives = 151/254 (59%), Gaps = 10/254 (3%)
Query: 5 FWKESNGLATANDMDKNLAYMYTMKKK-ARGQLLFTKEKLAEYGSKVGLFPGVKDWFRRI 63
FW E + N+ + L YM M ++ +G+L +E+L GS + FPGV+ WF R+
Sbjct: 58 FWSEVKRVTKENEASEVLTYMRMMAERIVQGKLAINRERLGALGSHIEYFPGVETWFDRM 117
Query: 64 RQYGADRE---VIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVN 120
+ ++ V + HYI+SSGL+E+++GTSIAK F+ I+A+ +++D G V+ +V+
Sbjct: 118 NGFVRNQTLGYVRLNHYIVSSGLREILQGTSIAKHFRSIFASQYHYDSFGRPVFVDRVIT 177
Query: 121 YTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGG 180
+KTQ++FRI+KGV N+ E+VND A DE +PF NMIY+GD DTD+P M + +GG
Sbjct: 178 DVSKTQYIFRINKGVENLA-ESVNDHMAEDERPIPFSNMIYLGDGDTDVPSMAVTRKNGG 236
Query: 181 YSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQL 240
++ V++ R E+ K + + + NR+ + A+YS L+ VK ++ + + +
Sbjct: 237 HAFAVYS---RGEDPQKCEI--LFKANRVDAYFEANYSPNSRLEIFVKNLLKKMIAEIRY 291
Query: 241 ERKHYEYKNEALKQ 254
+ + +K E Q
Sbjct: 292 KAMLHLFKEERSTQ 305
>gi|33602014|ref|NP_889574.1| hypothetical protein BB3038 [Bordetella bronchiseptica RB50]
gi|33576452|emb|CAE33530.1| conserved hypothetical protein [Bordetella bronchiseptica RB50]
Length = 282
Score = 152 bits (384), Expect = 3e-35, Method: Composition-based stats.
Identities = 83/230 (36%), Positives = 134/230 (58%), Gaps = 12/230 (5%)
Query: 5 FWKES-NGLATANDMDKNLAYMYTMKKKAR--GQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
FWK+ + L + D D AY+Y M + +R L T+E+L E+G+++ L GV+ F
Sbjct: 34 FWKDQVDPLLSQQDWDPVPAYLYQMIQLSRQGSHGLITRERLREWGARLALHDGVQTLFG 93
Query: 62 RIRQ--YGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVV 119
R+R +V +E Y+ISSG+ +++ T IA +F EI+A+ F + +DG +P ++V
Sbjct: 94 RLRAAVRAEHPKVQLEFYLISSGIGDVVRATPIAHEFTEIWASEFVYGEDGGISFPRRIV 153
Query: 120 NYTNKTQFLFRISKGVLNVNDE----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLV 175
++T+KT++LF I KG++ VN D +RVPF M+++GD TDIPC L+
Sbjct: 154 SFTDKTRYLFHIQKGLIGREYRNKPFEVNRKVPEDRLRVPFDQMVFVGDGYTDIPCFSLI 213
Query: 176 NSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQ 225
S GG++ GV++PK R++ + R + I + R+ A Y E EL Q
Sbjct: 214 RSAGGFAFGVWDPKHRDK---RSRAWGFIEEGRVSNLNQARYDEQAELYQ 260
>gi|33592772|ref|NP_880416.1| hypothetical protein BP1699 [Bordetella pertussis Tohama I]
gi|33572420|emb|CAE41986.1| conserved hypothetical protein [Bordetella pertussis Tohama I]
Length = 282
Score = 151 bits (381), Expect = 7e-35, Method: Composition-based stats.
Identities = 83/230 (36%), Positives = 133/230 (57%), Gaps = 12/230 (5%)
Query: 5 FWKES-NGLATANDMDKNLAYMYTMKKKAR--GQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
FWK+ + L + D D AY+Y M + +R L T+E+L E+G ++ L GV+ F
Sbjct: 34 FWKDQVDPLLSQQDWDPVPAYLYQMIQLSRQGSHGLITRERLREWGVRLALHDGVQTLFG 93
Query: 62 RIRQ--YGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVV 119
R+R +V +E Y+ISSG+ +++ T IA +F EI+A+ F + +DG +P ++V
Sbjct: 94 RLRAAVRAEHPKVQLEFYLISSGIGDVVRATPIAHEFTEIWASEFVYGEDGGISFPRRIV 153
Query: 120 NYTNKTQFLFRISKGVLNVNDE----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLV 175
++T+KT++LF I KG++ VN D +RVPF M+++GD TDIPC L+
Sbjct: 154 SFTDKTRYLFHIQKGLIGREYRNKPFEVNRKVPEDRLRVPFDQMVFVGDGYTDIPCFSLI 213
Query: 176 NSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQ 225
S GG++ GV++PK R++ + R + I + R+ A Y E EL Q
Sbjct: 214 RSAGGFAFGVWDPKHRDK---RSRAWGFIEEGRVSNLNQARYDEQAELYQ 260
>gi|33597611|ref|NP_885254.1| hypothetical protein BPP3075 [Bordetella parapertussis 12822]
gi|33574039|emb|CAE38362.1| conserved hypothetical protein [Bordetella parapertussis]
Length = 282
Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats.
Identities = 82/230 (35%), Positives = 133/230 (57%), Gaps = 12/230 (5%)
Query: 5 FWKES-NGLATANDMDKNLAYMYTMKKKAR--GQLLFTKEKLAEYGSKVGLFPGVKDWFR 61
FWK+ + L + D D AY+Y M + +R L T+E+L E+G+++ L GV+ F
Sbjct: 34 FWKDQVDPLLSQQDWDPVPAYLYQMIQLSRQGSHGLITRERLREWGARLALHDGVQTLFG 93
Query: 62 RIRQ--YGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVV 119
R+R +V +E Y+ISSG+ +++ T IA +F E +A+ F + +DG +P ++V
Sbjct: 94 RLRAAVRAEHPKVQLEFYLISSGIGDVVRATPIAHEFTETWASEFVYGEDGGISFPRRIV 153
Query: 120 NYTNKTQFLFRISKGVLNVNDE----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLV 175
++T+KT++LF I KG++ VN D +RVPF M+++GD TDIPC L+
Sbjct: 154 SFTDKTRYLFHIQKGLIGREYRNKPFEVNRKVPEDRLRVPFDQMVFVGDGYTDIPCFSLI 213
Query: 176 NSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQ 225
S GG++ GV++PK R++ + R + I + R+ A Y E EL Q
Sbjct: 214 RSAGGFAFGVWDPKHRDK---RSRAWGFIEEGRVSNLNQARYDEQAELYQ 260
>gi|115423073|emb|CAJ49604.1| conserved hypothetical protein [Bordetella avium 197N]
Length = 282
Score = 148 bits (373), Expect = 6e-34, Method: Composition-based stats.
Identities = 80/235 (34%), Positives = 134/235 (57%), Gaps = 16/235 (6%)
Query: 5 FWKES-NGLATANDMDKNLAYMYTM----KKKARGQLLFTKEKLAEYGSKVGLFPGVKDW 59
FWK++ + L + D D AY+Y M ++ G + T+E+L ++G+++ L GV
Sbjct: 34 FWKDAVDPLLSQGDWDPVPAYLYQMIALSRRGTHGAI--TRERLQQWGARLPLHKGVTTL 91
Query: 60 FRRIRQ--YGADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQ 117
F R+R A + +E Y+ISSG+ +++ T IA F +I+A+ F +D+ G +P +
Sbjct: 92 FDRLRDAVRAAHPRIQLEFYLISSGIGDIVRATPIAAAFTDIWASEFIYDEMGGICFPRR 151
Query: 118 VVNYTNKTQFLFRISKGVLNVNDE----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMK 173
+V++T+KT++LF I KG++ VN D +RVPF M+++GD TDIPC
Sbjct: 152 IVSFTDKTRYLFHIQKGLVGPEFRNKPFEVNRKVPGDRLRVPFDQMVFVGDGYTDIPCFS 211
Query: 174 LVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVK 228
L+ GGY+ GV++P R++ + R + + D R+ A Y E EL QL++
Sbjct: 212 LIRRAGGYAFGVWDPNHRDK---RSRAWGFVEDGRVSNLNFARYDEEAELYQLLE 263
>gi|77920059|ref|YP_357874.1| hypothetical protein Pcar_2465 [Pelobacter carbinolicus DSM 2380]
gi|77546142|gb|ABA89704.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380]
Length = 279
Score = 144 bits (363), Expect = 7e-33, Method: Composition-based stats.
Identities = 74/230 (32%), Positives = 128/230 (55%), Gaps = 9/230 (3%)
Query: 23 AYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFRRIRQYGA--DREVIIEHYIIS 80
AY+Y M + ++ +G ++ F G F R+R++ + +V++E Y+IS
Sbjct: 52 AYLYEMIVLSDSGSPIRRDDFVRWGKRIKPFTGATRIFDRVRRHAESINEKVVVEFYLIS 111
Query: 81 SGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVND 140
SG+ +++ T +A+ F +I+A F++ DDG V+P +V++T+KT+FLF+I+KG+
Sbjct: 112 SGIGDILRHTRLARQFSDIWACDFHYGDDGGIVYPKNIVSFTDKTRFLFQIAKGITGPEC 171
Query: 141 E----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKA 196
AVN + ++RVPF MI +GD TDIPC LV GGY++GVF +R+ +
Sbjct: 172 RREPFAVNRKISQRQLRVPFDQMIVVGDGLTDIPCFSLVRRSGGYALGVF---DRDNKAK 228
Query: 197 KKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIIDRTVFNEQLERKHYE 246
R + I D R+ PAD+S+ L + + ++ +L + Y+
Sbjct: 229 WGRAWGFIEDGRVSNLVPADFSKNSALSLSLMMAVENLARKLKLRAQTYQ 278
>gi|114775543|ref|ZP_01451111.1| hypothetical protein SPV1_04423 [Mariprofundus ferrooxydans PV-1]
gi|114553654|gb|EAU56035.1| hypothetical protein SPV1_04423 [Mariprofundus ferrooxydans PV-1]
Length = 281
Score = 142 bits (358), Expect = 3e-32, Method: Composition-based stats.
Identities = 80/238 (33%), Positives = 134/238 (56%), Gaps = 12/238 (5%)
Query: 2 IGDFWKESNGLATANDMDKNLAYMYTMKKKARGQLL--FTKEKLAEYGSKVGLFPGVKDW 59
I FWKE G+ A D AY++ M +R + T + L +G + LF GV+
Sbjct: 32 IEGFWKEVGGM-MAEGWDPVPAYLHHMIHASRSGRIKPMTCDALMAWGKTLPLFEGVEQV 90
Query: 60 FRRIRQYGADR--EVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQ 117
F ++R AD V +E Y+ISSG+ +++ SIA +F +I+A+ F++D+ G AV P +
Sbjct: 91 FSQLRDVVADANPRVSLEFYLISSGIGDVLRQMSIADEFTDIWASEFHYDEQGHAVAPKR 150
Query: 118 VVNYTNKTQFLFRISKGVLNVNDE----AVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMK 173
++++T+KT++LF+I KGV+ AVN D++R+P + MI++GD TDIPC
Sbjct: 151 IISFTDKTRYLFQIQKGVIGPASRAKPFAVNMKVPSDQLRIPLNQMIFVGDGYTDIPCFS 210
Query: 174 LVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLII 231
L+ GG I V++ +R+ EK ++ + D R+ A+Y G +L + + +
Sbjct: 211 LIKKEGGIPIAVYD--QRHVEKWGN-AFQFVADGRVSNLHSANYQAGSDLTNFLSMAV 265
>gi|153891169|ref|ZP_02012204.1| conserved hypothetical protein [Opitutaceae bacterium TAV2]
gi|151582772|gb|EDN46290.1| conserved hypothetical protein [Opitutaceae bacterium TAV2]
Length = 344
Score = 120 bits (301), Expect = 1e-25, Method: Composition-based stats.
Identities = 76/255 (29%), Positives = 128/255 (50%), Gaps = 33/255 (12%)
Query: 5 FWKESNGLAT-----ANDMDKNLAYM-YTMKKKARGQLL-FTKEKLAEYGSKVGLFPGVK 57
FW E+N L + +AY+ + + G + L E G ++ +PG+
Sbjct: 41 FWAETNSLVAHYLKRGYHLSGEIAYLNHLLTSVLTGTMPGLNNRVLRECGGELKFYPGIP 100
Query: 58 DWFRRIRQYGADR------EVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFY------ 105
D+F R R + ++R ++ +EHY++S+GL EMI G ++A ++ F
Sbjct: 101 DFFARSRAWVSERPEYEKHDIQLEHYVVSTGLAEMIRGCAVADHIDGVWGCEFIENPLRP 160
Query: 106 -------FDD----DGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRV 154
F++ + + V++ T KT+ LF I+KG VN + +P++ R+
Sbjct: 161 GFLQQAEFEEFNAAEALIAQIGMVIDNTTKTRALFEINKGTNKNPAIDVNANISPEDRRI 220
Query: 155 PFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTP 214
PF NMIYI D +DIP +V GG + V+NP+ R +E A+ K+ D RI ++ P
Sbjct: 221 PFQNMIYIADGPSDIPSFSVVKKGGGRAYAVYNPR-RTDEFAQND--KLRADGRIDHYGP 277
Query: 215 ADYSEGQELDQLVKL 229
ADY+EG +Q ++L
Sbjct: 278 ADYTEGSSTEQWLRL 292
>gi|91221717|ref|ZP_01257405.1| hypothetical protein P700755_33885 [Psychroflexus torquis ATCC
700755]
gi|91180475|gb|EAS67787.1| hypothetical protein P700755_33885 [Psychroflexus torquis ATCC
700755]
Length = 168
Score = 118 bits (296), Expect = 5e-25, Method: Composition-based stats.
Identities = 62/150 (41%), Positives = 91/150 (60%), Gaps = 7/150 (4%)
Query: 88 EGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVNDSF 147
+G+ + FKEI+ F + G +P +V+++T KTQ+LFRI+KG+L+ + E VND
Sbjct: 1 DGSQLRSHFKEIFGCEFAENSSGRISFPRRVISHTAKTQYLFRINKGMLDPS-EDVNDHM 59
Query: 148 APDEIR-VPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRD 206
PDE+R +PF NMIY+GD TD+PC ++N GG+SI V+NP + + +K
Sbjct: 60 -PDELRPIPFPNMIYLGDGPTDVPCFTVMNRFGGHSIAVYNPGDESRTSFRKAFQLSGVS 118
Query: 207 NRIGYFTPADYSEGQELDQLVKLIIDRTVF 236
RI Y PADY G L +LI++ TV
Sbjct: 119 GRIKYIAPADYRAGSHL----RLILEETVL 144
>gi|121528230|ref|ZP_01660844.1| conserved hypothetical protein [Ralstonia pickettii 12J]
gi|121304998|gb|EAX45962.1| conserved hypothetical protein [Ralstonia pickettii 12J]
Length = 394
Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats.
Identities = 76/206 (36%), Positives = 112/206 (54%), Gaps = 20/206 (9%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQYGADR------EVIIEHYIISSGLKEMIEGTSIAKDF 96
L E G+++ FPGV D+F R + + D ++ +EHYI+S+GL EMI+G+ IA
Sbjct: 162 LRELGAELEFFPGVTDFFERSKAWVRDNATYQAFDIKLEHYIVSTGLTEMIKGSPIAPYI 221
Query: 97 KEIYATSFY----FDDDGVAVWPAQV--VNYTNKTQFLFRISKGVLNVNDEAV--NDSFA 148
++ F + DG V + ++ T KT+ LF I+KGV N + EAV N S A
Sbjct: 222 DGVWGCEFLEASDSNADGRPVIAEVIYAIDNTTKTRALFEINKGV-NKHPEAVSVNSSIA 280
Query: 149 PDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKE-RNEEKAKKRVYKMIRDN 207
DE RVPF NMIY+ D +DIP + GG + V+NP+ R+ E+A + D+
Sbjct: 281 EDERRVPFANMIYVADGPSDIPAFSVARKGGGRTYAVYNPESPRSFEQAD----NLRADD 336
Query: 208 RIGYFTPADYSEGQELDQLVKLIIDR 233
R+ PADY G + +KL I +
Sbjct: 337 RVDMLGPADYRAGSPTEMWLKLHIGK 362
>gi|153888605|ref|ZP_02009746.1| conserved hypothetical protein [Ralstonia pickettii 12D]
gi|151574942|gb|EDN39357.1| conserved hypothetical protein [Ralstonia pickettii 12D]
Length = 317
Score = 112 bits (281), Expect = 3e-23, Method: Composition-based stats.
Identities = 75/206 (36%), Positives = 110/206 (53%), Gaps = 20/206 (9%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQYGADR------EVIIEHYIISSGLKEMIEGTSIAKDF 96
L E G+++ FPGV D+F R + + D ++ +EHYI+S+GL EMI+G+ IA
Sbjct: 85 LRELGAELEFFPGVTDFFERSKAWVRDNAAYQAFDIKLEHYIVSTGLTEMIKGSPIAPYI 144
Query: 97 KEIYATSFYFDDDGVAVWPAQV------VNYTNKTQFLFRISKGVLNVNDEAV--NDSFA 148
++ F D A + ++ T KT+ LF I+KGV N + EAV N S A
Sbjct: 145 DGVWGCEFLEAPDSNANGRPVISEVIYAIDNTTKTRALFEINKGV-NKHPEAVSVNSSIA 203
Query: 149 PDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKE-RNEEKAKKRVYKMIRDN 207
DE RVPF NMIY+ D +DIP + GG + V+NP+ R+ E+A + D+
Sbjct: 204 EDERRVPFANMIYVADGPSDIPAFSVARKGGGRTYAVYNPESPRSFEQAD----NLRADD 259
Query: 208 RIGYFTPADYSEGQELDQLVKLIIDR 233
R+ PADY G + +KL I +
Sbjct: 260 RVDMLGPADYRAGSPTEMWLKLHIGK 285
>gi|83746293|ref|ZP_00943346.1| Hypothetical protein RRSL_03886 [Ralstonia solanacearum UW551]
gi|83727043|gb|EAP74168.1| Hypothetical protein RRSL_03886 [Ralstonia solanacearum UW551]
Length = 315
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 75/204 (36%), Positives = 109/204 (53%), Gaps = 18/204 (8%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQYGADR------EVIIEHYIISSGLKEMIEGTSIAKDF 96
L G+++ FPGV D+F R + + A ++ +EHYI+S+GL EMI+G+ IA
Sbjct: 85 LRRLGAELEFFPGVADFFERSKAWVAGNPAYQAFDIKLEHYIVSTGLTEMIKGSPIAPHI 144
Query: 97 KEIYATSFYF--DDDGVAVWPAQV--VNYTNKTQFLFRISKGVLNVNDEAV--NDSFAPD 150
++ F D G V + ++ T KT+ LF I+KGV N + EAV N S A D
Sbjct: 145 DGVWGCEFLEVPDAGGRPVISEVIYAIDNTTKTRALFEINKGV-NKHPEAVSVNASMAED 203
Query: 151 EIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKE-RNEEKAKKRVYKMIRDNRI 209
E RVPF NMIY+ D +DIP + GG + V+NP R+ E+A + D+R+
Sbjct: 204 ERRVPFANMIYVADGPSDIPAFSVARKGGGRTYAVYNPDSPRSFEQAD----NLRADDRV 259
Query: 210 GYFTPADYSEGQELDQLVKLIIDR 233
PADY G + +KL I +
Sbjct: 260 DMLGPADYRVGSPTEMWLKLHIGK 283
>gi|126661496|ref|ZP_01732548.1| hypothetical protein CY0110_25813 [Cyanothece sp. CCY0110]
gi|126617223|gb|EAZ88040.1| hypothetical protein CY0110_25813 [Cyanothece sp. CCY0110]
Length = 290
Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats.
Identities = 66/228 (28%), Positives = 122/228 (53%), Gaps = 14/228 (6%)
Query: 15 ANDMDKNLAYMYTM---KKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFRRIRQYGADRE 71
A K LA Y + K+ + T E+LA G K+ L GV + F +RQ RE
Sbjct: 52 AQGWQKYLARTYGLIQESKRRESKDKITYERLANIGQKLNLIEGVPEMFDSLRQKA--RE 109
Query: 72 VI----IEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTNKTQF 127
V+ +E Y+IS G ++ TSIAK FK+++ +D+ G + + + +T KT +
Sbjct: 110 VLEGVEVEFYLISGGFVDIARNTSIAKHFKQMWGCELAYDEKGEIEFLKKQMTHTEKTHY 169
Query: 128 LFRISKGVLNVNDEAVNDSF---APDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIG 184
L+ +SKG N++ + ++ + +E+ +P + +IY+GD +DIPC ++N +GG ++G
Sbjct: 170 LYYLSKGNAEENEQDLMYNYQDLSLEELYIPLNQVIYVGDGTSDIPCFTVINKYGGIALG 229
Query: 185 VFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQELDQLVKLIID 232
+F ++ + K+ + ++ PADY + EL + ++L ++
Sbjct: 230 IFYSHSTAQDWEHRE--KVTKSQQLTNLVPADYHQQSELMRSLRLAVE 275
>gi|116669547|ref|YP_830480.1| hypothetical protein Arth_0983 [Arthrobacter sp. FB24]
gi|116609656|gb|ABK02380.1| conserved hypothetical protein [Arthrobacter sp. FB24]
Length = 315
Score = 105 bits (261), Expect = 6e-21, Method: Composition-based stats.
Identities = 80/248 (32%), Positives = 115/248 (46%), Gaps = 34/248 (13%)
Query: 5 FWKESNGLAT--AN---DMDKNLAYMYTMKKKARGQLL--FTKEKLAEYGSKVGLFPGVK 57
FW E N L AN + K+ AY+ + G T + L E G+ V L PG+
Sbjct: 41 FWDEVNALVDHYANRGLQVSKDTAYLGHILSYIDGGPFEGMTNQTLRELGADVPLAPGMP 100
Query: 58 DWFRRIRQY--GADR----EVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGV 111
+ R+R DR + +EHYI+S+GL++MIEG I ++A D G
Sbjct: 101 ECMDRMRSIVRNDDRFSHHAITVEHYIVSTGLRQMIEGNPIRAHVDGVWACELLSDPPGS 160
Query: 112 AVWPAQ--------------VVNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFH 157
+ V++ T KT+ +F I+KGV VN PDE RVP
Sbjct: 161 GYLSSPTADGQSDKLTQVGYVLDNTTKTRAIFEINKGVNAEPSLDVNARMDPDERRVPIK 220
Query: 158 NMIYIGDSDTDIPCMKLVNSHGGYSIGVF-NPKERNEEKAKKRVYKMIRDNRIGYFTPAD 216
NMIYI D +D+P +V+ +GG ++GV+ NP + K + D R+ AD
Sbjct: 221 NMIYIADGPSDVPVFSVVSGNGGKTLGVWTNPGNYDGVK------DLEEDGRVHSIAKAD 274
Query: 217 YSEGQELD 224
YSEG+ D
Sbjct: 275 YSEGEAAD 282
>gi|119026383|ref|YP_910228.1| hypothetical protein BAD_1365 [Bifidobacterium adolescentis ATCC
15703]
gi|118765967|dbj|BAF40146.1| hypothetical protein [Bifidobacterium adolescentis ATCC 15703]
Length = 303
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 80/265 (30%), Positives = 134/265 (50%), Gaps = 28/265 (10%)
Query: 4 DFWKESNGLATAN------DMDKNLAYMYTMKKKARGQLLF---TKEKLAEYGSKVGLFP 54
D+W+ N L +++K+ Y+ K AR F E+L ++G ++ +
Sbjct: 34 DYWRMINNLPQKYKEEQDVEVNKDTIYLNQFIKDARPDGCFPGLNNEQLKDFGKELKFYK 93
Query: 55 GVKDWFRRIRQY-----GADREVIIEHYIISSGLKEMIEGTSIAKDFKEIYATSFYFDDD 109
GV D F +++ +R++ +E+Y++S+G+ ++I+G+ I K I+ F D++
Sbjct: 94 GVPDIFEAVKRVVDQDAYRERDIHVENYVVSTGMSKVIQGSKIEPYMKHIWGCEFIEDEN 153
Query: 110 ----GVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDS 165
V ++ T+KT+ LF I+KG+ + VN + RV F NMIYI D
Sbjct: 154 ENGQKVISELGYTIDNTSKTRALFEINKGIPETPNIDVNAKVPEELRRVQFKNMIYIADG 213
Query: 166 DTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSEGQE--- 222
+DIP +V +GG + V+ +++EKA +V +M RD RI + ADYSEG
Sbjct: 214 PSDIPAFSVVKRYGGATFAVY---PKHDEKAFDQVEQMRRDERIDMYAEADYSEGTTAYM 270
Query: 223 -LDQLVKLIIDRTVFNEQLERKHYE 246
L VKL+ + V Q E+K E
Sbjct: 271 WLTHKVKLLAEGIV---QREKKRLE 292
>gi|17546785|ref|NP_520187.1| hypothetical protein RSc2066 [Ralstonia solanacearum GMI1000]
gi|17429085|emb|CAD15773.1| conserved hypothetical protein [Ralstonia solanacearum]
Length = 316
Score = 100 bits (248), Expect = 2e-19, Method: Composition-based stats.
Identities = 65/201 (32%), Positives = 105/201 (52%), Gaps = 11/201 (5%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQYGA------DREVIIEHYIISSGLKEMIEGTSIAKDF 96
L E GS + FPGV ++F + + A ++ +EHYI+S+GL EMI G+ +A
Sbjct: 85 LRELGSTLEFFPGVVEFFAASKAWVAGVPAYQQFDIQLEHYIVSTGLTEMIRGSVLAPHI 144
Query: 97 KEIYATSFYFD--DDGVAVWPAQV--VNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEI 152
I+ F DG A + ++ T KT+ +F I+KGV ++ VN + A DE
Sbjct: 145 DGIWGCEFLESAAPDGQAEIAEVIYAIDNTTKTRAIFEINKGVNKHHEVNVNSTIAEDER 204
Query: 153 RVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYF 212
RVP NMIY+ D +DIP +V GG + V+N + + ++ ++ + + R+ F
Sbjct: 205 RVPMRNMIYVADGPSDIPAFSVVRKGGGRTYAVYNAASQ-DGRSFEQADDLRANQRVDSF 263
Query: 213 TPADYSEGQELDQLVKLIIDR 233
PADY ++ +KL I +
Sbjct: 264 GPADYCARSHTERWLKLHIGK 284
>gi|152993883|ref|YP_001359604.1| hypothetical protein SUN_2308 [Sulfurovum sp. NBC37-1]
gi|151425744|dbj|BAF73247.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 296
Score = 98.2 bits (243), Expect = 6e-19, Method: Composition-based stats.
Identities = 86/274 (31%), Positives = 125/274 (45%), Gaps = 40/274 (14%)
Query: 4 DFWKESNGLATANDMDKNLAYMYTMKKK-ARGQLLFTKEKLAEYGSKVGLFPGV------ 56
+FWK T + D Y+ + + A L + L G + L+ G+
Sbjct: 34 EFWKRCTKKVTDEEYDLEHGYIKVITEYIAEQSLSLSNNDLFNLGRSISLYDGLSRRESK 93
Query: 57 KDWFRRI------RQYGADREVIIEHYIISSGLKEMIEGT----SIAKDFKEIYATSFYF 106
K+ F + + YG D V +E Y IS GL EMI G + FK+IYA
Sbjct: 94 KNIFDDMLDIVNSKTYG-DLNVELECYCISGGLTEMINGAIESHELTPFFKKIYACRLVE 152
Query: 107 DDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNMIYIGDSD 166
+ DG +P + V +T KTQ +F+I+KG+ +E V D +PF +MI++GD
Sbjct: 153 NSDGNIAFPKETVGHTIKTQKIFQIAKGLEKDVNEVV------DGYEIPFEHMIFVGDGL 206
Query: 167 TDIPCMKLVNSHGGYSIGVFNPKER-----NEEKAKKRV---YKM-IRDNRIGYFTPADY 217
TD+P LV GG SI V+ + N E+ K Y++ I+ R PADY
Sbjct: 207 TDVPAFSLVQKMGGTSIAVYRESKNGDGSVNPEQTFKNYEAGYELAIKSKRAEQLLPADY 266
Query: 218 SEGQELDQLVKLIIDRTVFNEQLERKHYEYKNEA 251
SEG+ L + +DR KH +KNEA
Sbjct: 267 SEGKPLKMALLYHVDRIC-------KHIAHKNEA 293
>gi|51598270|ref|YP_072458.1| hypothetical protein BG0007 [Borrelia garinii PBi]
gi|51572841|gb|AAU06866.1| hypothetical protein BG0007 [Borrelia garinii PBi]
Length = 286
Score = 78.2 bits (191), Expect = 8e-13, Method: Composition-based stats.
Identities = 60/198 (30%), Positives = 94/198 (47%), Gaps = 25/198 (12%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQYGADRE---VIIEHYIISSGLKEMIEGTSIAKDFKEI 99
L G+K+ F GV D F I + E I+ YI+SSG ++MI G+ IA ++
Sbjct: 84 LFNLGAKLRFFEGVIDLFDEISKINKKLENNNSQIKIYIVSSGFRQMILGSKIAPYLSKV 143
Query: 100 YATSFY--------------FDDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVND 145
+A F F + V V++T KT+ +F I+KG + E +ND
Sbjct: 144 WACEFIDSYLMPFYELRDEKFSKNKVLSSVCYFVDHTIKTRVIFEINKG----SYEKIND 199
Query: 146 SFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIR 205
+ +PF N+ YI D DIP +++N+ + N++ AKK Y
Sbjct: 200 RVPKSKREIPFKNIFYIADGFNDIPAFEILNNILNHCKNTLTVYHGNDKNAKKLFY---- 255
Query: 206 DNRIGYFTPADYSEGQEL 223
+NR+G F A+YS+G +L
Sbjct: 256 ENRVGDFAEANYSKGTKL 273
>gi|111114829|ref|YP_709447.1| hypothetical protein BAPKO_0006 [Borrelia afzelii PKo]
gi|110890103|gb|ABH01271.1| hypothetical protein BAPKO_0006 [Borrelia afzelii PKo]
Length = 285
Score = 77.4 bits (189), Expect = 1e-12, Method: Composition-based stats.
Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 29/200 (14%)
Query: 43 LAEYGSKVGLFPGVKDWFRRIRQY-----GADREVIIEHYIISSGLKEMIEGTSIAK--- 94
L G+K+ F GV F I Q G + +V I YI+SSG ++MI G+ IA
Sbjct: 84 LFNLGAKLRFFKGVISLFDEISQINKKLEGNNSQVKI--YIVSSGFRQMILGSKIAPYVS 141
Query: 95 -----DFKEIYATSFY------FDDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAV 143
+F + Y FY F + V V++T KT+ +F I+KG + E +
Sbjct: 142 KVWACEFIDSYLMPFYELRDEKFSKNKVLSSVCYFVDHTIKTRVIFEINKG----SYEKI 197
Query: 144 NDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKM 203
N + ++PF N+ YI D TDIP +++N+ + N+ AKK Y
Sbjct: 198 NQRVPKSKRQIPFKNIFYIADGFTDIPAFEILNNILNHCRNTLTVYHENDTNAKKLFY-- 255
Query: 204 IRDNRIGYFTPADYSEGQEL 223
+NR+G F A+YS+G +L
Sbjct: 256 --ENRVGDFAEANYSKGTKL 273
>gi|15594353|ref|NP_212141.1| hypothetical protein BB0007 [Borrelia burgdorferi B31]
gi|3915318|sp|O51040|Y007_BORBU Uncharacterized protein BB_0007
gi|2687893|gb|AAC66404.1| predicted coding region BB0007 [Borrelia burgdorferi B31]
Length = 285
Score = 73.9 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 65/244 (26%), Positives = 111/244 (45%), Gaps = 33/244 (13%)
Query: 5 FWKESNGLATA------NDMDKNLAYMYTMKKKARGQLLFTKEKLAEY--GSKVGLFPGV 56
FW+E GL N + + Y+ R + + A + G+K+ F GV
Sbjct: 38 FWREVEGLEYVYKQNGYNIISNEMIYLSHFLTYVREGFFESLDNRALFNLGAKLRFFEGV 97
Query: 57 KDWFRRIRQYGADRE---VIIEHYIISSGLKEMIEGTSIAK--------DFKEIYATSFY 105
F I + E I+ YI+SSG ++MI G+ IA +F + Y FY
Sbjct: 98 IGLFDEISEINKKLENSKSQIKIYIVSSGFRQMILGSKIAPYVSKVWACEFIDSYLMPFY 157
Query: 106 ------FDDDGVAVWPAQVVNYTNKTQFLFRISKGVLNVNDEAVNDSFAPDEIRVPFHNM 159
F + + V++T KT+ +F I+KG + E +N+ + ++PF N+
Sbjct: 158 ELRDDKFSKNKILSSVCYFVDHTIKTRVIFEINKG----SYEKINERVPKSKRQIPFKNI 213
Query: 160 IYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEKAKKRVYKMIRDNRIGYFTPADYSE 219
YI D +D+P +++N+ + N++ AKK Y +NR+G F A+Y++
Sbjct: 214 FYIADGFSDVPAFEILNNILKHCRNTLTVYHGNDKNAKKLFY----ENRVGDFAEANYTK 269
Query: 220 GQEL 223
G +L
Sbjct: 270 GTKL 273
>gi|114704592|ref|ZP_01437500.1| hypothetical protein FP2506_06646 [Fulvimarina pelagi HTCC2506]
gi|114539377|gb|EAU42497.1| hypothetical protein FP2506_06646 [Fulvimarina pelagi HTCC2506]
Length = 284
Score = 72.4 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 57/210 (27%), Positives = 104/210 (49%), Gaps = 16/210 (7%)
Query: 23 AYMYTMKKKARGQLLFTKEKLAEYGSKVGLFPGVKDWFRRIRQYGADREVI----IEHYI 78
AY + RG+ L ++ L + +++ L+ V RIR A REV ++ +
Sbjct: 60 AYALIKAGRERGRPL-NRDALRKAAAEIELYDEVTQMPERIRS--AAREVAPGVEVDFTV 116
Query: 79 ISSGLKEMIEGTSIAKDFKEIYATSFYFDDDGVAVWPAQVVNYTNKTQFLFRISKG--VL 136
ISSG ++IE T+I F I+ +S +FD+ G AV+ + + + K +++ +SKG +
Sbjct: 117 ISSGYVDIIEHTAIKDCFDRIWGSSLHFDESGEAVFIKRALTHPEKARYMEALSKGFSIH 176
Query: 137 NVN-DEAVNDSFAPDEIRVPFHNMIYIGDSDTDIPCMKLVNSHGGYSIGVFNPKERNEEK 195
N E+ + + ++ VP + MI++GD +D+ + V ++GG +I V ++
Sbjct: 177 GSNAPESTTKAVSDEDKAVPPNQMIFVGDGASDLQAFQYVEANGGIAIAV------RQDG 230
Query: 196 AKKRVYKMIRDNRIGYFTPADYSEGQELDQ 225
M R PA Y++G E+ Q
Sbjct: 231 GFAGAKSMHARQRPTNLAPASYAKGGEMMQ 260
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.135 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,292,182,663
Number of Sequences: 5470121
Number of extensions: 56457074
Number of successful extensions: 216125
Number of sequences better than 1.0e-05: 34
Number of HSP's better than 0.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 216021
Number of HSP's gapped (non-prelim): 34
length of query: 352
length of database: 1,894,087,724
effective HSP length: 134
effective length of query: 218
effective length of database: 1,161,091,510
effective search space: 253117949180
effective search space used: 253117949180
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 130 (54.7 bits)