BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF0657
(303 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|150007298|ref|YP_001302041.1| hypothetical protein BDI_0... 414 e-114
gi|154492793|ref|ZP_02032419.1| hypothetical protein PARMER... 405 e-111
gi|156862647|gb|EDO56078.1| hypothetical protein BACUNI_002... 337 7e-91
gi|150003108|ref|YP_001297852.1| hypothetical protein BVU_0... 331 4e-89
gi|29349097|ref|NP_812600.1| hypothetical protein BT_3689 [... 322 2e-86
gi|53711763|ref|YP_097755.1| hypothetical protein BF0472 [B... 319 1e-85
gi|60679996|ref|YP_210140.1| hypothetical protein BF0416 [B... 319 1e-85
gi|156109118|gb|EDO10863.1| hypothetical protein BACOVA_034... 318 4e-85
gi|34541455|ref|NP_905934.1| hypothetical protein PG1841 [P... 270 9e-71
gi|89894028|ref|YP_517515.1| hypothetical protein DSY1282 [... 198 4e-49
gi|154497094|ref|ZP_02035790.1| hypothetical protein BACCAP... 196 2e-48
gi|156868165|gb|EDO61537.1| hypothetical protein CLOLEP_019... 196 2e-48
gi|134300072|ref|YP_001113568.1| hypothetical protein Dred_... 185 3e-45
gi|153854175|ref|ZP_01995483.1| hypothetical protein DORLON... 183 1e-44
gi|34556601|ref|NP_906416.1| hypothetical protein WS0153 [W... 182 2e-44
gi|153814378|ref|ZP_01967046.1| hypothetical protein RUMTOR... 182 3e-44
gi|154502566|ref|ZP_02039626.1| hypothetical protein RUMGNA... 181 5e-44
gi|28210530|ref|NP_781474.1| hypothetical protein CTC00813 ... 161 5e-38
gi|156865105|gb|EDO58536.1| hypothetical protein CLOL250_00... 160 1e-37
gi|32267008|ref|NP_861040.1| hypothetical protein HH1509 [H... 160 1e-37
gi|118725200|ref|ZP_01573843.1| conserved hypothetical prot... 151 4e-35
gi|146295805|ref|YP_001179576.1| hypothetical protein Csac_... 146 1e-33
gi|78357692|ref|YP_389141.1| hypothetical protein Dde_2650 ... 146 2e-33
gi|15611347|ref|NP_222998.1| hypothetical protein jhp0277 [... 144 5e-33
gi|153808945|ref|ZP_01961613.1| hypothetical protein BACCAC... 144 7e-33
gi|15644920|ref|NP_207090.1| hypothetical protein HP0292 [H... 142 3e-32
gi|108562719|ref|YP_627035.1| hypothetical protein HPAG1_02... 139 2e-31
gi|21226283|ref|NP_632205.1| hypothetical protein MM_0181 [... 138 4e-31
gi|109947143|ref|YP_664371.1| hypothetical protein Hac_0552... 137 9e-31
gi|126178970|ref|YP_001046935.1| hypothetical protein Memar... 135 3e-30
gi|153938729|ref|YP_001390465.1| hypothetical protein CLI_1... 134 5e-30
gi|126700094|ref|YP_001088991.1| hypothetical protein CD247... 134 6e-30
gi|148379099|ref|YP_001253640.1| hypothetical protein CBO11... 134 9e-30
gi|147918986|ref|YP_687287.1| hypothetical protein RRC217 [... 132 3e-29
gi|73669627|ref|YP_305642.1| hypothetical protein Mbar_A213... 131 6e-29
gi|20090642|ref|NP_616717.1| hypothetical protein MA1791 [M... 130 9e-29
gi|46580958|ref|YP_011766.1| hypothetical protein DVU2554 [... 130 1e-28
gi|148322876|gb|EDK88126.1| hypothetical protein FNP_0312 [... 130 1e-28
gi|120601744|ref|YP_966144.1| hypothetical protein Dvul_069... 130 1e-28
gi|19703622|ref|NP_603184.1| hypothetical protein FN0277 [F... 130 1e-28
gi|154150103|ref|YP_001403721.1| hypothetical protein Mboo_... 127 6e-28
gi|125972764|ref|YP_001036674.1| hypothetical protein Cthe_... 127 8e-28
gi|39996148|ref|NP_952099.1| hypothetical protein GSU1046 [... 127 1e-27
gi|88602186|ref|YP_502364.1| hypothetical protein Mhun_0895... 124 6e-27
gi|153953652|ref|YP_001394417.1| hypothetical protein CKL_1... 124 7e-27
gi|121534719|ref|ZP_01666540.1| conserved hypothetical prot... 118 4e-25
gi|110800971|ref|YP_695841.1| hypothetical protein CPF_1396... 118 5e-25
gi|110803855|ref|YP_698530.1| hypothetical protein CPR_1208... 117 1e-24
gi|145618153|ref|ZP_01774214.1| conserved hypothetical prot... 116 2e-24
gi|148265328|ref|YP_001232034.1| hypothetical protein Gura_... 115 3e-24
gi|110601754|ref|ZP_01389925.1| conserved hypothetical prot... 112 3e-23
gi|124485248|ref|YP_001029864.1| putative transcriptional r... 112 3e-23
gi|42528175|ref|NP_973273.1| hypothetical protein TDE2675 [... 109 2e-22
gi|94987570|ref|YP_595503.1| hypothetical protein LI1128 [L... 107 1e-21
gi|154485046|ref|ZP_02027494.1| hypothetical protein EUBVEN... 106 2e-21
gi|85859315|ref|YP_461517.1| hypothetical cytosolic protein... 97 1e-18
gi|78221940|ref|YP_383687.1| hypothetical protein Gmet_0720... 91 1e-16
gi|78777400|ref|YP_393715.1| hypothetical protein Tmden_120... 84 1e-14
gi|154174955|ref|YP_001408913.1| ribonuclease HII (RNase HI... 77 2e-12
gi|150384601|ref|ZP_01923283.1| conserved hypothetical prot... 74 2e-11
gi|34762805|ref|ZP_00143791.1| hypothetical protein [Fusoba... 73 3e-11
gi|118475472|ref|YP_891393.1| hypothetical protein CFF8240_... 73 3e-11
gi|46446557|ref|YP_007922.1| hypothetical protein pc0923 [C... 60 2e-07
gi|26554339|ref|NP_758273.1| hypothetical protein MYPE8860 ... 57 2e-06
>gi|150007298|ref|YP_001302041.1| hypothetical protein BDI_0643 [Parabacteroides distasonis ATCC
8503]
gi|149935722|gb|ABR42419.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 305
Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats.
Identities = 190/301 (63%), Positives = 248/301 (82%), Gaps = 5/301 (1%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKPI +EDKE+ITSFT+PS ++NCDF+F+NMCSWRFLY+SE+A+ +G LLIRF IE+K
Sbjct: 5 FKPIRLEDKEIITSFTFPSDYRNCDFSFANMCSWRFLYDSEFAIVDGYLLIRFMIEDK-- 62
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
R AYM PVG+GDL AV LE+DS + HPL +LG+T +A+++LE P F Y+
Sbjct: 63 SRFAYMMPVGDGDLAGAVRLLEEDSLKYR---HPLCMLGITPDAKEQLEGALPGSFFYIP 119
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
ERDYFDYIYLREDL TL+GKK+Q+KRNHIN FKKQY Y ++PITP+IVPQC++LE +W +
Sbjct: 120 ERDYFDYIYLREDLATLRGKKYQSKRNHINNFKKQYSYEHVPITPDIVPQCLKLECKWYK 179
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVE 242
AN + +EL++E RS+T+A+HHF LGL+GGA+ V+ EI+AF++G+PIN+ TFGVHVE
Sbjct: 180 ANNGDNDEEELNDERRSLTYALHHFDSLGLIGGALRVDNEIIAFSFGAPINHNTFGVHVE 239
Query: 243 KADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGAIKR 302
KAD N++G +++INQEFA H+PEQY Y+NREEDLGIPGLRQ+KLSY P ILLEK+ AIK+
Sbjct: 240 KADVNFDGAYTVINQEFASHLPEQYTYVNREEDLGIPGLRQAKLSYQPTILLEKSAAIKK 299
Query: 303 R 303
+
Sbjct: 300 Q 300
>gi|154492793|ref|ZP_02032419.1| hypothetical protein PARMER_02432 [Parabacteroides merdae ATCC
43184]
gi|154087098|gb|EDN86143.1| hypothetical protein PARMER_02432 [Parabacteroides merdae ATCC
43184]
Length = 302
Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats.
Identities = 187/300 (62%), Positives = 250/300 (83%), Gaps = 5/300 (1%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKPI IED+++ITSFT PS ++NCD++F+N+CSWRFLY+SE+A+ G LLIRF+IE K
Sbjct: 5 FKPIEIEDRDIITSFTIPSNYKNCDYSFANICSWRFLYDSEFAIVNGSLLIRFWIENK-- 62
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
R+AYM P G G+L+QA++ LE DS E +GHPL +LGVT +A+++LE+ P F Y+
Sbjct: 63 TRVAYMTPTGQGNLKQAIDLLEADSLE---QGHPLCMLGVTPDAKEELEKAIPGGFFYIP 119
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
ER+YFDYIYLREDL TLKGKK+QAKRNHINKF K++ Y YIPITPE+VP+C++LE +W +
Sbjct: 120 ERNYFDYIYLREDLATLKGKKYQAKRNHINKFNKKFAYEYIPITPELVPECLQLECKWYK 179
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVE 242
AN + ++L++E RS+ +A++H+ ELGL+GGAI V+ +IVAFT+G+PIN+ TFGVHVE
Sbjct: 180 ANREDNDEEDLNDERRSIIYALNHYDELGLIGGAICVDHQIVAFTFGAPINHNTFGVHVE 239
Query: 243 KADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGAIKR 302
KA+ NYEG +++IN+EFA H+PE+Y Y+NREEDLGIPGLRQ+KLSYNP ILLEK+ AIK+
Sbjct: 240 KANVNYEGAYAVINKEFASHLPEKYTYVNREEDLGIPGLRQAKLSYNPFILLEKSAAIKK 299
>gi|156862647|gb|EDO56078.1| hypothetical protein BACUNI_00220 [Bacteroides uniformis ATCC 8492]
Length = 304
Score = 337 bits (863), Expect = 7e-91, Method: Composition-based stats.
Identities = 156/294 (53%), Positives = 218/294 (74%), Gaps = 8/294 (2%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK I ++DKELITS+T S +NCD +FSN+CSWRFLY +++A+ +G LL++F+ ++
Sbjct: 4 FKDIELQDKELITSYTQNSPRRNCDLSFSNLCSWRFLYNTKFAIMDGFLLLKFWANDE-- 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+ YM P+GNGDL + ++AL +D++ EG P +LG+ S +LE P +F +
Sbjct: 62 --LVYMMPIGNGDLTKVLDALVEDAHR---EGEPFCLLGICSGMCSELEAFMPGKFQFTA 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
+RDY DY+YLR DL TL GKK+Q+KRNH+NKFK+ Y Y Y PITP+ + +C++LE W +
Sbjct: 117 DRDYADYLYLRTDLATLAGKKFQSKRNHVNKFKRTYNYEYTPITPDRIQECLDLEAEWCK 176
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVE 242
AN N + + NE R++ +A+H+F+ELGL GG + V+G+I AFT+G PIN TFGVHVE
Sbjct: 177 AN-NCDQHEGTGNERRALVYALHNFEELGLTGGILHVDGKIAAFTFGMPINQDTFGVHVE 235
Query: 243 KADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
KADT+ +G +++IN EFA HIPEQYVY+NREEDLGI GLR++KLSY P I+LEK
Sbjct: 236 KADTSIDGAYAMINYEFANHIPEQYVYLNREEDLGIEGLRKAKLSYQPAIILEK 289
>gi|150003108|ref|YP_001297852.1| hypothetical protein BVU_0520 [Bacteroides vulgatus ATCC 8482]
gi|149931532|gb|ABR38230.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 300
Score = 331 bits (848), Expect = 4e-89, Method: Composition-based stats.
Identities = 165/295 (55%), Positives = 215/295 (72%), Gaps = 9/295 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK I + DK+LI SFT S +NCD +F+N+CSW FLY+++YAV + LL+RFY E+
Sbjct: 4 FKDITLADKDLIQSFTLGSLRRNCDLSFANLCSWIFLYQTKYAVMDNYLLLRFYAGEE-- 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+AYM PVG GD++ +EAL KD+ EM G L +LGV + +E P F +
Sbjct: 62 --LAYMMPVGTGDVKPVLEALIKDAEEM---GAKLRMLGVCVGMKADIEAAMPGRFTFTE 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRWV 181
+RDYFDYIYLR DL TLKGKK+QAKRNHINKFKKQY +Y Y P+TP++VP+C++LE W
Sbjct: 117 DRDYFDYIYLRTDLATLKGKKFQAKRNHINKFKKQYPDYEYKPLTPDLVPECLKLEEEWC 176
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHV 241
RAN N E L E +SMT+A++H + LGL GG + V G+I AFTYG+PIN++T+ V
Sbjct: 177 RAN-NCEEQLALGAERKSMTYALNHMEALGLTGGVLHVNGKIAAFTYGAPINHETWDTCV 235
Query: 242 EKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EKADT EG +++IN E+A HI EQY+Y+NREEDLG+ GLR++KLSY PVILLEK
Sbjct: 236 EKADTGIEGSYAMINYEYANHIDEQYIYVNREEDLGLEGLRKAKLSYQPVILLEK 290
>gi|29349097|ref|NP_812600.1| hypothetical protein BT_3689 [Bacteroides thetaiotaomicron
VPI-5482]
gi|116667787|pdb|2HQY|A Chain A, Crystal Structure Of Conserved Hypothetical Protein From
Bacteroides Thetaiotaomicron Vpi-5482
gi|116667788|pdb|2HQY|B Chain B, Crystal Structure Of Conserved Hypothetical Protein From
Bacteroides Thetaiotaomicron Vpi-5482
gi|29341004|gb|AAO78794.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 305
Score = 322 bits (825), Expect = 2e-86, Method: Composition-based stats.
Identities = 154/295 (52%), Positives = 212/295 (71%), Gaps = 9/295 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK I + D++ IT+FT S +NCD +FSN+CSWRFLY++++AV + L+ +F+ E+
Sbjct: 4 FKDITLADRDTITAFTMKSDRRNCDLSFSNLCSWRFLYDTQFAVIDDFLVFKFWAGEQ-- 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+AYM PVGNGDL+ + L +D+ + E H +LGV S R LE + P F +
Sbjct: 62 --LAYMMPVGNGDLKAVLRKLIEDADK---EKHNFCMLGVCSNMRADLEAILPERFIFTE 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRWV 181
+R Y DYIYLR DL TLKGKK+QAKRNHIN+F+ Y +Y Y PITP+ + +C++LE W
Sbjct: 117 DRAYADYIYLRSDLATLKGKKFQAKRNHINRFRNTYPDYEYTPITPDRIQECLDLEAEWC 176
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHV 241
+ N N + + NE R++ +A+H+F+ LGL GG + V G+IVAFT+G PIN++TFGVHV
Sbjct: 177 KVN-NCDQQEGTGNERRALIYALHNFEALGLTGGILHVNGKIVAFTFGMPINHETFGVHV 235
Query: 242 EKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EKADT+ +G +++IN EFA IPEQY+YINREEDLGI GLR++KLSY PV +LEK
Sbjct: 236 EKADTSIDGAYAMINYEFANRIPEQYIYINREEDLGIEGLRKAKLSYQPVTILEK 290
>gi|53711763|ref|YP_097755.1| hypothetical protein BF0472 [Bacteroides fragilis YCH46]
gi|52214628|dbj|BAD47221.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 305
Score = 319 bits (818), Expect = 1e-85, Method: Composition-based stats.
Identities = 153/297 (51%), Positives = 217/297 (73%), Gaps = 9/297 (3%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
+ F+ I I+DK+ IT++T S +NCD +FSN+CSWRFLY +++A+ L+ +F+ ++
Sbjct: 2 IAFRDITIQDKDTITAYTMNSCRRNCDLSFSNLCSWRFLYHTKFAIINNFLVFKFWAGDE 61
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+AYM PVG G+L + + L +D+ + EG P +LGV S R+ LE + P +F +
Sbjct: 62 ----LAYMMPVGEGNLEEVLNELIEDARQ---EGEPFCMLGVCSCMREDLEAIMPGQFGF 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERR 179
+RDY DYIYLR DL TLKGKK+Q+KRNHINKF+ Y +Y Y PIT + + +C+ELE +
Sbjct: 115 TVDRDYADYIYLRSDLATLKGKKFQSKRNHINKFRNTYPDYKYSPITKDRIQECLELEAK 174
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
W +AN+ + + NE R++ +A++HF+ELGL GG + V G+IVAFT+G PIN +TFGV
Sbjct: 175 WCKAND-CDQQEGTGNERRALIYALNHFEELGLTGGILHVNGQIVAFTFGMPINKETFGV 233
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
HVEKADT+ +G +++IN EFA HIPEQY+YINREEDLGI GLR++KLSY+P +LEK
Sbjct: 234 HVEKADTSIDGAYAMINYEFANHIPEQYIYINREEDLGIEGLRKAKLSYHPETILEK 290
>gi|60679996|ref|YP_210140.1| hypothetical protein BF0416 [Bacteroides fragilis NCTC 9343]
gi|60491430|emb|CAH06180.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 305
Score = 319 bits (817), Expect = 1e-85, Method: Composition-based stats.
Identities = 153/297 (51%), Positives = 217/297 (73%), Gaps = 9/297 (3%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
+ F+ I I+DK+ IT++T S +NCD +FSN+CSWRFLY +++A+ L+ +F+ ++
Sbjct: 2 IAFRDITIQDKDTITAYTMNSCRRNCDLSFSNLCSWRFLYHTKFAIINNFLVFKFWAGDE 61
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+AYM PVG G+L + + L +D+ + EG P +LGV S R+ LE + P +F +
Sbjct: 62 ----LAYMMPVGEGNLEEVLNELIEDARQ---EGEPFCMLGVCSCMREDLEAIMPGQFGF 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERR 179
+RDY DYIYLR DL TLKGKK+Q+KRNHINKF+ Y +Y Y PIT + + +C+ELE +
Sbjct: 115 TVDRDYADYIYLRSDLATLKGKKFQSKRNHINKFRNTYPDYEYSPITKDRIQECLELEAK 174
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
W +AN+ + + NE R++ +A++HF+ELGL GG + V G+IVAFT+G PIN +TFGV
Sbjct: 175 WCKAND-CDQQEGTGNERRALIYALNHFEELGLTGGILHVNGQIVAFTFGMPINKETFGV 233
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
HVEKADT+ +G +++IN EFA HIPEQY+YINREEDLGI GLR++KLSY+P +LEK
Sbjct: 234 HVEKADTSIDGAYAMINYEFANHIPEQYIYINREEDLGIEGLRKAKLSYHPETILEK 290
>gi|156109118|gb|EDO10863.1| hypothetical protein BACOVA_03496 [Bacteroides ovatus ATCC 8483]
Length = 305
Score = 318 bits (814), Expect = 4e-85, Method: Composition-based stats.
Identities = 151/295 (51%), Positives = 212/295 (71%), Gaps = 9/295 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK + + D++ ITSFT S +NCD +FSN+CSWRFLY++++AV + L+ +F+ ++
Sbjct: 4 FKDVTLADRDTITSFTMKSDRRNCDLSFSNLCSWRFLYDTQFAVVDNFLVFKFWAGDQ-- 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+AYM PVG GDL+ + L +D+ + E +LGV S R LE + P +F +
Sbjct: 62 --LAYMMPVGTGDLKAILGELIEDARK---ENQHFCMLGVCSNMRADLEAILPGQFTFTE 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRWV 181
+RDY DYIYLR DL TLKGKK+QAKRNHIN+F+ Y +Y Y PITP+ + +C++LE W
Sbjct: 117 DRDYADYIYLRSDLSTLKGKKFQAKRNHINRFRNTYPDYEYTPITPDRIQECLDLEAEWC 176
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHV 241
+ N + + + NE R++ +A+H+F+ LGL GG + V G+IVAFT+G PIN++TFGVHV
Sbjct: 177 KVN-HCDQQEGTGNERRALIYALHNFEALGLTGGILHVNGKIVAFTFGMPINHETFGVHV 235
Query: 242 EKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EKADT+ EG +++IN EFA IPEQY+YINREEDLG+ GLR++KLSY PV +LEK
Sbjct: 236 EKADTSIEGAYAMINYEFANRIPEQYIYINREEDLGLEGLRKAKLSYQPVTILEK 290
>gi|34541455|ref|NP_905934.1| hypothetical protein PG1841 [Porphyromonas gingivalis W83]
gi|34397772|gb|AAQ66833.1| conserved hypothetical protein [Porphyromonas gingivalis W83]
Length = 307
Score = 270 bits (690), Expect = 9e-71, Method: Composition-based stats.
Identities = 145/304 (47%), Positives = 190/304 (62%), Gaps = 11/304 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKP+ D+ IT+ T+ S + CD AFSN+ W F+Y + +A+ EGCL+IRF + K
Sbjct: 5 FKPVEPADRSAITTITFSSVARICDLAFSNLYCWSFVYGTSWAIVEGCLIIRF--KPKSR 62
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
Y+FPVG D Q V A + E++ E +PL+ +GVT + + +EE E+ ++
Sbjct: 63 SHPVYLFPVG-ADPEQVVAAAHRLKAEVVHEDYPLIFMGVTPDIHRCIEEHCSAEYYFIE 121
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRWV 181
+ Y DYIY RE L TL GKK QAKRNHINKF Y +YTY P+ +C+ L W+
Sbjct: 122 DEAYCDYIYERESLATLSGKKLQAKRNHINKFVSLYPDYTYDPVKDSDAEECLMLAHLWL 181
Query: 182 --RANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
R +++G EL E F H +ELGL GG + V G +VAF GSPIN TFGV
Sbjct: 182 DSRGDQDGR---ELEVEMVERAF--RHRQELGLSGGVLRVGGRVVAFCLGSPINTDTFGV 236
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGA 299
H+EKAD EG F++IN+EFA IP+ + YINREEDLG+PGLRQ+KLSY P ILL K A
Sbjct: 237 HIEKADAMVEGAFAMINREFARSIPDTFRYINREEDLGLPGLRQAKLSYRPAILLPKQTA 296
Query: 300 IKRR 303
I RR
Sbjct: 297 ILRR 300
>gi|89894028|ref|YP_517515.1| hypothetical protein DSY1282 [Desulfitobacterium hafniense Y51]
gi|109646405|ref|ZP_01370309.1| conserved hypothetical protein [Desulfitobacterium hafniense DCB-2]
gi|89333476|dbj|BAE83071.1| hypothetical protein [Desulfitobacterium hafniense Y51]
gi|109641651|gb|EAT51205.1| conserved hypothetical protein [Desulfitobacterium hafniense DCB-2]
Length = 299
Score = 198 bits (503), Expect = 4e-49, Method: Composition-based stats.
Identities = 114/297 (38%), Positives = 172/297 (57%), Gaps = 11/297 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK I + DKE + + C F+N+ SW Y + A E L+I+ + E
Sbjct: 4 FKKIEMRDKEWVKPLLEAADLGGCHQNFTNLFSWSGTYHYQVAQVEDYLVIKGRLGET-- 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
Y +P G GD++ +EA++KD+ E GH ++LG++ E L+EL+P F Y
Sbjct: 62 --YYYFYPAGTGDVQPVLEAMKKDAQE---NGHEFIVLGISPENMATLKELYPEHFEYEE 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
RD FDY+Y E L TL G+K QAKRNHIN+F+ + + + +TPE + +C E+ W R
Sbjct: 117 MRDSFDYVYQAEKLATLAGRKLQAKRNHINRFEANHIWAFELLTPENLAECWEMNLEWCR 176
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVE 242
N+ ++ ++L E+ ++ +F +L L GG + +G IVAFT G +N T+ VHVE
Sbjct: 177 RNDCKDD-EQLRAEYCAVKRDFDYFTDLELEGGLLRADGRIVAFTMGERLNSDTYVVHVE 235
Query: 243 KADTNYEGIFSLINQEFALHIPEQY---VYINREEDLGIPGLRQSKLSYNPVILLEK 296
KA +G + +IN+EF I E Y +Y+NREED+G GLR++KLSY+P + EK
Sbjct: 236 KAFGEIQGAYQMINREFVRWIRETYPDMIYVNREEDMGYEGLRKAKLSYHPDKMEEK 292
>gi|154497094|ref|ZP_02035790.1| hypothetical protein BACCAP_01387 [Bacteroides capillosus ATCC
29799]
gi|150273493|gb|EDN00621.1| hypothetical protein BACCAP_01387 [Bacteroides capillosus ATCC
29799]
Length = 304
Score = 196 bits (498), Expect = 2e-48, Method: Composition-based stats.
Identities = 108/307 (35%), Positives = 178/307 (57%), Gaps = 18/307 (5%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F+ + DK + S ++ C++ F+N+ +W+ Y + A + L++
Sbjct: 4 FRTPQLADKVWMDQLLSRSNYRGCEYNFTNLFAWKDAYHHKVARLDDFLVVHLC----GG 59
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGH--PLLILGVTSEARKKLEELFPNEFAY 120
++++P G+GD R +EAL D+ E H PL ++ +T E ++LEELFP F +
Sbjct: 60 LGCSFLYPAGSGDRRAVIEALRADA-----EAHDQPLRLVCLTREQTQELEELFPGRFRF 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKF-KKQYEYTYIPITPEIVPQCMELERR 179
++RD +DY+Y + L L GKK KRNHIN+F + Y PITP+ +P+C+E+++
Sbjct: 115 ESDRDGWDYLYEIDRLADLGGKKLHGKRNHINRFLDNNPTWVYEPITPDSLPECLEMDKE 174
Query: 180 WVR---ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQT 236
W R E +L +E R++ A+ H+ +LGL GG I V E+VAFT G ++ T
Sbjct: 175 WYRRSMVREGAVEERDLGDEGRALRLAIEHYHDLGLEGGLIRVYSEVVAFTMGDMLSSDT 234
Query: 237 FGVHVEKADTNYEGIFSLINQEFALHIPEQYV---YINREEDLGIPGLRQSKLSYNPVIL 293
+ VH EKA +G +++IN+EFA + E + Y+NRE+D+G+ GLR++K SY P ++
Sbjct: 235 YDVHFEKAYGELQGAYAMINREFARWVREHHPNVRYLNREDDMGVEGLRKAKESYYPDLM 294
Query: 294 LEKNGAI 300
+EK A+
Sbjct: 295 VEKYSAV 301
>gi|156868165|gb|EDO61537.1| hypothetical protein CLOLEP_01934 [Clostridium leptum DSM 753]
Length = 335
Score = 196 bits (497), Expect = 2e-48, Method: Composition-based stats.
Identities = 107/296 (36%), Positives = 168/296 (56%), Gaps = 6/296 (2%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKPI D + +C + F+N+ W +Y ++ A F+ ++ R + P
Sbjct: 30 FKPITAADGAWARPLLLKADCISCGYTFANLFVWSPVYHTDLARFQDFVIARCFDHPNTP 89
Query: 63 KR-MAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
Y+FP G+GD R A+ + +D+ G + ++ + LEE FP FA+
Sbjct: 90 DSPYLYLFPEGDGDKRAAIREIVQDA---AVHGKMPRLYSLSQRDKAWLEEQFPGIFAFT 146
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRW 180
+ R FDYIY E+LQ L GKK+Q KRNH+++F +++ ++ + PITPE + + W
Sbjct: 147 SCRGTFDYIYSSEELQNLPGKKFQKKRNHVSRFLREHPDFVFEPITPENLEEVRAFNNAW 206
Query: 181 VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
+ N ++T + EH ++ + HF EL L GG I +G+I+AF+YGSP++ + F VH
Sbjct: 207 SQLYGNSQDTG-IQTEHLAVEMGLKHFFELELSGGLIRADGKIIAFSYGSPVSGRVFDVH 265
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
VEKA + G +++IN+EFA Y +INRE+D+ GLR++KLSYNP L EK
Sbjct: 266 VEKALYDVNGAYNIINREFARRFCGDYQWINREDDVSEEGLRKAKLSYNPAFLQEK 321
>gi|134300072|ref|YP_001113568.1| hypothetical protein Dred_2230 [Desulfotomaculum reducens MI-1]
gi|134052772|gb|ABO50743.1| conserved hypothetical protein [Desulfotomaculum reducens MI-1]
Length = 301
Score = 185 bits (470), Expect = 3e-45, Method: Composition-based stats.
Identities = 108/302 (35%), Positives = 179/302 (59%), Gaps = 20/302 (6%)
Query: 3 FKPICIEDKE----LITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIE 58
FK + + DK+ LIT S+ QN F+N+ +W +Y A L+++ ++
Sbjct: 4 FKKVELADKQWMGPLITLGEMSSSHQN----FTNIFAWSEIYHYRVARVSDYLVVKGRLQ 59
Query: 59 EKDPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEF 118
+ Y +P G GD + +E +++D+ + GH ++LGV+ E L LFP F
Sbjct: 60 NGE---QYYFYPAGKGDPKSVIETMKQDAADC---GHKFIMLGVSPENIIVLNSLFPESF 113
Query: 119 AYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELER 178
Y RD FDY+YL + L TL G K +KRNHIN+FK+++ +++ I+ + + +C E+
Sbjct: 114 EYKEMRDSFDYVYLLDKLVTLSGNKLHSKRNHINRFKEKHVWSFEQISSDNLAECWEMNV 173
Query: 179 RWVRANENGENTD-ELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
W + ENG + D +L++E+ ++ ++F ELGL GG I + G ++A+T G +N T+
Sbjct: 174 IWCK--ENGCHEDKQLADENCAVCRCFNNFNELGLEGGLIRLNGRVIAYTMGEKLNSNTY 231
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQY---VYINREEDLGIPGLRQSKLSYNPVILL 294
+H+EKA + +G + +IN+EFA I E+Y +++NREED+G GLR++KLSY P +
Sbjct: 232 VIHIEKAFSKIQGAYQMINREFAAFIQEKYPQLMFVNREEDMGYEGLRKAKLSYYPYRME 291
Query: 295 EK 296
EK
Sbjct: 292 EK 293
>gi|153854175|ref|ZP_01995483.1| hypothetical protein DORLON_01474 [Dorea longicatena DSM 13814]
gi|149753224|gb|EDM63155.1| hypothetical protein DORLON_01474 [Dorea longicatena DSM 13814]
Length = 303
Score = 183 bits (465), Expect = 1e-44, Method: Composition-based stats.
Identities = 105/303 (34%), Positives = 173/303 (57%), Gaps = 11/303 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK +ED+E+I+ + ++C+ F+N+ W Y ++A+ + L+ + +D
Sbjct: 6 FKRAELEDQEIISHYFEHHTSRSCERTFANVYLWSRQYPVKWAIIKNALVFK----SEDE 61
Query: 63 KRMAYMFPVGNG-DLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
+++ +P G D++ A+E + + S +G P L+ VT E +LEE +P F
Sbjct: 62 NHVSFAYPAGAPEDVKNALEEMMEYS---KAKGRPFLMYNVTPEYFAQLEEWYPGRFQIE 118
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYE--YTYIPITPEIVPQCMELERR 179
+RD DY+Y E L TL GKK KRNHINKFK +E ++Y +T + + +C ++ +
Sbjct: 119 YDRDSADYVYESEKLATLSGKKLHGKRNHINKFKSLFEDRWSYESMTKDNLEECFQMALK 178
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
W R + E+ DE E ++ F+EL L GG + ++G++VAFT G PI T+ V
Sbjct: 179 W-RTENDCEDDDEKRGEMCVALNSLRLFEELHLTGGVLRIDGKVVAFTIGEPICEDTYVV 237
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGA 299
H+EKA + +G +++INQ+F H Y Y+NRE+D G GLR++KLSY P ++EK
Sbjct: 238 HIEKAYADVQGAYTMINQQFVEHECMNYKYVNREDDTGAEGLRKAKLSYRPAFMVEKGDV 297
Query: 300 IKR 302
++
Sbjct: 298 TEK 300
>gi|34556601|ref|NP_906416.1| hypothetical protein WS0153 [Wolinella succinogenes DSM 1740]
gi|34482315|emb|CAE09316.1| conserved hypothetical protein [Wolinella succinogenes]
Length = 299
Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats.
Identities = 109/297 (36%), Positives = 165/297 (55%), Gaps = 8/297 (2%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
M +KPI IED+E + F DF+F+N+ W F YA+ E L I+ +
Sbjct: 1 MQWKPIDIEDRETLEGFFRSEELSVSDFSFTNLYLWHFSRSISYAIIEDLLCIKTQYHGE 60
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
P FP+G G+ R +E L + +E P + + E + +LE L P +F +
Sbjct: 61 HP---FLFFPLGKGEKRGVIERL-MECFE--SRAIPFTMRSLGEEMKDELERLMPEKFEF 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERR 179
+ RD DY+YL E+L LKG+K+ K+NH+N+F + Y +++Y ++ E V + +E +
Sbjct: 115 IYNRDRSDYVYLTEELIELKGRKFHKKKNHLNRFFELYPDFSYESLSMENVDELLEAWKL 174
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
W E + L NE+ + + HF ++ GG + V+G+IVAFT G +N T V
Sbjct: 175 WF-GRIADEANEGLKNEYIGIVETLKHFGKMSYKGGILRVKGKIVAFTLGEQLNSDTVVV 233
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
H+EKADT Y G + INQ+F + Y+NREEDLGI GLR++KLSY P L++K
Sbjct: 234 HIEKADTEYHGAYQAINQQFLANEWSHLTYVNREEDLGIEGLRRAKLSYQPSHLIDK 290
>gi|153814378|ref|ZP_01967046.1| hypothetical protein RUMTOR_00588 [Ruminococcus torques ATCC 27756]
gi|145848774|gb|EDK25692.1| hypothetical protein RUMTOR_00588 [Ruminococcus torques ATCC 27756]
Length = 299
Score = 182 bits (461), Expect = 3e-44, Method: Composition-based stats.
Identities = 110/297 (37%), Positives = 170/297 (57%), Gaps = 13/297 (4%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK +EDKE+I+S+ + ++C+ F N+ W Y+ +YA+ E L+ R
Sbjct: 6 FKRPELEDKEIISSYFEKAQSRSCERTFVNVYLWSRHYKVQYAIIEDTLVFR-----DSG 60
Query: 63 KRMAYMFPVGNGD-LRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
K +++ +P G + +++A+E L + E + P ++ VT +L++ +P +F
Sbjct: 61 KNLSFTYPAGEPENVKKALEFLMEYCKE---KDVPFILYNVTPHMFAQLDKWYPKKFFIE 117
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRW 180
RD DY+Y E L +L GKK KRNHINKFK Y +++Y P+ + V C ++ +W
Sbjct: 118 YNRDLADYVYETEKLASLAGKKLHGKRNHINKFKALYPDWSYEPLNDDNVEDCFQMALKW 177
Query: 181 VRANENGENTDELSNEHRSMTF-AMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
N+NG D N +T ++ +KELGL GG + +G+IVAFT G P++ TF V
Sbjct: 178 --RNKNGCEDDPEKNAEMCVTLNSLRLYKELGLKGGVLKADGKIVAFTVGEPVSDDTFVV 235
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
H+EKA +G + +INQ+F H +Y Y+NREED G GLR++KLSY P L EK
Sbjct: 236 HIEKAFAEVDGAYPMINQQFVQHECMEYEYVNREEDTGAEGLRKAKLSYRPAFLEEK 292
>gi|154502566|ref|ZP_02039626.1| hypothetical protein RUMGNA_00379 [Ruminococcus gnavus ATCC 29149]
gi|153796758|gb|EDN79178.1| hypothetical protein RUMGNA_00379 [Ruminococcus gnavus ATCC 29149]
Length = 300
Score = 181 bits (459), Expect = 5e-44, Method: Composition-based stats.
Identities = 114/306 (37%), Positives = 174/306 (56%), Gaps = 14/306 (4%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
++F+ +ED+ELI S+ + ++C+ F ++ W Y+ +AV E L+ R
Sbjct: 4 IIFRRPALEDQELIRSYFDKAPSRSCERTFVDVFLWARHYDVTFAVIEDTLVFR--DGGA 61
Query: 61 DPKRMAYMFPVGNGD-LRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFA 119
DP + +P G +++A+E L K S E +G+P + VT+E ++E FP EF
Sbjct: 62 DP---GFAYPAGEEVCVKRALEFLMKYSEE---QGYPFKLYNVTAENFAQIEAWFPGEFE 115
Query: 120 YVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELER 178
+RD DY+Y E L TL GKK KRNHINKF+K Y ++Y P+ E + +C ++
Sbjct: 116 LEYDRDQADYVYESEKLATLAGKKLHGKRNHINKFQKLYPNWSYEPLNDENMEECFQMAL 175
Query: 179 RWVRANENGENTDELSNEHRSMTF-AMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
+W N+NG D N + A+ +KEL GG + V+G++VAFT G + TF
Sbjct: 176 KW--RNQNGCEEDPGKNAEMCVALNALRLYKELDQRGGVLRVDGQVVAFTIGEELCEDTF 233
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKN 297
VH+EKA + +G + +INQ+F H Y Y+NRE+D G GLR++KLSY P L+EK
Sbjct: 234 VVHIEKAFADIDGAYPMINQQFVQHECTGYQYVNREDDAGSEGLRKAKLSYRPAFLVEK- 292
Query: 298 GAIKRR 303
G + R+
Sbjct: 293 GILTRK 298
>gi|28210530|ref|NP_781474.1| hypothetical protein CTC00813 [Clostridium tetani E88]
gi|28202967|gb|AAO35411.1| conserved protein [Clostridium tetani E88]
Length = 303
Score = 161 bits (407), Expect = 5e-38, Method: Composition-based stats.
Identities = 99/290 (34%), Positives = 151/290 (52%), Gaps = 11/290 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F P+ +EDKE+ + P F+ +++F+N WR + Y + L+I+ +
Sbjct: 13 FSPLTLEDKEIFDKYIKPYKFKTSEYSFTNQYLWRKGSDVTYTILNDVLIIKKVDYDGTT 72
Query: 63 KRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+ + P+G +L++ V+ L K + L E K +EL+ N F
Sbjct: 73 Q---FTQPIGYEKENLKEIVDELIKYRQK---HNMDYLFKDAEEEFVKDFKELYDNNFTI 126
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
+RD DYIYL+EDL+ L GKK+ +K+NH N F K YEY IT I +C+ W
Sbjct: 127 EEDRDNADYIYLKEDLKNLSGKKFHSKKNHYNAFIKNYEYRTAKITESIANECLNTAMEW 186
Query: 181 VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
R N+ L +E + + +F +L ++G A+ V ++ AFT G +N +H
Sbjct: 187 CRQNDC---KGYLLHEISGIEDVLKNFDKLDVMGMAVYVNDKLSAFTLGEKVNSDMAIIH 243
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
+EKAD N G++ IN+ F + YINRE+DLGI GLR+SKLSYNP
Sbjct: 244 IEKADVNVRGLYPFINRTFIEEYLDDITYINREQDLGIEGLRKSKLSYNP 293
>gi|156865105|gb|EDO58536.1| hypothetical protein CLOL250_00560 [Clostridium sp. L2-50]
Length = 317
Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats.
Identities = 99/299 (33%), Positives = 152/299 (50%), Gaps = 17/299 (5%)
Query: 12 ELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDPKRMAYMFPV 71
+++ + S+ ++C F N W Y Y + L+ +I E+D K ++ FP+
Sbjct: 12 DILNRYLKGSSIKDCGFCTGNAILWAEEYHVSYVILSEMLV---FIAEEDGKPSSFTFPI 68
Query: 72 GNGD-----------LRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
G G L A ++ +G + T E + + + EF Y
Sbjct: 69 GIGTDFIKDIQNPDYLLNARSVFDEVCTYFHAQGVTPCVHCATPEIYEIITGWYGKEFPY 128
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERR 179
+ FDYIY E L L+GKK KRNHIN F + Y +Y Y IT E + C+ + R
Sbjct: 129 TLDPGDFDYIYTVEKLTYLRGKKLHGKRNHINNFMRNYPDYEYYTITDEHIDACLAIARY 188
Query: 180 WVRANE--NGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
WV ++ E +E + E+ + A+ + +++ + GG I V AFT G P+ TF
Sbjct: 189 WVEKHDVLQPELAEEHAYEYNIIRKALSNRRKMQMTGGIIYVNHIPSAFTLGEPLTNDTF 248
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
VH EKAD + +GI+ +IN+ F + + Y Y+NREEDLG+PGLR+SK+SY P IL +K
Sbjct: 249 DVHFEKADDSIDGIYPMINKTFVANELQNYTYVNREEDLGLPGLRKSKMSYVPDILYKK 307
>gi|32267008|ref|NP_861040.1| hypothetical protein HH1509 [Helicobacter hepaticus ATCC 51449]
gi|32263060|gb|AAP78106.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449]
Length = 295
Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats.
Identities = 104/301 (34%), Positives = 161/301 (53%), Gaps = 11/301 (3%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
M FK I +ED+ L+ FT D FSNM WR E Y + L+++ +
Sbjct: 1 MEFKKIELEDRTLLEPFTNQKGRWLSDMNFSNMFMWRHSREISYTFLQEHLIVQTRYPHQ 60
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+P +P+G GD + +E+L + ++ PL + + S ++LE FP+ F
Sbjct: 61 NP---FVFYPLGAGDKKPIIESLIQFYKDL---SLPLELHSLQSNEVEELESYFPHTFEI 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYT-YIPITPEIVPQCMELERR 179
RD FDY+Y E+L TL G+K+ K+NH+N+F Y T Y + + + +++
Sbjct: 115 TQRRDRFDYVYNVEELITLSGRKFHKKKNHLNRFWLTYPQTQYESFSNANLTEVLKVNNM 174
Query: 180 WVRANENGENTDE-LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
W A G + D+ L E+ + A+ HF +L L GG + +GEI+AF++G I+
Sbjct: 175 WFEA---GNSEDKGLYFENLGINDALSHFDKLSLRGGLLRCDGEIIAFSFGEEIDDDLAL 231
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
+H+EKA+ + G + INQ + + Y+NREEDLGI GLR++KLSY P LLEK
Sbjct: 232 IHIEKANIAFSGAYQAINQALLKNEFSNHRYVNREEDLGIEGLRKAKLSYQPSFLLEKYD 291
Query: 299 A 299
A
Sbjct: 292 A 292
>gi|118725200|ref|ZP_01573843.1| conserved hypothetical protein [Clostridium cellulolyticum H10]
gi|118665432|gb|EAV72051.1| conserved hypothetical protein [Clostridium cellulolyticum H10]
Length = 303
Score = 151 bits (382), Expect = 4e-35, Method: Composition-based stats.
Identities = 101/298 (33%), Positives = 157/298 (52%), Gaps = 8/298 (2%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
++ K I I DK + S + +F+N+ W+ Y+ + + L I + +
Sbjct: 6 LIDKEISINDKGCFDKYFETSHVLASEMSFTNLFMWKDHYKIRFGIINDFLCI---MSVR 62
Query: 61 DPKRMAYMFPVGNGDLRQAVE-ALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFA 119
D FPVG+ + R+ ++ +E EG L+ VT + K +++ E+
Sbjct: 63 DISNPFCFFPVGDYNRREELKKTIETLKEYFTSEGWKLVFSRVTKQQLKIFDDI-KIEYN 121
Query: 120 YVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERR 179
+R+ DY+Y + L TL GKK KRNHINKFKK Y Y Y I+P + C + +
Sbjct: 122 ATEDRNNADYVYTVKSLSTLAGKKLDGKRNHINKFKKLYTYEYEEISPSNIKDCKNIVEK 181
Query: 180 WVRANENGENTDE-LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
W R + +++DE L +E + + ++ LGL GG I V GE AFT G +N T
Sbjct: 182 WYR--QRVQSSDETLLHEKIANLGLLENYGILGLKGGLIRVYGEAQAFTVGEQLNQNTVV 239
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
+H+EKA+ GI+++INQ F + Y+NRE+DLG+ GLR++KLSYNP L+ K
Sbjct: 240 IHLEKANAEIRGIYTIINQMFINNQWLHMEYVNREQDLGVEGLRKAKLSYNPDHLIYK 297
>gi|146295805|ref|YP_001179576.1| hypothetical protein Csac_0766 [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|145409381|gb|ABP66385.1| conserved hypothetical protein [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 298
Score = 146 bits (369), Expect = 1e-33, Method: Composition-based stats.
Identities = 101/300 (33%), Positives = 156/300 (52%), Gaps = 12/300 (4%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQN--CDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIE 58
M F I I DK + Y AFQ D F+N+ W Y+ + +G LLI
Sbjct: 1 MKFYKIDISDKRIFDE--YFKAFQPEIADLTFTNLFMWDPFYDINFTEEDGFLLIMAKPY 58
Query: 59 EKDPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEF 118
+ P PVG D + +EK +G+ ++ + + L + +F
Sbjct: 59 NQPPFLHG---PVGV-DTNKLPIVIEKAKKYFETQGYKFMLKRASQKTIDMLTQC-GMKF 113
Query: 119 AYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY--EYTYIPITPEIVPQCMEL 176
+ ERD DY+Y +DL LKGKK+ AK+NHINKF + Y Y I EIV C +
Sbjct: 114 ESLLERDLSDYVYKVQDLVQLKGKKYHAKKNHINKFLRLYGQRYEVKKIDDEIVRLCWDF 173
Query: 177 ERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQT 236
E W NG+N L+ E ++ A+ +F +L G + ++G+I AFT+G P+N T
Sbjct: 174 ECEWYE-KRNGQNDVGLTFEKLAIERAIKNFDKLSYEGMIVFIDGKIKAFTFGEPLNKNT 232
Query: 237 FGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
+H+EKAD + EG+++ +N +F + + + ++NREEDLG G+R++K+SY+P EK
Sbjct: 233 VVIHIEKADPDIEGLYTFVNNKFLIEFWQNFEFVNREEDLGKEGIRKAKMSYHPFKFAEK 292
>gi|78357692|ref|YP_389141.1| hypothetical protein Dde_2650 [Desulfovibrio desulfuricans G20]
gi|78220097|gb|ABB39446.1| conserved hypothetical protein [Desulfovibrio desulfuricans G20]
Length = 293
Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats.
Identities = 91/296 (30%), Positives = 155/296 (52%), Gaps = 14/296 (4%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKP+ ++ E + + + D++F+N+ W +EY + G F+I + P
Sbjct: 6 FKPLTMDVMEEYLTLFSGTPSRASDYSFTNLWGW----ANEYGLEVGRTPNLFWIRQTRP 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+ + Y PVG+ + + + MM G + V + + +
Sbjct: 62 E-VRYWAPVGDWN-----AVADWANCPMMTAGRTFI--RVPQQLADLWQAALGDGVQVQE 113
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
+RD +DY+Y E L TL G K+ K+NH+N+FKKQY++ Y P+T + + ++L++ W
Sbjct: 114 DRDQWDYLYDAEKLATLSGNKYHKKKNHVNQFKKQYDWQYHPMTADCIEAVLQLQQEWCL 173
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKEL-GLLGGAIMVEGEIVAFTYGSPINYQTFGVHV 241
+ E E ++ L E+ ++ + ++ + GL GGA+ V +VA+T G + T VH
Sbjct: 174 SRE-CEESETLLAENEAVIRVLEYWDTIPGLKGGALYVNDVMVAYTVGEALTDDTLVVHF 232
Query: 242 EKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKN 297
EKA + Y G++ INQ F H Y Y+NRE+DLG GLR++KL+Y+P L K+
Sbjct: 233 EKARSEYRGVYQAINQIFVEHEGTGYTYVNREQDLGDEGLRKAKLTYHPADWLRKS 288
>gi|15611347|ref|NP_222998.1| hypothetical protein jhp0277 [Helicobacter pylori J99]
gi|4154812|gb|AAD05868.1| putative [Helicobacter pylori J99]
Length = 290
Score = 144 bits (364), Expect = 5e-33, Method: Composition-based stats.
Identities = 100/304 (32%), Positives = 162/304 (53%), Gaps = 18/304 (5%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+F+ I + K+L + F D +F+N W+ + AV CL+I+ E +
Sbjct: 1 MFEEITLAHKDLFSRFLQTQKIVLSDVSFTNCFLWQHARLIQVAVIRDCLVIQTTYENQK 60
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
P Y +P+G E LE + L +T E + L++ F F +
Sbjct: 61 P---FYFYPIGKRPHECVKELLELEK--------NLRFHSLTLEQKDDLKDNFVGVFDFT 109
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRW 180
RD DY+Y E+L LKGKK+ K+NH+N+F + + Y I+P+ + +E + W
Sbjct: 110 YNRDRSDYVYSIEELIALKGKKYHKKKNHLNQFLTNHANFVYEKISPQNRKEVLEASKAW 169
Query: 181 VRANENGENTDE--LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
++ TD+ L NE++ + + +++ L L GG I V GEIV+F++G +N ++
Sbjct: 170 FLESQ----TDDIGLINENKGIQSVLENYESLDLKGGLIRVNGEIVSFSFGEVLNEESAL 225
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
+H+EKA T+ G + +INQ+ L+ Y NREEDLG+ GLR+SK+SYNPV L++K
Sbjct: 226 IHIEKARTDIAGAYQIINQQLLLNEFSHLTYANREEDLGLEGLRRSKMSYNPVFLIDKYE 285
Query: 299 AIKR 302
A+ R
Sbjct: 286 AVAR 289
>gi|153808945|ref|ZP_01961613.1| hypothetical protein BACCAC_03246 [Bacteroides caccae ATCC 43185]
gi|149128278|gb|EDM19497.1| hypothetical protein BACCAC_03246 [Bacteroides caccae ATCC 43185]
Length = 295
Score = 144 bits (363), Expect = 7e-33, Method: Composition-based stats.
Identities = 95/302 (31%), Positives = 161/302 (53%), Gaps = 15/302 (4%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F+PI IE I + ++ CDF + W ++ EY + + L I+ Y E ++
Sbjct: 4 FQPITIESFHDILPYLKQQTYRTCDFTIGGIYMWIDYFQYEYCIADDLLFIKGYAENEN- 62
Query: 63 KRMAYMFPVGNGDLRQAVEALEK--DSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
K++++ P+G L + L++ D++ + PL + V E ++++E LF +
Sbjct: 63 KKLSFTIPIGASSLSHGISLLKEYCDNHHL-----PLFLSAVPEEGKRQIEMLFSCQSTP 117
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERR 179
+ D+ DY+Y ++DL L G+++ KRN +NKF K+Y +Y PIT + + R
Sbjct: 118 LP--DWSDYLYDQKDLSILPGRRFNKKRNRVNKFYKEYTNISYEPITINNIEAVKQFFRE 175
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
+ N+ + + + E + ++++ +L GG + EG I+ F+ G IN T V
Sbjct: 176 FYSINQ--KESLYFAYEEEMVNQVLNNYFKLNFTGGILKAEGCIIGFSIGEIIN-DTLFV 232
Query: 240 HVEKADTNYEGIFSLINQEFALH-IPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
H+EKA Y G + IN FA + I + +YINREED+G GLR++KLSYNP+ +L K
Sbjct: 233 HIEKALKQYSGAYETINFLFAQNAITPEVLYINREEDVGDVGLRKAKLSYNPIHILTKYN 292
Query: 299 AI 300
I
Sbjct: 293 II 294
>gi|15644920|ref|NP_207090.1| hypothetical protein HP0292 [Helicobacter pylori 26695]
gi|2313390|gb|AAD07362.1| predicted coding region HP0292 [Helicobacter pylori 26695]
Length = 290
Score = 142 bits (358), Expect = 3e-32, Method: Composition-based stats.
Identities = 97/304 (31%), Positives = 165/304 (54%), Gaps = 18/304 (5%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+F+ I + K+L + F D +F+N W+ + AV CL+I+ E +
Sbjct: 1 MFEKITLAHKDLFSRFLSAQKIVLSDVSFTNCFLWQHARLIQVAVIRDCLVIQTTYENQK 60
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
P Y +P+G + A E ++ E++ L +T E + L++ F F +
Sbjct: 61 P---FYFYPIG----KNAFECVK----ELLKLEKNLRFHSLTLEQKDDLKDNFVGVFDFT 109
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRW 180
RD DY+Y E+L LKGKK+ K+NH+N+F + + Y I+P+ + +E + W
Sbjct: 110 YNRDRSDYVYSIEELIALKGKKYHKKKNHLNQFLTNHANFVYEKISPQNKKEVLEASQAW 169
Query: 181 VRANENGENTDE--LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
++ TD+ L NE++ + + +++ L + GG I V GEI +F++G +N ++
Sbjct: 170 FLESQ----TDDIGLINENKGIQSVLENYESLDVKGGLIRVNGEIASFSFGEVLNEESAL 225
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
+H+EKA T+ G + +INQ+ L+ Y NREEDLG+ GLR+SK+SYNPV L++K
Sbjct: 226 IHIEKARTDIAGAYQIINQQLLLNEFSHLTYANREEDLGLEGLRRSKMSYNPVFLIDKYE 285
Query: 299 AIKR 302
A+ +
Sbjct: 286 AVAK 289
>gi|108562719|ref|YP_627035.1| hypothetical protein HPAG1_0294 [Helicobacter pylori HPAG1]
gi|107836492|gb|ABF84361.1| hypothetical protein HPAG1_0294 [Helicobacter pylori HPAG1]
Length = 290
Score = 139 bits (351), Expect = 2e-31, Method: Composition-based stats.
Identities = 96/302 (31%), Positives = 162/302 (53%), Gaps = 14/302 (4%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+F+ I + K+L + F D +F+N W+ + AV CL+I+ E +
Sbjct: 1 MFEKITLAHKDLFSRFLSTQKIVLSDVSFTNCFLWQHARLIQVAVIRDCLVIQTTYENQK 60
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
P Y +P+G ++ E ++ E++ L +T + L++ F F +
Sbjct: 61 P---FYFYPIG----KRPHECVK----ELLKLEKNLRFHSLTLGQKDDLKDNFVGVFDFT 109
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQCMELERRW 180
RD DY+Y E+L LKGKK+ K+NH+N+F Y + Y I+P+ + +E + W
Sbjct: 110 YNRDRSDYVYSIEELIALKGKKYHKKKNHLNQFLTNYANFVYEKISPQNKKEVLEAFQAW 169
Query: 181 VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
E+ N L NE++ + + +++ L + GG + V GEIV+F++G +N +T +H
Sbjct: 170 FL--ESQTNDIGLINENKGIQSVLENYESLDVKGGLVRVNGEIVSFSFGEVLNEETALIH 227
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGAI 300
+EKA + G + +INQ+ L+ Y NREEDLG+ GLR+SK+SYNPV L++K A+
Sbjct: 228 IEKARADIAGAYQIINQQLLLNEFSHLTYANREEDLGLEGLRRSKMSYNPVFLIDKYEAV 287
Query: 301 KR 302
+
Sbjct: 288 AK 289
>gi|21226283|ref|NP_632205.1| hypothetical protein MM_0181 [Methanosarcina mazei Go1]
gi|20904527|gb|AAM29877.1| conserved protein [Methanosarcina mazei Go1]
Length = 324
Score = 138 bits (348), Expect = 4e-31, Method: Composition-based stats.
Identities = 99/305 (32%), Positives = 155/305 (50%), Gaps = 34/305 (11%)
Query: 3 FKPICIEDKELIT--SFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLI------- 53
FKP+ + D+E S YP + F+NM W + YA G ++I
Sbjct: 7 FKPVTLADREFFERHSELYPQTHSSN--TFTNMVCWNHFTQYRYAYVNGNIIISGTTGGI 64
Query: 54 -RFY--IEEKDPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKL 110
RF+ I +DP+ M R+ ++ K S + PL+ + + +
Sbjct: 65 TRFHPPIGPRDPELM-----------RELIQLAMKVS-----DNTPLIF--IDPDTALWI 106
Query: 111 EELFPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIV 170
EL P + V +RD F+Y+Y DL L GKK+Q R+H+N+F++ T PITPE +
Sbjct: 107 RELEP-DLELVPDRDNFEYVYRASDLAELPGKKYQKIRSHLNRFRRNCMSTVEPITPENL 165
Query: 171 PQCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGS 230
+ MEL ++W R + + LS+E + +A+ HF EL L G + V+ E+ A +
Sbjct: 166 EEVMELLKKW-RDWKGCDKNPVLSHEVEAAFYAVEHFMELRLRGFLLRVDSEVGAISIFE 224
Query: 231 PINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
+N T +H EK + EGI+ IN E A + ++ YINRE DLG+ GLR++KL Y+P
Sbjct: 225 RLNADTALIHFEKGLPDCEGIYKAINAETAAALADEVEYINRESDLGVGGLREAKLRYHP 284
Query: 291 VILLE 295
++E
Sbjct: 285 HHMVE 289
>gi|109947143|ref|YP_664371.1| hypothetical protein Hac_0552 [Helicobacter acinonychis str.
Sheeba]
gi|109714364|emb|CAJ99372.1| conserved hypothetical protein [Helicobacter acinonychis str.
Sheeba]
Length = 290
Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats.
Identities = 95/304 (31%), Positives = 163/304 (53%), Gaps = 18/304 (5%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+F+ + +E K+L + F D +F+N W+ AV + CL+I+ E +
Sbjct: 1 MFEKLKLEHKDLFSRFLSAQKIVLSDVSFTNCFLWQHARLIRVAVIKDCLVIQTTYENQQ 60
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
P Y +P+G + A K E++ L +TSE + L + F F +
Sbjct: 61 P---FYFYPIG-----KRAHACVK---ELLKLEKNLKFHSLTSEQKDDLRDNFVGVFDFT 109
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYE-YTYIPITPEIVPQCMELERRW 180
RD DYIY ++L LKGKK+ K+NH+N+F + + Y I+ + + +E + W
Sbjct: 110 YNRDRSDYIYSIKELIALKGKKYHKKKNHLNQFLTNHAGFVYEKISSQNKREVLEASQEW 169
Query: 181 VRANENGENTDELS--NEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
++ TD+L NE++ + + +++ L L GG + V G+IV+F++G +N ++
Sbjct: 170 FLESQ----TDDLGLINENKGIQSVLENYESLDLKGGVVRVGGKIVSFSFGEILNEESAL 225
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
+H+EK + G + +INQ+ L+ Y+NREEDLG+ GLR++K+SYNPV L++K
Sbjct: 226 IHIEKVRADIAGAYQIINQQLLLNEFSHLTYVNREEDLGLEGLRRAKMSYNPVFLIDKYE 285
Query: 299 AIKR 302
A+ R
Sbjct: 286 AVAR 289
>gi|126178970|ref|YP_001046935.1| hypothetical protein Memar_1020 [Methanoculleus marisnigri JR1]
gi|125861764|gb|ABN56953.1| conserved hypothetical protein [Methanoculleus marisnigri JR1]
Length = 304
Score = 135 bits (340), Expect = 3e-30, Method: Composition-based stats.
Identities = 90/294 (30%), Positives = 148/294 (50%), Gaps = 11/294 (3%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKP+ ++D++L + D F+NM W + + EG +++ I+
Sbjct: 7 FKPVSLDDRDLFREHYRQFPQVHSDNTFANMVCWNHYADYRFIEVEGSIVLSSTIDGVTA 66
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
RM P+G + + ++ + E G PL +L +EA + EL+P
Sbjct: 67 FRM----PIGPRNPELVGDVVDLAARE--GGDTPLWVLDPANEAL--IRELYP-ALPLHA 117
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
R++FDYIY E L L GK + R +N+F ++Y+YT IT E + + E W
Sbjct: 118 NRNFFDYIYRTEALADLAGKGYATIRRQVNRFGREYQYTVEKITEENINEVWEFLVVWCE 177
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVE 242
+ ++ L+ E ++ FA++HF +GL G + + G I A + P+N VH E
Sbjct: 178 W-RDCDSEPVLAAEKEAILFAVNHFFPIGLEGWIVRIGGTIGAISIVGPVNESMAVVHFE 236
Query: 243 KA-DTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLE 295
KA Y GI+ +I E A + ++Y Y+NRE D+G+PGLR+SK Y+P ++E
Sbjct: 237 KALPETYPGIYKVITTETAAGLRDRYRYVNRECDMGVPGLRESKTRYHPAYMVE 290
>gi|153938729|ref|YP_001390465.1| hypothetical protein CLI_1199 [Clostridium botulinum F str.
Langeland]
gi|152934625|gb|ABS40123.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
Length = 319
Score = 134 bits (338), Expect = 5e-30, Method: Composition-based stats.
Identities = 96/302 (31%), Positives = 155/302 (51%), Gaps = 26/302 (8%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD- 61
FK + I DK+L S+ P F NC+++F+ + WR + +YA+++G L+I+ +KD
Sbjct: 4 FKRLTINDKKLFESYIKPYNFLNCEYSFTTLYIWRKALDIKYAIYKGALIIK----KKDF 59
Query: 62 PKRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEAR----KKLEELFP 115
+M P+G +LR +E L MG I + + R K+L+ L+
Sbjct: 60 NDEYHFMQPLGYKKENLRDIIETL-------MGYKKEKCIKYLFKDLRYDFVKELKNLYK 112
Query: 116 NEFAYVT-----ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPE-I 169
+E Y +RD FDY+Y + L TL GKK K+NH N F K Y Y T E +
Sbjct: 113 DEEIYSNIMVEEDRDNFDYLYESKKLITLSGKKLHGKKNHYNYFIKNYNYDIRDFTEEGV 172
Query: 170 VPQCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYG 229
+ + + W N+ + L E +S+ ++ LGL G + ++GEI F+ G
Sbjct: 173 IEDSLRIAELWYEKNDTKDK--HLLYELQSIRDMCYNMDVLGLEGMGLYIDGEIAGFSIG 230
Query: 230 SPINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYN 289
+ +H+EKA+ + G ++ +N+ F + INRE+DLGI GLR++KLSY+
Sbjct: 231 EKFSDNMAIIHIEKANKDLRGAYTFVNKAFVENYFSDIPIINREQDLGIEGLRKAKLSYS 290
Query: 290 PV 291
P+
Sbjct: 291 PL 292
>gi|126700094|ref|YP_001088991.1| hypothetical protein CD2477 [Clostridium difficile 630]
gi|145952918|ref|ZP_01801926.1| hypothetical protein CdifQ_04002886 [Clostridium difficile
QCD-32g58]
gi|115251531|emb|CAJ69364.1| conserved hypothetical protein [Clostridium difficile 630]
Length = 301
Score = 134 bits (338), Expect = 6e-30, Method: Composition-based stats.
Identities = 93/303 (30%), Positives = 160/303 (52%), Gaps = 15/303 (4%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
M FK I + K+ + + ++ C++ FS + W+ +Y++ Y + E ++ + E
Sbjct: 1 MFFKDIELNSKKELDPYFDLVDYEACEYCFSTLYMWQHVYKTGYYIGEDFAVL---VGEY 57
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+ + + P+ D V + + + + G+T+E + L+E +P F Y
Sbjct: 58 EGDSFS-ILPLAKKDKLPEVVDFVLEYFSK--NNKKIYLRGITTEVVEFLKEKYPGRFEY 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY--EYTYIPITPEIVPQCMELER 178
+ ERD FDYIY E L+TL GKK Q KRNHIN F K+Y Y + E +C+ L +
Sbjct: 115 IEERDLFDYIYDAESLRTLAGKKNQKKRNHINYFLKEYAGRYEAKLLDKENFDECLVLMK 174
Query: 179 RW-VRANENGENTDELSNEHRSMTFAMHHFK----ELGLLGGAIMVEGEIVAFTYGSPIN 233
W EN E + + +E + +H+ ++ + G + V+G++ AF+ G +N
Sbjct: 175 EWESNKEENNEFDESMDDELIGIKKIFNHYDILKDKVKVFG--VYVDGKLEAFSIGELLN 232
Query: 234 YQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVIL 293
+H+EKA+ + G++ INQ+F + + ++NREEDLGI GLR++KLSY+P
Sbjct: 233 PNMALIHIEKANPDIRGLYPFINQQFLVSEFKDVEFVNREEDLGIEGLRKAKLSYHPCRF 292
Query: 294 LEK 296
+EK
Sbjct: 293 VEK 295
>gi|148379099|ref|YP_001253640.1| hypothetical protein CBO1111 [Clostridium botulinum A str. ATCC
3502]
gi|153934027|ref|YP_001383482.1| hypothetical protein CLB_1150 [Clostridium botulinum A str. ATCC
19397]
gi|153936489|ref|YP_001387029.1| hypothetical protein CLC_1162 [Clostridium botulinum A str. Hall]
gi|148288583|emb|CAL82664.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152930071|gb|ABS35571.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
19397]
gi|152932403|gb|ABS37902.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
Length = 319
Score = 134 bits (336), Expect = 9e-30, Method: Composition-based stats.
Identities = 96/302 (31%), Positives = 155/302 (51%), Gaps = 26/302 (8%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD- 61
FK + I DK+L S+ P F NC+++F+ + WR + +YA+++G L+I+ +KD
Sbjct: 4 FKRLTINDKKLFESYIKPYNFLNCEYSFTTLYIWRKALDIKYAIYKGALIIK----KKDF 59
Query: 62 PKRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEAR----KKLEELFP 115
+M P+G +L+ +E L MG I + + R K+L+ L+
Sbjct: 60 NDEYHFMQPLGYKKENLKDIIETL-------MGYKKEKCIKYLFKDLRYDFVKELKNLYK 112
Query: 116 NEFAYVT-----ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPE-I 169
+E Y +RD FDY+Y + L TL GKK K+NH N F K Y Y T E +
Sbjct: 113 DEEIYSNIMVEEDRDNFDYLYESKKLITLSGKKLHGKKNHYNYFIKNYNYDIRDFTEEGV 172
Query: 170 VPQCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYG 229
+ + + W N+ + L E +S+ ++ LGL G + ++GEI F+ G
Sbjct: 173 IEDSLRIAELWYEKNDTKDK--HLLYELQSIRDMCYNMDVLGLEGMGLYIDGEIAGFSIG 230
Query: 230 SPINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYN 289
+ VH+EKA+ + G ++ +N+ F + INRE+DLGI GLR++KLSY+
Sbjct: 231 EKFSDNMAIVHIEKANKDLRGAYTFVNKAFVENYFSDIPIINREQDLGIEGLRKAKLSYS 290
Query: 290 PV 291
P+
Sbjct: 291 PL 292
>gi|147918986|ref|YP_687287.1| hypothetical protein RRC217 [uncultured methanogenic archaeon RC-I]
gi|110622683|emb|CAJ37961.1| conserved hypothetical protein [uncultured methanogenic archaeon
RC-I]
Length = 316
Score = 132 bits (332), Expect = 3e-29, Method: Composition-based stats.
Identities = 87/297 (29%), Positives = 153/297 (51%), Gaps = 18/297 (6%)
Query: 3 FKPICIEDKELITSF--TYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
FKP+ ++D + + YP + D F+NM W YA + +++ I+
Sbjct: 7 FKPVTLDDADFFRQYYSRYPQV--HSDNTFTNMTCWNHYANYSYAKVDDNVILASTIDGV 64
Query: 61 DPKRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEF 118
R P+G N DL + + +L ++ GE P++++ S + + ++P +
Sbjct: 65 TKFRP----PIGPYNPDLTRDLVSLAAET----GEEEPIILIDPASASL--ISYVYP-DM 113
Query: 119 AYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELER 178
+RD +Y+Y DL L+G+ + R +NKF++ + + PIT +
Sbjct: 114 EMSLDRDQSEYVYRATDLAELQGRDYLYIRRDLNKFRRNHSHRVEPITAANAAHVKDFLD 173
Query: 179 RWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
+W A N E++ +S+E +++ + + H ELGL G AI V+G I A + +N T
Sbjct: 174 QWFAA-RNPEDSGMISHEKKAIMYGLDHMAELGLSGLAIKVDGRIGAISMYERLNGDTAL 232
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLE 295
VH EK +Y GI+ IN E A + +++ YINRE D+G+PGLR++K+ Y+P ++E
Sbjct: 233 VHFEKGLQDYPGIYKAINAETAALLGKEFTYINRESDMGVPGLREAKMRYHPHHMVE 289
>gi|73669627|ref|YP_305642.1| hypothetical protein Mbar_A2131 [Methanosarcina barkeri str.
Fusaro]
gi|72396789|gb|AAZ71062.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 304
Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats.
Identities = 95/306 (31%), Positives = 155/306 (50%), Gaps = 20/306 (6%)
Query: 3 FKPICIEDKELITSF--TYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFY---I 57
FKP+ + D+ YP + D F++M W YA +G +++ +
Sbjct: 7 FKPVTLADRAFFERHYALYPQT--HSDNTFTSMICWNHFMHYRYAYVKGNVILACTAAGV 64
Query: 58 EEKDPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNE 117
P PVG D E + + + +M PL++ + E K ++E+ P +
Sbjct: 65 TRLHP-------PVGPRDPELMKEVI-RLALDMGNNNKPLML--IDPETAKCMKEIDP-D 113
Query: 118 FAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELE 177
V + ++F+Y+Y DL L GKK+ R+ +NKF+K Y +T PIT E + ME
Sbjct: 114 LILVPDLNHFEYVYRAFDLAELPGKKYLKIRSQLNKFRKNYRHTVEPITSENREEIMEFL 173
Query: 178 RRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
+W + + +N L++E + ++A+ H EL L G I V+ ++ A + +N T
Sbjct: 174 VKWCES-KRCKNNFTLAHEIEAFSYAVEHLTELPLRGLLIRVDSQVGAISLFERLNANTA 232
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKN 297
+H EK T+YEGI+ IN E A + + YINRE DLG GLR++KL Y+P ++E
Sbjct: 233 LIHFEKGLTDYEGIYKAINAETAAVLASEVEYINRESDLGASGLRKAKLRYHPHHMVEVY 292
Query: 298 GAIKRR 303
++KRR
Sbjct: 293 -SLKRR 297
>gi|20090642|ref|NP_616717.1| hypothetical protein MA1791 [Methanosarcina acetivorans C2A]
gi|19915686|gb|AAM05197.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 313
Score = 130 bits (328), Expect = 9e-29, Method: Composition-based stats.
Identities = 92/303 (30%), Positives = 151/303 (49%), Gaps = 30/303 (9%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLI--------R 54
FKP+ + D+ S + F+NM W YA G ++I R
Sbjct: 8 FKPVMLSDRAFFESHYAFFPQTHSSNTFTNMVCWNHFTPYRYAYVNGNVIISCTTEGVTR 67
Query: 55 FY--IEEKDPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEE 112
F+ I ++P+ M + +R A++ ++ E+ + + + L+E
Sbjct: 68 FHPPIGPRNPELMREL-------IRLALDVSDETPIEL-----------IDPDTAQWLQE 109
Query: 113 LFPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQ 172
L P E A + +R+ F+Y+Y DL L GKK+Q R+H+N+F+K T P+TP + +
Sbjct: 110 LDP-ELALMPDRNNFEYVYRASDLAELPGKKYQKIRSHLNRFRKNCSSTVEPVTPGNLKE 168
Query: 173 CMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPI 232
++ ++W +N L++E + +A+ HF EL L G I V+ EI A T +
Sbjct: 169 VIKFLKKWSEWKGCRKNL-VLASEVGAARYAVEHFNELPLQGLLIRVDSEIGAITLYERL 227
Query: 233 NYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVI 292
N T +H EK + EGI+ IN+E A + + YINRE DLG+ GLR++KL Y+P
Sbjct: 228 NTDTALIHFEKGLPDCEGIYKAINEETAAVLVSEVEYINRESDLGVGGLREAKLRYHPHH 287
Query: 293 LLE 295
++E
Sbjct: 288 MIE 290
>gi|46580958|ref|YP_011766.1| hypothetical protein DVU2554 [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
gi|46450378|gb|AAS97026.1| conserved hypothetical protein [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
Length = 294
Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats.
Identities = 85/296 (28%), Positives = 147/296 (49%), Gaps = 17/296 (5%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F P+ ++ + + + + D++F+N+ W Y E+ G +R + E
Sbjct: 6 FSPVTLDAMQDYLALFARTPRRASDYSFTNLWGWAEHYGLEWRFEHGLCWLRQTLPE--- 62
Query: 63 KRMAYMFPVGN-GDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
+ Y PVG GD+ A S +G+G + + V E E+ +
Sbjct: 63 --VRYWAPVGPWGDIDWA-------SCTCLGKG--MEFIRVPEELASLWREVLGDRVTVQ 111
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWV 181
+DY+Y DL LKG ++ K+NH+N + K Y Y P+TP+ + +E++ W
Sbjct: 112 ETPGQWDYLYDSADLADLKGNRFHRKKNHVNGYAKAYGIDYHPLTPDCIEAVIEMQEEWC 171
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKEL-GLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
+ E E ++ L E+ ++ + + + GL+GGA+ EG +VA+T G ++ +T VH
Sbjct: 172 QWRECAE-SESLLAENEAVIRVLEEWDSIPGLVGGALYAEGRMVAYTVGEALDDETLVVH 230
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EK +Y G++ INQ+F + ++ +NRE+DL GLR++K SY PV L K
Sbjct: 231 FEKGRGDYRGVYQAINQQFVQYEGARFRLVNREQDLDDEGLRKAKQSYYPVDYLRK 286
>gi|148322876|gb|EDK88126.1| hypothetical protein FNP_0312 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length = 290
Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats.
Identities = 93/304 (30%), Positives = 161/304 (52%), Gaps = 19/304 (6%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIR-FYIEEK 60
+++ + IE K I +T + F+ CD +FSN+ W +EY + L IR Y+ E
Sbjct: 1 MWQKLTIESKNSIEEYT-KNRFEICDLSFSNLLLWSIGENTEYEIENDILTIRSIYMGE- 58
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+ Y P+ D + +E +++ E++ E + I T +KL+ N+F
Sbjct: 59 ----VYYYMPIPKNDTPENIEKIKEKIREILKEN--VAIHYFTEYWYEKLK----NDFNL 108
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
+RDY DYIY E L TLKG+ + K+N + F+K YEY+Y I + + + ++ +++W
Sbjct: 109 QEKRDYEDYIYSYESLSTLKGRHYAKKKNRVANFRKSYEYSYDSINKDNIDEVVDFQKKW 168
Query: 181 --VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFG 238
+ + GE L NE+ + + ++++L L GG + V I+A++ G + +
Sbjct: 169 YEIHSESGGE---ILKNENEGILNLLKNYEKLDLKGGFLKVNNHIIAYSLGEALTDKMVL 225
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNG 298
VH EKA +Y G + IN + + Y +NRE+D G GLR++K+SY P + L+K
Sbjct: 226 VHTEKALIDYIGSYQAINMIYLQEEWQGYELVNREDDFGDEGLREAKMSYKP-LYLQKKY 284
Query: 299 AIKR 302
+I+R
Sbjct: 285 SIER 288
>gi|120601744|ref|YP_966144.1| hypothetical protein Dvul_0694 [Desulfovibrio vulgaris subsp.
vulgaris DP4]
gi|120561973|gb|ABM27717.1| conserved hypothetical protein [Desulfovibrio vulgaris subsp.
vulgaris DP4]
Length = 294
Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats.
Identities = 85/296 (28%), Positives = 147/296 (49%), Gaps = 17/296 (5%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F P+ ++ + + + + D++F+N+ W Y E+ G +R + E
Sbjct: 6 FSPVTLDAMQDYLALFARTPRRASDYSFTNLWGWAEHYGLEWRFEHGLCWLRQTLPE--- 62
Query: 63 KRMAYMFPVGN-GDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
+ Y PVG GD+ A S +G+G + + V E E+ +
Sbjct: 63 --VRYWAPVGPWGDIDWA-------SCTCLGKG--MEFIRVPEELASLWREVLGDRVTVQ 111
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWV 181
+DY+Y DL LKG ++ K+NH+N + K Y Y P+TP+ + +E++ W
Sbjct: 112 ETPGQWDYLYDAADLADLKGNRFHRKKNHVNGYAKAYGIDYHPLTPDCIEAVIEMQEEWC 171
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKEL-GLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
+ E E ++ L E+ ++ + + + GL+GGA+ EG +VA+T G ++ +T VH
Sbjct: 172 QWRECAE-SESLLAENEAVIRVLEEWDSIPGLVGGALYAEGRMVAYTVGEALDDETLVVH 230
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EK +Y G++ INQ+F + ++ +NRE+DL GLR++K SY PV L K
Sbjct: 231 FEKGRGDYRGVYQAINQQFVQYEGARFRLVNREQDLDDEGLRKAKQSYYPVDYLRK 286
>gi|19703622|ref|NP_603184.1| hypothetical protein FN0277 [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|19713732|gb|AAL94483.1| Hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
Length = 290
Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats.
Identities = 88/297 (29%), Positives = 162/297 (54%), Gaps = 16/297 (5%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIR-FYIEEK 60
+++ + IE K I +T + F+ CD +FSN+ W +EY + L IR Y+ +
Sbjct: 1 MWQKLTIESKSSIEEYT-KNRFEICDLSFSNLLLWSIGENTEYEIENDVLTIRSVYMGD- 58
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+ Y P+ D + +E +++ E++ E + I T +KL++ +F
Sbjct: 59 ----VYYYMPIPKNDTPKNIEKMKEKIREILKEN--VAIHYFTEYWYEKLKD----DFNL 108
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
+RDY DYIY E L TLKG+ + K+N ++ FKK Y+++Y I+ + + + + +++W
Sbjct: 109 QEKRDYEDYIYSYESLSTLKGRHYAKKKNRVSNFKKSYQFSYESISKDNINEVVAFQKKW 168
Query: 181 VRANENGENTDE-LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
+ E+++E L NE+ + + ++++LGL GG + V +++A++ G + + V
Sbjct: 169 YEIHS--ESSEEILKNENEGILNLLKNYEKLGLKGGFLKVNNQVIAYSLGEALTDKMILV 226
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
H EKA +Y G + IN + + Y +NRE+D G GLR++K+SY P+ L +K
Sbjct: 227 HTEKALIDYIGSYQAINMIYLQKEWQGYELVNREDDFGDEGLREAKMSYKPLYLQKK 283
>gi|154150103|ref|YP_001403721.1| hypothetical protein Mboo_0560 [Candidatus Methanoregula boonei
6A8]
gi|153998655|gb|ABS55078.1| conserved hypothetical protein [Candidatus Methanoregula boonei
6A8]
Length = 312
Score = 127 bits (320), Expect = 6e-28, Method: Composition-based stats.
Identities = 90/297 (30%), Positives = 149/297 (50%), Gaps = 28/297 (9%)
Query: 3 FKPICIEDKELITSF--TYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIR---FYI 57
F+P+ + D+ YP + + D F+NM W + + YA G +++ F +
Sbjct: 7 FRPVTLADRAFFERHYARYPQS--HSDNTFTNMVCWNHIAQYRYAYVNGSVILASTLFGV 64
Query: 58 EEKDPKRMAYMFPVGNGD---LRQAVE-ALEKDSYEMMGEGHPLLILGVTSEARKKLEEL 113
P P+G D +R + A+E + P+++ + E + ++EL
Sbjct: 65 TRFRP-------PIGPRDPDLMRDVIRFAME------FADDTPMVL--IDPETARWMKEL 109
Query: 114 FPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQC 173
P+ V +R++ +Y+YL DL L GK + RN INKF+K +T P+T E +
Sbjct: 110 DPS-LTLVPDRNHSEYVYLASDLAELPGKHYLKIRNQINKFRKNCSHTIEPVTGENREEV 168
Query: 174 MELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPIN 233
M+ +W + EN L++E ++ +A+ HF EL L G I V+ ++ A + +N
Sbjct: 169 MQFLVKWCEW-KGCENDLVLAHEKDAVFYAIEHFTELPLRGLMIRVDSQVAAISLFERLN 227
Query: 234 YQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
T VH EK + EGI+ +NQ A + + YINRE DLG+ GLR++KL Y+P
Sbjct: 228 EDTALVHFEKGLPDCEGIYKAVNQATAALLVHEVKYINRESDLGVAGLREAKLRYHP 284
>gi|125972764|ref|YP_001036674.1| hypothetical protein Cthe_0242 [Clostridium thermocellum ATCC
27405]
gi|125712989|gb|ABN51481.1| conserved hypothetical protein [Clostridium thermocellum ATCC
27405]
Length = 299
Score = 127 bits (319), Expect = 8e-28, Method: Composition-based stats.
Identities = 86/301 (28%), Positives = 152/301 (50%), Gaps = 17/301 (5%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FKPI ++D+EL + F +++F + WR +Y +E+A+ + ++I+
Sbjct: 4 FKPIELKDRELFHEYLKDYDFLTYEYSFLTLYIWRKMYNTEFAIVDDTIVIK-------- 55
Query: 63 KRMA-----YMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNE 117
KR A +M P+G + A L+ ++ L V + ++L E F N
Sbjct: 56 KRTANNGTYFMQPIGADKSKIADITLKLNTLRKNNPDFKYLYGDVETPFLEQLHENFGNL 115
Query: 118 FAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPI-TPEIVPQCMEL 176
+++ FDYI+ +DL L GKK+ K+N N+F K+Y+Y I +PE++ C++L
Sbjct: 116 VTSHEDKNNFDYIFNSKDLIKLSGKKYHRKKNQYNQFIKKYDYRIEEIQSPEVIKNCIDL 175
Query: 177 ERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGG-AIMVEGEIVAFTYGSPINYQ 235
+W + +++L NE +++ + K + G A+ V EI F G +N +
Sbjct: 176 SLKWY--DYKSLQSEQLKNEQKAIFDIFSNIKIFNNIKGIAVYVNNEIAGFAIGEKLNSK 233
Query: 236 TFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLE 295
VH EK + N+ GI+ +N+ +IN +EDLG+ GLR++K +Y P+ L +
Sbjct: 234 MATVHFEKGNYNFSGIYPFLNKSLVEIFFRDVEFINLQEDLGLEGLRRAKSAYQPIKLEK 293
Query: 296 K 296
K
Sbjct: 294 K 294
>gi|39996148|ref|NP_952099.1| hypothetical protein GSU1046 [Geobacter sulfurreducens PCA]
gi|39982913|gb|AAR34372.1| conserved hypothetical protein [Geobacter sulfurreducens PCA]
Length = 294
Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats.
Identities = 93/295 (31%), Positives = 143/295 (48%), Gaps = 21/295 (7%)
Query: 4 KPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLI--RFYIEEKD 61
+P+ + DK L+ + + + F+N+ +R +++ L++ R Y E
Sbjct: 10 RPLALADKPLLDALFTELQPRVSELTFANLYLFRGIHDYRLTRLGDALVVLGRGYGGE-- 67
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
AY P +GD+ A+ L D + + G L ++A +EE
Sbjct: 68 ----AYALPPLSGDVTGALRTLLADGFTIYGADDTFLERH-GADAAITVEE--------- 113
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWV 181
+RD FDY+YLR DL L G ++ K+N IN F ++ + P+ C+ L W
Sbjct: 114 -DRDGFDYLYLRSDLADLPGNRFHKKKNRINYFAARHPFEVRVFGPDHRQGCLALLDEWR 172
Query: 182 RANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHV 241
R + +T L E + A+ ELGL G I VEG + AF G +N +T H
Sbjct: 173 RVRDAAGST-SLDPETAAAAEAVTLSAELGLEGVVIAVEGRVGAFALGERLNRETAVCHF 231
Query: 242 EKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EK+D EGI L+N+EF + ++NRE+DLG PGLR +KLSY+PV L+ K
Sbjct: 232 EKSDPFMEGISQLVNREFC-RLFTDCTFVNREQDLGEPGLRTAKLSYHPVELVRK 285
>gi|88602186|ref|YP_502364.1| hypothetical protein Mhun_0895 [Methanospirillum hungatei JF-1]
gi|88187648|gb|ABD40645.1| conserved hypothetical protein [Methanospirillum hungatei JF-1]
Length = 297
Score = 124 bits (312), Expect = 6e-27, Method: Composition-based stats.
Identities = 90/290 (31%), Positives = 147/290 (50%), Gaps = 14/290 (4%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
F PI +EDK++ Q+ + F + WR + EGCL+I+ E
Sbjct: 8 FHPIELEDKDIFDRIYRDYPIQHSENTFGTLFCWRSYGHYKICEHEGCLIIKGETENYH- 66
Query: 63 KRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+Y FP+G D+ A L ++ +GE PLLIL + E +P
Sbjct: 67 ---SYRFPIGPVKPDVFHATIKLAQN----LGEEAPLLIL--EPWQYSWMREHYPT-LKL 116
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
+R++FDY+Y E L +L G+ + + R +NKF+++ T I+ + + +E +W
Sbjct: 117 KPDREFFDYVYKSEILASLPGQDFLSVRKQLNKFRRKCPSTVELISESNMDEVLEFLVKW 176
Query: 181 VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVH 240
+ E + T L +E ++ A+ +F +LG G + +GEI +N T VH
Sbjct: 177 CQQRECDKYTI-LKHEKEAINEAVQYFDQLGFSGITVNPKGEIGGIAIFEELNPTTAVVH 235
Query: 241 VEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
EKA + EGI+ +N + AL++ ++Y YINRE D+GIPGLR++K Y+P
Sbjct: 236 YEKALPDCEGIYKEVNIQTALYLKDRYHYINRESDMGIPGLREAKERYHP 285
>gi|153953652|ref|YP_001394417.1| hypothetical protein CKL_1027 [Clostridium kluyveri DSM 555]
gi|146346533|gb|EDK33069.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
Length = 297
Score = 124 bits (311), Expect = 7e-27, Method: Composition-based stats.
Identities = 86/300 (28%), Positives = 147/300 (49%), Gaps = 13/300 (4%)
Query: 1 MVFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEK 60
++FKPI I+DK + + F +++F+++ WR + Y +++ L+I+ +K
Sbjct: 2 LIFKPITIDDKSIFDKYLVNYNFDTSEYSFTSLIIWRKGCDITYTIYDNALIIK----KK 57
Query: 61 D-PKRMAYMFPVG--NGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNE 117
D +M P+G +++ ++ L + E P L + L+E++
Sbjct: 58 DFNGNYHFMQPIGYTKDNIKNIIKKLTEYKEE---NNMPYLFKDAEGKFLSDLKEIYGEA 114
Query: 118 FAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPI-TPEIVPQCMEL 176
+ + FDYIY + L TL GKK +K+NH N F K Y YT E+ +
Sbjct: 115 VHSKPDINNFDYIYTTQKLITLSGKKLHSKKNHYNYFIKTYNYTVKDFYDSEVKTDIISA 174
Query: 177 ERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQT 236
+ W + +G L E + + + ++L L A+ V+ +I AFT G +N
Sbjct: 175 AQSWYKQKNSGNKY--LKYELEGIKEIVFNMEKLNLKAMAVYVDNKISAFTIGEKVNSNM 232
Query: 237 FGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
+H+EK ++ GI++ N+ F + YINRE+DLGI GLR++K SY PV + EK
Sbjct: 233 AIIHIEKGSSSIRGIYTFTNKTFVENYLSDVKYINREQDLGIEGLRKAKRSYYPVKMGEK 292
>gi|121534719|ref|ZP_01666540.1| conserved hypothetical protein [Thermosinus carboxydivorans Nor1]
gi|121306739|gb|EAX47660.1| conserved hypothetical protein [Thermosinus carboxydivorans Nor1]
Length = 138
Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats.
Identities = 56/134 (41%), Positives = 88/134 (65%), Gaps = 3/134 (2%)
Query: 163 IPITPEIVPQCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGE 222
+PITP+++P C++ W ++ + +E ++ A HF+EL L+GGAI+++G+
Sbjct: 1 MPITPDLIPACVKSTLAWYE--RRCDDDPCMGHEKEAILRAFAHFRELKLVGGAIVIDGK 58
Query: 223 IVAFTYGSPINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLR 282
+ AFT+G +N T VH+EK + ++ GI+ +INQ+F YINREED+GI GLR
Sbjct: 59 VEAFTFGEALNSDTVVVHIEKGNADFRGIYQMINQQFCRQW-RHMRYINREEDMGIEGLR 117
Query: 283 QSKLSYNPVILLEK 296
Q+KLSY PV ++EK
Sbjct: 118 QAKLSYRPVKMVEK 131
>gi|110800971|ref|YP_695841.1| hypothetical protein CPF_1396 [Clostridium perfringens ATCC 13124]
gi|110675618|gb|ABG84605.1| conserved hypothetical protein [Clostridium perfringens ATCC 13124]
Length = 300
Score = 118 bits (295), Expect = 5e-25, Method: Composition-based stats.
Identities = 82/293 (27%), Positives = 146/293 (49%), Gaps = 9/293 (3%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+FK I ++DK L + + F +C+++F+ + W+ + EY + +I+ Y +K+
Sbjct: 1 MFKKITLKDKSLYYQYIDKNKFLSCEYSFATLFMWKDFNDIEYDIVNNIFIIKKY--DKN 58
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
+ +M P+G+ D + ++ E L V+ +LE+++ +
Sbjct: 59 NGKF-FMEPLGDIDDNSLINIIDYLESIRKKEKSKWLFGDVSINFLNRLEDIYKENLIFE 117
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEI----VPQCMELE 177
E + FDY+Y +DL+ L G+K++ KRN N+F K Y Y + +C+E
Sbjct: 118 EEINNFDYVYNFDDLRNLSGRKFRKKRNKYNQFIKNYNYKTAFFKSFLDNKEREECLEFL 177
Query: 178 RRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
+W EN E +E E +++ +L L + V+ +++ + G N T+
Sbjct: 178 DKWYL--ENKEMDEEFLAEIDGTRNLINYLGQLDLDLIKLYVDNKLIGISIGERFNDSTY 235
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
VHVEK ++G ++ IN E + Y+NREEDLGI GL++SK+SYNP
Sbjct: 236 IVHVEKCLKEFKGAYAFINNELLKNYFLDLKYVNREEDLGILGLKKSKMSYNP 288
>gi|110803855|ref|YP_698530.1| hypothetical protein CPR_1208 [Clostridium perfringens SM101]
gi|110684356|gb|ABG87726.1| conserved hypothetical protein [Clostridium perfringens SM101]
Length = 300
Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats.
Identities = 81/293 (27%), Positives = 148/293 (50%), Gaps = 9/293 (3%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKD 61
+FK I ++DK + + + F +C+++F+ + W+ EY + +IR Y +K
Sbjct: 1 MFKKITLKDKSIYYKYIDKNKFLSCEYSFTTLFMWKDFNNIEYDIVNNIFIIRKY--DKI 58
Query: 62 PKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYV 121
++ +M P+G+ D + ++ + E + L V+ +L++++ +
Sbjct: 59 NGKI-FMQPLGDIDDDSLINIIDYLEFIRKKENNKWLFGDVSINFLNRLKDIYKENLIFD 117
Query: 122 TERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTY----IPITPEIVPQCMELE 177
E+ FDY+Y +DL+ L G+K++ KRN N+F K Y Y + + +C+E
Sbjct: 118 EEKLNFDYLYDFDDLKNLSGRKFRNKRNKYNQFIKNYNYKTSFFKCFLNNKEKEECLEFL 177
Query: 178 RRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTF 237
+W EN E +E E +++ +L L + V+ +++ + G N T+
Sbjct: 178 NKWYL--ENKEIDEEFLAEINGTRNLINYLDQLDLDLIKLYVDNQLIGISIGERFNDITY 235
Query: 238 GVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
VHVEK ++G ++ IN E + Y+NREEDLGI GL++SK+SYNP
Sbjct: 236 IVHVEKCLKEFKGSYAFINNELLKNSFLDLKYVNREEDLGILGLKKSKMSYNP 288
>gi|145618153|ref|ZP_01774214.1| conserved hypothetical protein [Geobacter bemidjiensis Bem]
gi|144945613|gb|EDJ80677.1| conserved hypothetical protein [Geobacter bemidjiensis Bem]
Length = 297
Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats.
Identities = 72/231 (31%), Positives = 113/231 (48%), Gaps = 13/231 (5%)
Query: 67 YMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPN-EFAYVTERD 125
Y P GD+ A L + + G L PN V +RD
Sbjct: 69 YFLPPFGGDIAAATRRLLDEGLTLYGADDGFL------------SRYLPNFGLEVVPDRD 116
Query: 126 YFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVRANE 185
FDY++L++++ L GKK+ K+N +N F+ ++ + + +EL +W R
Sbjct: 117 NFDYLHLKQEMAELNGKKYHKKKNRVNYFQLRHRHQVELFNEGHLDGALELLEQWRRVRA 176
Query: 186 NGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVEKAD 245
L+ E T A+ + LGL G ++V+G + F G +N +T H EK D
Sbjct: 177 EFGEESSLAQEVEGATEALRLREALGLSGVVVLVDGAVRGFALGERLNRETAVCHFEKGD 236
Query: 246 TNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
EG++ L+++EF+ + + Y+NRE+DLG P LRQ+KLSY+PV LL K
Sbjct: 237 LFLEGLYQLLDREFSRLLFPECSYLNREQDLGEPALRQAKLSYHPVELLAK 287
>gi|148265328|ref|YP_001232034.1| hypothetical protein Gura_3304 [Geobacter uraniumreducens Rf4]
gi|146398828|gb|ABQ27461.1| conserved hypothetical protein [Geobacter uraniumreducens Rf4]
Length = 293
Score = 115 bits (288), Expect = 3e-24, Method: Composition-based stats.
Identities = 86/304 (28%), Positives = 139/304 (45%), Gaps = 24/304 (7%)
Query: 4 KPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLI---RFYIEEK 60
+P+ ++DK ++ + +F F+N+ +L+ S +A CL + + K
Sbjct: 10 RPLALDDKPMLDGIFAGLQPRVSEFTFANL----YLFRSVHAY---CLTMVGDALVVMGK 62
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPN-EFA 119
Y P GD+ A+ L D + G P + E F
Sbjct: 63 GYAGDDYFLPPLTGDIAAALSVLLNDGLTLYGADEPFV------------ERYFQGGNVD 110
Query: 120 YVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERR 179
ER+ FDY++LR ++ L G ++ K+N IN F +++ Y ++L
Sbjct: 111 VAAERNSFDYLHLRSEMAELPGNRFHKKKNRINYFARRHAYQVELYADSHREGALQLLDE 170
Query: 180 WVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGV 239
W R + L E + A+ +LGL G ++VEG++ AF G +N T
Sbjct: 171 WYRVRSE-VASGSLLPETEATRDALAMAGQLGLAGVVVLVEGKVRAFVLGERLNSDTSVC 229
Query: 240 HVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKNGA 299
H EKAD +G+ L ++EF + +Y+NRE+DLG P LR+SKLSY+PV L++K
Sbjct: 230 HFEKADPFLDGLNQLADREFNRLLFADCIYVNREQDLGEPNLRESKLSYHPVELIKKFKV 289
Query: 300 IKRR 303
RR
Sbjct: 290 RARR 293
>gi|110601754|ref|ZP_01389925.1| conserved hypothetical protein [Geobacter sp. FRC-32]
gi|110547537|gb|EAT60792.1| conserved hypothetical protein [Geobacter sp. FRC-32]
Length = 296
Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats.
Identities = 70/198 (35%), Positives = 110/198 (55%), Gaps = 6/198 (3%)
Query: 106 ARKKLEELFPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPI 165
AR+ L+ ++ +RD FDY+YLR DL L G ++ K+N IN F ++ +T P
Sbjct: 100 ARQHLQR---DDLEVTEDRDNFDYLYLRSDLAELPGNRYHKKKNRINYFAGRHPFTVEPY 156
Query: 166 TPEIVPQCMELERRWVRA-NENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIV 224
+ L W R ++ G + L E + A+ ++LGL G ++VEGE+
Sbjct: 157 GNQHRQGASALLDEWQRVHSQTGSGSFLL--EVDAAREALFMTEKLGLKGLVVLVEGEVK 214
Query: 225 AFTYGSPINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQS 284
AF G +N QT H +KAD +G++ L ++EF + + Y+NRE+DLG LR+S
Sbjct: 215 AFVLGEKLNDQTSVCHFQKADPFLDGLYQLTDREFNRLLFTECTYVNREQDLGEANLRES 274
Query: 285 KLSYNPVILLEKNGAIKR 302
KLSY+P+ L++K A +R
Sbjct: 275 KLSYHPLELVKKYRAGRR 292
>gi|124485248|ref|YP_001029864.1| putative transcriptional regulator, CopG family [Methanocorpusculum
labreanum Z]
gi|124362789|gb|ABN06597.1| conserved hypothetical protein [Methanocorpusculum labreanum Z]
Length = 300
Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats.
Identities = 89/292 (30%), Positives = 139/292 (47%), Gaps = 16/292 (5%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
FK I + DK LI + + + F + W EYAV L++
Sbjct: 6 FKKITLADKPLIEDYFRRFPQYHSEHNFLTLLCWEHYSHCEYAVINDHLIL----SNMTC 61
Query: 63 KRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLE--ELFPNEFAY 120
+ P+G D E L + + + G+ VT K L +L+ E
Sbjct: 62 GQCTCHAPIGEFD-PALFEELLQYTKKHTGD------CAVTFHEDKYLPYMKLYHPETPV 114
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
R DY Y ++L L G+K+ R IN+F +Y+YT PITPE +P+ E+ +W
Sbjct: 115 YQSRGCSDYYYRTKELAELAGQKYLNIRKQINQFNTKYQYTVDPITPESIPEIHEMLDKW 174
Query: 181 VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMV--EGEIVAFTYGSPINYQTFG 238
A +N E L+ E + A++ + ELG G I + + +I A +N +T
Sbjct: 175 SDA-KNTEVNSVLNEEVGAAHAALNQWDELGCEGLIIRILPKNKIGAVAIWGEMNNETAV 233
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
+H EK + Y+GI+ +INQE A + +Y +INRE D+ +PGLR++KL Y+P
Sbjct: 234 IHFEKGISQYKGIYKVINQETAKALLGKYSWINRESDMDVPGLREAKLRYHP 285
>gi|42528175|ref|NP_973273.1| hypothetical protein TDE2675 [Treponema denticola ATCC 35405]
gi|41819220|gb|AAS13192.1| conserved hypothetical protein [Treponema denticola ATCC 35405]
Length = 296
Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats.
Identities = 70/187 (37%), Positives = 104/187 (55%), Gaps = 9/187 (4%)
Query: 114 FPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYI-PITPEIVPQ 172
F E +RD FDY+YLR+DL LKGK + K+ HINKF+K Y+ I P+T E V
Sbjct: 110 FLKELKIEEDRDNFDYVYLRKDLAELKGKDFHKKKTHINKFEKSYDKIKIEPLTLENVED 169
Query: 173 CMELERRW--VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGS 230
++ W + + N EN+D + + A+ ++G + V E VA+T
Sbjct: 170 AKKVLEEWNSTKPDSNPENSD-----YEAALEALSILSRTSMMGIILYVWNEPVAWTLAE 224
Query: 231 PI-NYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYN 289
N +T + EKA +Y+G F IN FA ++PE +INRE+DLG GLRQ+K++Y
Sbjct: 225 ITQNNKTAVILFEKALASYKGSFQYINYAFAGYLPEYIEFINREQDLGDEGLRQAKMTYK 284
Query: 290 PVILLEK 296
P+ ++K
Sbjct: 285 PIKFIKK 291
>gi|94987570|ref|YP_595503.1| hypothetical protein LI1128 [Lawsonia intracellularis PHE/MN1-00]
gi|94731819|emb|CAJ55182.1| uncharacterized conserved protein [Lawsonia intracellularis
PHE/MN1-00]
Length = 299
Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats.
Identities = 84/299 (28%), Positives = 135/299 (45%), Gaps = 31/299 (10%)
Query: 16 SFTYPSAFQN------------CDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDPK 63
S +PS F N D+ +N+ W+ Y E+ G IR D
Sbjct: 7 SPVHPSKFDNYTLLWEMTPQHSSDYTLTNIWGWKDFYGLEWMFESGLCWIR-QTRHVDGA 65
Query: 64 RMAYMFPVGNGDLRQAVE-ALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAYVT 122
+ P+G+ + V L K M L L L+E+ P +
Sbjct: 66 DYIFWAPIGDWNSIDWVNLPLFKRGLHMQRVPEMLCSL---------LQEILPRKINIEE 116
Query: 123 ERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVR 182
++YIY +DL TL G+++ +KRNH+N F ++Y+ Y PI + + +E ++ W +
Sbjct: 117 TPGQWEYIYSTKDLTTLSGRRFDSKRNHLNSFFREYKEDYRPIGDSNILELIEFQKEWYK 176
Query: 183 ANENGENTDELSNEHRSMTFAMHHFKELGLL----GGAIMVEGEIVAFTYGSPINYQTFG 238
E + + L+ E + FK GL GGA+ +++AF+ G ++ T
Sbjct: 177 WRE-CKKSPSLAAEGEVLC---DIFKNWGLFPSLRGGALYSNNKVIAFSIGETLDENTVV 232
Query: 239 VHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEKN 297
VH EKA G + IN F + + +INRE+DLG GL+++K SY P+ L+KN
Sbjct: 233 VHFEKAKPGIRGAYQAINNCFVRYECNDFDFINREQDLGEEGLKKAKQSYYPIGFLKKN 291
>gi|154485046|ref|ZP_02027494.1| hypothetical protein EUBVEN_02767 [Eubacterium ventriosum ATCC
27560]
gi|149733999|gb|EDM50118.1| hypothetical protein EUBVEN_02767 [Eubacterium ventriosum ATCC
27560]
Length = 303
Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats.
Identities = 66/211 (31%), Positives = 117/211 (55%), Gaps = 9/211 (4%)
Query: 98 LILGVTSEARKKLEELFPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQ 157
L++ V A L +L +E+ V +R Y DY+Y E L+T GKK+ K+NH+N FK++
Sbjct: 96 LVMYVVDRAAVDLLQLPEDEYVVVPDRTYADYVYDAEKLRTFSGKKYHKKKNHLNAFKRE 155
Query: 158 YE--YTYIPITPEIVPQCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGG 215
YE Y + ++ + P+ ++ W + + E + + +E + + + H + G
Sbjct: 156 YEGRYEFKFLSKKDEPEILDFLEDWKKHKSDTEEHEFIDSEAVGIKYILEHEEVFDYKIG 215
Query: 216 AIMVEGEIVAFTYGSPINYQT----FGVHVEKADTNYEGIFSLINQEFALHIPEQYVYIN 271
A+ V+ ++ AFT G NY++ + VEKA+ G++ I +F + + N
Sbjct: 216 AVYVDNKLEAFTIG---NYESREDMVYIPVEKANPEIRGLYPYICSQFLIEAFPEAGKEN 272
Query: 272 REEDLGIPGLRQSKLSYNPVILLEKNGAIKR 302
RE+D+G+ GLR+SKLSYNP+ ++EK I++
Sbjct: 273 REDDMGLEGLRKSKLSYNPIYMVEKYTIIQK 303
>gi|85859315|ref|YP_461517.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
gi|85722406|gb|ABC77349.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
Length = 315
Score = 97.4 bits (241), Expect = 1e-18, Method: Composition-based stats.
Identities = 59/180 (32%), Positives = 98/180 (54%), Gaps = 6/180 (3%)
Query: 118 FAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY----EYTYIPITPEIVPQC 173
F+ + DY DYIYL++DL L+GK++ KRNHIN F++ + T ++P +P
Sbjct: 120 FSLTEQPDYSDYIYLKKDLTELQGKRYLKKRNHINYFRRIFLETGRTTIEALSPVNIPDT 179
Query: 174 MELERRWVRANE--NGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSP 231
+ RW + N E L E ++ + + + L G AI ++GE+ AF +
Sbjct: 180 LLFLDRWCAKDSRCNPETDPNLGQESQAARLMLENLELLEARGIAIRIDGEVCAFGIVTS 239
Query: 232 INYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPV 291
+ + ++ EKA ++ G++ +++E A + E +IN+E D+GIPGL SK SY PV
Sbjct: 240 LTDKIKVLNFEKAYSHIRGLYQFLDRECACRLFEDVEFINKESDMGIPGLAGSKKSYFPV 299
>gi|78221940|ref|YP_383687.1| hypothetical protein Gmet_0720 [Geobacter metallireducens GS-15]
gi|78193195|gb|ABB30962.1| conserved hypothetical protein [Geobacter metallireducens GS-15]
Length = 296
Score = 90.5 bits (223), Expect = 1e-16, Method: Composition-based stats.
Identities = 75/232 (32%), Positives = 113/232 (48%), Gaps = 16/232 (6%)
Query: 67 YMFPVGNGDLRQAVEALEKDSYEMMGEGHPLL--ILGVTSEARKKLEELFPNEFAYVTER 124
Y P GD+ A+ L + G P + L V A +EE +R
Sbjct: 73 YFLPPLGGDVAGALRVLLDAGMTLYGADEPFVSRYLAVGGVA---VEE----------DR 119
Query: 125 DYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRWVRAN 184
D FDY++LR+ L L G ++ K+N IN F ++ ++ + C+ L W R
Sbjct: 120 DAFDYLHLRQALAELPGNRFHKKKNRINYFTARHSFSVVLYGEAHRDGCLALLDEWRRV- 178
Query: 185 ENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPINYQTFGVHVEKA 244
NG ++ L E ++ A+ LGL G ++VEG + AF G +N T H EK
Sbjct: 179 RNGIDSPSLELETKAGAEALMLADRLGLEGVVVLVEGGVAAFALGERLNRDTSVCHFEKN 238
Query: 245 DTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVILLEK 296
D EG+ L+++EF + + NRE+DLG PGLR +KLSY+P L++K
Sbjct: 239 DPFMEGVSQLVDREFNRLLFTDCTWTNREQDLGEPGLRAAKLSYHPDELVKK 290
>gi|78777400|ref|YP_393715.1| hypothetical protein Tmden_1202 [Thiomicrospira denitrificans ATCC
33889]
gi|78497940|gb|ABB44480.1| conserved hypothetical protein [Thiomicrospira denitrificans ATCC
33889]
Length = 326
Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 61/189 (32%), Positives = 93/189 (49%), Gaps = 11/189 (5%)
Query: 119 AYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPI-TPEIVPQCMEL 176
+Y+ E+ DY+Y + L L+G + KR INKF K Y +Y + + + + M L
Sbjct: 132 SYIVEKKLVDYVYEVDALIDLRGNSYHTKRTEINKFMKSYPDYVIEKLDSAKHKDEIMHL 191
Query: 177 ERRWVR------ANENGENTDE-LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYG 229
+WV E E E + E ++ + ++ EL LLG I + GE+ FT G
Sbjct: 192 FNKWVSDRVKYMPKEEAEVFLEGIHQERHAIKQMIKNYDELMLLGLVIYINGELKGFTVG 251
Query: 230 SPINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQY--VYINREEDLGIPGLRQSKLS 287
I++ T V +EK D G I +EF+ + E Y YIN +D+G LR+ K+S
Sbjct: 252 ERISHDTATVIIEKTDFEILGCAQFIFREFSKMLKEHYEVAYINVGDDMGFENLRKVKMS 311
Query: 288 YNPVILLEK 296
Y P L+ K
Sbjct: 312 YRPFKLVPK 320
>gi|154174955|ref|YP_001408913.1| ribonuclease HII (RNase HII) [Campylobacter curvus 525.92]
gi|112802584|gb|EAT99928.1| ribonuclease HII (RNase HII) [Campylobacter curvus 525.92]
Length = 328
Score = 76.6 bits (187), Expect = 2e-12, Method: Composition-based stats.
Identities = 55/182 (30%), Positives = 89/182 (48%), Gaps = 11/182 (6%)
Query: 120 YVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVP-QCMELE 177
Y+ E+ DYIY E+L LKG + KR INKFK Y ++ + PE + +EL
Sbjct: 135 YLFEKRLADYIYKAENLIELKGNSYHTKRTEINKFKNTYPNFSVQTLMPEKHKGEIIELS 194
Query: 178 RRWVRANEN---GENTDE----LSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGS 230
+W + E D + EH ++ + H+ +L L+G + ++G + FT G
Sbjct: 195 NKWAKERMKYMPKEQADAFMEGIYQEHAAIKRMLDHYDKLELIGIVLYIDGVMQGFTVGE 254
Query: 231 PINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYV--YINREEDLGIPGLRQSKLSY 288
IN V +EK + G I +EF+ + +Y +IN +D+G L++ K+SY
Sbjct: 255 RINEGVASVILEKTNFEILGCAQFIFREFSKILKSEYSCEFINVGDDMGFENLKKVKMSY 314
Query: 289 NP 290
P
Sbjct: 315 RP 316
>gi|150384601|ref|ZP_01923283.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548]
gi|150258988|gb|EDM96237.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548]
Length = 297
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 74/297 (24%), Positives = 130/297 (43%), Gaps = 28/297 (9%)
Query: 3 FKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIRFYIEEKDP 62
+P+ +ED++L A Q+C+ +F+N+ F+Y+ Y + R + E+
Sbjct: 8 LRPVTLEDRKLFDEKLAVLASQSCECSFANL----FMYQQPYGLEFVEAGDRLVVYERHA 63
Query: 63 KRMAYMFPVGN----GDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKK---LEELFP 115
+ + Y P+G +L++ ++A + G I V E K + F
Sbjct: 64 RTIHY--PIGKWTTPEELKEIIDAFNAEGLTDGG------IYDVPEEFLDKHSDCDAFFE 115
Query: 116 NEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYI-PITPEIVPQCM 174
EF + DY+Y + + T G K + K N + +F+ + Y + IT E +P
Sbjct: 116 LEF----DEGAIDYLYSIDKIATFAGPKLRKKHNLVKQFQTNWPYAEVRKITREEIPAAA 171
Query: 175 ELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVE-GEIVAFTYGSPIN 233
+L N E + L E ++ A +F ELGL G + E G + S +
Sbjct: 172 KLA---ADLNSRLEPCEFLEEEALALDRAWKNFDELGLGGIILYAEPGYPAGLSVYSHLP 228
Query: 234 YQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
T VH EKAD +G + + A+ + + ++NRE+D+ LR +K S +P
Sbjct: 229 SDTVDVHFEKADHTVKGAPQTLTWQLAIALRGKAKFMNREQDMNEESLRHAKRSLDP 285
>gi|34762805|ref|ZP_00143791.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27887507|gb|EAA24591.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 169
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 55/181 (30%), Positives = 92/181 (50%), Gaps = 13/181 (7%)
Query: 2 VFKPICIEDKELITSFTYPSAFQNCDFAFSNMCSWRFLYESEYAVFEGCLLIR-FYIEEK 60
+++ + IE K I +T + F+ CD +FSN+ W +EY + L IR Y+ E
Sbjct: 1 MWQKLTIESKSSIEEYT-KNRFEICDLSFSNLLLWSTGENTEYEIENDVLTIRSVYMGE- 58
Query: 61 DPKRMAYMFPVGNGDLRQAVEALEKDSYEMMGEGHPLLILGVTSEARKKLEELFPNEFAY 120
+ Y P+ D + +E +++ E++ E + I T +KL++ +F
Sbjct: 59 ----VYYYMPIPKNDTPENIEKMKEKIREILKEN--VAINYFTEYWYEKLKD----DFNL 108
Query: 121 VTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQCMELERRW 180
+RDY DYIY E L TLKG+ + K+N + FKK YEY+Y I + + ++ R+
Sbjct: 109 QEKRDYEDYIYSYESLSTLKGRHYAKKKNRVANFKKSYEYSYGSINKNNINEVVDFHRKM 168
Query: 181 V 181
V
Sbjct: 169 V 169
>gi|118475472|ref|YP_891393.1| hypothetical protein CFF8240_0188 [Campylobacter fetus subsp. fetus
82-40]
gi|118414698|gb|ABK83118.1| conserved hypothetical protein [Campylobacter fetus subsp. fetus
82-40]
Length = 330
Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats.
Identities = 52/182 (28%), Positives = 89/182 (48%), Gaps = 11/182 (6%)
Query: 120 YVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQY-EYTYIPITPEIVPQ-CMELE 177
Y+ E+ DYIY +DL LKG + KR INKFK Y + + P+ + ++L
Sbjct: 136 YIFEKQLVDYIYKSDDLIELKGNSYHTKRTEINKFKNTYPNFKIETLDPKAHREIIIDLA 195
Query: 178 RRW-------VRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGS 230
+W + A + + + + E ++ + H+++L L+G + ++ EI F+ G
Sbjct: 196 NKWAAERIKYMPAEKMDDFLEGIYQEKAAIKRMLDHYEKLELIGIVLFIDDEIKGFSVGE 255
Query: 231 PINYQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYV--YINREEDLGIPGLRQSKLSY 288
IN V +EK D G I +EF+ + Y +IN +D+G L++ K+SY
Sbjct: 256 KINDGVASVIIEKTDFATLGSAQFIFREFSKVLKNIYNCDFINVGDDMGFENLKKVKMSY 315
Query: 289 NP 290
P
Sbjct: 316 RP 317
>gi|46446557|ref|YP_007922.1| hypothetical protein pc0923 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46400198|emb|CAF23647.1| hypothetical protein [Candidatus Protochlamydia amoebophila UWE25]
Length = 285
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 50/186 (26%), Positives = 86/186 (46%), Gaps = 7/186 (3%)
Query: 114 FPNEFAYVTERDYFDYIYLREDLQTLKGKKWQAKRNHINKFKKQYEYTYIPITPEIVPQC 173
FP +F++ E DY++ E L G+ KRN + + +Y+ +T +
Sbjct: 100 FPLDFSF--EDADSDYLFKVEKLAFFSGRHLSKKRNLVKQLFDRYQIKTAQLTQKNKSDA 157
Query: 174 MELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSPIN 233
+++ W + ++ S A+ + L L G I ++G+ FT G +
Sbjct: 158 LKILVNWQDQQTQ-----QEQTDYFSCLEAIKLLECLNLEGVIIYIDGKPAGFTIGEYLT 212
Query: 234 YQTFGVHVEKADTNYEGIFSLINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNPVIL 293
F +H K D N G++ + Q A + +Q +IN E+DLG+P LRQ+K SY P +
Sbjct: 213 PTCFVIHFAKGDANIHGLYQYLYQYQAQILSKQTEWINLEQDLGLPFLRQAKHSYVPDRM 272
Query: 294 LEKNGA 299
+ K A
Sbjct: 273 ILKQRA 278
>gi|26554339|ref|NP_758273.1| hypothetical protein MYPE8860 [Mycoplasma penetrans HF-2]
gi|26454348|dbj|BAC44677.1| conserved hypothetical protein [Mycoplasma penetrans HF-2]
Length = 319
Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats.
Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 11/180 (6%)
Query: 116 NEFAYVTERDYFD--YIYLREDLQTLKGKKWQAKRNHINKFKKQYEYT--YIPITPEIVP 171
N F + D F+ Y+Y E L+ + GKK Q KRN IN +KK YE PE+
Sbjct: 129 NIFKKIELLDTFNSAYLYPTEQLKYMAGKKMQKKRNFINFYKKNYESESRIEKFNPELKQ 188
Query: 172 QCMELERRWVRANENGENTDELSNEHRSMTFAMHHFKELGLLGGAIMVEGEIVAFTYGSP 231
+ ++ + EN E + + + + F G + + +IV FTYG
Sbjct: 189 EIIDFCSKVSIDKENQETREYEIDALKKIL----EFNPNSSYGSTLFYKNKIVGFTYGF- 243
Query: 232 INYQTFGVHVEKADTNYEGIFS-LINQEFALHIPEQYVYINREEDLGIPGLRQSKLSYNP 290
IN + + +EK D +GI+ L+++ ++ Q ++R++D+ L QSK SY P
Sbjct: 244 INNDKYEIFIEKGDKELKGIYQYLLSKNLEIN-NIQTKLVDRQDDMYSENLAQSKQSYKP 302
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.321 0.139 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,197,607,439
Number of Sequences: 5470121
Number of extensions: 53301928
Number of successful extensions: 109007
Number of sequences better than 1.0e-05: 68
Number of HSP's better than 0.0 without gapping: 57
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 108805
Number of HSP's gapped (non-prelim): 68
length of query: 303
length of database: 1,894,087,724
effective HSP length: 132
effective length of query: 171
effective length of database: 1,172,031,752
effective search space: 200417429592
effective search space used: 200417429592
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 129 (54.3 bits)