BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_1195 hypothetical protein
(407 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540831|ref|NP_905310.1| hypothetical protein PG1098 [P... 810 0.0
gi|150008084|ref|YP_001302827.1| hypothetical protein BDI_1... 288 4e-76
gi|154493663|ref|ZP_02032983.1| hypothetical protein PARMER... 274 1e-71
gi|153806987|ref|ZP_01959655.1| hypothetical protein BACCAC... 265 3e-69
gi|167763898|ref|ZP_02436025.1| hypothetical protein BACSTE... 263 2e-68
gi|29348417|ref|NP_811920.1| hypothetical protein BT_3008 [... 259 2e-67
gi|160884232|ref|ZP_02065235.1| hypothetical protein BACOVA... 259 3e-67
gi|60683732|ref|YP_213876.1| hypothetical protein BF4320 [B... 246 2e-63
gi|53715803|ref|YP_101795.1| hypothetical protein BF4524 [B... 244 6e-63
gi|160891672|ref|ZP_02072675.1| hypothetical protein BACUNI... 244 1e-62
gi|150003879|ref|YP_001298623.1| hypothetical protein BVU_1... 230 1e-58
gi|163754903|ref|ZP_02162024.1| hypothetical protein KAOT1_... 228 8e-58
gi|110639758|ref|YP_679968.1| hypothetical protein CHU_3389... 219 4e-55
gi|86130473|ref|ZP_01049073.1| hypothetical protein MED134_... 217 1e-54
gi|88712929|ref|ZP_01107014.1| hypothetical protein FB2170_... 214 1e-53
gi|163788239|ref|ZP_02182685.1| hypothetical protein FBALC1... 213 2e-53
gi|154486382|ref|ZP_02027789.1| hypothetical protein BIFADO... 212 3e-53
gi|124005819|ref|ZP_01690657.1| conserved hypothetical prot... 211 1e-52
gi|150025671|ref|YP_001296497.1| hypothetical protein FP162... 211 1e-52
gi|23335244|ref|ZP_00120482.1| COG0500: SAM-dependent methy... 204 8e-51
gi|119025051|ref|YP_908896.1| N6-adenine-specific methylase... 204 1e-50
gi|126663636|ref|ZP_01734632.1| hypothetical protein FBBAL3... 203 2e-50
gi|23466002|ref|NP_696605.1| hypothetical protein BL1446 [B... 201 1e-49
gi|183602204|ref|ZP_02963571.1| N6-adenine-specific methyla... 198 7e-49
gi|149372807|ref|ZP_01891828.1| hypothetical protein SCB49_... 196 2e-48
gi|126646426|ref|ZP_01718943.1| hypothetical protein ALPR1_... 195 6e-48
gi|88802617|ref|ZP_01118144.1| hypothetical protein PI23P_0... 194 8e-48
gi|86133982|ref|ZP_01052564.1| hypothetical protein MED152_... 194 9e-48
gi|120436004|ref|YP_861690.1| hypothetical protein GFO_1650... 186 3e-45
gi|146302326|ref|YP_001196917.1| hypothetical protein Fjoh_... 185 5e-45
gi|171741703|ref|ZP_02917510.1| hypothetical protein BIFDEN... 181 1e-43
gi|149280236|ref|ZP_01886359.1| hypothetical protein PBAL39... 180 2e-43
gi|83857756|ref|ZP_00951284.1| hypothetical protein CA2559_... 179 3e-43
gi|88805298|ref|ZP_01120817.1| hypothetical protein RB2501_... 177 1e-42
gi|86142666|ref|ZP_01061105.1| hypothetical protein MED217_... 173 3e-41
gi|21672941|ref|NP_661006.1| hypothetical protein CT0100 [C... 150 1e-34
gi|89891579|ref|ZP_01203083.1| putative methyltransferase [... 147 1e-33
gi|78188005|ref|YP_378343.1| hypothetical protein Cag_0021 ... 136 3e-30
gi|68549338|ref|ZP_00588803.1| conserved hypothetical prote... 134 1e-29
gi|91214828|ref|ZP_01251801.1| hypothetical protein P700755... 131 1e-28
gi|119358355|ref|YP_912999.1| hypothetical protein Cpha266_... 129 3e-28
gi|67918008|ref|ZP_00511610.1| conserved hypothetical prote... 128 9e-28
gi|67937956|ref|ZP_00530486.1| conserved hypothetical prote... 127 2e-27
gi|67940206|ref|ZP_00532663.1| conserved hypothetical prote... 125 5e-27
gi|145220508|ref|YP_001131217.1| hypothetical protein Cvib_... 120 1e-25
gi|110597296|ref|ZP_01385584.1| conserved hypothetical prot... 120 2e-25
gi|68552180|ref|ZP_00591572.1| conserved hypothetical prote... 120 2e-25
gi|78187905|ref|YP_375948.1| hypothetical protein Plut_2063... 118 7e-25
gi|167752881|ref|ZP_02425008.1| hypothetical protein ALIPUT... 108 1e-21
gi|159896610|ref|YP_001542857.1| hypothetical protein Haur_... 79 5e-13
gi|163847010|ref|YP_001635054.1| hypothetical protein Caur_... 76 5e-12
gi|116671434|ref|YP_832367.1| hypothetical protein Arth_288... 64 2e-08
gi|118046552|ref|ZP_01515201.1| conserved hypothetical prot... 64 3e-08
gi|119964237|ref|YP_948586.1| hypothetical protein AAur_287... 64 3e-08
gi|163840422|ref|YP_001624827.1| methyltransferase [Renibac... 60 2e-07
gi|168704435|ref|ZP_02736712.1| hypothetical protein GobsU_... 60 3e-07
gi|152964703|ref|YP_001360487.1| hypothetical protein Krad_... 59 6e-07
>gi|34540831|ref|NP_905310.1| hypothetical protein PG1098 [Porphyromonas gingivalis W83]
gi|34397145|gb|AAQ66209.1| hypothetical protein PG_1098 [Porphyromonas gingivalis W83]
Length = 407
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/407 (97%), Positives = 399/407 (98%)
Query: 1 MLFDEKEILAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPG 60
MLFDEKEILAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQW G
Sbjct: 1 MLFDEKEILAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWAG 60
Query: 61 ISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIER 120
ISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIER
Sbjct: 61 ISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIER 120
Query: 121 NDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYA 180
NDETAVAARHNIPLLLNEGKD+NILTGDFKEYLPLIKTFH DYIYVDPARRSGADKRVYA
Sbjct: 121 NDETAVAARHNIPLLLNEGKDVNILTGDFKEYLPLIKTFHPDYIYVDPARRSGADKRVYA 180
Query: 181 IADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVR 240
IADCEPDLIPLA ELLPFCSSILAKLSPMIDLWDTLQSL HVQELHVVAAHGEVKELLVR
Sbjct: 181 IADCEPDLIPLATELLPFCSSILAKLSPMIDLWDTLQSLLHVQELHVVAAHGEVKELLVR 240
Query: 241 MSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFK 300
MSLNEATIPPEKVPIHA+NLL EDTVIPFIFTMEEERSIS+PY DSIDKYVYEPHTAL K
Sbjct: 241 MSLNEATIPPEKVPIHAINLLLEDTVIPFIFTMEEERSISIPYTDSIDKYVYEPHTALLK 300
Query: 301 AGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRA 360
AGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQL KVVP+A
Sbjct: 301 AGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLRKVVPQA 360
Query: 361 SISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKAE 407
SISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKAE
Sbjct: 361 SISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKAE 407
>gi|150008084|ref|YP_001302827.1| hypothetical protein BDI_1448 [Parabacteroides distasonis ATCC
8503]
gi|149936508|gb|ABR43205.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 395
Score = 288 bits (738), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 161/400 (40%), Positives = 228/400 (57%), Gaps = 7/400 (1%)
Query: 6 KEILAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLY 65
++I ++R+ K +A+ +R+LL ++ P V Q++ ++++KLP W L
Sbjct: 2 EQIEELSRFIKEHASDDLNRLLLSASRYPGIDIPFVVDQLKSRRQIKDKLPSWYQNDRLV 61
Query: 66 IPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETA 125
P++++ EQ S T+ YK R + V DLTGGLGID K Q YIER
Sbjct: 62 FPAKIAAEQCSSEQTALYKQRLVDPQAHVCDLTGGLGIDSYFFSRKVQQVTYIERFPAYC 121
Query: 126 VAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCE 185
AA HN +L E ++ +L GD E L I D Y+DPARR +KRV+A+ DCE
Sbjct: 122 EAAIHNFKVL--EADNVTVLNGDSTELLAEIDGI--DVFYIDPARRGEGNKRVFALQDCE 177
Query: 186 PDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNE 245
PDL L L ++AKLSPM D+ TL LP V ++ V++ E KELL +
Sbjct: 178 PDLTKLVPVLFSHAPRVIAKLSPMADIRMTLDLLPGVTDIQVLSVKNECKELLFVLERGS 237
Query: 246 ATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFK 305
P I VN +SE+ F F++ EER + +Y+YEP+ ++ KAGAFK
Sbjct: 238 RNGNPR---ICCVNFISEEESESFTFSLMEEREAVASIRGEVKRYLYEPNVSILKAGAFK 294
Query: 306 TVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCR 365
VA R GL KLH +SHLYTS+ FPGR+FV++E+IPF+ K L + +P+A+I+ R
Sbjct: 295 AVATRFGLSKLHVSSHLYTSDEVVPCFPGRSFVVDEVIPFTNKQCKTLSRQIPQANITVR 354
Query: 366 NFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
NFPLS ELR+R+K+ADGG L TT+ +G KVL+ K
Sbjct: 355 NFPLSVEELRKRTKIADGGTIYLFATTLENGDKVLIKAHK 394
>gi|154493663|ref|ZP_02032983.1| hypothetical protein PARMER_03004 [Parabacteroides merdae ATCC
43184]
gi|154086873|gb|EDN85918.1| hypothetical protein PARMER_03004 [Parabacteroides merdae ATCC
43184]
Length = 394
Score = 274 bits (700), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 157/394 (39%), Positives = 225/394 (57%), Gaps = 8/394 (2%)
Query: 13 RWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSL 72
++ K +A R+LL + P + QI + ++R KLP W L P++++
Sbjct: 9 QFIKEHATDDLTRLLLSAAKYPGMDIPFLVDQIAVRRQIREKLPSWFENGQLVFPAKIAA 68
Query: 73 EQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNI 132
EQ S T++YK I E + DLTGGLGID L KA Q YIER AA+HN
Sbjct: 69 EQCSSEQTAAYKQELIGESWTICDLTGGLGIDSYFLSRKAKQLTYIERFPVYCEAAKHNF 128
Query: 133 PLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLA 192
+L E ++ I+ D + + + D Y+DPARR ++KRV+A+ DCEP+L L
Sbjct: 129 SVL--EANNITIINADAAQVVDTLP--EVDAFYIDPARRGESNKRVFALQDCEPNLPGLL 184
Query: 193 AELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEK 252
LL ++AKLSPM D+ TL+ LP +HV++ E KELL + P
Sbjct: 185 PALLKRSPHVIAKLSPMADIQMTLELLPGTTSVHVLSVRNECKELLFVVEREADGREP-- 242
Query: 253 VPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLG 312
+ +N D + F FT+EEERS + A + Y+YEP+ ++ KAGAFK +A R G
Sbjct: 243 -LVRCINF-GLDGMQSFSFTLEEERSAVLVPAGQVGTYLYEPNASVLKAGAFKQIAVRTG 300
Query: 313 LRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPI 372
++KL NSHLYTS+ S FPGR F ++E++ F+ + K L K +P+A+I+ RNFPLS
Sbjct: 301 VKKLQVNSHLYTSDHLVSDFPGRRFRVDEVLSFTGKLCKGLSKTIPQANITVRNFPLSVE 360
Query: 373 ELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKA 406
ELR+R+K+ DGG L TT+ DG+KVL+ KA
Sbjct: 361 ELRKRTKITDGGHVYLFATTLVDGEKVLVKCSKA 394
>gi|153806987|ref|ZP_01959655.1| hypothetical protein BACCAC_01264 [Bacteroides caccae ATCC 43185]
gi|149130107|gb|EDM21317.1| hypothetical protein BACCAC_01264 [Bacteroides caccae ATCC 43185]
Length = 396
Score = 265 bits (678), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/379 (39%), Positives = 214/379 (56%), Gaps = 22/379 (5%)
Query: 32 DIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREG 91
DIP A TQI K+P W I ++ P LSLEQ S +T+ YK+ + +G
Sbjct: 34 DIP-----AAITQIAGRQIAAEKIPSWKEIDDIWYPKHLSLEQCSSEITARYKASLL-QG 87
Query: 92 TKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLN---ILTGD 148
+ DLTGG GID L + Y+ER E A HN P L DLN + D
Sbjct: 88 ESLADLTGGFGIDCSFLATGFRSATYVERQAELCAIAAHNFPAL-----DLNHISVRNDD 142
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
YL + D I++DPARR+ + AI+DCEP++ L LL + I+ KLSP
Sbjct: 143 GVAYLEAMSPV--DCIFLDPARRNEHGGKTVAISDCEPNVAELEGLLLSKANRIMIKLSP 200
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLS--EDTV 266
M+DL L+ LPH QE+H+++ + E KELL+ + PE++P+H VNL + E
Sbjct: 201 MLDLSQALKELPHTQEVHIISVNNECKELLLLL----GQTAPEEIPVHCVNLSTKGEQEK 256
Query: 267 IPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSE 326
F+FT E+E+ Y D++ Y+YEP+ +L KAGAF+++A ++KLHPNSHLYTSE
Sbjct: 257 QLFVFTREQEQHSECSYTDTLGNYLYEPNASLLKAGAFRSIAAAYSVKKLHPNSHLYTSE 316
Query: 327 AYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEK 386
FPGRTF + + +K+ + +A+I+ RNFP + ELR+R K+A+GG+
Sbjct: 317 TQIEGFPGRTFRIINRCSLNKKEIKENLSDLKKANITVRNFPATVAELRKRIKLAEGGDT 376
Query: 387 TLMGTTMADGKKVLLLLRK 405
L +T+ DG+K+L+ K
Sbjct: 377 YLFASTLNDGQKILIRCGK 395
>gi|167763898|ref|ZP_02436025.1| hypothetical protein BACSTE_02280 [Bacteroides stercoris ATCC
43183]
gi|167698014|gb|EDS14593.1| hypothetical protein BACSTE_02280 [Bacteroides stercoris ATCC
43183]
Length = 404
Score = 263 bits (673), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 145/379 (38%), Positives = 217/379 (57%), Gaps = 15/379 (3%)
Query: 32 DIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIRE- 90
D+P V QI K+P W + P+ LS+EQ S VT++YK + + +
Sbjct: 35 DVPAAIIQIVGRQIA-----EEKIPAWAAREGILYPTHLSMEQCSSEVTANYKVKIVSDT 89
Query: 91 --GTK--VVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILT 146
G++ DLTGG GID L + Q Y+ER + A HN LL K + +
Sbjct: 90 GYGSRKTFTDLTGGFGIDCAFLSACFQQSAYVERQETLCTIAAHNFSLL--NLKQVMVCH 147
Query: 147 GDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKL 206
D YL +++ D+I++DPARR G + AI++CEPD+ L LL S +L KL
Sbjct: 148 EDSIRYLQMMEPV--DWIFIDPARRDGHGGKTIAISECEPDVSALENLLLEKASHVLVKL 205
Query: 207 SPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTV 266
SPM+DL L L HVQ HVV+ + E KELL+ + + +P E++PIH +NL + V
Sbjct: 206 SPMLDLTLALHDLKHVQAAHVVSVNNECKELLLVLERGKELVP-EEIPIHCINLTASQKV 264
Query: 267 IPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSE 326
+FT ++E+ + PY ++ Y+YEP+ ++ KAGAF++V+ + KLHPNSHLYTS+
Sbjct: 265 QKLLFTRQQEKEHTCPYTPTLKTYLYEPNASILKAGAFRSVSSIYNVEKLHPNSHLYTSD 324
Query: 327 AYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEK 386
Y FPGR F + + + LK+L +A+++ RNFP S ELR+R K+A+GG+
Sbjct: 325 EYIPDFPGRKFRITDSSSLNKKELKKLIGTEKKANLTVRNFPASVAELRKRLKLAEGGDM 384
Query: 387 TLMGTTMADGKKVLLLLRK 405
L TT+AD KK+L+ ++
Sbjct: 385 YLFATTLADEKKLLIACKQ 403
>gi|29348417|ref|NP_811920.1| hypothetical protein BT_3008 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340321|gb|AAO78114.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 395
Score = 259 bits (663), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/367 (38%), Positives = 212/367 (57%), Gaps = 13/367 (3%)
Query: 43 TQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLG 102
TQI K+P W ++ P LSLEQ S VT+ YK+ ++ G + DLTGG G
Sbjct: 39 TQIAGRQVAAEKIPSWRDTDDIWYPKHLSLEQCSSEVTARYKATLLK-GNSLTDLTGGFG 97
Query: 103 IDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNILTGDFKEYLPLIKTFHS 161
ID L ++ Y+ER E A HN P+L LN +N+ D YL +
Sbjct: 98 IDCAFLAARFKSATYVERQQELCEIAAHNFPILNLNH---INVKNEDGVSYLQAMSPV-- 152
Query: 162 DYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPH 221
D I++DPARR+ + AI+DCEPD+ L LL ++ KLSPM+DL L+ L H
Sbjct: 153 DCIFLDPARRNEHGGKTVAISDCEPDVAELEELLLNKAEQVMVKLSPMLDLSLALKELQH 212
Query: 222 VQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP--FIFTMEEERSI 279
VQE+H+++A+ E KELL+ + A E++ IH VNL + F+FT E+E+
Sbjct: 213 VQEVHIISANNECKELLLILGQASA----EEISIHCVNLPTRGAQEEQHFVFTREQEQCS 268
Query: 280 SVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVL 339
Y D+++ Y+YEP+ +L KAGAF+++A ++KLHPNSHLYTS+ +FPGR F +
Sbjct: 269 ECNYTDTLENYLYEPNASLLKAGAFRSIASAFPVKKLHPNSHLYTSDVLVESFPGRAFHI 328
Query: 340 EEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKV 399
+ LK+ + +A+I+ RNFP + ELR+R K+++GG+ L +T+ +G+KV
Sbjct: 329 ISQCSLNKKELKESLGDLKKANITVRNFPATVAELRKRIKLSEGGDTYLFASTLNNGQKV 388
Query: 400 LLLLRKA 406
L+ KA
Sbjct: 389 LIRCEKA 395
>gi|160884232|ref|ZP_02065235.1| hypothetical protein BACOVA_02210 [Bacteroides ovatus ATCC 8483]
gi|156110574|gb|EDO12319.1| hypothetical protein BACOVA_02210 [Bacteroides ovatus ATCC 8483]
Length = 396
Score = 259 bits (661), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/368 (38%), Positives = 214/368 (58%), Gaps = 13/368 (3%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
V TQI K+P W I ++ P LSLEQ S +T+ YK+R + +G + DLTGG
Sbjct: 38 VITQIAGRQVAAEKIPSWREIEEIWYPKHLSLEQCSSEITARYKARLL-QGDSLTDLTGG 96
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNILTGDFKEYLPLIKTF 159
GID L + Y+ER +E A HN P+L LN +N+ D YL +
Sbjct: 97 FGIDCSFLATGFKSATYVERQEELCEIAAHNFPILNLNH---INVRNEDGVAYLQSMSPV 153
Query: 160 HSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSL 219
D I++DPARR+ + AI+DCEP++ L A LL + ++ KLSPM+DL L+ L
Sbjct: 154 --DCIFLDPARRNEHGGKTVAISDCEPNVAELEALLLNKANRVMIKLSPMLDLSLALKEL 211
Query: 220 PHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP--FIFTMEEER 277
H QE+H+++ + E KELL+ + P ++ IH VNLL++ T +FT E+E+
Sbjct: 212 KHTQEIHILSVNNECKELLILL----GQTSPTEITIHCVNLLTKGTQEEQHLVFTREQEQ 267
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
Y DS+ Y+YEP+ +L KAGAF+++A +RKLHPNSHLYTS+++ FPGR F
Sbjct: 268 RSQCTYTDSLGNYLYEPNASLLKAGAFRSIAAAYPVRKLHPNSHLYTSDSFIENFPGRIF 327
Query: 338 VLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGK 397
+ F+ +K+ + +A+++ RNFP + ELR+R + +GG+ L +T+ +G+
Sbjct: 328 RIVNQCSFNKKEVKENLADLKKANVTVRNFPATVAELRKRLHLTEGGDTYLFASTLNNGQ 387
Query: 398 KVLLLLRK 405
KV++ K
Sbjct: 388 KVIIRCEK 395
>gi|60683732|ref|YP_213876.1| hypothetical protein BF4320 [Bacteroides fragilis NCTC 9343]
gi|60495166|emb|CAH09987.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 399
Score = 246 bits (629), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 142/366 (38%), Positives = 203/366 (55%), Gaps = 9/366 (2%)
Query: 44 QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGI 103
QI W +K+P W L P LSLEQ S VT+ YK+ + G +VDLTGG GI
Sbjct: 40 QIAGWQATVSKIPSWHATEGLLYPRHLSLEQCSSEVTALYKASLVH-GEGLVDLTGGFGI 98
Query: 104 DFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDY 163
D L ++ YIER +E A HN PLL K + + GD EYL + D
Sbjct: 99 DCAFLATQFKTVTYIERQEELCELAMHNFPLL--GLKHIRVQNGDGVEYLQQMPAV--DC 154
Query: 164 IYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQ 223
I++DPARR+ ++ AI+DCEP++ L LL ++ KLSPM+DL ++ + HV
Sbjct: 155 IFLDPARRNEHGGKIVAISDCEPNVATLEKLLLEKGKQVMIKLSPMLDLSLAIRDMQHVS 214
Query: 224 ELHVVAAHGEVKELLV--RMSLNEATIPPE-KVPIHAVNLLSEDTVIPFIFTMEEERSIS 280
E H+V+ + E KELL+ R + IP PI +N +++ + F+FT E E++
Sbjct: 215 EAHIVSVNNECKELLLLLRPGDDSPEIPSAMSQPIVCINFANQE-IQRFVFTRESEQATE 273
Query: 281 VPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLE 340
Y I Y+YEP+ ++ KAGAF+++A L KLH NSHLYTS FPGR F +
Sbjct: 274 CSYTHEIGTYLYEPNASILKAGAFRSIASSFHLSKLHANSHLYTSNERIEKFPGRIFRIT 333
Query: 341 EIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVL 400
+ LK + + +A+I+ RNFP S ELR+R K+ DGG+ L TT+ D +K++
Sbjct: 334 GYSSLNKKELKNILNGLDKANITTRNFPQSVAELRKRLKLTDGGDIYLFATTLNDERKII 393
Query: 401 LLLRKA 406
+ KA
Sbjct: 394 IRCEKA 399
>gi|53715803|ref|YP_101795.1| hypothetical protein BF4524 [Bacteroides fragilis YCH46]
gi|52218668|dbj|BAD51261.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 399
Score = 244 bits (624), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 142/366 (38%), Positives = 202/366 (55%), Gaps = 9/366 (2%)
Query: 44 QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGI 103
QI W +K+P W L P LSLEQ S VT+ YK+ + G +VDLTGG GI
Sbjct: 40 QIAGWQATVSKIPSWHATEGLLYPRHLSLEQCSSEVTALYKASLVH-GEGLVDLTGGFGI 98
Query: 104 DFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDY 163
D L ++ YIER +E A HN PLL K + + GD EYL + D
Sbjct: 99 DCAFLATQFKTVTYIERQEELCELAMHNFPLL--GLKHIRVQNGDGVEYLQQMPAV--DC 154
Query: 164 IYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQ 223
I++DPARR+ + AI+DCEP++ L LL ++ KLSPM+DL ++ + HV
Sbjct: 155 IFLDPARRNEHGGKTVAISDCEPNVATLEKLLLEKGKQVMIKLSPMLDLSLAIRDMQHVS 214
Query: 224 ELHVVAAHGEVKELLV--RMSLNEATIPPE-KVPIHAVNLLSEDTVIPFIFTMEEERSIS 280
E H+V+ + E KELL+ R + IP PI +N +++ + F+FT E E++
Sbjct: 215 EAHIVSVNNECKELLLLLRPGDDSPEIPSAMSQPIVCINFANQE-IQRFVFTRESEQATE 273
Query: 281 VPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLE 340
Y I Y+YEP+ ++ KAGAF+++A L KLH NSHLYTS FPGR F +
Sbjct: 274 CSYTHEIGTYLYEPNASILKAGAFRSIASSFHLSKLHANSHLYTSNERIEKFPGRIFRIT 333
Query: 341 EIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVL 400
+ LK + + +A+I+ RNFP S ELR+R K+ DGG+ L TT+ D +K++
Sbjct: 334 GYSGLNKKELKNILNGLDKANITTRNFPQSVAELRKRLKLTDGGDIYLFATTLNDERKII 393
Query: 401 LLLRKA 406
+ KA
Sbjct: 394 IRCEKA 399
>gi|160891672|ref|ZP_02072675.1| hypothetical protein BACUNI_04127 [Bacteroides uniformis ATCC 8492]
gi|156859079|gb|EDO52510.1| hypothetical protein BACUNI_04127 [Bacteroides uniformis ATCC 8492]
Length = 415
Score = 244 bits (622), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 142/381 (37%), Positives = 201/381 (52%), Gaps = 26/381 (6%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIRE--------- 90
A TQI W + K+P W + P+ LSLEQ S T+ YK+ I
Sbjct: 36 AAITQISGWQIAKEKIPAWAENEHILYPAHLSLEQCSSQATAQYKAEIITNLLHTEQEHP 95
Query: 91 ---------GTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGK 140
GT DLTGG GID L S+ Y+ER + A HN P+L LN
Sbjct: 96 AQNSTPTSAGT-FTDLTGGFGIDCAYLSSRFGHATYVERQETLCQIAAHNFPVLGLNH-- 152
Query: 141 DLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCS 200
+++ D +L ++ D I++DPARR G + AI DCEPD+ L LL
Sbjct: 153 -ISVCHADSVRHLQEMEPV--DCIFIDPARRDGHGGKTVAIGDCEPDIAALEELLLRKAR 209
Query: 201 SILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNL 260
+L KLSPM+DL L L HV+E H+V+ E KELL+ + E +P + +PIH VN
Sbjct: 210 HVLVKLSPMLDLTLALNDLKHVREAHIVSVGNECKELLLLLGQGEG-VPADNIPIHCVNF 268
Query: 261 LSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNS 320
+FT +E+ + PY + Y+YEP+ ++ KAGAF++++ + KLHPNS
Sbjct: 269 TGVPAPQALVFTRRQEKERACPYTPQLKSYLYEPNASVLKAGAFRSLSSLYKVEKLHPNS 328
Query: 321 HLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKM 380
HLYTS + FPGR F + F +K++ +A+++ RNFP + ELR+R K+
Sbjct: 329 HLYTSGHFLPDFPGRKFQITSSCGFGKKEVKEMLAAEKKANLTVRNFPATVAELRKRLKL 388
Query: 381 ADGGEKTLMGTTMADGKKVLL 401
A+GG L TT+AD KKVL+
Sbjct: 389 AEGGGTYLFATTLADEKKVLI 409
>gi|150003879|ref|YP_001298623.1| hypothetical protein BVU_1312 [Bacteroides vulgatus ATCC 8482]
gi|149932303|gb|ABR39001.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 394
Score = 230 bits (587), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 141/385 (36%), Positives = 208/385 (54%), Gaps = 20/385 (5%)
Query: 28 LGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRF 87
L S P AA TQI K+P W +S+L+ P LS+EQ S T+ YK+
Sbjct: 24 LQSKKYPDVDMAAAVTQIAGRQVAARKIPSWYPVSALWYPPHLSMEQCSSEATALYKASL 83
Query: 88 IREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL------LNEGKD 141
+ EG DLTGG GID + Q Y+ER A HN PLL ++
Sbjct: 84 L-EGDTFADLTGGFGIDCSFISRNFKQADYVERQSGLCELALHNFPLLGLGHIRIHNRDG 142
Query: 142 LNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSS 201
++ L +E LP+ D +++DPARR G + AI+DCEPD+ L L+
Sbjct: 143 ISYL----QEMLPV------DCLFLDPARRDGHGGKTVAISDCEPDVTVLEPLLVDKAKK 192
Query: 202 ILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLL 261
++ KLSPM+DL L L V+ +H+VA + E KELL + L + ++ E V IH ++
Sbjct: 193 VMVKLSPMLDLSLALNELKTVRAVHIVAVNNECKELL--LILQKESVSSE-VSIHCEHIA 249
Query: 262 SEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSH 321
+ FT+++E++ AD + Y+YEP+ A+ KAGAF+++ + KLH NSH
Sbjct: 250 GNGESRHYTFTLKQEKTSPCLLADEVGTYLYEPNAAILKAGAFRSLTQTYPVAKLHLNSH 309
Query: 322 LYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMA 381
LYTS + FPGR F +E + F LK + +A+I+ RNFPLS ELR+R K+
Sbjct: 310 LYTSVSLVPDFPGRRFRVEAVSGFGKKELKAFLMDMDKANITIRNFPLSVAELRKRLKLK 369
Query: 382 DGGEKTLMGTTMADGKKVLLLLRKA 406
+GG+ + TT++ G+KVL+ +K
Sbjct: 370 EGGDDYIFATTLSGGQKVLIRGKKC 394
>gi|163754903|ref|ZP_02162024.1| hypothetical protein KAOT1_02777 [Kordia algicida OT-1]
gi|161324970|gb|EDP96298.1| hypothetical protein KAOT1_02777 [Kordia algicida OT-1]
Length = 394
Score = 228 bits (580), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 137/397 (34%), Positives = 224/397 (56%), Gaps = 13/397 (3%)
Query: 11 ITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRL 70
I ++ + +P ++LL ++ + QI+ R KLP W ++Y P++L
Sbjct: 10 IQQFINDHLKDNPTKLLLKHKEVFGVPFTEIIEQIQAKTRCEKKLPTWFNAENIYYPNKL 69
Query: 71 SLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARH 130
++EQ+S T++YK++ + G ++DLTGG G+D A + Q + E + + + H
Sbjct: 70 NIEQTSSEKTANYKAKLL-SGNSLIDLTGGFGVDCYAFSKQFQQVTHCEISPKLSEIVTH 128
Query: 131 NIPLLLNEGKDLNILTGDFKEYLP-LIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLI 189
N L + +++ + D YL KT+ D IY+DP+RR+ +V+ + DC PD++
Sbjct: 129 NYNSL--QVENIQTVNQDGIAYLQNSSKTY--DCIYIDPSRRNDVKGKVFLLEDCLPDVV 184
Query: 190 PLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIP 249
L +IL K SP++D+ + L++L HV+E+HVVA EVKELL + N T
Sbjct: 185 SHQELLFAHADTILIKTSPLLDITNGLRALQHVKEVHVVAVQNEVKELLWLLDKNTNT-- 242
Query: 250 PEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAY 309
E V I VNL +++ F F + E+ +V Y + KY+YEP+ A+ KAG F ++A
Sbjct: 243 -ENVSIKTVNLTNKEDEF-FNFQLSAEKEFTVVY-NHPKKYLYEPNAAIMKAGGFSSIAK 299
Query: 310 RLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPL 369
L KL +SHLYTS S FPGR F +++I+P++ ++K+ +P+A+I+ RNFP
Sbjct: 300 TFQLEKLAQHSHLYTSNQLIS-FPGRRFEVQKILPYTKKIIKKELN-LPKANITTRNFPE 357
Query: 370 SPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKA 406
S +R++ +ADGGE L TT+ GKK ++ KA
Sbjct: 358 SVSNIRKKLTIADGGEDFLFLTTVFTGKKAVIHCVKA 394
>gi|110639758|ref|YP_679968.1| hypothetical protein CHU_3389 [Cytophaga hutchinsonii ATCC 33406]
gi|110282439|gb|ABG60625.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 399
Score = 219 bits (557), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 131/370 (35%), Positives = 206/370 (55%), Gaps = 22/370 (5%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A QIE W + KLP W ++ P RLS+EQ S T+ YKS + G ++VDLTGG
Sbjct: 46 IAAQIEGWQKASQKLPLWASTENIIYPIRLSMEQCSSERTAHYKSTLM-SGERLVDLTGG 104
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
G+D L ++ IYIE+ ++ A A HN L ++ + T D EYL I
Sbjct: 105 FGVDSFYLSKSFNEVIYIEQQEDLAEIAAHNFSTL--NQNNIVVKTADSDEYLKDIAE-K 161
Query: 161 SDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLP 220
D I++DPARR ++VY ++DCEP+++ L + IL K SPM+D+ + + L
Sbjct: 162 IDVIFLDPARRKEM-RKVYKLSDCEPNVVALQSFYFSKADVILIKTSPMLDITEATRQLT 220
Query: 221 HVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSED---TVIPFIFTMEEER 277
++E+HVV+ E KE+L + N + A + + D + F FT+ E+
Sbjct: 221 GIKEIHVVSVDNECKEVLYLLEKNYSG--------KATYICAHDYKKQLQLFSFTIAAEQ 272
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
++++ Y+ + Y+YEP+ ++ KAG F ++ + + KLH +SHLYTS + S FPGRTF
Sbjct: 273 AVTLSYSAPL-AYLYEPNPSILKAGGFNSITQKYEVFKLHASSHLYTSASEVSDFPGRTF 331
Query: 338 VLEEIIPFSTSVLKQLGKVVP--RASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMAD 395
++E + +S K + VP +A+I RNFP S +R+++ + DGG + TT+ D
Sbjct: 332 KIKETVGYSK---KDIQAAVPGGKANIQTRNFPDSVEAIRKKTGLKDGGNIFIFATTLHD 388
Query: 396 GKKVLLLLRK 405
KVLL+ K
Sbjct: 389 LSKVLLICEK 398
>gi|86130473|ref|ZP_01049073.1| hypothetical protein MED134_06119 [Cellulophaga sp. MED134]
gi|85819148|gb|EAQ40307.1| hypothetical protein MED134_06119 [Dokdonia donghaensis MED134]
Length = 393
Score = 217 bits (553), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 209/368 (56%), Gaps = 20/368 (5%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A Q+E + + KLP+W + + P +L++EQ+S T+SYK+ I +GT + D TGG
Sbjct: 40 LAQQLEGKRKAKGKLPRWHNTAGILFPPKLNMEQTSSEATASYKAS-IMQGTTLADGTGG 98
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
GID Q +IE + + A+ N +L E ++ + D Y T
Sbjct: 99 FGIDAYHFAQTNKQVTHIEMDASLSRFAQSNAEVLKQE--NIEFIVDDSMHYFETCDT-R 155
Query: 161 SDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELL-PFCSSILAKLSPMIDLWDTLQSL 219
D I++DP RR+ + +V+ + DC P+ +PL LL C+++ K SP++D+ ++ L
Sbjct: 156 FDTIFLDPGRRTDSKGKVFMLKDCLPN-VPLYKNLLLSKCNTLWIKTSPILDIAAGIEEL 214
Query: 220 PHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFT--MEEER 277
V E+H+VA EVKELL +++ + + VNL S D V+ F + + +
Sbjct: 215 RSVTEIHIVAVKNEVKELLWKLTSQRC----KDIKFTVVNLQSSDPVVSFSHSDALNAQA 270
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
I +P KY+YEP++ L K+GAF V+ + KLH +SHLYTSE S F GR F
Sbjct: 271 VIDIP-----QKYLYEPNSTLMKSGAFNWVSTYFNVSKLHEHSHLYTSEELIS-FAGRRF 324
Query: 338 VLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGK 397
V+E+I+P+S + K+LG + +A+I+ RNFP+ +R++ K+ DGG+ L TT+ DGK
Sbjct: 325 VIEKIVPYSKRIAKELG--ITKANITTRNFPMPVDVIRKKLKIKDGGDVYLFFTTLDDGK 382
Query: 398 KVLLLLRK 405
KV++ +K
Sbjct: 383 KVVISTKK 390
>gi|88712929|ref|ZP_01107014.1| hypothetical protein FB2170_09831 [Flavobacteriales bacterium
HTCC2170]
gi|88708827|gb|EAR01062.1| hypothetical protein FB2170_09831 [Flavobacteriales bacterium
HTCC2170]
Length = 374
Score = 214 bits (544), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 135/379 (35%), Positives = 212/379 (55%), Gaps = 31/379 (8%)
Query: 38 RAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDL 97
+ + QIE + + KLP W ++Y P++L++EQ S +T+ YKSR I+ G +VDL
Sbjct: 15 QKELTEQIEAKLKSQKKLPTWFNNKNIYYPNKLNIEQCSSEITAEYKSRIIK-GKSLVDL 73
Query: 98 TGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTG-----DFKEY 152
TGG G+D K Q + E N + HN +L E NILT DF E
Sbjct: 74 TGGFGVDSYFFSKKIEQVFHCEINRNLSEIVTHNYAILGVE----NILTKTANGIDFLEQ 129
Query: 153 LPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFC----SSILAKLSP 208
++IY+DP+RRS + +V+ ++DC P++I E LP C +IL K SP
Sbjct: 130 ----SNASFEWIYLDPSRRSESKGKVFLLSDCTPNII----EHLPLCFEKSKNILIKTSP 181
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP 268
++D+ + L+ L +V+E+H+VA + EVKELL + + +V IH +NL E
Sbjct: 182 LLDISNGLKQLKNVKEVHIVALNNEVKELLWVLQHDYLG----EVRIHTINLKKE-LEER 236
Query: 269 FIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAY 328
+ F EE++ S+ ++ Y+YEP+ A+ K+GAFK ++ + KLH +SHLYTS
Sbjct: 237 YDFNWSEEKNTSLSTEKPLN-YLYEPNVAILKSGAFKLLSNDFKISKLHEHSHLYTSNKL 295
Query: 329 ESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTL 388
FPGRTF + EI+ + L+ G +A+I+CRNFP S ++R++SK+ DGG+ L
Sbjct: 296 -IQFPGRTFKILEILKYHKKNLR--GFSGAKANITCRNFPESVAQIRRKSKIKDGGDSFL 352
Query: 389 MGTTMADGKKVLLLLRKAE 407
T K++++ +K +
Sbjct: 353 FFTKDYKESKIVIVCKKQK 371
>gi|163788239|ref|ZP_02182685.1| hypothetical protein FBALC1_07658 [Flavobacteriales bacterium
ALC-1]
gi|159876559|gb|EDP70617.1| hypothetical protein FBALC1_07658 [Flavobacteriales bacterium
ALC-1]
Length = 392
Score = 213 bits (543), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 204/368 (55%), Gaps = 17/368 (4%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTG 99
+ QIE R KLP W ++Y P++L++EQ+S VT+ YK+ + G ++DLTG
Sbjct: 39 TIIEQIEAKKRCEKKLPTWYTTKNIYYPNKLNIEQTSSEVTAKYKASLV-SGKSLIDLTG 97
Query: 100 GLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTF 159
G GID + E N E + + N L +++ + + + ++K
Sbjct: 98 GFGIDAYYFSKHIESITHCEINTELSDIVKQNYKTL-----NVSNIACKNESGIDVLKQL 152
Query: 160 HS--DYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQ 217
D+IY+DP+RR + ++V+ ++DC P++ L + ++ K SP++DL TL
Sbjct: 153 DELFDWIYIDPSRRDDSKQKVFLLSDCIPNVKTFQNLFLKYAQQVMVKTSPLLDLTATLS 212
Query: 218 SLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEER 277
L +V+E+H+VA + EVKELL + E + + I VNL E+ + F F + E
Sbjct: 213 DLKYVKEIHIVAVNNEVKELLWIL---ERDYDGDAI-IKTVNLQKEN-IQNFEFNFKNEP 267
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
S Y++ + Y+YEP+ A+ KAGAF T++ L + KLH +SHLYTS+ FPGR F
Sbjct: 268 SSQAIYSEPLS-YLYEPNVAILKAGAFNTISELLNINKLHKHSHLYTSDKI-IDFPGRRF 325
Query: 338 VLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGK 397
+E +PF+ V + + +A+I+ RNFPLS ++R++ K+ DGG L TT + +
Sbjct: 326 KIENRLPFNKKVF--FKEKITKANITTRNFPLSVNDIRKKLKVKDGGNLYLFFTTNLNNE 383
Query: 398 KVLLLLRK 405
K++L+ K
Sbjct: 384 KIILVCSK 391
>gi|154486382|ref|ZP_02027789.1| hypothetical protein BIFADO_00194 [Bifidobacterium adolescentis
L2-32]
gi|154084245|gb|EDN83290.1| hypothetical protein BIFADO_00194 [Bifidobacterium adolescentis
L2-32]
Length = 424
Score = 212 bits (540), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/391 (33%), Positives = 192/391 (49%), Gaps = 42/391 (10%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIR---------- 89
A QI W RNKLPQW + + P+ +S+EQ S T+ YK+ R
Sbjct: 40 AALDQIAGWQIARNKLPQWAACADIVYPAHISMEQCSSQFTAQYKAEIARRLLRSLPQSA 99
Query: 90 ----EGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNI 144
+ DLTGG G+DF L Y+ER A HN+ L L + + +
Sbjct: 100 GQTANDATMTDLTGGFGVDFSYLARGFGHATYVERQSHLCELAAHNMAALGLTQAQ---V 156
Query: 145 LTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILA 204
+ GD EYL ++ IY+DPARR R YAI DC PD++ L LL ++
Sbjct: 157 VCGDGVEYLRAMEPVQ--LIYIDPARRDEHGARTYAIEDCTPDVLALRDLLLAKARYVMI 214
Query: 205 KLSPMIDLWDTLQSLPH-VQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSE 263
KLSPM+D + V E+H+V+ E KELL+ + A I + +
Sbjct: 215 KLSPMLDWRKAVDDFAGTVAEVHIVSTGNECKELLLVLDGKVAGITSDAA--------AA 266
Query: 264 DTVIPFIFTMEEERSI---SVPYADSID----------KYVYEPHTALFKAGAFKTVAYR 310
DT P ++ + +++ + + Y + +Y+YEP+ ++ KAG F V R
Sbjct: 267 DTRAPHVYCVNDDQRLDYDAAAYTRGLRIGDAPLPHELRYLYEPNASIMKAGCFDVVEAR 326
Query: 311 LGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLS 370
G ++ P+SHL+ S+ FPGR F +E I LK+L + RA+I+ RNFPL+
Sbjct: 327 FGAVQIGPSSHLFVSDEPVDGFPGRGFAIETIGGMGKKELKRLLSGLDRANIAVRNFPLT 386
Query: 371 PIELRQRSKMADGGEKTLMGTTMADGKKVLL 401
+LR++ K+ADGG+ L GTTM G VL+
Sbjct: 387 APQLRKKLKLADGGDAYLFGTTMQGGDHVLI 417
>gi|124005819|ref|ZP_01690657.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988502|gb|EAY28143.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 398
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/394 (33%), Positives = 212/394 (53%), Gaps = 17/394 (4%)
Query: 15 AKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQ 74
A LY ++P ++L ++ + QI+ +++ KLP W ++ P LS+EQ
Sbjct: 20 ANLY--KTPADLILAASPYSKPEMTHIVGQIQARRKIQAKLPTWFDTPNIIYPPLLSIEQ 77
Query: 75 SSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPL 134
S + + +K++ I G +VDLTGG G+D Q Y+E++ E A +HN
Sbjct: 78 CSSEIAALHKAQ-ILSGNTLVDLTGGFGVDSYHFAQTFDQVYYVEQHQELAKLVQHNFEA 136
Query: 135 LLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAE 194
++ I + +L + D +Y+DPARR A +V+ + +C P+L+ L
Sbjct: 137 F--GVTNIQIKAQSAESFLKEVDKV--DAVYIDPARRDDAKNKVFRLEECTPNLLELLPV 192
Query: 195 LLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVP 254
L +L KLSPM+D+ ++ L V ++ VVA + E KELL + N TIP
Sbjct: 193 LWQKTDQLLIKLSPMLDIDLGIRQLEQVAKVVVVAINNECKELLFVLKNNHQTIPH---- 248
Query: 255 IHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLR 314
I A+NL + F FT EER+ V YA+ + +Y+YEP+ A+ KAGAFK +A GL
Sbjct: 249 IEALNLNAHKPAQYFSFTRNEERTSQVVYAEPM-QYLYEPNVAILKAGAFKQIAQCFGLA 307
Query: 315 KLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVP--RASISCRNFPLSPI 372
KLHP+SHLYTS + FPGR+F ++ + ++ K++ K P +A+I+ RNFP S
Sbjct: 308 KLHPHSHLYTSNTWLKDFPGRSFKIKGVCRYTK---KEVAKYCPAKKANITARNFPDSVA 364
Query: 373 ELRQRSKMADGGEKTLMGTTMADGKKVLLLLRKA 406
+R++ ++ DGG L T + ++++ KA
Sbjct: 365 LVRKKLQLKDGGNDYLFVTQTQAHRALVIVTEKA 398
>gi|150025671|ref|YP_001296497.1| hypothetical protein FP1621 [Flavobacterium psychrophilum JIP02/86]
gi|149772212|emb|CAL43688.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 392
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/400 (32%), Positives = 213/400 (53%), Gaps = 19/400 (4%)
Query: 9 LAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPS 68
I + L + + ++ L N P + TQI + + KLP W ++ P+
Sbjct: 8 FEIQEFIALNIDANISKLALQKNPFPEIPWVEILTQISAKSKAKEKLPTWYAHQNIMYPN 67
Query: 69 RLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAA 128
++S+EQ+S T+ YKS + G ++DLTGG G+D K +Q + E N E +
Sbjct: 68 KISVEQTSSEKTAYYKSNLVS-GESLIDLTGGFGVDDYYFSKKINQVYHCELNRELSEIV 126
Query: 129 RHNIPLLLNEGKDLNILTGD-FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPD 187
HN +L K++ L GD F+ L + F D+IY+DP+RRS +V+ + DC P+
Sbjct: 127 VHNFEVL--NQKNITCLQGDSFETLKKLNQKF--DWIYIDPSRRSDTKGKVFMLKDCLPN 182
Query: 188 LIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEAT 247
+ L + F + IL K +P++D+ L L +V+ +H++A + EVKELL + N
Sbjct: 183 VPKLLEDYFKFSNKILIKTAPILDITSGLNELQNVKTIHIIAINNEVKELLWEIEQN--- 239
Query: 248 IPPEKVPIHAVNLLSE-DTVIPFIFTME-EERSISVPYADSIDKYVYEPHTALFKAGAFK 305
+ + I +N E + V FI E E + ++P KY+YEP+ AL K+GAF+
Sbjct: 240 -YKDAITIKTINFNKEKEDVFNFILNTEIENKKYTLP-----KKYIYEPNAALLKSGAFE 293
Query: 306 TVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCR 365
+A + KLH +SHLYTS FPGR F ++ ++ + +K+ + +A+I+ R
Sbjct: 294 LIANHFKIEKLHQHSHLYTSNEI-IDFPGRIFEIKNRFEYNKNEMKEFLENT-KANITTR 351
Query: 366 NFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
NFP + +R++ K++DGG + TT A+ K++L+ K
Sbjct: 352 NFPETVENIRKKWKISDGGNSYVFFTTDANNYKIVLICAK 391
>gi|23335244|ref|ZP_00120482.1| COG0500: SAM-dependent methyltransferases [Bifidobacterium longum
DJO10A]
Length = 431
Score = 204 bits (520), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 129/391 (32%), Positives = 197/391 (50%), Gaps = 34/391 (8%)
Query: 44 QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIRE-----------GT 92
QI W R KLP W L P ++ +EQ S T+ YK+R + T
Sbjct: 40 QIAGWQRASLKLPDWASHDGLIFPPQVPMEQCSSQFTAQYKARLAQRLLAEEAVDDDSPT 99
Query: 93 KVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNILTGDFKE 151
+VDLTGG G+DF + ++ IY+E+ ARHN P+L L+ + +N + +
Sbjct: 100 SLVDLTGGFGVDFSYMSRVFNRAIYVEQQSILCDIARHNFPILGLDHAEVINDDSTAVLD 159
Query: 152 YLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMID 211
L + I++DPARR R YAIADC PD++ L LL +++ KLSPM+D
Sbjct: 160 TLGRVS-----MIFLDPARRDDHGSRTYAIADCMPDVLTLKDMLLAKAPTVMVKLSPMLD 214
Query: 212 LWDTLQSLPH-VQELHVVAAHGEVKELLVRM-----------SLNEATIPPEKVPIHAVN 259
T+ V E+H+V+ E KELL+ + N+ + K ++ N
Sbjct: 215 WHKTVADFAGAVHEVHIVSTGNECKELLLVLGRGRYASPLVVCANDEQVLSYKAGDNSDN 274
Query: 260 LLS-EDTVIPFIFTMEEERSISVPYADSID----KYVYEPHTALFKAGAFKTVAYRLGLR 314
+ D+ + T E S+S A+ D KY+YEP+ ++ KAG F + R +
Sbjct: 275 HTTISDSALAARNTCNTEDSLSEESANDFDSSHWKYLYEPNASIMKAGCFDVLEQRFAVH 334
Query: 315 KLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIEL 374
+ PNSHL+ + + FPGR+F +E I + LKQL + A+I+ RNFPL+ +L
Sbjct: 335 HISPNSHLFVAAEPIADFPGRSFAIESIATMNKKELKQLLAGLTHANIAVRNFPLTVAQL 394
Query: 375 RQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
R++ K+ DGG L TT A G+ ++L +K
Sbjct: 395 RKKLKLKDGGSTYLFATTDAQGRHLVLRTKK 425
>gi|119025051|ref|YP_908896.1| N6-adenine-specific methylase [Bifidobacterium adolescentis ATCC
15703]
gi|118764635|dbj|BAF38814.1| N6-adenine-specific methylase [Bifidobacterium adolescentis ATCC
15703]
Length = 424
Score = 204 bits (518), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/391 (32%), Positives = 192/391 (49%), Gaps = 42/391 (10%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIR---------- 89
A QI W RNKLPQW + + P+ +S+EQ S T+ YK+ R
Sbjct: 40 AALDQIAGWQIARNKLPQWAACADIVYPAHISMEQCSSQFTAQYKAEIARRLLRSLPQSA 99
Query: 90 ----EGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNI 144
+ DLTGG G+DF L Y+ER A HN+ L L + + +
Sbjct: 100 GQTANDATMTDLTGGFGVDFSYLARGFGHATYVERQSHLCELAAHNMAALGLTQAQ---V 156
Query: 145 LTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILA 204
+ GD EYL ++ + IY+DPARR R YAI DC PD++ L LL ++
Sbjct: 157 VCGDGVEYLRAMEP--AQLIYIDPARRDEHGARTYAIEDCTPDVLALRDLLLAKARYVMI 214
Query: 205 KLSPMIDLWDTLQSLPH-VQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSE 263
KLSPM+D + V E+H+V+ E KELL+ + K + +
Sbjct: 215 KLSPMLDWRKAVDDFAGTVAEVHIVSTGNECKELLLVLD--------GKAAGATSDAAAA 266
Query: 264 DTVIPFIFTMEEERSI---SVPYADSID----------KYVYEPHTALFKAGAFKTVAYR 310
DT P ++ + +++ + + Y + +Y+YEP+ ++ KAG F V R
Sbjct: 267 DTRAPHVYCVNDDQRLDYDAAAYTRGLRIGDAPLPHELRYLYEPNASIMKAGCFDVVEAR 326
Query: 311 LGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLS 370
G ++ P+SHL+ S+ FPGR F +E I LK+L + RA+I+ RNFPL+
Sbjct: 327 FGAVQIGPSSHLFVSDEPVDGFPGRGFAIETIGGMGKKELKRLLSGLDRANIAVRNFPLT 386
Query: 371 PIELRQRSKMADGGEKTLMGTTMADGKKVLL 401
+LR++ K+ADGG+ L GTTM G VL+
Sbjct: 387 APQLRKKLKLADGGDAYLFGTTMQGGDHVLI 417
>gi|126663636|ref|ZP_01734632.1| hypothetical protein FBBAL38_11219 [Flavobacteria bacterium BAL38]
gi|126624219|gb|EAZ94911.1| hypothetical protein FBBAL38_11219 [Flavobacteria bacterium BAL38]
Length = 398
Score = 203 bits (517), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 210/396 (53%), Gaps = 28/396 (7%)
Query: 20 NQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAV 79
N ++ L N + + QI + ++KLP W ++ P ++S+EQ+S
Sbjct: 20 NSDSSKLALKKNPFSDVNYSIIINQIIAKKKAKDKLPTWFSTENIIYPEKISIEQTSSET 79
Query: 80 TSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEG 139
T+ YKS + G ++D TGG GID + I+ E N + + +HN +L
Sbjct: 80 TAKYKSSLVS-GEIIIDCTGGFGIDDYYFSKQFKTVIHCELNSDLSKIVKHNFKVL--NA 136
Query: 140 KDLNILTGDFKEYLPLIKTFHS--------DYIYVDPARRSGADKRVYAIADCEPDLIPL 191
++ GD E+L K +H+ D IY+DP+RR+ +V+ +ADC P+++ L
Sbjct: 137 TNIKCYQGDSTEFL---KNYHAEPVETQKFDCIYIDPSRRNDTKGKVFMLADCLPNVVEL 193
Query: 192 AAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPE 251
F +++L K +P++DL L L +V E+HVVA EVKELL ++ + T PE
Sbjct: 194 QDFYYQFTNTLLIKTAPILDLHAGLLELKNVAEIHVVAVDNEVKELLWKIE-KDFTESPE 252
Query: 252 KVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRL 311
I AVN+ E I I E ++ S Y+ KYVYEP+ +L K+GAF+ V+
Sbjct: 253 ---IIAVNIEKEKQTITRI---ESSKNYSARYSLP-KKYVYEPNASLMKSGAFEAVSELF 305
Query: 312 GLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVV--PRASISCRNFPL 369
+ KLH +SHLYTS+ FPGR F ++ I+PF K++ + + ++S RNFP+
Sbjct: 306 LVNKLHQHSHLYTSDEI-VEFPGRKFSIDAIVPFQK---KEISAYIQGKKMNVSTRNFPI 361
Query: 370 SPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
P E++++ K+ DGG TT + +K++LL K
Sbjct: 362 KPEEIKKKYKIQDGGTIFAFFTTNMNNEKIILLCTK 397
>gi|23466002|ref|NP_696605.1| hypothetical protein BL1446 [Bifidobacterium longum NCC2705]
gi|23326720|gb|AAN25241.1| hypothetical protein BL1446 [Bifidobacterium longum NCC2705]
Length = 400
Score = 201 bits (510), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/391 (32%), Positives = 196/391 (50%), Gaps = 34/391 (8%)
Query: 44 QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIRE-----------GT 92
QI W R KLP W L P ++ +EQ S T+ YK+R +
Sbjct: 9 QIAGWQRASLKLPDWASHDGLIFPPQVPMEQCSSQFTAQYKARLAQRLLAEEAVDDDSPA 68
Query: 93 KVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLL-LNEGKDLNILTGDFKE 151
+VDLTGG G+DF + ++ IY+E+ ARHN P+L L+ + +N + +
Sbjct: 69 SLVDLTGGFGVDFSYMSRVFNRAIYVEQQSILCDIARHNFPILGLDHAEVINDDSTAVLD 128
Query: 152 YLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMID 211
L + I++DPARR R YAIADC PD++ L LL +++ KLSPM+D
Sbjct: 129 TLGRVS-----MIFLDPARRDDHGSRTYAIADCMPDVLTLKDMLLAKAPTVMVKLSPMLD 183
Query: 212 LWDTLQSLPH-VQELHVVAAHGEVKELLVRM-----------SLNEATIPPEKVPIHAVN 259
T+ V E+H+V+ E KELL+ + N+ + K ++ N
Sbjct: 184 WHKTVADFAGAVHEVHIVSTGNECKELLLVLGRGRYASPLVVCANDEQVLSYKAGDNSDN 243
Query: 260 LLS-EDTVIPFIFTMEEERSISVPYADSID----KYVYEPHTALFKAGAFKTVAYRLGLR 314
+ D+ + T + S+S A+ D KY+YEP+ ++ KAG F + R +
Sbjct: 244 HTTISDSALAARNTCNTKDSLSEESANDFDSSHWKYLYEPNASIMKAGCFDVLEQRFAVH 303
Query: 315 KLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIEL 374
+ PNSHL+ + + FPGR+F +E I + LKQL + A+I+ RNFPL+ +L
Sbjct: 304 HISPNSHLFVAAEPIADFPGRSFAIESIATMNKKELKQLLAGLTHANIAVRNFPLTVAQL 363
Query: 375 RQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
R++ K+ DGG L TT A G+ ++L +K
Sbjct: 364 RKKLKLKDGGNTYLFATTDAQGRHLVLRTKK 394
>gi|183602204|ref|ZP_02963571.1| N6-adenine-specific methylase [Bifidobacterium animalis subsp.
lactis HN019]
gi|183218418|gb|EDT89062.1| N6-adenine-specific methylase [Bifidobacterium animalis subsp.
lactis HN019]
Length = 468
Score = 198 bits (503), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 197/368 (53%), Gaps = 27/368 (7%)
Query: 52 RNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSR----FIREGTK--VVDLTGGLGIDF 105
R KLPQW + P L++EQ S T+ YK++ + E ++ +VDLTGG G+DF
Sbjct: 112 RTKLPQWAACEGIVYPPHLAMEQCSSQFTAEYKAQVAAQLVEEESEPTLVDLTGGFGVDF 171
Query: 106 IALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIY 165
A++ + G Y+ER + ARHN+ L G +++ GD ++L + + IY
Sbjct: 172 SAMVRGFAHGTYVERQERLCEVARHNLREL-GLGGRADVVCGDGVDHLAQMPP--ATLIY 228
Query: 166 VDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS-LPHVQE 224
+DPARR R YAI DC P+ + L +LL ++ KLSPM+D T+ + V +
Sbjct: 229 LDPARRDEHGARTYAIEDCTPNALALRGQLLDKAPFVMIKLSPMLDWRKTIADFVGSVAQ 288
Query: 225 LHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYA 284
+H+ A E KE++V ++ E E+V + VN F + + + S A
Sbjct: 289 VHITATGNECKEIVVVLARGE----HERVRVVCVNDAQR-----LEFEVGADSAGSWQRA 339
Query: 285 DSID-------KYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
D +D +Y+YEP+ A+ KAGAF + +R++ NSHL+ SE + FPGR F
Sbjct: 340 DVLDACELGAQRYLYEPNAAIMKAGAFGEITRNYDVRQIGANSHLFVSEHPVADFPGRAF 399
Query: 338 VLEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGK 397
+ + + ++Q+ + A++S RNFPL+ +LR++ K+ DGG+ L TT+A G
Sbjct: 400 EITALGTMNKRDVRQVLGGISAANVSVRNFPLTVQQLRRKLKLRDGGDVYLFATTVAGG- 458
Query: 398 KVLLLLRK 405
+LL RK
Sbjct: 459 HMLLRTRK 466
>gi|149372807|ref|ZP_01891828.1| hypothetical protein SCB49_12629 [unidentified eubacterium SCB49]
gi|149354504|gb|EDM43069.1| hypothetical protein SCB49_12629 [unidentified eubacterium SCB49]
Length = 391
Score = 196 bits (498), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 197/364 (54%), Gaps = 19/364 (5%)
Query: 44 QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGI 103
QIE + KL + +Y P +++LEQ+S T+ YK+ I G ++ D+TGGLGI
Sbjct: 42 QIEGRLKANKKLDYLLEVDDIYFPPKINLEQTSSQQTALYKASLII-GEELADVTGGLGI 100
Query: 104 DFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDY 163
D Y E N++ A+HN L + +++++ G+ L K + D
Sbjct: 101 DTFHFSKNTKNVDYFELNEDLVTIAKHNFDKL--DAPNISVVNGNGLALLNTEKKY--DT 156
Query: 164 IYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQ 223
IY+DP+RR+ + ++V+ + DC P++ ELL C+ ++ K SPM+D+ LQ L +V
Sbjct: 157 IYIDPSRRTESKQKVFFLNDCLPNVPKHIDELLNHCTLLMIKTSPMLDIQVGLQELKNVS 216
Query: 224 ELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLL-SEDTVIPFIFTMEEERSISVP 282
E+HVVA + EVKELL N V I +N+ SE+ F + + S P
Sbjct: 217 EIHVVAVNNEVKELLWLCKPNAVA----HVQIKTINIKNSENECFNFQLNTVSQSTYSSP 272
Query: 283 YADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEI 342
Y+YEP+ A+ KAG F ++ + GL KLH ++HL+TSE FPGR F + E+
Sbjct: 273 ST-----YLYEPNAAILKAGGFSHISDKKGLFKLHQHTHLFTSEMLMD-FPGRRFKIVEV 326
Query: 343 IPFSTSVLK-QLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLL 401
IP+S +K + + +A+++ RNF + +R++ K+ DGG+ L +T+ D KV++
Sbjct: 327 IPYSKKEMKANIAGI--KANVTTRNFTETVAAIRKKWKLKDGGDTYLFFSTLHDNSKVVI 384
Query: 402 LLRK 405
K
Sbjct: 385 KCEK 388
>gi|126646426|ref|ZP_01718943.1| hypothetical protein ALPR1_03515 [Algoriphagus sp. PR1]
gi|126578058|gb|EAZ82278.1| hypothetical protein ALPR1_03515 [Algoriphagus sp. PR1]
Length = 394
Score = 195 bits (495), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 136/404 (33%), Positives = 212/404 (52%), Gaps = 19/404 (4%)
Query: 1 MLFDEKEILAITRWAKLYANQSPDRILLG-SNDIPPEYRAAVATQIELWPRLRNKLPQWP 59
M F E + ++ + + + P +LL + + + + AV QI + KLP+W
Sbjct: 1 MDFSELHSAELKKFVQDHLREDPALLLLKFAGKVDFDLKTAV-QQISARQKASKKLPEWT 59
Query: 60 GISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIE 119
+L+ P LSLEQSS T+ +K++ + G +VDLTGG G+D L QGIY E
Sbjct: 60 QNPALFFPPSLSLEQSSSEETARHKAQGL-SGNLIVDLTGGFGVDAYYLSKNFKQGIYCE 118
Query: 120 RNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVY 179
E + N+ +L GK GD +L + D IY DPARR ++++Y
Sbjct: 119 YQPELFKLTKQNLEILA-PGK-FQFYQGDGLTFLKEQNQYF-DLIYADPARRGKGNQKLY 175
Query: 180 AIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLV 239
I DCEPDL+ L S I+ K SPM+D+ + +Q++ +++ EVKELL+
Sbjct: 176 KIQDCEPDLVSEWEVLKEKSSQIILKYSPMLDISQAWNEIQDIQKITILSVKNEVKELLI 235
Query: 240 RMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALF 299
+ NEAT P ++V + + L D F F+++EE+ Y++ +++ EPH+ +
Sbjct: 236 HWNKNEAT-PNQQVLVQDIGLGYPD----FSFSLQEEKLAQTLYSEP-KRFLIEPHSGIL 289
Query: 300 KAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF-VLEEIIPFSTSVLKQLGKVVP 358
KAGAFK RLGL+KL NSHLYTSE Y PG+ F V+ E+ P K++ K+ P
Sbjct: 290 KAGAFKLFGQRLGLQKLETNSHLYTSEEYPVKIPGKVFEVIREVSPKK----KEIKKLFP 345
Query: 359 RASISC--RNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVL 400
++ RN+ L+++ + DGG+ L+GT G K+
Sbjct: 346 SGKVNVITRNYASGSEALKKKLGLKDGGDNYLIGTKTQSGFKIF 389
>gi|88802617|ref|ZP_01118144.1| hypothetical protein PI23P_08505 [Polaribacter irgensii 23-P]
gi|88781475|gb|EAR12653.1| hypothetical protein PI23P_08505 [Polaribacter irgensii 23-P]
Length = 391
Score = 194 bits (494), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 198/370 (53%), Gaps = 24/370 (6%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A QI + KLP W ++Y P ++S+EQ+S +T+ YKS+ ++ G +VD+TGG
Sbjct: 40 LANQIVAKQKSEKKLPTWFSSENIYYPPKVSIEQTSSEITADYKSKLLK-GDVIVDITGG 98
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILT--GDFKEYLPLIKT 158
GID + + I+ E + + HN L K I+T GD +YL K
Sbjct: 99 FGIDCYYFSKQFKEVIHCEIDATLSTIVAHNYQQL----KQHTIVTHFGDGIQYLK-NKQ 153
Query: 159 FHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
+ D IYVDP+RR+ ++V+ + DC P++ L +L K SP++D+ +
Sbjct: 154 ENFDCIYVDPSRRNDRKEKVFLLKDCVPNVPDHLDFLFSKSKQVLLKNSPILDITSAINE 213
Query: 219 LPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTV-IPFIFTMEEER 277
L V+E+HVVA EVKE+L + +P+ +N+L T F++ E
Sbjct: 214 LKFVKEIHVVAVDNEVKEVLFLLEKGFVGT----LPVKTINILKNKTQHFNFLWNSPAES 269
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
S P Y+YEP+ AL K+G F V+ +L + KLH +SHLYTSE AFPGR F
Sbjct: 270 EYSEPLL-----YLYEPNAALLKSGGFHQVSAQLSIFKLHQHSHLYTSENL-IAFPGRIF 323
Query: 338 VLEEIIPFSTSVLKQLGKVV--PRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMAD 395
+E ++ + KQ+GK+V +A+I+ RNFP + ++RQ +K+ +GG L TT +
Sbjct: 324 KIETVLSYDK---KQIGKLVSHKKANITTRNFPKTVAQIRQETKIKEGGTLFLFFTTSKN 380
Query: 396 GKKVLLLLRK 405
+ +++ K
Sbjct: 381 NQLIVIFCSK 390
>gi|86133982|ref|ZP_01052564.1| hypothetical protein MED152_04720 [Tenacibaculum sp. MED152]
gi|85820845|gb|EAQ41992.1| hypothetical protein MED152_04720 [Polaribacter dokdonensis MED152]
Length = 391
Score = 194 bits (493), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 204/367 (55%), Gaps = 18/367 (4%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A Q+ + KLP W S +Y P +LS+EQ+S +T+ YKS I EG+ ++D+TGG
Sbjct: 40 LANQVIAKQKSEKKLPTWFNTSKIYYPPKLSIEQTSSEITAKYKSAII-EGSSIIDITGG 98
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
G+D ++ + E N+E + HN L + ++ + G+ E+L K H
Sbjct: 99 FGVDCFYFAKTFNKVTHCEINEELSKIVSHNYKEL--DVSNITTIAGNSFEFLKNTKQ-H 155
Query: 161 SDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLP 220
D IY+DP+RRS +V+ + DC P + P ++IL K+SP++D+ +T+ L
Sbjct: 156 YDCIYIDPSRRSDVKGKVFLLKDCLPYVPPKIDFFFTKANNILIKVSPILDITNTINDLK 215
Query: 221 HVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSIS 280
+V+E+HVVA + EVKELL + + + I +N +D + + F ++E +
Sbjct: 216 NVKEVHVVAVNNEVKELLFLLEKDYNNT----IKIKTIN-FQKDQLQSYDFAYKDE--VQ 268
Query: 281 VPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLE 340
Y++ +D Y+YEP++A+ KAGAF+ +A KLH +SHL+TS+ + FPGR F ++
Sbjct: 269 ASYSEPLD-YLYEPNSAILKAGAFQQIAKHTNSFKLHQHSHLFTSKMLIT-FPGRAFKID 326
Query: 341 EIIPFSTSVLKQLGKVVP--RASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKK 398
I K+L K++P +A+ + RNFP + +LR+ +K+ DGG + T +
Sbjct: 327 TI---LKYDKKKLKKLLPENKANFTTRNFPKTVAQLRKETKIKDGGAIYVFFTKINKSDL 383
Query: 399 VLLLLRK 405
+ ++ K
Sbjct: 384 ITIICSK 390
>gi|120436004|ref|YP_861690.1| hypothetical protein GFO_1650 [Gramella forsetii KT0803]
gi|117578154|emb|CAL66623.1| conserved hypothetical protein [Gramella forsetii KT0803]
Length = 393
Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 127/366 (34%), Positives = 201/366 (54%), Gaps = 15/366 (4%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A QI+ K P++ ++ P +L+LEQ+S +T+ YK+ I+ G +DLTGG
Sbjct: 41 LAIQIKGLNVAEKKFPEFYQNPNILYPPKLNLEQTSSEITAKYKASLIK-GNTGIDLTGG 99
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
LGID + + +Y E N + A A HN L + ++++ TGD +L K
Sbjct: 100 LGIDTYFISKNFQEFMYCEINTDLAEIAEHNFKAL--KADNISVNTGDGLIFLSKFKG-G 156
Query: 161 SDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELL-PFCSSILAKLSPMIDLWDTLQSL 219
D+IY DPARR +V+ DC+P+ IP ELL ++++ K SP++D+ L L
Sbjct: 157 LDWIYADPARRDDRGGKVFKFEDCDPN-IPQNLELLFNHTNTMMIKSSPILDISAGLSEL 215
Query: 220 PHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSI 279
V+E+H+VA EVKELL + + P I A+N +D+V F E E I
Sbjct: 216 KFVKEVHIVAVRNEVKELLWILEKDFEDTP----EIKAIN-FEKDSVQKFSSESEPEHQI 270
Query: 280 SVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVL 339
+ A+ + Y+YEP+ A+ K+G F +A + +KLH +SHLYTS+A FPGR F +
Sbjct: 271 A-ELAEP-ETYLYEPNAAIMKSGLFDLLAVKTKTKKLHQHSHLYTSKAL-VGFPGRKFQI 327
Query: 340 EEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKV 399
I + S LK+ K +A+I+ RNFP S ++R++ K+ DGG + T+ +K+
Sbjct: 328 ASIKEYKPSGLKKHFK-SKKANITTRNFPESVEDIRKKFKIKDGGSDYIFFTSNMRDEKI 386
Query: 400 LLLLRK 405
++ +K
Sbjct: 387 VIYCKK 392
>gi|146302326|ref|YP_001196917.1| hypothetical protein Fjoh_4599 [Flavobacterium johnsoniae UW101]
gi|146156744|gb|ABQ07598.1| hypothetical protein Fjoh_4599 [Flavobacterium johnsoniae UW101]
Length = 393
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 202/383 (52%), Gaps = 17/383 (4%)
Query: 25 RILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYK 84
++ L N P ++ QIE + ++KLP W + PS++S+EQ+S T++YK
Sbjct: 24 KLALQKNPFPDVEWISILNQIEARTKAKDKLPTWFSAKDIIYPSKISVEQTSSEKTATYK 83
Query: 85 SRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNI 144
+ I EG ++DLTGG G+D + + E N++ + HN L K+ +
Sbjct: 84 ASLI-EGETLIDLTGGFGVDDYYFSQRFISIAHCEINEDLSAIVSHNFEQL--HVKNSHF 140
Query: 145 LTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILA 204
D L + D+IY+DP+RR+ A +V+ + DC P++ SIL
Sbjct: 141 YADDSANVLNNLNQ-KWDWIYIDPSRRNDAKGKVFMLKDCLPNVPESLDFYFEKSDSILI 199
Query: 205 KLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSED 264
K +P++D+ L L V+ +H++A EVKELL + N + ++ + N++ +D
Sbjct: 200 KTAPLLDISAGLSELKSVKNIHIIALENEVKELLFEIHKNYSG----EITLKTANIV-KD 254
Query: 265 TVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYT 324
+ F F + + P + KY+YEP++A+ K+G F V+ + KLH +SHLYT
Sbjct: 255 KIETFEFVLGAKGQF--PSYNLPQKYLYEPNSAIMKSGGFDEVSTSFKINKLHKHSHLYT 312
Query: 325 SEAYESAFPGRTFVLEEIIPFSTSVLKQ--LGKVVPRASISCRNFPLSPIELRQRSKMAD 382
S+ FPGR+F +E++I ++ + +K L K +A+++ RNFP + +R++ K+ +
Sbjct: 313 SDDL-IDFPGRSFEIEKVISYNKNDMKNELLNK---QANVTTRNFPETVENIRKKWKIKN 368
Query: 383 GGEKTLMGTTMADGKKVLLLLRK 405
GG TT + K++L+ RK
Sbjct: 369 GGNFYCFFTTDKNDNKIVLICRK 391
>gi|171741703|ref|ZP_02917510.1| hypothetical protein BIFDEN_00791 [Bifidobacterium dentium ATCC
27678]
gi|171277317|gb|EDT44978.1| hypothetical protein BIFDEN_00791 [Bifidobacterium dentium ATCC
27678]
Length = 455
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 125/420 (29%), Positives = 194/420 (46%), Gaps = 77/420 (18%)
Query: 43 TQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIR------EGT---- 92
QI W RNKL +W + P +S+EQ S T+ YK+ + +G
Sbjct: 47 NQIAGWQIARNKLSEWADCDDIIYPPHISMEQCSSQFTAQYKAEIVNRLLCTDDGADNAR 106
Query: 93 -------------------------------------KVVDLTGGLGIDFIALMSKASQG 115
+VDLTGG G+DF L ++
Sbjct: 107 DSAHSDDIGKTDIAGITEAEHAEEWDSVTTSAGNADLSMVDLTGGFGVDFSYLARGFTRA 166
Query: 116 IYIERNDETAVAARHNIPLL-LNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGA 174
Y+ER A HN+ +L L++ + I+ GD EYL ++ IY+DPARR
Sbjct: 167 AYVERQPHLCDLAAHNMIVLGLHQTE---IICGDGVEYLRSMQPV--SLIYIDPARRDEH 221
Query: 175 DKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPH-VQELHVVAAHGE 233
R YAI DC P+++ L LL + KLSPM+D + V E+H+V+ E
Sbjct: 222 GTRTYAIEDCMPNVLALRDLLLAKARFAMIKLSPMLDWRKAIADFGGAVSEVHIVSTGNE 281
Query: 234 VKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSI---SVPYADSID-- 288
KELL+ + T P ++D P ++ + + + I S YA +
Sbjct: 282 CKELLMVLDGAVETRHP-----------TDDVRAPHVYCVNDGQRIDYDSAVYARGLRIG 330
Query: 289 -------KYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEE 341
+Y+YEP+ ++ KAG F + R G+ ++ PNSHL+ + + FPGR F +E
Sbjct: 331 TAPLPEMEYLYEPNASIMKAGCFDLLEERYGVTQIGPNSHLFVAAEPVTDFPGRGFAIEA 390
Query: 342 IIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLL 401
+ +++L V RA+++ RNFPL+ +LR++ K+ADGG+ L GTT+ G +LL
Sbjct: 391 VGGMGKKDVRRLLSGVGRANVAVRNFPLTAPQLRKKLKLADGGDTYLFGTTIQGGGHILL 450
>gi|149280236|ref|ZP_01886359.1| hypothetical protein PBAL39_19300 [Pedobacter sp. BAL39]
gi|149229073|gb|EDM34469.1| hypothetical protein PBAL39_19300 [Pedobacter sp. BAL39]
Length = 393
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 127/394 (32%), Positives = 196/394 (49%), Gaps = 21/394 (5%)
Query: 11 ITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRL 70
+ + + + ++ +I + + P +A QI + KLP W S LY P L
Sbjct: 10 VQHYIRQHLHEDAYKIAMAKSPFPTVSGQELAGQIVAKQKCLKKLPLWYERSFLYFPPVL 69
Query: 71 SLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARH 130
S+EQ S T+ YKSR +GT ++DLTGG G+D S + E N E + A H
Sbjct: 70 SIEQCSSERTAIYKSRLALKGT-LIDLTGGFGVDSFYFAKSCSSVTHCELNQELSEIAAH 128
Query: 131 NIPLLLNEGKD-LNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLI 189
N L G+D L GD L T H D IY+DPARR G +V+ + DC P+++
Sbjct: 129 NAATL---GQDNLTFFAGDGIAQLQ-NNTTHYDNIYIDPARR-GTMGKVFMLKDCTPNVV 183
Query: 190 PLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIP 249
+ LL ++ K +P++DL L+ L HV E+H+V+ E KEL+ + +
Sbjct: 184 ENLSLLLSRAHRVILKTAPLLDLSSGLKELHHVAEIHIVSVRNECKELVWILEKKQ---- 239
Query: 250 PEKVPIHAVNLLSEDTVIPFIFTMEE--ERSISVPYADSIDKYVYEPHTALFKAGAFKTV 307
PE + I + L E+ + FI EE R + P + +Y+YEP AL K+GAF +
Sbjct: 240 PEALIISCITLNEEEKIFSFIKGEEELEARLLQGP----VQEYLYEPDAALLKSGAFNLI 295
Query: 308 AYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNF 367
R GL KL + LYTS + FPGR F + + S + LK+ + +A++ RN+
Sbjct: 296 GERYGLFKLQHQTQLYTSASAIPEFPGRRFKVRGFL--SATTLKKEKNL--KANVISRNY 351
Query: 368 PLSPIELRQRSKMADGGEKTLMGTTMADGKKVLL 401
P +L ++ K+ + L+ T DG V++
Sbjct: 352 PDKAEQLVKKYKIKPDQLRFLIFTQTKDGGNVII 385
>gi|83857756|ref|ZP_00951284.1| hypothetical protein CA2559_13168 [Croceibacter atlanticus
HTCC2559]
gi|83849123|gb|EAP86992.1| hypothetical protein CA2559_13168 [Croceibacter atlanticus
HTCC2559]
Length = 396
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 184/354 (51%), Gaps = 12/354 (3%)
Query: 52 RNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSK 111
++KL W + P ++++EQ+S + T+ YK+ I G ++D+TGGLGID
Sbjct: 54 KDKLSLWHSTKGILFPPKVNIEQTSSSKTARYKASLI-SGKTIIDITGGLGIDDYYFSKV 112
Query: 112 ASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARR 171
+ E N + A HN +L ++ GD L T D++Y DP+RR
Sbjct: 113 FDTVTHCELNVSLSALAAHNSNVL--GAHNITFKVGDGISILKQEST-KFDWVYSDPSRR 169
Query: 172 SGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAH 231
+V+ ++DCEP++ LL IL K SP++D+ L+ L V E+H+VA +
Sbjct: 170 DDTGGKVFKLSDCEPNIPKHLDVLLEKGKRILLKTSPLLDITAGLRELKSVSEIHIVAIN 229
Query: 232 GEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYV 291
+VKELL + E P E + ++ D F + E V Y+ ++ Y+
Sbjct: 230 NDVKELLWIID-QEHNSPTEIITVN----FKADYKEEFKAKLTYESLAQVNYSKALS-YL 283
Query: 292 YEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLK 351
YEP++AL K+G F T++ + L KLH NSHLYTSE F GR F + E +PF LK
Sbjct: 284 YEPNSALMKSGLFNTISEQYSLFKLHQNSHLYTSERLID-FSGRRFKILETLPFHKKSLK 342
Query: 352 QLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
++ K +A+I+ RNFPL+ +L+ ++ DGG L TT KV+L+ K
Sbjct: 343 RVFKNT-KANITTRNFPLTVSQLKSALQIKDGGTTYLFFTTQNQSDKVVLVCEK 395
>gi|88805298|ref|ZP_01120817.1| hypothetical protein RB2501_13199 [Robiginitalea biformata
HTCC2501]
gi|88784116|gb|EAR15286.1| hypothetical protein RB2501_13199 [Robiginitalea biformata
HTCC2501]
Length = 369
Score = 177 bits (449), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 128/357 (35%), Positives = 178/357 (49%), Gaps = 18/357 (5%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A Q+ + KLP W +++Y P L+LEQ+S T+ YK+ I G +VDLTGG
Sbjct: 18 LAQQVAGRKKTLEKLPSWHRANAIYYPPGLNLEQASSEATARYKASLI-SGNWLVDLTGG 76
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
G+D ++ + Y E + A A HN L ++ + GD +YL ++
Sbjct: 77 FGVDAYFFARQSQRVDYFEIDTGLAEIAAHNFRQL--GATNIRVHPGDGLDYLRGLQPGS 134
Query: 161 S--DYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
S D+IY DP+RR RV + D PD+ LL ++L K SPM+DL L++
Sbjct: 135 SLPDWIYADPSRRHAKKGRVIRLEDYAPDIPGNLDLLLGVSDNLLVKTSPMLDLSAGLKA 194
Query: 219 LPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERS 278
L V E+HVVA EVKELL + + P IHA +L D PF F +EE S
Sbjct: 195 LRQVAEIHVVAVKNEVKELLWVIRSTASDGP----RIHASDL--SDGHPPFRFHPDEEAS 248
Query: 279 ISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFV 338
A +Y+YEP AL KAGAFK R GL KLH ++HLYTSE +PGR F
Sbjct: 249 ADSQLALPA-RYLYEPGAALLKAGAFKLAGTRYGLGKLHRHTHLYTSEKL-IPYPGRRFR 306
Query: 339 LEEIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMAD 395
+ E P+ L ++ RNFP +R+R+++ GG L AD
Sbjct: 307 ILENHPYKPGRLPYR-----EGHVNSRNFPEDVARIRKRNRIKSGGNTYLFFVRAAD 358
>gi|86142666|ref|ZP_01061105.1| hypothetical protein MED217_07121 [Flavobacterium sp. MED217]
gi|85830698|gb|EAQ49156.1| hypothetical protein MED217_07121 [Leeuwenhoekiella blandensis
MED217]
Length = 395
Score = 173 bits (438), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 142/407 (34%), Positives = 209/407 (51%), Gaps = 27/407 (6%)
Query: 5 EKEILAITRWAKLYANQSPDRILLGSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSL 64
++E++ TR A A+ S +ILL + +A QI + KLP W +
Sbjct: 9 KQEVITYTR-AHYKADVS--KILLSKSPFEDVSAPELAQQIIGLQKAERKLPSWFQKPEI 65
Query: 65 YIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDET 124
P +L+LEQ+S T+ YK+ G+ +DLTGGLGID L + + + E N E
Sbjct: 66 LYPPKLNLEQTSSEETALYKASLFA-GSSAIDLTGGLGIDTYFLSTVFDEVTHCEMNAEL 124
Query: 125 AVAARHNIPLL-LNEGKDLNILTGDFKEYLPLIKTF--HSDYIYVDPARRSGADKRVYAI 181
+ A+HN +L K +N + DF IK + D IY DP+RR+ +V+ +
Sbjct: 125 SELAQHNFEVLGATNIKTINHNSIDF------IKNTAGNFDLIYCDPSRRNEDKGKVFML 178
Query: 182 ADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRM 241
DCEP++ LL ++ K SP++D+ L+ L +V ++H VA EVKELL +
Sbjct: 179 KDCEPNIPEHLDFLLDKAKHLVIKTSPLLDIAAGLRELKNVIQIHCVAVKNEVKELLWVL 238
Query: 242 SLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKA 301
S NE T P + + AVNL S +P + ++ Y DS Y+YEP+ AL K
Sbjct: 239 S-NE-TQP--DIELIAVNLTSAFD-LPVSIEGFDLQTAEATY-DSPSNYLYEPNAALLKL 292
Query: 302 GAFKTVAYRLGLRKLHPNSHLYTS-EAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPR- 359
G F ++ L KL PN+HLYTS EA + FPGR F + + P+S K + K + R
Sbjct: 293 GCFNWISSHYKLDKLAPNTHLYTSTEAIQ--FPGRHFKINAVHPYSK---KSIAKFLKRK 347
Query: 360 -ASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLRK 405
A+I+ RNF S LR + K+ GG+ L TT+ D V+L K
Sbjct: 348 KANITTRNFKESVASLRAKFKVKSGGDLYLFFTTLEDSTTVMLECEK 394
>gi|21672941|ref|NP_661006.1| hypothetical protein CT0100 [Chlorobium tepidum TLS]
gi|21645998|gb|AAM71348.1| conserved hypothetical protein [Chlorobium tepidum TLS]
Length = 399
Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 187/384 (48%), Gaps = 27/384 (7%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G D+P A+A QI + KLP LY +RL LEQ+SG + +K+ +
Sbjct: 35 GRGDLPVR---AIAEQIACRKKAAAKLPSLSRFPMLY--TRLGLEQASGERAAEWKASLM 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
R G + +DLTGGLGID + L + + ERN+ A A N ++ ++ L GD
Sbjct: 90 R-GWRAIDLTGGLGIDTLFLAQRFDSVVSCERNEALARLAEANRRMM--GVTNVETLIGD 146
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
+E L D++ VDPARR R ++ PD++ L LL + K SP
Sbjct: 147 SEELLAGYADDSFDWVLVDPARREHGG-RSAGLSASSPDVVRLHDMLLRKARRVCIKASP 205
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRM-SLNEATIPPEKVPIHAVNLLSEDTVI 267
+++ LP + E+ V+ GE KE+L+ + EA + PE I AV L SE
Sbjct: 206 ALEISGLETQLPTLSEVIAVSVDGECKEVLLLLYREREAGLTPE---IRAVCLGSE---- 258
Query: 268 PFIFTMEEERSISVP----YADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLY 323
T E S VP A++ ++YEP TA+ KA +A + L L+
Sbjct: 259 ----TFEIVSSGGVPPARVVAEAPGTWLYEPDTAIIKARLTGELARQFHLEFLNRTVDYL 314
Query: 324 TSEAYESAFPGRTFVLEEIIPF-STSVLKQLGKV-VPRASISCRNFPLSPIELRQRSKMA 381
TS+ FPGR+F +EE PF S K+L ++ + A+I R+FPLS ELR+R K+
Sbjct: 315 TSDRLIEPFPGRSFRIEECRPFRQKSFRKELAELEITNAAIQRRDFPLSVEELRKRYKIG 374
Query: 382 DGGEKTLMGTTMADGKKVLLLLRK 405
+ E+ L T A G + L RK
Sbjct: 375 ESSERYLFFTKNATGSLIWLSCRK 398
>gi|89891579|ref|ZP_01203083.1| putative methyltransferase [Flavobacteria bacterium BBFL7]
gi|89516126|gb|EAS18789.1| putative methyltransferase [Flavobacteria bacterium BBFL7]
Length = 387
Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 185/365 (50%), Gaps = 18/365 (4%)
Query: 41 VATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGG 100
+A Q++ + R KL + S + P +++LEQ+S T+SYK+ G K++DLTGG
Sbjct: 40 LAQQLKGLQKARIKLEPYFNNSQIIYPPKVNLEQTSSWSTASYKANLFN-GEKMIDLTGG 98
Query: 101 LGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFH 160
GID A + +IE + + A + +G D +YL + +
Sbjct: 99 FGIDISAFAKAYNNTTHIELHTDLQKLAEQSFKA---QGLTTKSYASDGMQYLAKSTSIY 155
Query: 161 SDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLP 220
IY+DP+R++ + + D EP++I LL ++ K SPM+D+ L+ L
Sbjct: 156 G-LIYIDPSRKTATTSKAIQLEDYEPNVIENLDLLLSKGKIVMIKTSPMLDITAGLKQLK 214
Query: 221 HVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSIS 280
+V +H+VA EVKELL + NEAT V +H +NL S + F + +S
Sbjct: 215 NVCAIHIVAVKNEVKELLWILE-NEAT----DVVVHCINLESSQPDLCFKHDHKANIDLS 269
Query: 281 VPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLE 340
P Y+YEP+ A+ K+ AF + + G+ K+ ++HL+TS+ FPGRTF++E
Sbjct: 270 EPLT-----YLYEPNAAIMKSQAFGHLCEKYGVSKIDQDAHLFTSKK-NIDFPGRTFIIE 323
Query: 341 EIIPFSTSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVL 400
EI + +K+ RA ++ RNF S +LR + ++ + E + T + GK ++
Sbjct: 324 EIKAYKPKDIKRTYAKSHRAVVT-RNFRESVHQLRTKFQLKE-HETDYLFFTSSMGKPIV 381
Query: 401 LLLRK 405
+ +K
Sbjct: 382 IQAKK 386
>gi|78188005|ref|YP_378343.1| hypothetical protein Cag_0021 [Chlorobium chlorochromatii CaD3]
gi|78170204|gb|ABB27300.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
Length = 399
Score = 136 bits (343), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 183/367 (49%), Gaps = 20/367 (5%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTG 99
A+A Q+ + KLP + LY + LSLEQ+S T+ +K F+ +G + +DL+G
Sbjct: 43 ALAEQLACRRKAERKLPTLSRHNLLY--TTLSLEQASSERTARFKCTFM-QGKRCIDLSG 99
Query: 100 GLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEG-KDLNILTGDFKEYLPLIKT 158
GLGID I L + + +Y ERN+ RHN ++ G ++ + GD +L
Sbjct: 100 GLGIDAIFLAAHFEELLYCERNELLCNVVRHN---MVRCGIGNVRLQQGDSLSFLASQPD 156
Query: 159 FHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
D+I VDPARR KR + P+++ LL I K SP +++ +
Sbjct: 157 NAFDWIMVDPARREEG-KRSIGLEAASPNVVASQELLLAKAPHICIKASPALEISNLKML 215
Query: 219 LPHVQELHVVAAHGEVKELLVRMSLN-EATIPPEK-VPIHAVNLLSEDTVIPFIFTMEEE 276
LP + + VV+ GE KE+L+ + EA P K + + A N + V+ + T E+
Sbjct: 216 LPALHTILVVSVSGECKEILLLLKRGAEAEHPITKAICLQADN----NAVVEIVGTHEQH 271
Query: 277 RSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRT 336
RS+ A+S+ Y+YEP A+ KA VA + GL L+ + TS ++F G+
Sbjct: 272 RSL----AESLQCYLYEPDAAIIKARLSGVVAKQEGLEFLNKSVDYLTSNHVVASFAGKV 327
Query: 337 FVLEEIIPFSTSVLKQL--GKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMA 394
F + E +P+ ++ + ASI R+FPLS ELR++ ++ + + L+ T
Sbjct: 328 FQVIESVPYKPKEFRKFLDRHAISAASIQRRDFPLSADELRKKFRLREDEKHFLIFTRNR 387
Query: 395 DGKKVLL 401
+ + + +
Sbjct: 388 NAEPICI 394
>gi|68549338|ref|ZP_00588803.1| conserved hypothetical protein [Pelodictyon phaeoclathratiforme
BU-1]
gi|68243751|gb|EAN25947.1| conserved hypothetical protein [Pelodictyon phaeoclathratiforme
BU-1]
Length = 420
Score = 134 bits (337), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 176/370 (47%), Gaps = 15/370 (4%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G +D+P A+A Q+ + KLP LY P L+LEQSSG T++YK+ +
Sbjct: 35 GRSDLPVR---AMAEQLACQQKAVKKLPILSKHKLLYTP--LALEQSSGERTAAYKASLM 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G + +DL+GGLG+D + L +Y ER+ + HN L ++ ++ + G+
Sbjct: 90 -SGKRAIDLSGGLGVDAMFLARTFQDVVYCERDSQLCAVVEHN--LKVSGIANVQVRNGE 146
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
L D+I+VDPARR +R A+ PD++ LL + K SP
Sbjct: 147 SISLLAEYPDNSFDWIFVDPARREEG-RRSIALDAASPDVVASHDLLLRHAQRVCIKASP 205
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP 268
+++ + LP +Q + VV+ E KE+L+ + P +V +N SE+
Sbjct: 206 ALEISGLKKLLPALQTIVVVSVDRECKEILLLLERAHPADGPVQVKAVCLNRDSEEITEV 265
Query: 269 FIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAY 328
F E R + + ++ +Y+YEP A+ KA +A GL+ ++ + T++
Sbjct: 266 FGGNGEAPRVVGM----AVKEYLYEPDPAIIKARLSAVLARDSGLQFVNRSVDYLTADRK 321
Query: 329 ESAFPGRTFVLEEIIPFSTSVLKQL--GKVVPRASISCRNFPLSPIELRQRSKMADGGEK 386
FPGRTF + + +P+ + + ASI R+FPLS ELR++ ++ +
Sbjct: 322 IEDFPGRTFRVVDCVPYKPKSFRAFLERHAITGASIQRRDFPLSAEELRKKYRLLESERA 381
Query: 387 TLMGTTMADG 396
L T A G
Sbjct: 382 FLFFTKDATG 391
>gi|91214828|ref|ZP_01251801.1| hypothetical protein P700755_18224 [Psychroflexus torquis ATCC
700755]
gi|91187255|gb|EAS73625.1| hypothetical protein P700755_18224 [Psychroflexus torquis ATCC
700755]
Length = 402
Score = 131 bits (329), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 194/404 (48%), Gaps = 30/404 (7%)
Query: 11 ITRWAKLYANQS-PDRILLGSNDIPPEYR--AAVATQIELWPRLRNKLPQWPGISSLYIP 67
+ ++ + N+S PD IL GS P ++ + QI + + KLP W + P
Sbjct: 16 VNKFIEKNVNKSIPDLILKGS---PFKHIDIKDIINQIIGKEKAKKKLPHWYKNEKVIYP 72
Query: 68 SRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVA 127
S+L+LEQ+S +T+ +KS+ + T ++D+TGG GID K +Y E N E
Sbjct: 73 SKLNLEQTSSEITALHKSKLVSCQT-LLDMTGGFGIDSYYFSKKVISLLYTELNSELCRV 131
Query: 128 ARHNIPLLLNEGKD-LNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEP 186
+HNI G D I D ++L + ++IY+DP+RR+ K V+ + D P
Sbjct: 132 VKHNIKAF---GIDNFEIKNEDSIDFLQKNSQIY-NWIYIDPSRRNEKTK-VFQLKDSLP 186
Query: 187 DLIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEA 246
++I + + K SPM D+ + L V+E+H+++ EVKELL
Sbjct: 187 NIIEHLEIIERKSERFMLKTSPMYDIDMGYKELKGVKEIHIISVKNEVKELL-------W 239
Query: 247 TIPPEKVPIHAVNLLSEDTVIPFIFTM--EEERSISVPYADSIDKYVYEPHTALFKAGAF 304
I K + + + T + +T E E + + +Y+YE + + K+G
Sbjct: 240 IIDWRKQNSKTIKIYNYQTKKRYSYTSIDENEEQLVRIKLTTCCQYLYELDSGIMKSGLN 299
Query: 305 KTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISC 364
+ R L+KL +++LYTS+ + PG+ + ++ + P + +K+ K IS
Sbjct: 300 DLIGIRYDLKKLEQHTNLYTSDKKILSIPGKIYKVKSVEPINYKKIKKAIKGYQINLIS- 358
Query: 365 RNFPLSPIELRQRSKMADGGE-------KTLMGTTMADGKKVLL 401
+NF L+ EL+++ K GG+ KT+ G + + +++L
Sbjct: 359 KNFQLNTDELQKKLKCTIGGKSDYLIFAKTIEGNRVIEATRIVL 402
>gi|119358355|ref|YP_912999.1| hypothetical protein Cpha266_2587 [Chlorobium phaeobacteroides DSM
266]
gi|119355704|gb|ABL66575.1| conserved hypothetical protein [Chlorobium phaeobacteroides DSM
266]
Length = 444
Score = 129 bits (325), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/372 (29%), Positives = 173/372 (46%), Gaps = 20/372 (5%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G D+P +A Q+ R KLP + LY P L+LEQSSG ++YK+ F+
Sbjct: 60 GRGDLPVR---VMAEQLGCRQRAVKKLPILSAFNLLYTP--LALEQSSGERAAAYKASFM 114
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G +V+DL+GGLG+D + L + + +Y ER+ HN L + ++ I D
Sbjct: 115 -SGNRVIDLSGGLGVDTVFLAGRFREVVYCERDPLLCAVVEHN--LQASGVANVAIKNDD 171
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
L D+I+VDPARR +R A+ PD++ LL + K SP
Sbjct: 172 SISLLATYPDDFFDWIFVDPARREEG-RRSVALEAASPDVVASHDLLLRHAQRVCIKASP 230
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDT--V 266
+++ + LP ++ + VV+ E KE+L+ L E P + + + LS D+ +
Sbjct: 231 ALEISGLKKLLPALRSIVVVSVDRECKEILL---LLERGFPSDGIVLVKAVCLSADSEEL 287
Query: 267 IPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSE 326
+ E R + A + Y+YEP A+ KA +A GL ++ + T++
Sbjct: 288 TDVVGGGEAPRVV----ASGVKGYLYEPDPAIIKARLSAVLARVSGLEFVNGSVDYLTAD 343
Query: 327 AYESAFPGRTFVLEEIIPFSTSVLKQL--GKVVPRASISCRNFPLSPIELRQRSKMADGG 384
FPGR F + E IP+ + + ASI R+FPLS ELR++ ++ +
Sbjct: 344 VVIDGFPGRMFRVVECIPYKPKSFRAFLERNGITGASIQRRDFPLSAEELRKKYRLVESE 403
Query: 385 EKTLMGTTMADG 396
+ T A G
Sbjct: 404 RVFIFFTRDAAG 415
>gi|67918008|ref|ZP_00511610.1| conserved hypothetical protein [Chlorobium limicola DSM 245]
gi|67784304|gb|EAM43681.1| conserved hypothetical protein [Chlorobium limicola DSM 245]
Length = 419
Score = 128 bits (321), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 180/380 (47%), Gaps = 26/380 (6%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G DIP A+A Q++ R KLP LY +LEQSSG ++YK+ +
Sbjct: 35 GRADIPAR---AIAEQLDCRRRALKKLPVLSQKGLLYTAP--ALEQSSGEAAATYKASLM 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G +++DLTGGLGID + +Y ER+ A A +N L ++ I GD
Sbjct: 90 -GGERLIDLTGGLGIDAVFFSRVFRNVVYCERDPLLAELAAYNFRRL--GVSNVEICQGD 146
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
L H D++YVDP+RR G +R ++ PD+ LL + ++ K SP
Sbjct: 147 SLAMLHSFSDDHFDWMYVDPSRREGG-RRSVGLSAASPDVAASHDLLLRKAAKVMVKASP 205
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP 268
++L + LP + +HVV+ GE +E L+ L E T P + PI +L + T
Sbjct: 206 ALELSGIERQLPSISAIHVVSVGGECRETLL---LLERT-PARQGPIARKAVLLDTTAGE 261
Query: 269 F---IFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTS 325
+ T E RS +VP + + YEP A+ K+ ++ G + TS
Sbjct: 262 WREIAGTGWESRSPAVP----VQAFFYEPGPAIIKSALTARLSEVYGFDYVSNTVDYLTS 317
Query: 326 EAYESAFPGRTFVL---EEIIPFS-TSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMA 381
+ FPGRTF + + P + S L++ G + A+I R+FPLSP E+R++ ++
Sbjct: 318 NRFVEGFPGRTFRVVASDRYKPKTFRSFLQRHG--IRGAAIQRRDFPLSPEEIRRKFRLR 375
Query: 382 DGGEKTLMGTTMADGKKVLL 401
+ L T + G+ + +
Sbjct: 376 ESHCDYLFFTKDSAGESICV 395
>gi|67937956|ref|ZP_00530486.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
gi|67915817|gb|EAM65135.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
Length = 402
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 180/369 (48%), Gaps = 19/369 (5%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI-REGTKVVDLT 98
A+A QI + NKLP+ + Y + L+LEQ+SG + YKS GT+++D++
Sbjct: 43 AIAEQIACRRKAHNKLPELSKHNLFY--TSLALEQASGERAACYKSGLQGMSGTRLLDMS 100
Query: 99 GGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKT 158
GGLGID I + + +Y ER+ A A HN L ++ ++ GD + L
Sbjct: 101 GGLGIDTIFFAERFDEVVYCERDAVVAKIAEHNFREL--GISNVTVIHGDSVKTLETFPD 158
Query: 159 FHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
D I++DPARR +R A+ PD++ L LL K SP +++
Sbjct: 159 ESFDLIFIDPARREKG-RRSAALEAGSPDVVSLHDLLLRKAHRFCVKASPALEIDGIEAK 217
Query: 219 LPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSE-DTVIPFIFTMEEER 277
LP + ++ V++ E KE+L+ P + + AV+L + + VI + ++
Sbjct: 218 LPSLFQVIVLSVERECKEILLFCRREHDAGKP--LVVKAVSLKGDKENVIVSEAGCQPQK 275
Query: 278 SISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTF 337
I A + +Y+YEP A+ KA A + GL+ ++ TS+ F GR+F
Sbjct: 276 HI----APTPGRYLYEPDPAIIKARLTAFAAAQYGLQFINSRIDYLTSDREIGDFHGRSF 331
Query: 338 VLEEIIPFS----TSVLKQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTM 393
+ I F T+ LK+ G + ASI R+FPLSP E+R+R ++ + E L+ T
Sbjct: 332 RIITTIFFKPGRFTTFLKEQG--IESASILRRDFPLSPDEIRRRYRLRENKETFLIFTKD 389
Query: 394 ADGKKVLLL 402
G+ + LL
Sbjct: 390 ISGRLLCLL 398
>gi|67940206|ref|ZP_00532663.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
gi|67913581|gb|EAM62972.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1]
Length = 415
Score = 125 bits (314), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 172/344 (50%), Gaps = 21/344 (6%)
Query: 40 AVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI-REGTKVVDLT 98
A+A QI + NKLP+ + Y + L+LEQ+SG + YKS GT+++D++
Sbjct: 43 AIAEQIACRRKAHNKLPELSKRNLFY--TSLALEQASGERAARYKSGLQGMSGTRLLDMS 100
Query: 99 GGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKT 158
GGLGID I + + +Y ER+ A A HN L ++ ++ GD E L
Sbjct: 101 GGLGIDTIFFAERFDEVVYCERDAVVAKIAEHNFREL--GISNVTVIHGDSVETLETFPD 158
Query: 159 FHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
D I++DPARR +R A+ PD++ L LL K SP +++
Sbjct: 159 ESFDLIFIDPARREKG-RRSAALEAGSPDVVSLHDMLLRKAHRFCVKASPALEIDGLDAK 217
Query: 219 LPHVQELHVVAAHGEVKE-LLVRMSLNEATIPPEKVPIHAVNLLSE-DTVIPFIFTMEEE 276
LP + ++ V++ E KE LL +++A P + AV+L + + VI +++
Sbjct: 218 LPSLFQVIVLSVERECKEILLFCRRVHDAGKP---FVVKAVSLKGDKENVIVSEAWCQQQ 274
Query: 277 RSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRT 336
+SI A + +Y+YEP A+ KA A + GL+ ++ TS+ FPGR+
Sbjct: 275 KSI----APTPGRYLYEPDPAIIKAHLTAFAAAQYGLQFINSRVDYLTSDRKIGDFPGRS 330
Query: 337 FVLEEIIPFS----TSVLKQLGKVVPRASISCRNFPLSPIELRQ 376
F + I F T+ LK+ G + ASI R+FPLSP E+R+
Sbjct: 331 FRIITTIFFKPGRFTTFLKEQG--IESASILRRDFPLSPDEIRR 372
>gi|145220508|ref|YP_001131217.1| hypothetical protein Cvib_1705 [Prosthecochloris vibrioformis DSM
265]
gi|145206672|gb|ABP37715.1| conserved hypothetical protein [Prosthecochloris vibrioformis DSM
265]
Length = 408
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 176/376 (46%), Gaps = 20/376 (5%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G DIP A+A QI + KLP+ LY SR +LEQ+S + Y++ +
Sbjct: 35 GRTDIPVR---AIAEQIACRKKAAKKLPRLSTRPLLYT-SR-ALEQASAERVAGYRASQM 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G + +DL+GGLGID L ++ ++ER+ A H + +L E K++ I GD
Sbjct: 90 -SGQRAIDLSGGLGIDSSFLAGAFARIDFVERDPLLCRLASHTMQVL--ERKNVAIQCGD 146
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
L I + D+I+VDP RR +RV +++ PD++ EL+ + K SP
Sbjct: 147 AFAVLDSIPSGILDWIFVDPDRRESGRRRV-GLSESSPDVVSGHNELIHKAPGVCIKASP 205
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP 268
++L + LP + + VV+ + +E ++ + A EK P L +
Sbjct: 206 ALELSGLKRELPSLSSIIVVSLDRQCRESMLILLKGTAQ---EKEPERKAVCLHSSGIGE 262
Query: 269 FIFTMEE--ERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSE 326
+ + E ER ++ +EP A+ KAG VA GL L+ + T
Sbjct: 263 YEVSGREGVERRVTAEPGPCF----FEPDPAIIKAGLSAEVAEEFGLEFLNHSVDYLTGG 318
Query: 327 AYESAFPGRTFVLEEIIPFSTSVLKQLGK--VVPRASISCRNFPLSPIELRQRSKMADGG 384
E+ FPGR F L + P+ + ++ K + ASI R+FPLSP ELR+ ++ +
Sbjct: 319 PSEAVFPGRAFRLVALTPYKPKLFRRFLKEHQIFGASIQRRDFPLSPEELRRLYRLKESS 378
Query: 385 EKTLMGTTMADGKKVL 400
+ L + A G+ +
Sbjct: 379 SRFLFFSRNAAGEATV 394
>gi|110597296|ref|ZP_01385584.1| conserved hypothetical protein [Chlorobium ferrooxidans DSM 13031]
gi|110341132|gb|EAT59600.1| conserved hypothetical protein [Chlorobium ferrooxidans DSM 13031]
Length = 418
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 176/379 (46%), Gaps = 25/379 (6%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G D+P A+A QI + KLP + LY P LSLEQ+SG +++K+ +
Sbjct: 35 GRKDLPVR---AMAEQIGCRRKAAKKLPSLSTYNLLYTP--LSLEQASGERAAAFKASML 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G +++DL+GGLGID I L +Y ER+ + HN+ + + +
Sbjct: 90 -SGKRLIDLSGGLGIDSIFLSRTFDDVLYCERDPLLSTLMEHNL-----KQCGIGNVQVQ 143
Query: 149 FKEYLPLIKTFHSD---YIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAK 205
+ L L+ + D +IYVDPARR + V + PD++ LL + + K
Sbjct: 144 HADSLTLLAGYPDDSFEWIYVDPARREEGQRSV-TLESASPDVVLNHDLLLRKAAKVCIK 202
Query: 206 LSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNL-LSED 264
SP ++L LP + E+ VV+ E KE L+ + P V I AV L D
Sbjct: 203 ASPALELSTLKAILPALSEIVVVSVDRECKESLLLLDRGARDRP---VTIRAVALSTGSD 259
Query: 265 TVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYT 324
V+ + +R++ A S+ +++YEP A+ KA +A GL L+ + T
Sbjct: 260 AVVEVCGDPDADRAV----ASSLKQFLYEPDPAIIKARLSAVLARNCGLEFLNGSVDYLT 315
Query: 325 SEAYESAFPGRTFVLEEIIPFSTSVLKQL--GKVVPRASISCRNFPLSPIELRQRSKMAD 382
E + FPGR F + + +P+ + + ASI R+FPLS +LR R + +
Sbjct: 316 GEQLIADFPGRLFRVLDAVPWKPKTFRAFLARHHITGASIQRRDFPLSVEKLRTRFGIRE 375
Query: 383 GGEKTLMGTTMADGKKVLL 401
L T A+ + V++
Sbjct: 376 SERVFLFFTRNAEREPVVI 394
>gi|68552180|ref|ZP_00591572.1| conserved hypothetical protein [Prosthecochloris aestuarii DSM 271]
gi|68240995|gb|EAN23264.1| conserved hypothetical protein [Prosthecochloris aestuarii DSM 271]
Length = 401
Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 169/379 (44%), Gaps = 16/379 (4%)
Query: 29 GSNDIPPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFI 88
G D+P A+A QI + KLP +Y + LSLEQ+SG + YK+ +
Sbjct: 35 GRRDLPVR---AIAEQIACRRKAARKLPSLSQHPLIY--TTLSLEQASGEKAARYKAGLM 89
Query: 89 REGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD 148
G +V+DL+GGLGID I L S+ + +Y ER+ A A N L +++++TGD
Sbjct: 90 -GGDRVIDLSGGLGIDAIFLASRFREVMYCERDAMLAAIAETNFRTL--GITNISVVTGD 146
Query: 149 FKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSP 208
L + D+IYVDPARR + R + PD+ L LL K SP
Sbjct: 147 SLSLLQSVADDGFDWIYVDPARREHS-GRSAGLRSASPDVTQLHDLLLRKARKFCVKASP 205
Query: 209 MIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIP 268
I++ + L + + V++ E KE+L+ + + V + L E
Sbjct: 206 AIEISALGRELRSLVSVTVLSVDRECKEVLLFCDREGSGVRLPSVRSVCLGLSGE----- 260
Query: 269 FIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAY 328
F+ E S P A +Y++EP A+ KA +A L+ L+ + T
Sbjct: 261 FVLNETGEDSSVRPVAVDPGRYLFEPDPAIIKARQSHLLALHYDLQFLNSSVDYLTGSVP 320
Query: 329 ESAFPGRTFVLEEIIPFSTSVLKQL--GKVVPRASISCRNFPLSPIELRQRSKMADGGEK 386
FPGR F + + + LK L + + ASI R+FPLSP +R++ ++ +
Sbjct: 321 VEGFPGRVFSIIGSFAYKPAALKALLNKRGITAASIQRRDFPLSPDVIRKKFRLKESDHT 380
Query: 387 TLMGTTMADGKKVLLLLRK 405
L T G L +
Sbjct: 381 FLFFTRDRSGNLFCLCCER 399
>gi|78187905|ref|YP_375948.1| hypothetical protein Plut_2063 [Pelodictyon luteolum DSM 273]
gi|78167807|gb|ABB24905.1| conserved hypothetical protein [Pelodictyon luteolum DSM 273]
Length = 417
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 163/333 (48%), Gaps = 21/333 (6%)
Query: 71 SLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARH 130
+LEQ+S +SY + ++ G + +DL+GGLGID +L + +++ER+ + A H
Sbjct: 72 ALEQASEERAASYMASLLK-GRRAIDLSGGLGIDAFSLAGSFEEVVHVERDPVLSAIACH 130
Query: 131 NIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIP 190
N+ +L +++ L GD +E L + D+IY+DP RR GA + + + + PD++
Sbjct: 131 NMGVLGR--RNVECLRGDGRERLASFPDGYFDWIYLDPDRREGAHRNIR-LEEGSPDVVS 187
Query: 191 LAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPP 250
L LL + K SP ++ LP + + V++ G+ + L+ + P
Sbjct: 188 LHDFLLQKAPGVCIKASPALETSRLKDRLPSLASVTVLSVDGQCRATLLILRRQ----AP 243
Query: 251 EKVPIHAVNLL-SEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAY 309
+V HAV + + + F + ER +S D++ +++ EP A+ KAG VA
Sbjct: 244 LEVERHAVCIRGAGEEARRFTWRDTGERRVS----DAVMEWLLEPDPAILKAGLEGVVAE 299
Query: 310 RLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPR-----ASISC 364
GL ++ + T+ FPGR F + E +S K GK + R A+I
Sbjct: 300 TFGLLFINRSVGYLTAAHRPEGFPGRVFRVVEAALYSG---KGFGKFLKRHGIDGAAIQR 356
Query: 365 RNFPLSPIELRQRSKMADGGEKTLMGTTMADGK 397
R+FPLS +R R ++ + + L T A+ +
Sbjct: 357 RDFPLSADAIRARYRLKEDDRRFLFFTRDAEAR 389
>gi|167752881|ref|ZP_02425008.1| hypothetical protein ALIPUT_01143 [Alistipes putredinis DSM 17216]
gi|167659950|gb|EDS04080.1| hypothetical protein ALIPUT_01143 [Alistipes putredinis DSM 17216]
Length = 383
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 170/362 (46%), Gaps = 33/362 (9%)
Query: 39 AAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLT 98
A VATQ++ R R KLP + + P L+ EQ+S ++ KS G V+DLT
Sbjct: 40 ALVATQVKYLARARTKLPSYYEARCILPP--LAFEQASSEACAARKSC---SGNTVLDLT 94
Query: 99 GGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKT 158
GLG+D + L + + I +ER+ A A N L ++ ++ + +L
Sbjct: 95 CGLGVDALYLSKRFREVITLERDATLAAVAEENFRRL--GATNIRVINSSAEAFLASTLD 152
Query: 159 FHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAELLPFCSSILAKLSPMIDLWDTLQS 218
H D+IY DP RRS +++ + DC PD+ L EL ++ K SP+ D+ +T +
Sbjct: 153 -HFDWIYADPDRRSAGGRKLVRLEDCSPDIPKLLPELRRIAPNLCVKNSPLFDVGETFRL 211
Query: 219 LPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNL-LSEDTVIPFIFTMEEER 277
+ VV+ H E KE++V + T P + AV L L E F T E R
Sbjct: 212 FGPCR-TEVVSLHDECKEVVV---YTDGTGP----LLTAVALELGE-----FSCTPAEAR 258
Query: 278 SISV--PYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKL-HPNSHLYTSEAYESAFPG 334
+ P+ + +++ P AL KA T Y G + PN + + +E E G
Sbjct: 259 ATPCDKPFDPAAYRWLLIPDVALQKARL--TPTYLQGHADIWSPNGYGFAAEKSEDTL-G 315
Query: 335 RTFVLEEIIPFSTSVLKQLGKVV--PRASISCRNFPLSPIELRQRSKMADGGEKTLMGTT 392
R +E I P++ KQL + + R +I ++FPL+ E+ R + +GGE+ + T
Sbjct: 316 RWLEIERIEPYNP---KQLKRELSGSRLTILKQDFPLTAAEIAARLGIREGGERRIAFTK 372
Query: 393 MA 394
+
Sbjct: 373 LG 374
>gi|159896610|ref|YP_001542857.1| hypothetical protein Haur_0077 [Herpetosiphon aurantiacus ATCC
23779]
gi|159889649|gb|ABX02729.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC
23779]
Length = 392
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 176/358 (49%), Gaps = 31/358 (8%)
Query: 52 RNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSK 111
R ++P S LY +R +LEQ++ + +SY+ R G+++VDL +G D +AL
Sbjct: 59 RAAQAKFPQASQLYF-TREALEQATPWLVASYRQRHFATGSRLVDLGCSVGGDALALAQS 117
Query: 112 ASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARR 171
S + I+R D +A L + ++I DF ++ +++DPARR
Sbjct: 118 CSV-LAIDR-DPLRLAMLEANAQALGLSQQISIQEADFTT----LEFAGYAGLFIDPARR 171
Query: 172 SGADKRVYAIADCEPDLIPLAA--ELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVA 229
S KR++ + +P L L +P AK++P I ++P +L ++
Sbjct: 172 SNG-KRIWDVEHYQPPLSTLERWRGQVPIHG---AKVAPGI----PDDAVPAGYDLEFIS 223
Query: 230 AHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDK 289
G+++E + + + V + + +E ++I + + ++S P A
Sbjct: 224 LDGDLREACLWWQAGQVGGQRKAVVLTSAG--AEHSLIAD--STQAAAALSEPLA----- 274
Query: 290 YVYEPHTALFKAGAFKTVAYRLGLRKLHPN-SHLYTSEAYESAFPGRTFVLEEIIPFSTS 348
Y+YEP A+ +A A +A +L L + + ++L + +S F R + +E+ +PF+
Sbjct: 275 YLYEPDPAVIRAHAVADIANQLDLAQFDASIAYLTSDRLVQSPFL-RAWQIEQWLPFNLK 333
Query: 349 VLKQL--GKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLLR 404
+L+Q+ + + R ++ R P++P EL ++ ++ E+TL+ T + G+ V+LL++
Sbjct: 334 LLRQILQAREIGRVTVKKRGSPITPEELSKQLRLKGRYEQTLVLTKL-QGQPVVLLVK 390
>gi|163847010|ref|YP_001635054.1| hypothetical protein Caur_1437 [Chloroflexus aurantiacus J-10-fl]
gi|187602018|ref|ZP_02988244.1| conserved hypothetical protein [Chloroflexus sp. Y-400-fl]
gi|163668299|gb|ABY34665.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl]
gi|187485703|gb|EDU23678.1| conserved hypothetical protein [Chloroflexus sp. Y-400-fl]
Length = 396
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 171/371 (46%), Gaps = 41/371 (11%)
Query: 35 PEYRAAVAT-QIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTK 93
P RA A Q L R +K PQ + + +R +L Q+S A +++++R + +
Sbjct: 43 PAARARAAIEQALLRRRAISKFPQ----ADRMLFTREALAQASAAPVAAHRARRLAQAGN 98
Query: 94 VVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD-FKEY 152
V DL G+G D IAL +Q I +ER+ AR N+ +L G ++ L D +E
Sbjct: 99 VADLGCGIGGDTIALADAGAQVIAVERDPIRLALARFNVE-VLGLGSRVSFLERDLLREP 157
Query: 153 LPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLAAEL--LPFCSSILAKLSPMI 210
LPL ++ DPARRSG ++R++ D +P PL+ L ++ KL+P I
Sbjct: 158 LPLAAA-----LFCDPARRSG-ERRLFDPDDFQP---PLSHVLSWRQHTPALAVKLAPGI 208
Query: 211 DLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFI 270
+P E+ V+ G++KE ++ P A L S+ ++ I
Sbjct: 209 Q----RSVVPEDAEVEFVSLDGDLKEAVIWCG------PCATTSRRATVLGSDGSMSTMI 258
Query: 271 FTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPN-SHLYTSEAYE 329
S+S P A +YEP A+ +AG +A++LG +L P+ ++L A
Sbjct: 259 ADATTAPSLSPPLA-----VLYEPDPAVIRAGLVAELAHQLGAYQLAPDIAYLTAHTALP 313
Query: 330 SAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISC--RNFPLSPIELRQRSKMADGGEKT 387
+ F R + + IPF L+ L + + ++ R PL L + +++ G++
Sbjct: 314 TPF-ARAWPIITWIPFQLKRLRALVRELDAGVVTVKKRGSPLDTDSLAR--QLSGSGQRH 370
Query: 388 LMG--TTMADG 396
L+ T M DG
Sbjct: 371 LVVVLTQMPDG 381
>gi|116671434|ref|YP_832367.1| hypothetical protein Arth_2888 [Arthrobacter sp. FB24]
gi|116611543|gb|ABK04267.1| conserved hypothetical protein [Arthrobacter sp. FB24]
Length = 414
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 148/361 (40%), Gaps = 51/361 (14%)
Query: 35 PEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSG-AVTSSYKSRFIREGTK 93
P +AV TQ L + K ++ ++ P+ LEQ++ V + + RF G +
Sbjct: 46 PALVSAVLTQSRLRTKAEAKFGEF-ARQMIFTPA--GLEQATRLNVAARHAERFASAGIE 102
Query: 94 -VVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHN-IPLLLNEGKDLNILTGDFKE 151
V DL GLG D +AL S + +E ++ TA A N IP
Sbjct: 103 HVADLGCGLGADSMALASMDIRVTAVEMDETTAACATMNLIPF----------------- 145
Query: 152 YLPLIKTFHSDY----------IYVDPARRSGADKRVYAIADCEPDLIPLA--AELLPFC 199
P HSD +++DPARR+ + I D E PL+ L
Sbjct: 146 --PHASVVHSDATSVPLEGVGGVWLDPARRTTSTSGTRRIWDPEEFSPPLSFVKSLAESG 203
Query: 200 SSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVN 259
++ K+ P I S+P E V+ G+V E V + N + P I
Sbjct: 204 RAVGVKMGPGI----PHDSVPAGCEAQWVSVGGDVTE--VTLWFNAVSRPG----IRRAA 253
Query: 260 LLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPN 319
LL I + EE V ++ Y+YEP A+ +AG VA RLG L +
Sbjct: 254 LLLGPQGAAEITSAEEFDGGPVAPVGPVEGYLYEPDGAVIRAGLVADVAARLGGHLLDEH 313
Query: 320 -SHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKV--VPRASISCRNFPLSPIELRQ 376
+++ E ++ F R + + E++PF+ LK K + I R ++P ELR+
Sbjct: 314 IAYICAPELVDTPF-ARAYKVLEVMPFNVKALKAWVKQNNIGVLDIKKRGTSVTPEELRK 372
Query: 377 R 377
+
Sbjct: 373 Q 373
>gi|118046552|ref|ZP_01515201.1| conserved hypothetical protein [Chloroflexus aggregans DSM 9485]
gi|117996944|gb|EAV11136.1| conserved hypothetical protein [Chloroflexus aggregans DSM 9485]
Length = 396
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 143/336 (42%), Gaps = 34/336 (10%)
Query: 68 SRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMSKASQGIYIERNDETAVA 127
+R +LEQ+S A +++++ + +V DL G+G D IA+ Q I +ER D +A
Sbjct: 73 TRDALEQASAAPVAAHRAARLACQRRVADLGCGIGGDTIAMALAGIQVIAVER-DPIRLA 131
Query: 128 ARHNIPLLLNEGKDLNILTGDFKEYLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPD 187
L G+ + L D P H+D ++ DPARR G D+R++ A +P
Sbjct: 132 LAQANLAALGLGERVLWLKRDLLHEPP----PHADALFCDPARRVG-DQRIFDPAAFQPP 186
Query: 188 LIPLAAELLPFCSSILAKLSPMIDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEAT 247
L + + +++ KL+P ID +P EL V+ GE+KE ++ T
Sbjct: 187 LTHVLG-WQRYNPALVVKLAPGID----RNRIPAEAELEFVSFDGELKEAVLWCGSLATT 241
Query: 248 IPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTV 307
V A N +S + S P + +YEP + +AG +
Sbjct: 242 ERRATVLNGAGNAVSLTA-----------GAASPPPLSTPQIVLYEPDPTIIRAGLIAEL 290
Query: 308 AYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISCRNF 367
A +LG +L P+ T+ Y RT+ + +PF L+ L R+
Sbjct: 291 AAQLGAAQLSPDIAYLTATTYHPTPFARTWPIITWLPFQLKRLRAL----------LRDL 340
Query: 368 PLSPIELRQRSKMADGGEKTLMGTTMADGKKVLLLL 403
P+ +++R D TL +G + L+++
Sbjct: 341 NAGPVTVKKRGSPLD--TTTLAHQLSGNGNRRLVVV 374
>gi|119964237|ref|YP_948586.1| hypothetical protein AAur_2877 [Arthrobacter aurescens TC1]
gi|119951096|gb|ABM10007.1| conserved hypothetical protein [Arthrobacter aurescens TC1]
Length = 409
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 151/350 (43%), Gaps = 29/350 (8%)
Query: 35 PEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSG-AVTSSYKSRFIREG-T 92
PE AAV TQ L R K ++ + + ++ LEQ++ V + + RF + G +
Sbjct: 46 PELVAAVLTQSRLRTRAEAKFGEF---ARQMLFTQAGLEQATRLNVAARHAERFAKAGIS 102
Query: 93 KVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEY 152
V DL GLG D +A+ S +E ++ TA A N+ + ++ D
Sbjct: 103 HVADLGCGLGADSMAMASMDISVTAVEMDETTAACATINLMPFPHA----TVVHADATS- 157
Query: 153 LPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLA--AELLPFCSSILAKLSPMI 210
++ D +++DPARR+ + I D E PL+ L +I K+ P I
Sbjct: 158 ---VELDGMDGVWLDPARRTTSTSGTKRIWDPEAFSPPLSFVERLAATGRAIGVKMGPGI 214
Query: 211 DLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPFI 270
S+P E V+ G+V E V + N P + L+ + I
Sbjct: 215 ----PHDSVPPGCEAQWVSVGGDVTE--VTLWFNAVARPG----VRRAALVIGNQGAAEI 264
Query: 271 FTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPN-SHLYTSEAYE 329
+ + V ++ ++YEP A+ +AG VA RLG + + +++ E ++
Sbjct: 265 TSGADFDGGPVAAVGPVEGFLYEPDGAVIRAGLVADVAGRLGGHLVDEHIAYICAPELHD 324
Query: 330 SAFPGRTFVLEEIIPFSTSVLKQLGK--VVPRASISCRNFPLSPIELRQR 377
+ F R F + E++P++ LK K + I R ++P ELR++
Sbjct: 325 TPF-ARAFKVLEVMPYNVKALKAWVKNQGITVLDIKKRGTAVTPEELRKQ 373
>gi|163840422|ref|YP_001624827.1| methyltransferase [Renibacterium salmoninarum ATCC 33209]
gi|162953898|gb|ABY23413.1| methyltransferase [Renibacterium salmoninarum ATCC 33209]
Length = 406
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 150/351 (42%), Gaps = 28/351 (7%)
Query: 34 PPEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSG-AVTSSYKSRFIREGT 92
P + +AV TQ L + + K ++ + I ++ LEQ++ +V + + RF G
Sbjct: 42 PADLVSAVLTQSRLRAKAQAKFGEF---AQQMIFTQAGLEQATRLSVAALHAQRFSNAGV 98
Query: 93 -KVVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKE 151
++ DL G+G D +A + + +E ++ TA A N+ N ++ D
Sbjct: 99 QRIADLGCGIGADSLAFATLDLEVTAVELDETTAACAMMNLMPFPNA----KVVQADAAS 154
Query: 152 YLPLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDLIPLA--AELLPFCSSILAKLSPM 209
+ P D +++DPARR I D E PL+ L + KL P
Sbjct: 155 FDPKA----VDGVWLDPARRDTTTSGTARIFDPEASSPPLSFVEGLAETGMPVGVKLGPG 210
Query: 210 IDLWDTLQSLPHVQELHVVAAHGEVKELLVRMSLNEATIPPEKVPIHAVNLLSEDTVIPF 269
I +S+P E V+ G++ E V + N+ P + A LL
Sbjct: 211 I----PHESIPANCEAQWVSVDGDLVE--VALWFNKLARPEIR---RAALLLGTGQGAEL 261
Query: 270 IFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPN-SHLYTSEAY 328
+++ + + + Y+YEP A+ +AG +A LG + P+ ++L +
Sbjct: 262 TSSIDFDAAAQGAEVGPLQGYLYEPDAAVIRAGLVADLANHLGAHLIDPHIAYLCADDLV 321
Query: 329 ESAFPGRTFVLEEIIPFSTSVLKQLGKV--VPRASISCRNFPLSPIELRQR 377
E+ F R + + E+ P++ VL+ + + I R ++P ELR++
Sbjct: 322 ETPF-ARAYRVIEVKPYNVKVLRAWVRESDIGTLEIKKRGTSVTPEELRKQ 371
>gi|168704435|ref|ZP_02736712.1| hypothetical protein GobsU_33174 [Gemmata obscuriglobus UQM 2246]
Length = 391
Score = 59.7 bits (143), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 144/353 (40%), Gaps = 48/353 (13%)
Query: 51 LRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTKVVDLTGGLGIDFIALMS 110
LR K + ++L +R +LEQS+ + +++R E V DL G+G D IAL +
Sbjct: 56 LRTKAREKFADAALMYFTREALEQSTSEIVGRHRARRFAEFGNVADLCCGIGADAIAL-A 114
Query: 111 KASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGD-FKEYLPLIKTFHSDYIYVDPA 169
+A + D VA L + + GD LP + + DP
Sbjct: 115 RAGLTVAAVDLDPLRVAMCGANAAALGVTDRVRGIAGDALAAPLPDARA-----AFADPD 169
Query: 170 RRSGADKRVYAIADCEPDLIPLAAELLPFCSS--ILAKLSPMIDLWDTLQSLPHVQELHV 227
RR+ +R D P L L F + + K++P + D +P E
Sbjct: 170 RRANG-RRFLDPEDYSPSLGALRGR---FGADFPLGVKIAPGVAKSD----IPSAAEAEF 221
Query: 228 VAAHGEVKELL-----VRMSLNEATIPPEKVPIHAVNLLSEDTVIPFIFTMEEERSISVP 282
V+ GE+KE + +R ++ AT+ P S +T+ T E E S P
Sbjct: 222 VSLRGELKECILWFGPLRQAVRRATVLP-----------SGETL-----TGEGEASPPPP 265
Query: 283 YADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLYTSEAYESAFPGRTFVLEEI 342
A+ + + +Y+P A+ +AG +A RL L T A+ + +E
Sbjct: 266 LAERVGEVLYDPDPAVTRAGLVPQLAERLSAEATDFEVQLLTGGAHAPTAFATAYRVEHA 325
Query: 343 IPFSTSVL------KQLGKVVPRASISCRNFPLSPIELRQRSKMADGGEKTLM 389
PF + L +Q+G+V ++ R P P E+ ++ K+ G + ++
Sbjct: 326 APFHPNHLRDYLRERQIGRV----TVINRGSPADPAEVVKKLKLKGPGHRGVL 374
>gi|152964703|ref|YP_001360487.1| hypothetical protein Krad_0734 [Kineococcus radiotolerans SRS30216]
gi|151359220|gb|ABS02223.1| conserved hypothetical protein [Kineococcus radiotolerans SRS30216]
Length = 396
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 86/381 (22%), Positives = 152/381 (39%), Gaps = 45/381 (11%)
Query: 35 PEYRAAVATQIELWPRLRNKLPQWPGISSLYIPSRLSLEQSSGAVTSSYKSRFIREGTK- 93
P A TQ L R +K P L + + + + + V + + RF G +
Sbjct: 43 PALAALATTQSRLRARAASKF--GPRAQDLLLTAAGAEQATRAVVAAEHARRFAAAGVRR 100
Query: 94 VVDLTGGLGIDFIALMSKASQGIYIERNDETAVAARHNIPLLLNEGKDLNILTGDFKEYL 153
V DL G+G D +A + + ++ + TA A HN L + ++ G E
Sbjct: 101 VADLGCGVGADSLAFLDAGLDVLAVDADPVTAAVAAHN---LGRPVRCADVTAGVVDELG 157
Query: 154 PLIKTFHSDYIYVDPARRSGADKRVYAIADCEPDL--IPLAAELLPFCSSILAKLSPMI- 210
P D + DPARR+ +R++ P L + A +P AKL+P +
Sbjct: 158 P------GDGAWFDPARRTSGGRRLFDPEATSPPLSFVLATAARVPATG---AKLAPGLP 208
Query: 211 -DLWDTLQSLPHVQELHVVAAHGEVKEL------LVRMSLNEATIPPEKVPIHAVNLLSE 263
DL +P E + GEV E L R + P A + +
Sbjct: 209 HDL------VPAGAEAQWTSVDGEVVECALWSGPLARPGTGRSARVVRGGPSGAAVVEVD 262
Query: 264 DTVIPFIFTMEEERSISVPYADSIDKYVYEPHTALFKAGAFKTVAYRLGLRKLHPNSHLY 323
DT +P V + ++++EP A+ +AG VA LG R +
Sbjct: 263 DTDLP------------VADVGPVGEFLHEPDGAVIRAGLVARVAADLGGRLVDETIAYV 310
Query: 324 TSEAYESAFPGRTFVLEEIIPFSTSVLKQLGKVVPRASISC--RNFPLSPIELRQRSKMA 381
T++ ++ R F + E++PF L+ + + +++ R + P +LR++ +
Sbjct: 311 TTDTGLTSPLTRAFRVLEVLPFGLKPLRARLRALDVGTLTVKKRGTAVDPDQLRRQLALK 370
Query: 382 DGGEKTLMGTTMADGKKVLLL 402
T++ T +A + VL++
Sbjct: 371 GSRPATIVLTRVAGRQSVLVV 391
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.319 0.136 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,717,771,996
Number of Sequences: 6515104
Number of extensions: 71640026
Number of successful extensions: 201087
Number of sequences better than 1.0e-04: 62
Number of HSP's better than 0.0 without gapping: 34
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 200836
Number of HSP's gapped (non-prelim): 63
length of query: 407
length of database: 2,222,278,849
effective HSP length: 136
effective length of query: 271
effective length of database: 1,336,224,705
effective search space: 362116895055
effective search space used: 362116895055
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 123 (52.0 bits)