BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PG1774
(150 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34541619|ref|NP_906098.1| hypothetical protein PG2031 [P... 288 7e-77
gi|150010075|ref|YP_001304818.1| hypothetical protein BDI_3... 104 1e-21
gi|154491976|ref|ZP_02031602.1| hypothetical protein PARMER... 103 2e-21
gi|156109355|gb|EDO11100.1| hypothetical protein BACOVA_030... 87 2e-16
gi|153808345|ref|ZP_01961013.1| hypothetical protein BACCAC... 86 5e-16
gi|29347828|ref|NP_811331.1| hypothetical protein BT_2418 [... 85 1e-15
gi|150002747|ref|YP_001297491.1| hypothetical protein BVU_0... 84 2e-15
gi|53711883|ref|YP_097875.1| hypothetical protein BF0592 [B... 84 3e-15
gi|86131885|ref|ZP_01050482.1| hypothetical protein MED134_... 80 3e-14
gi|116246348|ref|XP_001230367.1| ENSANGP00000030003 [Anophe... 80 4e-14
gi|149372189|ref|ZP_01891459.1| hypothetical protein SCB49_... 77 2e-13
gi|156862767|gb|EDO56198.1| hypothetical protein BACUNI_001... 75 9e-13
gi|89891492|ref|ZP_01202997.1| conserved hypothetical prote... 75 1e-12
gi|120435794|ref|YP_861480.1| hypothetical protein GFO_1440... 72 7e-12
gi|146297908|ref|YP_001192499.1| hypothetical protein Fjoh_... 72 7e-12
gi|150026044|ref|YP_001296870.1| hypothetical protein FP200... 72 8e-12
gi|88714202|ref|ZP_01108278.1| hypothetical protein FB2170_... 69 1e-10
gi|126663446|ref|ZP_01734443.1| hypothetical protein FBBAL3... 68 2e-10
gi|86141621|ref|ZP_01060167.1| hypothetical protein MED217_... 67 5e-10
gi|91214899|ref|ZP_01251872.1| hypothetical protein P700755... 65 1e-09
gi|88803471|ref|ZP_01118997.1| hypothetical protein PI23P_1... 65 1e-09
gi|86133081|ref|ZP_01051663.1| hypothetical protein MED152_... 65 1e-09
gi|88805384|ref|ZP_01120903.1| hypothetical protein RB2501_... 65 1e-09
gi|83857050|ref|ZP_00950578.1| hypothetical protein CA2559_... 63 6e-09
gi|126645441|ref|ZP_01717985.1| hypothetical protein ALPR1_... 60 3e-08
gi|124006212|ref|ZP_01691047.1| conserved hypothetical prot... 58 2e-07
gi|94502325|ref|ZP_01308799.1| Hypothetical protein SMU_153... 57 4e-07
>gi|34541619|ref|NP_906098.1| hypothetical protein PG2031 [Porphyromonas gingivalis W83]
gi|34397937|gb|AAQ66997.1| hypothetical protein PG_2031 [Porphyromonas gingivalis W83]
Length = 150
Score = 288 bits (737), Expect = 7e-77, Method: Composition-based stats.
Identities = 149/150 (99%), Positives = 150/150 (100%)
Query: 1 LPLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE 60
+PLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE
Sbjct: 1 MPLRQKRLYWPLFFRDDLIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLE 60
Query: 61 DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI 120
DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI
Sbjct: 61 DNGDFYFQWDKNSGEKASVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVI 120
Query: 121 DYIAPEEVADMEHFWNNLISRLRLSLGAPE 150
DYIAPEEVADMEHFWNNLISRLRLSLGAPE
Sbjct: 121 DYIAPEEVADMEHFWNNLISRLRLSLGAPE 150
>gi|150010075|ref|YP_001304818.1| hypothetical protein BDI_3494 [Parabacteroides distasonis ATCC
8503]
gi|149938499|gb|ABR45196.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 127
Score = 104 bits (260), Expect = 1e-21, Method: Composition-based stats.
Identities = 56/128 (43%), Positives = 85/128 (66%), Gaps = 2/128 (1%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M+K K +EY+FD+VS RSLW +L+T GLS WFA +V + DN + F+W K + E A+V
Sbjct: 1 MKKEKFHIEYIFDKVSRRSLWNHLTTALGLSAWFADEVIINDNL-YTFKWSKEAQE-ATV 58
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
++ + IR++W E+D+NA FEF+IH ELTG +LE+ D+ P E D + W++ +
Sbjct: 59 IDSKPENFIRYRWVDEEDENAYFEFIIHTIELTGSTALEITDFSEPGEKKDSINLWDSQV 118
Query: 140 SRLRLSLG 147
L+ +LG
Sbjct: 119 EDLKRTLG 126
>gi|154491976|ref|ZP_02031602.1| hypothetical protein PARMER_01607 [Parabacteroides merdae ATCC
43184]
gi|154088217|gb|EDN87262.1| hypothetical protein PARMER_01607 [Parabacteroides merdae ATCC
43184]
Length = 127
Score = 103 bits (258), Expect = 2e-21, Method: Composition-based stats.
Identities = 58/129 (44%), Positives = 84/129 (65%), Gaps = 4/129 (3%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
M+K K +EY+FD+VS RSLW +L+TP GLS WFA DV + NG+ Y F+W+K + ++A
Sbjct: 1 MKKEKFHIEYIFDKVSRRSLWNHLTTPPGLSAWFADDVII--NGNIYVFKWNK-AEQEAE 57
Query: 79 VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNL 138
VL + IR++W E+D+NA FEF IH ELTG SL++ D+ +E D W+
Sbjct: 58 VLSIKPEMSIRYRWMDEEDENAYFEFQIHSHELTGSTSLQITDFAEQDEKKDSIDLWDTQ 117
Query: 139 ISRLRLSLG 147
+ L+ +LG
Sbjct: 118 VEELKRTLG 126
>gi|156109355|gb|EDO11100.1| hypothetical protein BACOVA_03002 [Bacteroides ovatus ATCC 8483]
Length = 128
Score = 87.4 bits (215), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 1/128 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M+K K+ LEY+ + S LW +STP GL +WFA D + D+ F W K KA +
Sbjct: 1 MKKEKIHLEYLLNATSKNILWGAISTPTGLEDWFA-DKVISDDKIVEFHWGKTEQRKAEI 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ R IRF+W ++++ FE + ++ELT LE+ D+ P+EV DM+ W + +
Sbjct: 60 IAIRSFSFIRFRWQDDENERDYFEIKMTYNELTSDYVLEITDFAEPDEVDDMKELWESQV 119
Query: 140 SRLRLSLG 147
++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|153808345|ref|ZP_01961013.1| hypothetical protein BACCAC_02639 [Bacteroides caccae ATCC 43185]
gi|149129248|gb|EDM20464.1| hypothetical protein BACCAC_02639 [Bacteroides caccae ATCC 43185]
Length = 128
Score = 86.3 bits (212), Expect = 5e-16, Method: Composition-based stats.
Identities = 46/128 (35%), Positives = 72/128 (56%), Gaps = 1/128 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M+K K+ LEY+ + S LW +STP GL +WFA D + D+ F W K KA +
Sbjct: 1 MKKEKIHLEYLLNATSKNILWAAISTPTGLEDWFA-DKVISDDKIVEFHWGKTEQRKAEI 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
R IRF+W ++++ FE + ++ELT LE+ D+ P+EV DM+ W + +
Sbjct: 60 TAIRSFSFIRFRWEDDENERDYFEIKMTYNELTSDYVLEITDFAEPDEVDDMKELWESQV 119
Query: 140 SRLRLSLG 147
++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|29347828|ref|NP_811331.1| hypothetical protein BT_2418 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339730|gb|AAO77525.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 128
Score = 85.1 bits (209), Expect = 1e-15, Method: Composition-based stats.
Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 1/128 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M+K K+ LEY+ + S LW +STP GL +WFA D + D+ F W K A +
Sbjct: 1 MKKEKIHLEYLLNATSKNILWAAISTPTGLEDWFA-DKVVSDDKIVEFHWGKTEQRNAEI 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ R IRF+W ++++ FE + ++ELT LE+ D+ +EVADM+ W + +
Sbjct: 60 IAIRSFSFIRFRWQDDENERDYFEIKMTYNELTSDYVLEITDFAEADEVADMKELWESQV 119
Query: 140 SRLRLSLG 147
++LR + G
Sbjct: 120 AKLRRTCG 127
>gi|150002747|ref|YP_001297491.1| hypothetical protein BVU_0141 [Bacteroides vulgatus ATCC 8482]
gi|149931171|gb|ABR37869.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 128
Score = 84.3 bits (207), Expect = 2e-15, Method: Composition-based stats.
Identities = 49/124 (39%), Positives = 69/124 (55%), Gaps = 1/124 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M K K++LEY+ S +W +STP GL WFA V +D F F W K +A V
Sbjct: 1 MRKEKIRLEYMLKAGSGNIVWSIISTPSGLETWFADKVIFKDKV-FTFYWGKTETRQAEV 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
R IRF+W ++D A FE + ++ELT LEVID+ AP+EV D + W++ I
Sbjct: 60 TNFRVNSFIRFRWLDDEDPKAYFELKMVYNELTSDYMLEVIDWAAPDEVEDTKELWDSEI 119
Query: 140 SRLR 143
+L+
Sbjct: 120 EKLK 123
>gi|53711883|ref|YP_097875.1| hypothetical protein BF0592 [Bacteroides fragilis YCH46]
gi|60680111|ref|YP_210255.1| hypothetical protein BF0542 [Bacteroides fragilis NCTC 9343]
gi|52214748|dbj|BAD47341.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60491545|emb|CAH06297.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 128
Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 1/128 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M+K K+ LEY+ + S LW +STP GL +WFA D + D+ F W K +A +
Sbjct: 1 MKKEKIHLEYLLNATSKNILWSAISTPTGLEDWFA-DKVVSDDKTVTFCWGKTEQRQAGI 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ R IRF W ++++ FE + ++ELTG LE+ D+ +E D++ W++ +
Sbjct: 60 VAIRAYSFIRFHWLDDENERDYFEIKMSYNELTGDYVLEITDFSEADEADDLKELWDSQV 119
Query: 140 SRLRLSLG 147
S+LR + G
Sbjct: 120 SKLRRTCG 127
>gi|86131885|ref|ZP_01050482.1| hypothetical protein MED134_02760 [Cellulophaga sp. MED134]
gi|85817707|gb|EAQ38881.1| hypothetical protein MED134_02760 [Dokdonia donghaensis MED134]
Length = 128
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 50/129 (38%), Positives = 77/129 (59%), Gaps = 5/129 (3%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
+K K +E+V R S L++Y++TP G+SEWFA +V G+FY F WD S EKA +
Sbjct: 3 DKKKYVIEFVV-RASPSLLYQYMATPSGMSEWFADNV--NSRGEFYTFIWD-GSEEKAKL 58
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
L ++ GE I+FQW ++ + F+ I E+T SL + DY +E+ + + FW N I
Sbjct: 59 LSKKSGEYIKFQWLDDEGEEYFFQLRIQVDEITKDVSLMITDYAEEDEIDEGKMFWENQI 118
Query: 140 SRLRLSLGA 148
S L+ +G+
Sbjct: 119 SELKQVIGS 127
>gi|116246348|ref|XP_001230367.1| ENSANGP00000030003 [Anopheles gambiae str. PEST]
Length = 128
Score = 79.7 bits (195), Expect = 4e-14, Method: Composition-based stats.
Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 2/129 (1%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M K K+ EY + L+ YL++ +GL+EWFA DV +E DFYF W+ EKA++
Sbjct: 1 MAKTKVHFEYPM-HCQSEILYEYLASAEGLAEWFADDV-VEKGDDFYFSWNGGEPEKATM 58
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ + +R++W ++ FE I E+T SL V D+ + +++ +W+NLI
Sbjct: 59 IRYKPESFVRYRWEADEGTKNFFELTIVIDEITNDLSLNVTDFADEGDEEEVQQYWDNLI 118
Query: 140 SRLRLSLGA 148
L++ LGA
Sbjct: 119 ENLQIKLGA 127
>gi|149372189|ref|ZP_01891459.1| hypothetical protein SCB49_00675 [unidentified eubacterium SCB49]
gi|149354956|gb|EDM43518.1| hypothetical protein SCB49_00675 [unidentified eubacterium SCB49]
Length = 128
Score = 77.4 bits (189), Expect = 2e-13, Method: Composition-based stats.
Identities = 50/128 (39%), Positives = 75/128 (58%), Gaps = 5/128 (3%)
Query: 22 KNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVL 80
K K ++E+V VS L+ Y+STP GLSEW+A +V G+F+ F W+ S EKA +L
Sbjct: 4 KIKYEMEFVI-YVSPAMLYNYISTPSGLSEWYADNV--NSRGEFFTFIWE-GSEEKAKLL 59
Query: 81 EQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLIS 140
++ RI+FQW ++D FE I E+T SL +ID+ +EV + + W N+I
Sbjct: 60 SKKSPHRIKFQWMEDEDTEYFFELRIQVDEITKDVSLMIIDFAEEDEVEEGKLLWENMIG 119
Query: 141 RLRLSLGA 148
L+ LG+
Sbjct: 120 NLKQILGS 127
>gi|156862767|gb|EDO56198.1| hypothetical protein BACUNI_00138 [Bacteroides uniformis ATCC 8492]
Length = 128
Score = 75.5 bits (184), Expect = 9e-13, Method: Composition-based stats.
Identities = 43/128 (33%), Positives = 67/128 (52%), Gaps = 1/128 (0%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
ME+ K+ LEY+ + S LW +STP GL WFA V+ +D +F W K A +
Sbjct: 1 MERKKIHLEYLLNATSKSILWAAISTPTGLEGWFADRVQSDDKTVTFF-WGKTEKRDAEI 59
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ R IRF+W ++++ FE + ++ELT LE+ D+ +EV D W + +
Sbjct: 60 IAVRAYSFIRFRWLDDENEREYFELKMTNNELTNDFVLEITDFADIDEVGDSRELWESQV 119
Query: 140 SRLRLSLG 147
LR + G
Sbjct: 120 DTLRRTCG 127
>gi|89891492|ref|ZP_01202997.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89516266|gb|EAS18928.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 129
Score = 74.7 bits (182), Expect = 1e-12, Method: Composition-based stats.
Identities = 52/132 (39%), Positives = 75/132 (56%), Gaps = 6/132 (4%)
Query: 19 IMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKA 77
+ E K LE+ VS L++Y++TP G+SEWFA +V G+++ F WD S EKA
Sbjct: 1 MQEPIKFNLEFPI-HVSPALLYQYIATPSGMSEWFADNV--NSRGEYFRFIWD-GSEEKA 56
Query: 78 SVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWN 136
++++ GER RFQW ++D FE I E+T SL + D+ E EV D + FW
Sbjct: 57 KIVKRVSGERARFQWDYDEDTKKYFEMSIQVDEITKDVSLMISDFGDDEDEVEDQKMFWE 116
Query: 137 NLISRLRLSLGA 148
N I L+ LG+
Sbjct: 117 NQIGELKKVLGS 128
>gi|120435794|ref|YP_861480.1| hypothetical protein GFO_1440 [Gramella forsetii KT0803]
gi|117577944|emb|CAL66413.1| conserved hypothetical protein [Gramella forsetii KT0803]
Length = 128
Score = 72.4 bits (176), Expect = 7e-12, Method: Composition-based stats.
Identities = 46/129 (35%), Positives = 77/129 (59%), Gaps = 5/129 (3%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
EK K ++E+ + S L++Y+STP GLSEW+A +V G+F+ F W+ S EKA +
Sbjct: 3 EKIKYEMEFPI-QASPSLLYQYISTPSGLSEWYADNV--NSRGEFFTFIWE-GSEEKAKL 58
Query: 80 LEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLI 139
+ ++ ERI+F+W ++D FE I ++T SL + D+ +EV + + W N++
Sbjct: 59 VSKKSDERIKFKWTDDEDTPYFFELRIQVDDITKDVSLMITDFAEDDEVDEGKMLWENMV 118
Query: 140 SRLRLSLGA 148
S L+ LG+
Sbjct: 119 SDLKQILGS 127
>gi|146297908|ref|YP_001192499.1| hypothetical protein Fjoh_0142 [Flavobacterium johnsoniae UW101]
gi|146152326|gb|ABQ03180.1| hypothetical protein Fjoh_0142 [Flavobacterium johnsoniae UW101]
Length = 130
Score = 72.4 bits (176), Expect = 7e-12, Method: Composition-based stats.
Identities = 46/117 (39%), Positives = 71/117 (60%), Gaps = 6/117 (5%)
Query: 35 STRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVLEQREGERIRFQWH 93
S + L++Y+STP GLSEWFA +V G+F+ F W+ +S EKA + ++ GE+++F+W
Sbjct: 16 SPQLLYQYISTPSGLSEWFADNV--NSRGEFFTFIWN-DSQEKARLASKKSGEKVKFKWV 72
Query: 94 RE--QDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLISRLRLSLGA 148
E +D FE I ELT SL V+D+ EE+ + + W N IS L+ +G+
Sbjct: 73 DESSKDTEYFFELHILVDELTKDVSLMVVDFAEKEEIGEAKQLWENQISDLKHLIGS 129
>gi|150026044|ref|YP_001296870.1| hypothetical protein FP2005 [Flavobacterium psychrophilum JIP02/86]
gi|149772585|emb|CAL44068.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 130
Score = 72.4 bits (176), Expect = 8e-12, Method: Composition-based stats.
Identities = 48/131 (36%), Positives = 77/131 (58%), Gaps = 7/131 (5%)
Query: 22 KNKLQLEYVFD-RVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
+NK++ E F S + L++Y+STP GL EWFA +V G+FY F W+ +S EKA +
Sbjct: 2 QNKIRYELEFPINSSPQLLYQYISTPSGLQEWFADNV--NSRGEFYTFIWN-DSEEKARL 58
Query: 80 LEQREGERIRFQW--HREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
++ GE+I+F+W + +D FE I E+T SL ++DY P ++ + + W N
Sbjct: 59 YSKKTGEKIKFKWMDNDNKDTEYYFELKILVDEITKDVSLMIVDYAEPNDIQESKLLWEN 118
Query: 138 LISRLRLSLGA 148
IS L+ +G+
Sbjct: 119 QISDLKHVIGS 129
>gi|88714202|ref|ZP_01108278.1| hypothetical protein FB2170_10364 [Flavobacteriales bacterium
HTCC2170]
gi|88707465|gb|EAQ99709.1| hypothetical protein FB2170_10364 [Flavobacteriales bacterium
HTCC2170]
Length = 173
Score = 68.6 bits (166), Expect = 1e-10, Method: Composition-based stats.
Identities = 46/131 (35%), Positives = 76/131 (58%), Gaps = 4/131 (3%)
Query: 18 LIMEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKA 77
++ +K K ++E+V + S + L++YLSTP GLSEWFA +V F F WD + E A
Sbjct: 46 VMSDKIKFEIEFVI-QSSPQLLYQYLSTPSGLSEWFADNVNSRGE-KFSFIWD-GTEEDA 102
Query: 78 SVLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
+L+++ E ++F W +D +A FE I E+T SL + D+ +E+ + + W N
Sbjct: 103 ILLKKKSDEFVKFAWEEVED-DAYFEMKIIVDEITKDVSLFITDFAEEDELDEAKMLWEN 161
Query: 138 LISRLRLSLGA 148
I+ L+ LG+
Sbjct: 162 QITDLKHVLGS 172
>gi|126663446|ref|ZP_01734443.1| hypothetical protein FBBAL38_08859 [Flavobacteria bacterium BAL38]
gi|126624394|gb|EAZ95085.1| hypothetical protein FBBAL38_08859 [Flavobacteria bacterium BAL38]
Length = 130
Score = 67.8 bits (164), Expect = 2e-10, Method: Composition-based stats.
Identities = 48/130 (36%), Positives = 77/130 (59%), Gaps = 7/130 (5%)
Query: 22 KNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASVL 80
K K +LE+ + S + L++Y+STP GLSEWFA +V G+F+ F W+ +S EKA +
Sbjct: 4 KVKYELEFPI-QSSPQLLYQYISTPSGLSEWFADNV--NSRGEFFTFIWN-DSEEKAKLA 59
Query: 81 EQREGERIRFQWHREQ--DQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNL 138
++ GERI+F+W E + + FE I E+T S+ + D+ +E+ + + W N
Sbjct: 60 SKKSGERIKFRWLEEDNTETDYFFEIKIMEDEITKDVSIVISDFAHEDELDESKLLWENQ 119
Query: 139 ISRLRLSLGA 148
IS L+ LG+
Sbjct: 120 ISDLKHVLGS 129
>gi|86141621|ref|ZP_01060167.1| hypothetical protein MED217_06367 [Flavobacterium sp. MED217]
gi|85832180|gb|EAQ50635.1| hypothetical protein MED217_06367 [Leeuwenhoekiella blandensis
MED217]
Length = 130
Score = 66.6 bits (161), Expect = 5e-10, Method: Composition-based stats.
Identities = 48/131 (36%), Positives = 79/131 (60%), Gaps = 7/131 (5%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKASV 79
EK K ++E+ + S L++Y+S+P G+SEWFA +V G++Y F W+ S EKA +
Sbjct: 3 EKVKFEIEFPI-QASPSLLYQYMSSPSGMSEWFADNV--NSRGEYYTFIWN-GSEEKAKL 58
Query: 80 LEQREGERIRFQWHRE-QDQNA-CFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
L ++ ERI+F+W + +D+N FE I E+T SL V D+ +E+ + + FW N
Sbjct: 59 LGKKSDERIKFRWMDDVEDENLYFFELRIVVDEITKDVSLIVTDFAEEDEIDEGKMFWEN 118
Query: 138 LISRLRLSLGA 148
I+ L+ +G+
Sbjct: 119 QIAELKHIIGS 129
>gi|91214899|ref|ZP_01251872.1| hypothetical protein P700755_18579 [Psychroflexus torquis ATCC
700755]
gi|91187326|gb|EAS73696.1| hypothetical protein P700755_18579 [Psychroflexus torquis ATCC
700755]
Length = 139
Score = 65.5 bits (158), Expect = 1e-09, Method: Composition-based stats.
Identities = 45/131 (34%), Positives = 77/131 (58%), Gaps = 7/131 (5%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKASV 79
+K K ++E+ +VS L++Y+STP GLSEWFA +V G+ F F W+ S E+A +
Sbjct: 12 DKVKYEMEFPI-QVSPSLLYQYISTPSGLSEWFADNV--NSRGELFTFMWE-GSEEEAKL 67
Query: 80 LEQREGERIRFQWHREQDQ--NACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
L ++ +RI+ +W ++++ N FE I E+T SL + D+ E + + W+N
Sbjct: 68 LSKKADDRIKLRWTEDEEESLNYFFEMKIQVDEITKDVSLMITDFAEEGEEDEGKMLWDN 127
Query: 138 LISRLRLSLGA 148
++S L+ LG+
Sbjct: 128 MVSDLKQILGS 138
>gi|88803471|ref|ZP_01118997.1| hypothetical protein PI23P_12802 [Polaribacter irgensii 23-P]
gi|88781037|gb|EAR12216.1| hypothetical protein PI23P_12802 [Polaribacter irgensii 23-P]
Length = 127
Score = 65.1 bits (157), Expect = 1e-09, Method: Composition-based stats.
Identities = 50/131 (38%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
MEK K +LE V S L++Y+STP L EWFA V G Y F W+ E A
Sbjct: 1 MEKVKFELE-VPIHASPNMLYQYISTPSNLQEWFADKV--NSRGKIYSFVWEGEE-ELAE 56
Query: 79 VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIA-PEEVADMEHFWNN 137
++ ++ GERIR++W +D + FE I LT +L V DY +EV + + W N
Sbjct: 57 LITKKAGERIRWKWLESEDDESYFEIRIDLDALTKDVTLVVTDYAEDADEVEESKQLWEN 116
Query: 138 LISRLRLSLGA 148
I LR ++GA
Sbjct: 117 QIEELRHTIGA 127
>gi|86133081|ref|ZP_01051663.1| hypothetical protein MED152_00215 [Tenacibaculum sp. MED152]
gi|85819944|gb|EAQ41091.1| hypothetical protein MED152_00215 [Polaribacter dokdonensis MED152]
Length = 127
Score = 64.7 bits (156), Expect = 1e-09, Method: Composition-based stats.
Identities = 46/131 (35%), Positives = 71/131 (54%), Gaps = 6/131 (4%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFY-FQWDKNSGEKAS 78
M+K K ++E + S L++Y+S+P L EWFA V G Y F WD + E+A
Sbjct: 1 MDKVKYEIE-IPVHASPNMLYQYISSPSNLQEWFADTV--NSRGKIYTFSWD-GTEEQAE 56
Query: 79 VLEQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWNN 137
++ ++ ERIRF+W +D ++ FE I LT SL + D+ E EV + + W N
Sbjct: 57 LVTKKSEERIRFKWLESEDDDSFFEIKIQVDPLTKDVSLIITDFADDEDEVEEAKQLWEN 116
Query: 138 LISRLRLSLGA 148
I L+ ++GA
Sbjct: 117 QIDELKHTIGA 127
>gi|88805384|ref|ZP_01120903.1| hypothetical protein RB2501_13629 [Robiginitalea biformata
HTCC2501]
gi|88784202|gb|EAR15372.1| hypothetical protein RB2501_13629 [Robiginitalea biformata
HTCC2501]
Length = 127
Score = 64.7 bits (156), Expect = 1e-09, Method: Composition-based stats.
Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASVL 80
EK K ++E+V + S + L++YL+TP GLSEWFA +V F F W+ + E A +L
Sbjct: 3 EKTKFEIEFVI-QASPQLLFQYLATPSGLSEWFADNVNSRGE-RFTFIWE-GTEEVARLL 59
Query: 81 EQREGERIRFQWHREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLIS 140
+++ E +RF W D ++ FE I E+T SL + D+ +E+ + + W N +S
Sbjct: 60 KRKTDEFVRFAWEY-NDDDSYFEMRIIVDEITKDVSLFITDFAEEDELEEAKMLWTNQVS 118
Query: 141 RLRLSLGA 148
L+ LG+
Sbjct: 119 DLKQILGS 126
>gi|83857050|ref|ZP_00950578.1| hypothetical protein CA2559_09638 [Croceibacter atlanticus
HTCC2559]
gi|83848417|gb|EAP86286.1| hypothetical protein CA2559_09638 [Croceibacter atlanticus
HTCC2559]
Length = 131
Score = 62.8 bits (151), Expect = 6e-09, Method: Composition-based stats.
Identities = 47/132 (35%), Positives = 77/132 (58%), Gaps = 8/132 (6%)
Query: 21 EKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKASV 79
++ K ++E+V + S L+ YLSTP GLSEW+A +V G+ F F W+ S E+A +
Sbjct: 3 DRIKYEMEFVI-QSSPSLLYNYLSTPSGLSEWYADNV--NSRGELFTFIWE-GSEEQAKL 58
Query: 80 LEQREGERIRFQWHREQD--QNACFEFVIHHSELTGGKSLEVIDYIAPE-EVADMEHFWN 136
L ++ ER++F+W + D ++ FE I E+T SL + D+ E EV + + W
Sbjct: 59 LSKKTDERVKFRWMDDVDNEESYFFELRIQVDEITKDVSLMITDFADDEDEVEEGKMLWE 118
Query: 137 NLISRLRLSLGA 148
N++S L+ LG+
Sbjct: 119 NMVSNLKQVLGS 130
>gi|126645441|ref|ZP_01717985.1| hypothetical protein ALPR1_12385 [Algoriphagus sp. PR1]
gi|126578852|gb|EAZ83016.1| hypothetical protein ALPR1_12385 [Algoriphagus sp. PR1]
Length = 131
Score = 60.5 bits (145), Expect = 3e-08, Method: Composition-based stats.
Identities = 40/133 (30%), Positives = 70/133 (52%), Gaps = 6/133 (4%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASV 79
M KNK +Y + S + +++YLST GL EWFA +VR+ ++ +F F +D N A +
Sbjct: 1 MVKNKFVADYQIN-ASKKIVFQYLSTASGLEEWFADEVRINEDKNFIFNFD-NEDHYARL 58
Query: 80 LEQREGERIRFQWHREQD----QNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFW 135
R ++F+++ ++ N+ EF + +ELT L+VIDY + ++ W
Sbjct: 59 ASIRTNSHVKFEFYDPKNPDASDNSYIEFKLEENELTQTLFLKVIDYSDGYDDEELIAIW 118
Query: 136 NNLISRLRLSLGA 148
L+ L+ +G
Sbjct: 119 GGLVGSLKEIIGG 131
>gi|124006212|ref|ZP_01691047.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123988136|gb|EAY27794.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 129
Score = 57.8 bits (138), Expect = 2e-07, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 4/131 (3%)
Query: 20 MEKNKLQLEYVFDRVSTRSLWRYLSTPDGLSEWFAYDVRLEDNGD-FYFQWDKNSGEKAS 78
M K K EY S + ++ Y+ T +GL +WFA V ++ F F WD N A
Sbjct: 1 MTKYKFSSEYEI-HASPKRIYSYVHTANGLEQWFATKVEVKGKEKVFNFVWD-NEDHFAR 58
Query: 79 VLEQREGERIRFQW-HREQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNN 137
++ R + ++F++ E+ ++ EF + H+E+T L+V DY E+ D+ W+
Sbjct: 59 IVAHRTNKLVKFEFISDEEGKSNYIEFRLEHNEMTDSTFLKVTDYSEMEDEDDLNGLWDG 118
Query: 138 LISRLRLSLGA 148
L++ LR +G
Sbjct: 119 LVADLREVVGG 129
>gi|94502325|ref|ZP_01308799.1| Hypothetical protein SMU_153 [Candidatus Sulcia muelleri str. Hc
(Homalodisca coagulata)]
gi|94451121|gb|EAT14072.1| Hypothetical protein SMU_153 [Candidatus Sulcia muelleri str. Hc
(Homalodisca coagulata)]
Length = 125
Score = 56.6 bits (135), Expect = 4e-07, Method: Composition-based stats.
Identities = 32/109 (29%), Positives = 58/109 (53%), Gaps = 2/109 (1%)
Query: 35 STRSLWRYLSTPDGLSEWFAYDVRLEDNGDFYFQWDKNSGEKASVLEQREGERIRFQWHR 94
ST L+ Y++ P LSEWFA +V + + F W N E +++++ + +R++W +
Sbjct: 14 STELLYYYITAPVKLSEWFADNV-ISIGNRYIFTW-YNYDETCFLIKKKPYQYVRYKWEQ 71
Query: 95 EQDQNACFEFVIHHSELTGGKSLEVIDYIAPEEVADMEHFWNNLISRLR 143
+ D FEF I + LT L++ D+ E+ + +W N I +L+
Sbjct: 72 DVDNKYYFEFFIKKNNLTKNVYLKITDFAIKTEIKKSKMWWKNRIKKLK 120
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.322 0.139 0.442
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 600,300,352
Number of Sequences: 5470121
Number of extensions: 24457775
Number of successful extensions: 48766
Number of sequences better than 1.0e-05: 28
Number of HSP's better than 0.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 48717
Number of HSP's gapped (non-prelim): 28
length of query: 150
length of database: 1,894,087,724
effective HSP length: 113
effective length of query: 37
effective length of database: 1,275,964,051
effective search space: 47210669887
effective search space used: 47210669887
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 124 (52.4 bits)