BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_0907 hypothetical protein
(146 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540799|ref|NP_905278.1| transcriptional regulator, put... 303 3e-81
gi|150007756|ref|YP_001302499.1| hypothetical protein BDI_1... 93 5e-18
gi|154492157|ref|ZP_02031783.1| hypothetical protein PARMER... 90 4e-17
gi|150005815|ref|YP_001300559.1| hypothetical protein BVU_3... 82 1e-14
gi|53713689|ref|YP_099681.1| hypothetical protein BF2398 [B... 81 2e-14
gi|120435838|ref|YP_861524.1| HTH_3 family transcriptional ... 81 2e-14
gi|160884623|ref|ZP_02065626.1| hypothetical protein BACOVA... 80 5e-14
gi|29346300|ref|NP_809803.1| hypothetical protein BT_0890 [... 80 6e-14
gi|86133482|ref|ZP_01052064.1| transcriptional regulator, p... 77 4e-13
gi|146300553|ref|YP_001195144.1| bacteriophage CI repressor... 75 1e-12
gi|153806310|ref|ZP_01958978.1| hypothetical protein BACCAC... 75 1e-12
gi|83857210|ref|ZP_00950738.1| transcriptional regulator, p... 75 1e-12
gi|86143376|ref|ZP_01061778.1| transcriptional regulator, p... 74 4e-12
gi|150024931|ref|YP_001295757.1| Putative transcriptional r... 74 4e-12
gi|126664248|ref|ZP_01735240.1| putative DNA-binding protei... 72 1e-11
gi|88712430|ref|ZP_01106516.1| hypothetical protein FB2170_... 72 1e-11
gi|110638108|ref|YP_678317.1| conserved hypothetical protei... 72 1e-11
gi|163755564|ref|ZP_02162683.1| hypothetical protein KAOT1_... 72 1e-11
gi|91217684|ref|ZP_01254641.1| transcriptional regulator, p... 71 2e-11
gi|163789015|ref|ZP_02183459.1| hypothetical protein FBALC1... 71 3e-11
gi|88802945|ref|ZP_01118472.1| putative DNA-binding protein... 70 5e-11
gi|160889582|ref|ZP_02070585.1| hypothetical protein BACUNI... 70 5e-11
gi|167763094|ref|ZP_02435221.1| hypothetical protein BACSTE... 68 2e-10
gi|88805716|ref|ZP_01121235.1| putative DNA-binding protein... 66 7e-10
gi|89891243|ref|ZP_01202750.1| conserved hypothetical prote... 64 2e-09
gi|149370471|ref|ZP_01890160.1| putative DNA-binding protei... 64 3e-09
gi|86132990|ref|ZP_01051580.1| putative DNA-binding protein... 56 8e-07
>gi|34540799|ref|NP_905278.1| transcriptional regulator, putative [Porphyromonas gingivalis W83]
gi|34397113|gb|AAQ66177.1| transcriptional regulator, putative [Porphyromonas gingivalis W83]
Length = 146
Score = 303 bits (775), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 145/146 (99%), Positives = 145/146 (99%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE
Sbjct: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKIVQ 120
WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDN GIQSPSGRLEPEIKIVQ
Sbjct: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNCGIQSPSGRLEPEIKIVQ 120
Query: 121 MPAKQIERITVFYTDGSYQEFRSNKE 146
MPAKQIERITVFYTDGSYQEFRSNKE
Sbjct: 121 MPAKQIERITVFYTDGSYQEFRSNKE 146
>gi|150007756|ref|YP_001302499.1| hypothetical protein BDI_1114 [Parabacteroides distasonis ATCC
8503]
gi|149936180|gb|ABR42877.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 151
Score = 93.2 bits (230), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 84/142 (59%), Gaps = 6/142 (4%)
Query: 10 RILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWIIFG 69
RIL +ME+ G+TPS FA+ GI RSA++HI SGRN +LD++ KI++ F ++W++FG
Sbjct: 6 RILKIMEREGLTPSKFAESIGIQRSAMSHIISGRNNPSLDVLLKILERFTYVDSDWLLFG 65
Query: 70 KGPQQSNKTAFENDLFS--YSSVPEQELHTRTKDNSGIQSPSG-RLEPEIKIV---QMPA 123
KG E +LF+ + P ++ + G+++P + +P ++ V + P+
Sbjct: 66 KGEMIREHVLTEPNLFTNMLENRPNVQVVAENRKEIGVETPVNIQKQPVVEQVICQEKPS 125
Query: 124 KQIERITVFYTDGSYQEFRSNK 145
K + +I +FY+D ++ F K
Sbjct: 126 KNVSKIMIFYSDNTFDTFVPEK 147
>gi|154492157|ref|ZP_02031783.1| hypothetical protein PARMER_01789 [Parabacteroides merdae ATCC
43184]
gi|154087382|gb|EDN86427.1| hypothetical protein PARMER_01789 [Parabacteroides merdae ATCC
43184]
Length = 149
Score = 90.1 bits (222), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 84/142 (59%), Gaps = 6/142 (4%)
Query: 10 RILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWIIFG 69
RI +ME+ G+TPS FA+ GI RSA++HI +GRN V+LD++ KI+ F ++W++FG
Sbjct: 4 RITQIMEREGLTPSKFAEAIGIQRSAMSHILNGRNNVSLDVLIKILDRFTYVDSDWLLFG 63
Query: 70 KGPQQSNKTAFENDLFSYSSVPEQ--ELHTRTKDNSGIQSPSGRLEPEI--KIVQ--MPA 123
KG + + + DLFS +++ + + + +P ++P + +I+Q
Sbjct: 64 KGEMVRDHMSTQPDLFSNTAINPSGGQAALEYRKEMRVDTPVNTVKPPVVEQIIQQETTT 123
Query: 124 KQIERITVFYTDGSYQEFRSNK 145
+++ +I VFY+D ++ F S K
Sbjct: 124 RKVSKIMVFYSDNTFDTFVSEK 145
>gi|150005815|ref|YP_001300559.1| hypothetical protein BVU_3311 [Bacteroides vulgatus ATCC 8482]
gi|149934239|gb|ABR40937.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 137
Score = 82.0 bits (201), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 76/134 (56%), Gaps = 11/134 (8%)
Query: 20 MTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWIIFGKGPQQS---- 75
MT + FA++ GIS S+L+HI SGRN +L+++ KI + D S +W+++G+G ++
Sbjct: 1 MTAAQFAEKIGISPSSLSHILSGRNNPSLEVVMKIHKACDYISLDWLLYGEGQMETDVDS 60
Query: 76 -NKTAF------ENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKIVQMPAKQIER 128
N + EN LF+ E + + + SP + EIK V++PAK+I
Sbjct: 61 DNNIHYTPSLFDENSLFTPERPASPEYRKENEVKTPLYSPKEIVREEIKYVEVPAKKITE 120
Query: 129 ITVFYTDGSYQEFR 142
I +F+ +G+Y+ F+
Sbjct: 121 IRIFFDNGTYETFK 134
>gi|53713689|ref|YP_099681.1| hypothetical protein BF2398 [Bacteroides fragilis YCH46]
gi|60681961|ref|YP_212105.1| putative DNA-binding protein [Bacteroides fragilis NCTC 9343]
gi|52216554|dbj|BAD49147.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60493395|emb|CAH08181.1| putative DNA-binding protein [Bacteroides fragilis NCTC 9343]
Length = 151
Score = 81.3 bits (199), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 89/153 (58%), Gaps = 18/153 (11%)
Query: 7 IKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWI 66
IK RI +ME+ M AFA+ GI +S L+HI +GRN +LD+I K+ Q ++ EW+
Sbjct: 3 IKDRIKIIMEKENMASGAFAESIGIQQSTLSHILNGRNNPSLDVIMKVHQKYNYVKLEWL 62
Query: 67 IFGKG--PQQSNKTA--FENDLFSYSSV--------PE--QELHTRTKDNSGIQSPSGRL 112
++G+G ++S ++A F+ LF+ +++ PE +E+ + N +P +
Sbjct: 63 LYGQGNISEESIQSASDFQPSLFAENAIIPPNGTVTPENRREMPLESSQN----TPKEIV 118
Query: 113 EPEIKIVQMPAKQIERITVFYTDGSYQEFRSNK 145
+ EI+ ++ P+++I I +F+ D +Y+ FR K
Sbjct: 119 KQEIRYIEKPSRKITEIRIFFDDNTYETFRGEK 151
>gi|120435838|ref|YP_861524.1| HTH_3 family transcriptional regulator protein [Gramella forsetii
KT0803]
gi|117577988|emb|CAL66457.1| HTH_3 family transcriptional regulator protein [Gramella forsetii
KT0803]
Length = 140
Score = 80.9 bits (198), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 80/148 (54%), Gaps = 17/148 (11%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ + R+ ++E ++ ++FAD+ + RS+++HI SGRNK +LD + K+++ F E
Sbjct: 1 MVNTEKFSSRLNKILEYYDISAASFADKIEVGRSSISHILSGRNKPSLDFVMKVVKNFPE 60
Query: 61 WSAEWIIFGKG-----PQQSNKTAFEN-DLFSYSSVPEQELH-TRTKDNSGIQSPSGRLE 113
W++ GKG P+ +KT EN + SSVP +E + T P R
Sbjct: 61 VELYWLLNGKGKFPSTPESKSKTQSENREQEKTSSVPTRENQISETIKEVPTSFPPNR-- 118
Query: 114 PEIKIVQMPAKQIERITVFYTDGSYQEF 141
KQI++I +FYTDGS++ F
Sbjct: 119 --------SGKQIQKIVIFYTDGSFEAF 138
>gi|160884623|ref|ZP_02065626.1| hypothetical protein BACOVA_02612 [Bacteroides ovatus ATCC 8483]
gi|156110362|gb|EDO12107.1| hypothetical protein BACOVA_02612 [Bacteroides ovatus ATCC 8483]
Length = 156
Score = 79.7 bits (195), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 83/156 (53%), Gaps = 19/156 (12%)
Query: 7 IKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWI 66
IK RI +ME+ + P FA+ G+ +S L+HI + RNK +L+++ K+ Q +D + EW+
Sbjct: 3 IKDRIRMIMEREKVPPRVFAETIGVQQSTLSHILNDRNKPSLEVVMKVHQKYDYVNLEWL 62
Query: 67 IFGKGPQQSNK--TAFENDLFSY---------------SSVPEQELHTRTKDNSGIQSPS 109
++GKG ++ T+F + Y ++PE T ++ +P
Sbjct: 63 LYGKGEMMVSEEGTSFSSSNHDYLPSLFDENPVNPSKEPTLPENRKETPLRNAEN--APK 120
Query: 110 GRLEPEIKIVQMPAKQIERITVFYTDGSYQEFRSNK 145
++ EI+ ++ PA++I I +F+ D +Y+ FR K
Sbjct: 121 EIVKQEIRYIEKPARKITEIRIFFDDNTYETFRPEK 156
>gi|29346300|ref|NP_809803.1| hypothetical protein BT_0890 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338195|gb|AAO75997.1| putative DNA-binding protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 156
Score = 79.7 bits (195), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 85/158 (53%), Gaps = 23/158 (14%)
Query: 7 IKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWI 66
IK RI +ME+ + P FA+ G+ +S L+HI + RNK +L+++ K+ Q + + EW+
Sbjct: 3 IKDRIRMIMEREKVPPRVFAETIGVQQSTLSHILNDRNKPSLEVVMKVHQTYSYVNLEWL 62
Query: 67 IFGKGPQ---------QSNKTAFENDLFSYSSV--------PE--QELHTRTKDNSGIQS 107
++GKG S+ ++ LF + V PE +E+ RT +N +
Sbjct: 63 LYGKGEMITSAEDASTVSSNGDYQPSLFDENPVNPSKETINPENRKEMALRTAEN----A 118
Query: 108 PSGRLEPEIKIVQMPAKQIERITVFYTDGSYQEFRSNK 145
P ++ EI+ ++ PA++I I +F+ D +Y+ FR K
Sbjct: 119 PKEIVKQEIRYIEKPARKITEIRIFFDDNTYETFRPEK 156
>gi|86133482|ref|ZP_01052064.1| transcriptional regulator, putative [Tenacibaculum sp. MED152]
gi|85820345|gb|EAQ41492.1| transcriptional regulator, putative [Polaribacter dokdonensis
MED152]
Length = 114
Score = 77.0 bits (188), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 69/141 (48%), Gaps = 29/141 (20%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ A R+ +ME + S+FAD+ G+ RS+++HI SGRNK +LD + KI F++
Sbjct: 1 MLNTVAFINRLKQIMEHHQFSASSFADKVGVQRSSISHILSGRNKPSLDFVLKINTVFND 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKIVQ 120
EW++ G G ++ F N SG IK
Sbjct: 61 LDLEWLLNGTGSYPKSQDEFSN--------------------------SG---TTIKNTS 91
Query: 121 MPAKQIERITVFYTDGSYQEF 141
K+I+RI FYTDG+++EF
Sbjct: 92 KSKKEIQRIVTFYTDGTFEEF 112
>gi|146300553|ref|YP_001195144.1| bacteriophage CI repressor [Flavobacterium johnsoniae UW101]
gi|146154971|gb|ABQ05825.1| bacteriophage CI repressor [Flavobacterium johnsoniae UW101]
Length = 155
Score = 75.5 bits (184), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 79/165 (47%), Gaps = 36/165 (21%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ D R+ +++ G+ S+FAD+ G+ RS+++H+ SGRNK +LD + KI+ F E
Sbjct: 1 MVNIDDFVKRLEIILDYYGLNASSFADKIGVQRSSMSHLLSGRNKPSLDFVMKILDVFPE 60
Query: 61 WSAEWIIFGKG-----------------------PQQSNKTAFENDLFSYSSVPEQELHT 97
W++ GKG P SN+ E DLFS +LH
Sbjct: 61 VDLYWMLLGKGNFPKSENETLKFEPKSDIKSEDSPASSNENHSEIDLFS-------QLHY 113
Query: 98 RTKDNSGIQSP-SGRLEPEIKIVQMPAKQIERITVFYTDGSYQEF 141
+ ++P EP+ + +IE+I FY +G+++ +
Sbjct: 114 EEE-----KTPVKNYAEPKRSNISFEEDEIEKIVFFYKNGTFKAY 153
>gi|153806310|ref|ZP_01958978.1| hypothetical protein BACCAC_00569 [Bacteroides caccae ATCC 43185]
gi|149130987|gb|EDM22193.1| hypothetical protein BACCAC_00569 [Bacteroides caccae ATCC 43185]
Length = 166
Score = 75.1 bits (183), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 85/156 (54%), Gaps = 19/156 (12%)
Query: 7 IKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWI 66
IK RI +ME+ + P FA+ G+ +S L+HI + RNK +L+++ K+ Q ++ + EW+
Sbjct: 13 IKDRIRMIMEREKVPPRVFAETIGVQQSTLSHILNDRNKPSLEVVMKVHQTYNYVNLEWL 72
Query: 67 IFGKGPQQSNKTA---------FENDLFSYSSV-PEQELHTRTKDN-------SGIQSPS 109
++G+G ++ ++ LF + V P +E T T +N S +P
Sbjct: 73 LYGRGEMLASAEESSLASSNGDYQPSLFDENPVNPSKE--TITLENRKEMALRSTGNTPK 130
Query: 110 GRLEPEIKIVQMPAKQIERITVFYTDGSYQEFRSNK 145
++ EI+ ++ PA++I I +F+ D +Y+ FR K
Sbjct: 131 EIVKQEIRYIEKPARKITEIRIFFDDNTYETFRPEK 166
>gi|83857210|ref|ZP_00950738.1| transcriptional regulator, putative [Croceibacter atlanticus
HTCC2559]
gi|83848577|gb|EAP86446.1| transcriptional regulator, putative [Croceibacter atlanticus
HTCC2559]
Length = 147
Score = 75.1 bits (183), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 82/151 (54%), Gaps = 12/151 (7%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M+++ R+ + E ++ S+FAD+ + RS+++HI SGRNK +LD + K++ FDE
Sbjct: 1 MVNSTEFTKRLEKIFEYYDLSASSFADKIEVGRSSISHIVSGRNKPSLDFVMKVVSTFDE 60
Query: 61 WSAEWIIFGKGP-QQSNKTAFENDLFS-------YSSVPEQELHTRTKDNSGIQSPSGRL 112
W++ GKG +SN T + + + P Q+L + + +S ++ +
Sbjct: 61 VDLYWLLNGKGTFPKSNTTVQSSSPPRPQSSFSDFDTSPTQDLFSDAEHHS--KAIENHI 118
Query: 113 EPEIKIVQMPAKQIERITVFYTDGSYQEFRS 143
P+ + V+ A I +I V YTDGS++ F +
Sbjct: 119 TPKPRNVKSNA--IAKIIVLYTDGSFEAFEN 147
>gi|86143376|ref|ZP_01061778.1| transcriptional regulator, putative [Flavobacterium sp. MED217]
gi|85830281|gb|EAQ48741.1| transcriptional regulator, putative [Leeuwenhoekiella blandensis
MED217]
Length = 142
Score = 73.6 bits (179), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++++ R+ +M++ + ++FAD + RS+++HI SGRNK +L+ + KI++ F E
Sbjct: 1 MVNSELFAKRLQKVMDEYDLNATSFADAIDVGRSSISHIISGRNKPSLEFVMKIIEAFPE 60
Query: 61 WSAEWIIFGKG--PQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQ---SPSGRLEPE 115
W++ GKG P+++ T +Y Q K ++ Q + S +
Sbjct: 61 VELYWLLNGKGSFPKKARPTP------NYKPASSQLAANFDKPSTPAQEDLNSSSEFTQK 114
Query: 116 IKIVQMPAKQIERITVFYTDGSYQEF 141
+ K+IERI +FY DG+++ F
Sbjct: 115 KPAINSDIKEIERIVIFYNDGTFKSF 140
>gi|150024931|ref|YP_001295757.1| Putative transcriptional regulator, XRE family [Flavobacterium
psychrophilum JIP02/86]
gi|149771472|emb|CAL42941.1| Putative transcriptional regulator, XRE family [Flavobacterium
psychrophilum JIP02/86]
Length = 159
Score = 73.6 bits (179), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 80/157 (50%), Gaps = 17/157 (10%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ D R+ +++ G++ S FAD+ G+ RS+L+H+ SGRNK +LD+I KI + F E
Sbjct: 1 MVNIDDFIKRLEIILDYYGLSASGFADKVGVQRSSLSHLLSGRNKPSLDLILKINENFPE 60
Query: 61 WSAEWIIFGKG----------PQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPS- 109
WI+ GKG P N T N ++PE + + ++ +++P
Sbjct: 61 VDLYWILNGKGNFPELEIKTEPNIQNTTPILNSNIE-ENMPEDFPNLFSDEDQNVKNPVF 119
Query: 110 GRLEPEIKIVQMPAK-----QIERITVFYTDGSYQEF 141
++ + +IERI VFY +GS++ +
Sbjct: 120 ENIKNNFSNTGNTSNAKHNSEIERIVVFYKNGSFKNY 156
>gi|126664248|ref|ZP_01735240.1| putative DNA-binding protein [Flavobacteria bacterium BAL38]
gi|126623780|gb|EAZ94476.1| putative DNA-binding protein [Flavobacteria bacterium BAL38]
Length = 153
Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 81/164 (49%), Gaps = 34/164 (20%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ D R+ +++ ++ SAFAD+ + RS+L+H+ SGRNK +LD I K+++ F E
Sbjct: 1 MVNIDDFIKRLEIILDYYNLSASAFADKINVQRSSLSHLLSGRNKPSLDFIIKVIEVFPE 60
Query: 61 WSAEWIIFGKG----------------------PQQSNKTAFENDLFSYSSVPEQELHTR 98
WI+ GKG P N + E DLFS + E+
Sbjct: 61 VDLYWILNGKGNFPKSEIVAHSVLEIEKSTSSLPIVENPNS-ELDLFSTA---EKVTTPT 116
Query: 99 TKDNSGIQSPSGRLEPEIKIVQMPAKQIERITVFYTDGSYQEFR 142
+N+ ++S IK ++IERI VF+ +G+++ ++
Sbjct: 117 LSENNAVES--------IKTENNAEEEIERIVVFFKNGTFKNYK 152
>gi|88712430|ref|ZP_01106516.1| hypothetical protein FB2170_10796 [Flavobacteriales bacterium
HTCC2170]
gi|88708968|gb|EAR01202.1| hypothetical protein FB2170_10796 [Flavobacteriales bacterium
HTCC2170]
Length = 125
Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 76/143 (53%), Gaps = 20/143 (13%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M++ + R+ ++ ++ S FADR + RS+++H+ SGRNK +L+ + K+++ F E
Sbjct: 1 MVNTEDFIKRLEKILRYYDLSASVFADRISVQRSSISHLLSGRNKPSLEFVLKVIKNFPE 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLF-SYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKIV 119
+ W++ GKG S+ N+L + + +P++E +T EP+
Sbjct: 61 VNLYWLLNGKGSFPSDPNTKANNLTPTQNIIPQKEAPLKT-------------EPK---- 103
Query: 120 QMPAKQIERITVFYTDGSYQEFR 142
K IE+I +FY DGS++ +
Sbjct: 104 --KEKTIEKIIIFYDDGSFKSYH 124
>gi|110638108|ref|YP_678317.1| conserved hypothetical protein, possible transcriptional
regulator [Cytophaga hutchinsonii ATCC 33406]
gi|110280789|gb|ABG58975.1| conserved hypothetical protein, possible transcriptional
regulator [Cytophaga hutchinsonii ATCC 33406]
Length = 231
Score = 71.6 bits (174), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 58/96 (60%), Gaps = 1/96 (1%)
Query: 5 DAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAE 64
++I R+L +++ G+T S FAD+ + RS+++H+ SGRNK +LD I KI+ F + +
Sbjct: 3 ESIVERVLVLIKDYGLTASEFADKIDVQRSSMSHLVSGRNKPSLDFIQKILNNFSDINPT 62
Query: 65 WIIFGKGP-QQSNKTAFENDLFSYSSVPEQELHTRT 99
W+I G GP +Q + + D+ +P ++ + T
Sbjct: 63 WLIMGTGPMKQLDLFDIKGDVAPTEPIPAEKFKSET 98
>gi|163755564|ref|ZP_02162683.1| hypothetical protein KAOT1_05367 [Kordia algicida OT-1]
gi|161324477|gb|EDP95807.1| hypothetical protein KAOT1_05367 [Kordia algicida OT-1]
Length = 140
Score = 71.6 bits (174), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 78/149 (52%), Gaps = 17/149 (11%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
MI+ R+ ++E ++ SAF+D+ G+ RS+++HI SGRNK +L+ + KI+ F E
Sbjct: 1 MINTADFTKRLKKILEYYDLSASAFSDKLGVQRSSISHILSGRNKPSLEFVMKILHNFPE 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKI-- 118
W++ GKG TA EN + + + + ++ T S L E K+
Sbjct: 61 VEVYWLLNGKG-SFPKLTASEN-VSAVTPIATPKIAQTTNQQS--------LPLEEKVTS 110
Query: 119 -----VQMPAKQIERITVFYTDGSYQEFR 142
V K I++I +FYTDG+++ ++
Sbjct: 111 TIPTQVSSKGKTIDKIVIFYTDGTFEAYQ 139
>gi|91217684|ref|ZP_01254641.1| transcriptional regulator, putative [Psychroflexus torquis ATCC
700755]
gi|91184188|gb|EAS70574.1| transcriptional regulator, putative [Psychroflexus torquis ATCC
700755]
Length = 136
Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 77/154 (50%), Gaps = 29/154 (18%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M+++ R+ + E ++ S+FADR + R++++HI SGRNK +LD + K++ F E
Sbjct: 1 MVNSLDFTKRLKEIFEYYDLSASSFADRIDVGRASISHIISGRNKPSLDFVMKVVSNFKE 60
Query: 61 WSAEWIIFGKG--PQQSN-------KTAFENDLFSYSSVPEQ--ELHTRTKDNSGIQSPS 109
W++ GKG P + K F +D+ PE+ + DNS
Sbjct: 61 VELYWLLNGKGYFPSSKDNRNVINAKEEFVDDI-----KPEEPKTIEPLLNDNSN----- 110
Query: 110 GRLEPEIKIVQMPAKQIERITVFYTDGSYQEFRS 143
V P K+I+++ + YTDGS++ F++
Sbjct: 111 --------QVSHPKKEIQKVIICYTDGSFESFQN 136
>gi|163789015|ref|ZP_02183459.1| hypothetical protein FBALC1_09417 [Flavobacteriales bacterium
ALC-1]
gi|159875679|gb|EDP69739.1| hypothetical protein FBALC1_09417 [Flavobacteriales bacterium
ALC-1]
Length = 122
Score = 70.9 bits (172), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 74/144 (51%), Gaps = 25/144 (17%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
MI++ R+ +++ G T S FA++ G+ RS+++HI SGRNK +LD + K++ + E
Sbjct: 1 MINSIKFTERLQKVIDFYGETASGFAEKIGVQRSSISHILSGRNKPSLDFVMKVLHSYPE 60
Query: 61 WSAEWIIFGKG--PQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKI 118
W++ GKG P Q + N S++ + EL +T Q P E
Sbjct: 61 VELYWLMNGKGEFPSQPKISESPN-----SNLTQTELKPKT------QIPESNSE----- 104
Query: 119 VQMPAKQIERITVFYTDGSYQEFR 142
IE+I +FY DG+++ ++
Sbjct: 105 -------IEKIVIFYKDGTFKSYK 121
>gi|88802945|ref|ZP_01118472.1| putative DNA-binding protein [Polaribacter irgensii 23-P]
gi|88781803|gb|EAR12981.1| putative DNA-binding protein [Polaribacter irgensii 23-P]
Length = 119
Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 72/144 (50%), Gaps = 28/144 (19%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
MI++ R+ ++E + S+FAD+ G+ RS+++HI SGRNK +LD I KI E
Sbjct: 1 MINSADFTNRLKEVIEYYQFSASSFADKVGVQRSSISHILSGRNKPSLDFILKITTELQE 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEI--KI 118
W++ GKG F SV +Q+ + ++ P +
Sbjct: 61 VDINWLLKGKGS------------FPKVSVSKQK--------------NAQMNPTLINSE 94
Query: 119 VQMPAKQIERITVFYTDGSYQEFR 142
+ P K+I+ I +FY DG+++E++
Sbjct: 95 IAEPGKKIKNIVIFYEDGTFEEYQ 118
>gi|160889582|ref|ZP_02070585.1| hypothetical protein BACUNI_02008 [Bacteroides uniformis ATCC 8492]
gi|156861099|gb|EDO54530.1| hypothetical protein BACUNI_02008 [Bacteroides uniformis ATCC 8492]
Length = 153
Score = 70.1 bits (170), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 83/149 (55%), Gaps = 12/149 (8%)
Query: 6 AIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKV-TLDMINKIMQGFDEWSAE 64
+IK R +M++ +T AFA+ G++++ ++HI RNK + ++I ++ Q +++ + E
Sbjct: 2 SIKDRFKMVMDREKLTAGAFAESIGVAQATISHILGPRNKYPSTEVILRLHQRYNDINLE 61
Query: 65 WIIFGKGPQQSNKTAFENDLFSY----------SSVPEQELHTR-TKDNSGIQSPSGRLE 113
W++ GKG +N + END F Y S+ PE+ + + + ++SP ++
Sbjct: 62 WLLTGKGNMSNNPDSPENDGFDYPLFAENPENLSNGPEESKNRKEIALETALKSPKEIVK 121
Query: 114 PEIKIVQMPAKQIERITVFYTDGSYQEFR 142
EI + P ++I I +F+ D +Y+ F+
Sbjct: 122 QEIIYKERPPRKITEIRIFFDDNTYETFK 150
>gi|167763094|ref|ZP_02435221.1| hypothetical protein BACSTE_01461 [Bacteroides stercoris ATCC
43183]
gi|167699434|gb|EDS16013.1| hypothetical protein BACSTE_01461 [Bacteroides stercoris ATCC
43183]
Length = 152
Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 78/150 (52%), Gaps = 15/150 (10%)
Query: 6 AIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEW 65
+IK R +M++ +T AFA+ ++++ ++HI + RN + ++I K+ + +++ + EW
Sbjct: 2 SIKDRFKMIMDREQLTAGAFAESIEVAQATISHILASRNNPSTEVILKLHKRYNDINLEW 61
Query: 66 IIFGKGPQQSNKTAFENDLFSY-------------SSVPEQELHTRTKDNSGIQSPSGRL 112
++ GKG +N + EN+ F Y S PE + + + +P +
Sbjct: 62 LLTGKGNMSNNSPSVENNGFDYPLFADNPENPSNEPSTPENRKEIALE--APVNTPKEIV 119
Query: 113 EPEIKIVQMPAKQIERITVFYTDGSYQEFR 142
+ EI + P K+I I +F+ D +Y+ F+
Sbjct: 120 KQEIIYKERPPKKITEIRIFFDDNTYETFK 149
>gi|88805716|ref|ZP_01121235.1| putative DNA-binding protein [Robiginitalea biformata HTCC2501]
gi|88784534|gb|EAR15704.1| putative DNA-binding protein [Robiginitalea biformata HTCC2501]
Length = 118
Score = 66.2 bits (160), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 66/132 (50%), Gaps = 31/132 (23%)
Query: 10 RILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDEWSAEWIIFG 69
RIL+ EQ T S FAD G+ RS ++H+ SGRNK +LD + K+++ + E W++ G
Sbjct: 13 RILAYYEQ---TASGFADAIGVPRSTISHLLSGRNKPSLDFVMKVIRKYPEVDLYWLLNG 69
Query: 70 KGPQQSNKTAFENDLFSYSSVPEQELHTRTKDNSGIQSPSGRLEPEIKIVQMPAKQIERI 129
KG ++ S P + R EP + ++ + I++I
Sbjct: 70 KG--------------TFPSTPSRPTTIRR-------------EPAVPTIET-NRAIDKI 101
Query: 130 TVFYTDGSYQEF 141
+FY+DG+++ +
Sbjct: 102 VIFYSDGTFESY 113
>gi|89891243|ref|ZP_01202750.1| conserved hypothetical protein, helix-turn-helix domain-containing
[Flavobacteria bacterium BBFL7]
gi|89516555|gb|EAS19215.1| conserved hypothetical protein, helix-turn-helix domain-containing
[Flavobacteria bacterium BBFL7]
Length = 152
Score = 64.3 bits (155), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 78/166 (46%), Gaps = 42/166 (25%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M+++ RI +ME+ + S+FA+ + RS+++HI SGRNK +LD++ I+ F E
Sbjct: 1 MVNSADFTKRIHKIMEKNDLNASSFAEAIHVGRSSISHILSGRNKPSLDLVMNIVDQFPE 60
Query: 61 WSAEWIIFGKGPQQSNKTAFENDLFSYSSVPEQELHT---------RTKDNSGIQSPSGR 111
W++ GKG S P++E+ T K N ++S S
Sbjct: 61 VDLYWLLNGKG-----------------SYPKKEIATIPIAPPVSPTVKKNEPVESKSES 103
Query: 112 LE-----PEIKIVQMP-----------AKQIERITVFYTDGSYQEF 141
++ P++ +P K I +I V Y+DGS++++
Sbjct: 104 IDSTIDSPDLFSSTIPKNNKENSISGKGKNITKIIVLYSDGSFKDY 149
>gi|149370471|ref|ZP_01890160.1| putative DNA-binding protein [unidentified eubacterium SCB49]
gi|149356022|gb|EDM44579.1| putative DNA-binding protein [unidentified eubacterium SCB49]
Length = 139
Score = 63.9 bits (154), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 71/148 (47%), Gaps = 18/148 (12%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
M+++ RI ++ G++ +AFA +RS ++HI SGRNK +L+ + KI++ + E
Sbjct: 1 MVNSVEFAKRIEKILSYYGISATAFAVAISFNRSTISHILSGRNKPSLEFVLKIVETYPE 60
Query: 61 WSAEWIIFGKG---PQQ-SNKTAFE---NDLFSYSSVPEQELHTRTKDNSGIQSPSGRLE 113
W++ GKG P + SNK E L + P QE SP
Sbjct: 61 VELYWLLNGKGSFPPSEISNKNISEVKKEPLVKNITSPIQEETIVNTTQQTDTSPRAN-- 118
Query: 114 PEIKIVQMPAKQIERITVFYTDGSYQEF 141
I+RI +FY+DGS++E+
Sbjct: 119 ---------NSNIDRIVIFYSDGSFKEY 137
>gi|86132990|ref|ZP_01051580.1| putative DNA-binding protein [Cellulophaga sp. MED134]
gi|85816508|gb|EAQ37696.1| putative DNA-binding protein [Dokdonia donghaensis MED134]
Length = 149
Score = 55.8 bits (133), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/157 (21%), Positives = 73/157 (46%), Gaps = 29/157 (18%)
Query: 1 MIDADAIKMRILSMMEQLGMTPSAFADRAGISRSALTHITSGRNKVTLDMINKIMQGFDE 60
++D R+ +ME ++ ++FA++ + R+ ++H+ +GRNK +LD + K++ F
Sbjct: 4 ILDNTKFAKRLQKIMESHELSATSFAEKLSVGRATISHLMAGRNKPSLDFVMKVVDTFPN 63
Query: 61 WSAEWIIFGKG----------------PQQSNKTAFENDLFSYSSVPEQELHTRTKDNSG 104
EW+++GK PQ++++ F N L + + ++ +N G
Sbjct: 64 IELEWLLYGKKTTPKPLPPTPQKTIIEPQENSQKNFSNSLQKTTEDKTPKTAQKSMENIG 123
Query: 105 IQSPSGRLEPEIKIVQMPAKQIERITVFYTDGSYQEF 141
S S ++R+ +F DG+++ +
Sbjct: 124 DFSSS-------------KSAVKRVIIFLEDGTFESY 147
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.315 0.130 0.367
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 628,591,426
Number of Sequences: 6515104
Number of extensions: 24948492
Number of successful extensions: 58631
Number of sequences better than 1.0e-04: 27
Number of HSP's better than 0.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 58582
Number of HSP's gapped (non-prelim): 29
length of query: 146
length of database: 2,222,278,849
effective HSP length: 110
effective length of query: 36
effective length of database: 1,505,617,409
effective search space: 54202226724
effective search space used: 54202226724
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 116 (49.3 bits)