BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0314
(349 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|154489924|ref|ZP_02030185.1| hypothetical protein PARMER... 206 1e-51
gi|34540132|ref|NP_904611.1| hypothetical protein PG0291 [P... 192 3e-47
gi|150009904|ref|YP_001304647.1| hypothetical protein BDI_3... 186 2e-45
gi|67940395|ref|ZP_00532822.1| hypothetical protein Cphamn1... 80 2e-13
gi|120437338|ref|YP_863024.1| gliding motility protein GldO... 69 6e-10
gi|88804984|ref|ZP_01120504.1| hypothetical protein RB2501_... 68 7e-10
gi|86143894|ref|ZP_01062262.1| hypothetical protein MED217_... 67 2e-09
gi|149369927|ref|ZP_01889778.1| hypothetical protein SCB49_... 65 6e-09
gi|83595998|gb|ABC25357.1| conserved hypothetical protein [... 62 4e-08
gi|89891813|ref|ZP_01203315.1| gliding motility gene GldO [... 60 2e-07
gi|88713423|ref|ZP_01107506.1| hypothetical protein FB2170_... 59 6e-07
>gi|154489924|ref|ZP_02030185.1| hypothetical protein PARMER_00153 [Parabacteroides merdae ATCC
43184]
gi|154089366|gb|EDN88410.1| hypothetical protein PARMER_00153 [Parabacteroides merdae ATCC
43184]
Length = 363
Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats.
Identities = 118/333 (35%), Positives = 187/333 (56%), Gaps = 16/333 (4%)
Query: 22 ARRRAEQQKGTRSNANTITTRAQISFPTSAPMEEDVVWRRDIYRELNLVEDANAGLYYPT 81
A R A +++ + +T RAQ + W R IYR+++L ++ NA LYYPT
Sbjct: 41 ASRGAREEEKKENGMPELTVRAQDMNERMTQHIGNARWMRVIYRQIDLTKEQNAPLYYPT 100
Query: 82 EPVGSQMNLFCYIFKLMMMGPRNGGISAYEYRMDGNEVFTDSARIKPLQFLDNYHVYYER 141
+P+ QMNLF IF+L+ G +SAYEY +DG E F+D+ ++ LD ++YE
Sbjct: 101 QPMNGQMNLFSIIFQLVC----EGKLSAYEY-LDGYEDFSDNRKLDLKVMLDRCRIFYEE 155
Query: 142 TD------RGIRLDNSDIPSAEVKGYYLKESAYYDQGTSTFHRKVIALCPIMYRADDFGD 195
T G ++ SDIPS +V+ YY+KE+ Y+DQ S F K +A+CPI+ DD G
Sbjct: 156 TPGKDNEPAGFVVNESDIPSGDVRSYYIKEAWYFDQNNSVFDVKTLAICPILTIVDDMGQ 215
Query: 196 GEVKYPLFWVKYDDLAPFLSKQTIMTSNLNNAATMSVEDYFTMNRYQGKIYKTNNMLGRT 255
+ P+FW+ Y++L P+++ IMTSN+NNA T +++DYF +QG I+KT N++ +
Sbjct: 216 NTM--PMFWIPYENLRPYINTAYIMTSNINNAMTFTLDDYFRRRMFQGDIFKTQNLMNQP 273
Query: 256 LSQYCTTDSAMTKEQRKIENELLAFEKNIFGDAQKRDSVNSIAASKEDAKAVKKTNRRTR 315
L YC T +M +EQ +IEN+L+ FEK+++ + D+ A +K + R R
Sbjct: 274 LQAYCPTPDSMKREQERIENQLITFEKSLY---LQPDTAQLAADTKGKKSKNATVSARGR 330
Query: 316 RSAETTTARPSRRSSSSSGTSNNAARVSVRRQR 348
++ ++ + +A SVRR+R
Sbjct: 331 KTEAAKESKQKEVKIKAPKAQKSAPVRSVRRRR 363
>gi|34540132|ref|NP_904611.1| hypothetical protein PG0291 [Porphyromonas gingivalis W83]
gi|34396444|gb|AAQ65510.1| hypothetical protein PG_0291 [Porphyromonas gingivalis W83]
Length = 361
Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats.
Identities = 121/365 (33%), Positives = 192/365 (52%), Gaps = 28/365 (7%)
Query: 2 KRIFLIVCVAAVAISTFAQPARRRAEQQKGTR------SNANTITTRAQISFPTSAPMEE 55
K + + A V+IS+ AQ Q G T++ RAQ +
Sbjct: 7 KAVIGAILAATVSISSVAQENTNNRSPQVGRAPRNTEVEQTTTLSNRAQEFNRRLTQKTD 66
Query: 56 DVVWRRDIYRELNLVEDANAGLYYPTEPVGSQMNLFCYIFKLMMMGPRNGGISAYEYRMD 115
+ WRR +YR ++L+E++NA LYYP P+G + NLF IF L+ + + YEY +D
Sbjct: 67 NAPWRRVVYRRVDLMEESNAVLYYPPRPIGDRKNLFSTIFGLI----NSNSLDVYEY-LD 121
Query: 116 GNEVFTDSARIKPLQFLDNYHVYYE----RTDRGIRLDNSDIPSAEVKGYYLKESAYYDQ 171
G E FTD +IK +FLD + +YY+ + ++ +SDIPSAEVK YY+KE Y+
Sbjct: 122 GFEAFTDQYKIKFQEFLDRFGIYYQPSTNKNAELFKVADSDIPSAEVKAYYVKEEWYFTP 181
Query: 172 GTSTFHRKVIALCPIMYRADDFGDGEVKYPLFWVKYDDLAPFLSKQTIMTSNLNNAATMS 231
S K+ A+CPIM D+FG+ + PLFW+ Y+++ P+++++ +M S+LNN +
Sbjct: 182 TNSDVDIKIQAICPIMTGQDEFGEVRNQ-PLFWIPYENIRPYIARERVMLSSLNNTRNST 240
Query: 232 VEDYFTMNRYQGKIYKTNNMLGRTLSQYCTTDSAMTKEQRKIENELLAFEKNIF------ 285
++D+F +N Y+G I KT N+ R L++YC T +M E ++I+ EL F +F
Sbjct: 241 IDDFFRLNLYKGDIVKTENLHNRALAEYCPTPDSMKMESKRIDKELQGFRDGLFVTQDTT 300
Query: 286 --GDAQKRDSVNSIAASKEDAKAVKKTNRRTRRSAETTTARPSRRSSSSSGTSNNAARVS 343
A+ + S +T + +AET P ++ +S N AA S
Sbjct: 301 WMKQAETKKSKGKKLEKARGKNITSRTRGQGEGAAETEAVEPKKQKAS----KNKAATRS 356
Query: 344 VRRQR 348
VRR++
Sbjct: 357 VRRRK 361
>gi|150009904|ref|YP_001304647.1| hypothetical protein BDI_3321 [Parabacteroides distasonis ATCC
8503]
gi|149938328|gb|ABR45025.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 363
Score = 186 bits (473), Expect = 2e-45, Method: Composition-based stats.
Identities = 106/282 (37%), Positives = 164/282 (58%), Gaps = 19/282 (6%)
Query: 16 STFAQPARRRAEQ-QKGTRSNAN-----TITTRAQISFPTSAPMEEDVVWRRDIYRELNL 69
+T Q RRR+ Q +G R++ +T RAQ + W R IYR+++L
Sbjct: 30 ATQQQQTRRRSPQASRGERNDKKDTGLPELTVRAQDMNERLTQEIGNARWMRIIYRQVDL 89
Query: 70 VEDANAGLYYPTEPVGSQMNLFCYIFKLMMMGPRNGGISAYEYRMDGNEVFTDSARIKPL 129
+++ NA LYYPT P+ QMNLF IF+L+ I AYEY +DG E F ++ I
Sbjct: 90 MKEQNAPLYYPTRPMNGQMNLFSVIFQLL----GENKIKAYEY-LDGYEEFDEAHLINFK 144
Query: 130 QFLDNYHVYYERT------DRGIRLDNSDIPSAEVKGYYLKESAYYDQGTSTFHRKVIAL 183
LD +++ YE + ++ SDIP+A+V+ YY+KE+ Y+DQ S F K++A+
Sbjct: 145 DLLDRFYILYEEIPGRAGEEPTFVINESDIPAADVRSYYVKEAWYFDQNNSAFDVKILAI 204
Query: 184 CPIMYRADDFGDGEVKYPLFWVKYDDLAPFLSKQTIMTSNLNNAATMSVEDYFTMNRYQG 243
CPI+ D G E P+FW+ Y+++ P++S IMTSN+NNA T +++DYF ++G
Sbjct: 205 CPILTSTGDMG--ETTMPMFWLPYENIRPYISNSYIMTSNMNNAMTFTMDDYFRRRMFEG 262
Query: 244 KIYKTNNMLGRTLSQYCTTDSAMTKEQRKIENELLAFEKNIF 285
I KT N++ L YC T ++ EQ +IE++L FEK+++
Sbjct: 263 DIIKTQNLMNLPLQAYCPTPDSLKNEQARIEDQLTGFEKSLW 304
>gi|67940395|ref|ZP_00532822.1| hypothetical protein Cphamn1DRAFT_1025 [Chlorobium phaeobacteroides
BS1]
gi|67913392|gb|EAM62813.1| hypothetical protein Cphamn1DRAFT_1025 [Chlorobium phaeobacteroides
BS1]
Length = 282
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 66/255 (25%), Positives = 118/255 (46%), Gaps = 33/255 (12%)
Query: 45 ISFPTSAPMEEDVVWRRDIYRELNLVEDANAGLYYPTEPVGSQMNLFCYIFKLMMMGPRN 104
I FPT + DV+W + I+RE++ + N YYP + Q N +I ++ +
Sbjct: 45 IPFPTI--RKADVMWSKRIWREIDFRQKFNQKFYYPLD---VQQNWKSFIMVILDALKES 99
Query: 105 GGISAYEYRMDGNEVFTDSARIKPLQFLDNYHVYYERTDRGIRLDNSDIPSAE------- 157
G +AY+ TD + P+ F + V +TD ++ P+ E
Sbjct: 100 GHFTAYDL------TXTDELLV-PITFNE---VVGRQTDTLHQMGRRPYPAYEEYDTVII 149
Query: 158 -------VKGYYLKESAYYDQGTSTFHRKVIALCPIMYRADDFGDGEVKYPLFWVKYDDL 210
V +KE Y+D+ S ++IALCP+M + D EV P+FW+ Y
Sbjct: 150 TSFDPTKVMRLRVKEDWYFDKQRSQMMVRIIALCPVMIKERD--GQEVTEPMFWISYAQA 207
Query: 211 APFLSKQTIMTSNLNNAATMSVEDYFTMNRYQGKIYKTNNMLGRTLSQYCTTDSAMTKEQ 270
L+ Q ++ + N+A ++ ++ F + IYK N+ R ++ Y T A+ + +
Sbjct: 208 REVLA-QALVFNEFNSAMRLTYDELFWKRLFDSYIYKEQNVYDRRINSYATGVDALLESE 266
Query: 271 RKIENELLAFEKNIF 285
R I+ E+ FE++++
Sbjct: 267 R-IKREMFRFEEDLW 280
>gi|120437338|ref|YP_863024.1| gliding motility protein GldO [Gramella forsetii KT0803]
gi|117579488|emb|CAL67957.1| gliding motility protein GldO [Gramella forsetii KT0803]
Length = 297
Score = 68.6 bits (166), Expect = 6e-10, Method: Composition-based stats.
Identities = 65/306 (21%), Positives = 132/306 (43%), Gaps = 34/306 (11%)
Query: 1 MKRIFLIVCVAAVAISTFAQPARRRAEQ--QKGTRSNANTITTRAQISFPTSAPMEEDVV 58
+K IF+ V + +S+F Q A++ + G +S A + + D++
Sbjct: 3 LKEIFVYGFVMMMGVSSFGQANILNADKPDEIGKKSEAQALVDEEDKPLEYGYVGDRDIL 62
Query: 59 WRRDIYRELNLVEDANAGLYYP--TEPVGSQMNLFCYIFKLMMMGPRNGGISAYEYRMDG 116
W + + ++L E N LYYP T +GS ++ +++ + G I
Sbjct: 63 WSKGTWEIIDLDERVNFPLYYPIDTNNIGSSRR---SLYDVLVRAIKQGEIE-------- 111
Query: 117 NEVFTDS--ARIKPLQFLDNYHVYYERTDRGIR------------LDNSDIPSAEVKGYY 162
V+ DS + L+ + + TD GI +D D+ +A++ Y+
Sbjct: 112 -NVYADSYFKEKRTLKDISATLSKVDTTDLGIEQYNAGEEVDAQFVDRRDLGAADIAEYH 170
Query: 163 LKESAYYDQGTSTFHRKVIALCPIMYRADDFGDGEVKY-PLFWVKYDDLAPFLSKQTIMT 221
++ Y+D+ + +++ + P+ + G+ LFWV Y D L + T
Sbjct: 171 IRGMWYFDKRLAELKYRLLGIAPVAPDVNFIDSGQTDLVELFWVWYPDARKVLHDAKVFT 230
Query: 222 SNLNNAATMSVEDYFTMNRYQGKIYKTNNMLG-RTLSQYCTTDSAMT-KEQRKIENELLA 279
N + +S + R+ G IYK +N+ G R + +Y ++ M E ++I+ ++
Sbjct: 231 GG-NTSQVISFDHLLNSRRFDGIIYKEDNVQGDREIDEYIADNAFMQLMESQRIKEQIRN 289
Query: 280 FEKNIF 285
FE++++
Sbjct: 290 FEQDMW 295
>gi|88804984|ref|ZP_01120504.1| hypothetical protein RB2501_09020 [Robiginitalea biformata
HTCC2501]
gi|88785863|gb|EAR17032.1| hypothetical protein RB2501_09020 [Robiginitalea biformata
HTCC2501]
Length = 300
Score = 68.2 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 70/308 (22%), Positives = 140/308 (45%), Gaps = 35/308 (11%)
Query: 2 KRIFLIVCVAAVAISTFAQPARRRAE--QQKGTRSNANTITTRAQISFPTSAPME----- 54
K+ ++ VAA+ +S +AQ A+ ++ G R T AQ + AP++
Sbjct: 4 KKALVLGAVAALPVSGWAQANILNAKKPEEIGMR-------TEAQKAMDNDAPLDYGYVD 56
Query: 55 -EDVVWRRDIYRELNLVEDANAGLYYPTEPV--GSQMNLFCYIFKLMMMGPRNGGISAYE 111
D++W + ++ ++L E N LYYPT+ V GS ++ +++ +NG ++
Sbjct: 57 DRDILWSKTVWEVIDLDERVNFPLYYPTDTVDIGSDRR---SLYDVLIKNIKNGKLTDVY 113
Query: 112 YRMDGNEVFTDSARIKPLQFLDNYHVYYERTDRGIR-----LDNSDIPSAEVKGYYLKES 166
E S LQ +D + YE+ + G + ++ +I +A+++ Y +K
Sbjct: 114 VDSYFTEKRNFSDLQATLQKVDTTDLGYEQVNAGEQVSAEYINRRNITAADIEEYRIKGM 173
Query: 167 AYYDQGTSTFHRKVIALCPI-----MYRADDFGDGEVKYPLFWVKYDDLAPFLSKQTIMT 221
Y+D+ +++ + P+ D + PLFWV Y L + +
Sbjct: 174 WYFDKRQGELKYRLLGIAPVAPDVNFIDDDSMAPEDALVPLFWVWYPSARQILHEAKVFN 233
Query: 222 SNLNNAATMSVEDYFTMNRYQGKIYKTNNMLG-RTLSQYCTTDSAMTK--EQRKIENELL 278
N+A +S + R+ G IYK +N+ G R + Y D+A+ + E ++I+ ++
Sbjct: 234 QG-NSAQPLSFDMLLNARRFNGMIYKEDNVHGDREIRDY-IADNALFQLLESQRIKEQIR 291
Query: 279 AFEKNIFG 286
E++++
Sbjct: 292 NREQDMWA 299
>gi|86143894|ref|ZP_01062262.1| hypothetical protein MED217_03570 [Flavobacterium sp. MED217]
gi|85829601|gb|EAQ48064.1| hypothetical protein MED217_03570 [Leeuwenhoekiella blandensis
MED217]
Length = 296
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 58/239 (24%), Positives = 120/239 (50%), Gaps = 13/239 (5%)
Query: 56 DVVWRRDIYRELNLVEDANAGLYYP--TEPVGS-QMNLFCYIFKLMMMGPRNGGISAY-E 111
DV+W ++ + ++L E N LYYP T +GS + +L+ + K + G + +Y
Sbjct: 60 DVLWGKNTWEIVDLDERVNFPLYYPIDTNNIGSDRRSLYDVLVKNVKAGKLDIYADSYFN 119
Query: 112 YRMDGNEVFTDSARIKPLQF-LDNYHVYYERTDRGIRLDNSDIPSAEVKGYYLKESAYYD 170
+++ +++ +R+ ++ + YE ++ I D DI +A+++ Y+++ Y+D
Sbjct: 120 QKIELSDIAAAMSRVDTTDLGIEQINAGYEVDEQYI--DRRDISAADIEQYWIRGYWYFD 177
Query: 171 QGTSTFHRKVIALCPIMYRADDFGDGE-VKYPLFWVKYDDLAPFLSKQTIMTSNLNNAAT 229
+ ++I L P+ + D + V PLFWV Y L + + N+A
Sbjct: 178 KRQGELKYRLIGLAPVAPDVNFIDDEDPVMVPLFWVWYPTARDILHEAKVFNPQ-NSAQP 236
Query: 230 MSVEDYFTMNRYQGKIYKTNNMLG-RTLSQYCTTDSAMTK--EQRKIENELLAFEKNIF 285
+S + R+ G IY+ +N+ G R + +Y D+AM + E ++I+ ++ FE +++
Sbjct: 237 ISFDHILNSRRFNGVIYQVDNIQGDREVKEY-IADNAMMQLLESQRIKEQIRNFEIDMW 294
>gi|149369927|ref|ZP_01889778.1| hypothetical protein SCB49_02599 [unidentified eubacterium SCB49]
gi|149356418|gb|EDM44974.1| hypothetical protein SCB49_02599 [unidentified eubacterium SCB49]
Length = 301
Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats.
Identities = 68/254 (26%), Positives = 110/254 (43%), Gaps = 41/254 (16%)
Query: 41 TRAQISFPTSAPM------EEDVVWRRDIYRELNLVEDANAGLYYP--TEPVGSQMNLFC 92
T AQI + P+ E DV+W + + ++L E N LYYP T +GS+
Sbjct: 39 TLAQIQYDNDEPLPYGYVDERDVLWSKSTWEFIDLDERVNFPLYYPIDTNNIGSERRSLY 98
Query: 93 YIFKLMMMGPRNGGISAYEYRMDGNEVFTDS--ARIKPLQFLDNYHVYYERTDRGI---- 146
+ MM +G I+ EV++DS + L LD VY + TD GI
Sbjct: 99 DVLTGAMM---SGEIT---------EVYSDSYFTEKRTLSDLDASLVYRDTTDYGIDQFN 146
Query: 147 ---RLDNSDI-----PSAEVKGYYLKESAYYDQGTSTFHRKVIALCPIMYRA-----DDF 193
R+D+ I +A+V+G+ ++ Y D+ +++ +CP+ A DD
Sbjct: 147 AEGRVDDEYIREFTLDAADVEGWKIRGLWYIDKRQGELKYRLLGICPVAGEARSKAFDDG 206
Query: 194 GDGEVKYPLFWVKYDDLAPFLSKQTIMTSNLNNAATMSVEDYFTMNRYQGKIYKTNNMLG 253
G + LFWV + L N + +S + R+ G IYK +N+ G
Sbjct: 207 GGVDDTVDLFWVFFPAARETLHNAKAFNRK-NTSQPISYDHLLNSRRFSGVIYKEDNVQG 265
Query: 254 -RTLSQYCTTDSAM 266
R +++Y +S M
Sbjct: 266 DREVNEYIVDNSLM 279
>gi|83595998|gb|ABC25357.1| conserved hypothetical protein [uncultured marine bacterium
Ant29B7]
Length = 274
Score = 62.4 bits (150), Expect = 4e-08, Method: Composition-based stats.
Identities = 57/249 (22%), Positives = 108/249 (43%), Gaps = 30/249 (12%)
Query: 48 PTSAPMEEDVVWRRDIYRELNLVEDANAGLYYPTEPVGSQMNLFCYIFKLMMMGPRNGGI 107
P S ++D++W + + + E N LYYP + + + +LF + ++ G I
Sbjct: 43 PYSFIRQDDIMWSTRQWERIQVQEKLNHPLYYPVKALPDRKSLFDVLKDAILT---EGTI 99
Query: 108 SAYEYRMDGNEVFTDSARIKPLQFLDNYHVYY------ERTDRGIRL--DNSDIPSAEVK 159
S EVF D PL + + + + D I L D+ +I + +++
Sbjct: 100 S---------EVFVDDRFEMPLTTQEVADILFRVDTILDPDDPSIVLGIDSIEIKAPDLR 150
Query: 160 GYYLKESAYYDQGTSTFHRKVIALCPIMYRADDFGDGEVKYPLFWVKYDDLAPFLSKQTI 219
+ +K Y+D+ ++I + P++ +D E+ YP FW+ + D +S
Sbjct: 151 YWEIKSDWYFDKKRGEMKNRIIGISPVV---EDPSTREI-YPTFWIWFPDARQAMSTHVA 206
Query: 220 MTSNLNNAATMSVEDYFTMNRYQGKIYKTNNMLGRTLSQYCT---TDSAMTKEQRKIENE 276
+ NN M+ + F M + +YK NN+ R + Y D + E R+I E
Sbjct: 207 YNPS-NNTRRMTFDQIFQMRYFNAVVYKENNVYDRGIKDYKNGRPLDQLL--ESRRIREE 263
Query: 277 LLAFEKNIF 285
L +E +++
Sbjct: 264 LRNYEHDLW 272
>gi|89891813|ref|ZP_01203315.1| gliding motility gene GldO [Flavobacteria bacterium BBFL7]
gi|89515968|gb|EAS18633.1| gliding motility gene GldO [Flavobacteria bacterium BBFL7]
Length = 291
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 66/276 (23%), Positives = 110/276 (39%), Gaps = 17/276 (6%)
Query: 1 MKRIFLIVCVAAVAISTFAQPARRRAEQQKGT-RSNANTITTRAQISFPTSAPMEEDVVW 59
M ++F+ V + S FAQ A+ + A+ P + DV+W
Sbjct: 1 MNKLFVFVLGLMLTSSAFAQNNILNAKDPNEIGKKTASQTENDNDTPLPYGFIGDRDVLW 60
Query: 60 RRDIYRELNLVEDANAGLYYP--TEPVGSQMNLFCYIFKLMMMGPRNGGISAY-EYRMDG 116
R + +++L E N LYYP T VG + +F ++ RNG I Y + +
Sbjct: 61 SRTTWEKIDLDERVNFPLYYPIDTNYVGKERRA---LFHVIAAAARNGEIDLYSDSYFNE 117
Query: 117 NEVFTDSARIKPLQFLDNYHVYYERTDRGIRLD-----NSDIPSAEVKGYYLKESAYYDQ 171
F D L +D E+ + R+D + I + V Y ++ Y D
Sbjct: 118 KREFADIE--TSLSRVDTSDYGIEQFNAEGRVDAEYITKTSIGADNVSAYNVRGLWYLDT 175
Query: 172 GTSTFHRKVIALCPIMYRADDFGDGEVKYPLFWVKYDDLAPFLSKQTIMTSNLNNAATMS 231
+++ + P+ +F D LFW+ Y D+ L + LN+A +S
Sbjct: 176 RQGEMRYRILGIAPVTPDV-NFMDDPKGIELFWIYYPDVRELLHNAKAFNT-LNSAKPIS 233
Query: 232 VEDYFTMNRYQGKIYKTNNMLG-RTLSQYCTTDSAM 266
+ R+ G IYK +N+ G R + Y +S M
Sbjct: 234 FDHILNSRRFSGVIYKMDNVHGDRKIEAYIADNSMM 269
>gi|88713423|ref|ZP_01107506.1| hypothetical protein FB2170_08694 [Flavobacteriales bacterium
HTCC2170]
gi|88708333|gb|EAR00570.1| hypothetical protein FB2170_08694 [Flavobacteriales bacterium
HTCC2170]
Length = 300
Score = 58.5 bits (140), Expect = 6e-07, Method: Composition-based stats.
Identities = 68/315 (21%), Positives = 136/315 (43%), Gaps = 49/315 (15%)
Query: 2 KRIFLIVCVAAVAISTFAQPARRRAE--QQKGTRSNANTITTRAQISFPTSAPME----- 54
K + LI V+ + S AQ A+ Q+ G R T AQ + AP++
Sbjct: 4 KNVLLIGAVSLLPASILAQANILNAKKPQEIGVR-------TEAQKALDNDAPLDYGYVD 56
Query: 55 -EDVVWRRDIYRELNLVEDANAGLYYPTEPVGSQMNLFCYIFKLMMMGPRNGGISAYEYR 113
D++W + I+ ++L E N LYYP + V + ++ +++ ++G +
Sbjct: 57 DRDILWSKTIWETIDLDERVNFPLYYPIDTVDIGADRRS-LYDVLLKNIKSGKLE----- 110
Query: 114 MDGNEVFTDSARIKPLQF---------LDNYHVYYERTDRGIRL-----DNSDIPSAEVK 159
+V+ DS + +F D + YE+ + G L + D+ +A+++
Sbjct: 111 ----DVYVDSYFTEKRKFSELAATLSKKDTTDMGYEQINAGEELSAEFINERDLTAADIE 166
Query: 160 GYYLKESAYYDQGTSTFHRKVIALCPIMYRADDFGD-----GEVKYPLFWVKYDDLAPFL 214
Y +K Y+D+ +++ + P+ + D + K LFWV Y L
Sbjct: 167 EYRIKGMWYFDKRQGELKYRLLGIAPVAPDVNFIDDESMDPADAKVELFWVWYPGARQLL 226
Query: 215 SKQTIMTSNLNNAATMSVEDYFTMNRYQGKIYKTNNMLG-RTLSQYCTTDSAMTK--EQR 271
+ N+A +S + R+ G IY+ +N+ G R ++ Y D+A+ + E +
Sbjct: 227 HDAKVFNQR-NSAQPISFDMLLNARRFNGVIYREDNVHGDRKVNDY-IADNALFQLLEAK 284
Query: 272 KIENELLAFEKNIFG 286
+I+ + E++++
Sbjct: 285 RIKEVIRDREQDMWA 299
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.318 0.130 0.373
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,186,189,431
Number of Sequences: 5470121
Number of extensions: 48359515
Number of successful extensions: 153988
Number of sequences better than 1.0e-05: 14
Number of HSP's better than 0.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 153960
Number of HSP's gapped (non-prelim): 14
length of query: 349
length of database: 1,894,087,724
effective HSP length: 134
effective length of query: 215
effective length of database: 1,161,091,510
effective search space: 249634674650
effective search space used: 249634674650
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 130 (54.7 bits)