BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0002
(273 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|29348751|ref|NP_812254.1| hypothetical protein BT_3342 [... 247 5e-64
gi|153807256|ref|ZP_01959924.1| hypothetical protein BACCAC... 244 4e-63
gi|156110736|gb|EDO12481.1| hypothetical protein BACOVA_019... 241 3e-62
gi|156861904|gb|EDO55335.1| hypothetical protein BACUNI_010... 239 9e-62
gi|53711485|ref|YP_097477.1| hypothetical protein BF0194 [B... 239 2e-61
gi|150003608|ref|YP_001298352.1| hypothetical protein BVU_1... 230 7e-59
gi|60679755|ref|YP_209899.1| hypothetical protein BF0159 [B... 214 3e-54
gi|154489933|ref|ZP_02030194.1| hypothetical protein PARMER... 112 3e-23
gi|150009431|ref|YP_001304174.1| hypothetical protein BDI_2... 109 2e-22
gi|34540819|ref|NP_905298.1| hypothetical protein PG1083 [P... 83 1e-14
gi|149277610|ref|ZP_01883751.1| hypothetical protein PBAL39... 73 2e-11
gi|120437494|ref|YP_863180.1| conserved hypothetical protei... 62 4e-08
gi|88801958|ref|ZP_01117486.1| hypothetical protein PI23P_0... 59 2e-07
gi|83855942|ref|ZP_00949471.1| hypothetical protein CA2559_... 59 2e-07
gi|126646801|ref|ZP_01719311.1| hypothetical protein ALPR1_... 58 8e-07
gi|88804750|ref|ZP_01120270.1| hypothetical protein RB2501_... 57 1e-06
gi|91215524|ref|ZP_01252495.1| hypothetical protein P700755... 56 2e-06
gi|86143741|ref|ZP_01062117.1| hypothetical protein MED217_... 55 5e-06
>gi|29348751|ref|NP_812254.1| hypothetical protein BT_3342 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340657|gb|AAO78448.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 259
Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats.
Identities = 125/256 (48%), Positives = 168/256 (65%), Gaps = 3/256 (1%)
Query: 19 LSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYRASLT 78
++ A+AQC +N AF GE++ Y+LYFNWKFVWVK G AS T ++ Y PAYR +L
Sbjct: 4 FALPANAQCEAKNDAFQSGEHVMYDLYFNWKFVWVKAGIASLTTNATTYHSEPAYRINLL 63
Query: 79 TRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPNGKVQTKQHR 138
G+ + D +F MRDTL C + L P Y+RKGA+EGKRYTVDE ++SY +G KQ R
Sbjct: 64 ALGSKRADFFFKMRDTLTCVMGEKLEPRYFRKGAEEGKRYTVDEAWFSYKDGLCFAKQKR 123
Query: 139 IDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKTLLPARIIYN 198
DGE S C+YDM++I +ARS++PA +K G + FP+ G+ + +IY
Sbjct: 124 TFRDGEVQESEESDSRCIYDMLTILAQARSYDPADYKVGDKIKFPMATGRKVEEQTLIYR 183
Query: 199 GKKTIKADNDKKYRCL--ELAYYEKEDGKWRNLANFFVTDDENHIPVRLDMNLKFGSAKA 256
GK+ +KA+N YRCL L Y+K+ GK + + FFVTDD+NH+PVRLDM L FGSAKA
Sbjct: 184 GKENVKAENGVTYRCLIFSLVEYDKK-GKEKEVITFFVTDDKNHLPVRLDMFLNFGSAKA 242
Query: 257 FLISMKGIRHKIASQV 272
FL ++G RH + S V
Sbjct: 243 FLNDVRGHRHPLTSIV 258
>gi|153807256|ref|ZP_01959924.1| hypothetical protein BACCAC_01534 [Bacteroides caccae ATCC 43185]
gi|149130376|gb|EDM21586.1| hypothetical protein BACCAC_01534 [Bacteroides caccae ATCC 43185]
Length = 290
Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats.
Identities = 128/275 (46%), Positives = 176/275 (64%), Gaps = 4/275 (1%)
Query: 1 MGNKMKKLKFYIASVLL-LLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTAS 59
+ N +K+ +A++L+ + + ASAQC +N AF GE++ Y+LYFNWKFVWVK G AS
Sbjct: 16 VANFRRKIIIGLATLLMGIFVLPASAQCEAKNDAFKSGEHVMYDLYFNWKFVWVKAGLAS 75
Query: 60 WYTVSSVYEGTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYT 119
T ++ Y PAYR +L G+ + D +F MRDTL C + L P Y+RKGA+EGKRYT
Sbjct: 76 LTTNATTYHSEPAYRINLLALGSKRADFFFKMRDTLTCVIGEKLEPHYFRKGAEEGKRYT 135
Query: 120 VDEVFYSYPNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYV 179
VDE ++SY +G Q R DG S C+YDM+SI +ARS++P+ +K G
Sbjct: 136 VDEAWFSYKDGLCFANQKRTYRDGSVTESEESDSRCIYDMLSILAQARSYDPSDYKVGDK 195
Query: 180 VDFPLIGGKTLLPARIIYNGKKTIKADNDKKYRCL--ELAYYEKEDGKWRNLANFFVTDD 237
+ FP+ G+ + +IY GK+ +KA+N YRCL L Y+K+ GK + + FFVTDD
Sbjct: 196 IKFPMATGRKVEEQTLIYRGKENVKAENGVTYRCLIFSLVEYDKK-GKEKEVITFFVTDD 254
Query: 238 ENHIPVRLDMNLKFGSAKAFLISMKGIRHKIASQV 272
NH+PVRLD+ L FGSAKAFL S+ G RH + S V
Sbjct: 255 LNHLPVRLDLFLNFGSAKAFLNSVTGNRHPLTSIV 289
>gi|156110736|gb|EDO12481.1| hypothetical protein BACOVA_01980 [Bacteroides ovatus ATCC 8483]
Length = 259
Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats.
Identities = 124/256 (48%), Positives = 166/256 (64%), Gaps = 3/256 (1%)
Query: 19 LSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYRASLT 78
++ ASAQC +N AF GE++ Y+LYFNWKFVWVK G AS T ++ Y PAYR +L
Sbjct: 4 FALPASAQCEAKNDAFQSGEHVMYDLYFNWKFVWVKAGLASLTTNATTYHSQPAYRINLL 63
Query: 79 TRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPNGKVQTKQHR 138
G+ + D +F MRDTL C + L P Y+RKGA+EGKRYTVDE ++SY +G Q R
Sbjct: 64 ALGSKRADFFFKMRDTLTCVIGEKLEPRYFRKGAEEGKRYTVDEAWFSYKDGLCLVNQKR 123
Query: 139 IDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKTLLPARIIYN 198
DG +S C+YDM+SI +ARS++PA +K G + FP+ G+ + +IY
Sbjct: 124 TYRDGAFDESEASDSRCIYDMLSILAQARSYDPADYKVGDKIKFPMATGRKVEEQTLIYR 183
Query: 199 GKKTIKADNDKKYRCL--ELAYYEKEDGKWRNLANFFVTDDENHIPVRLDMNLKFGSAKA 256
GK+ +KA+N YRCL L Y+K+ GK + + FFVTDD NH+PVRLD+ L FGSAKA
Sbjct: 184 GKENVKAENGVTYRCLIFSLVEYDKK-GKEKEVITFFVTDDLNHLPVRLDLFLNFGSAKA 242
Query: 257 FLISMKGIRHKIASQV 272
FL ++ G RH + S V
Sbjct: 243 FLNNVTGNRHPLTSIV 258
>gi|156861904|gb|EDO55335.1| hypothetical protein BACUNI_01007 [Bacteroides uniformis ATCC 8492]
Length = 283
Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats.
Identities = 119/263 (45%), Positives = 170/263 (64%), Gaps = 2/263 (0%)
Query: 12 IASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTP 71
+ +V L S S AQC+ +N AF GE++ Y+LYFNWKF+W KVG AS T ++ Y P
Sbjct: 18 VGAVCLGTSQSVQAQCTAKNEAFQSGEHVMYDLYFNWKFIWKKVGLASLTTNATTYHSEP 77
Query: 72 AYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPNGK 131
A+R +L G+ K D +F MRDTL CY + L PLY+RK A+EG R+TVDE ++SY +G
Sbjct: 78 AFRFNLLCVGSKKTDFFFKMRDTLTCYVSDRLEPLYFRKAAEEGSRHTVDEAWFSYSDGL 137
Query: 132 VQTKQHRIDNDG--EQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKT 189
KQ R ++ E S C++DM+SI +ARS++P +K G + FP+ G+
Sbjct: 138 ANVKQRRTWHNPVREAQEMEYSDSRCIFDMLSILAQARSYDPKDYKVGQKILFPMATGRR 197
Query: 190 LLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRNLANFFVTDDENHIPVRLDMNL 249
+ +IY GK+ +KA+ND YRCL ++ E + GK + + FFV+DD+NH+P+RLDM L
Sbjct: 198 VEEQTLIYRGKEEVKANNDTVYRCLVFSFVEYKKGKEKEVITFFVSDDKNHLPIRLDMYL 257
Query: 250 KFGSAKAFLISMKGIRHKIASQV 272
FGSAKAF S++G R+ + S V
Sbjct: 258 NFGSAKAFFKSVRGNRYPMTSVV 280
>gi|53711485|ref|YP_097477.1| hypothetical protein BF0194 [Bacteroides fragilis YCH46]
gi|52214350|dbj|BAD46943.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 292
Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats.
Identities = 122/262 (46%), Positives = 160/262 (61%), Gaps = 1/262 (0%)
Query: 12 IASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTP 71
I ++L S +AQC +N AF GE++ Y LYFNWKF+W KVG AS T S+ Y P
Sbjct: 29 IIALLFAFPSSGNAQCEAKNDAFKSGEHVMYELYFNWKFIWKKVGLASLTTNSTTYHSEP 88
Query: 72 AYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPNGK 131
AYR +L + + D +F MRDTL T+ L P Y+RKGA+EGKRYTVDE +S+ NG
Sbjct: 89 AYRVNLLAISSKEADFFFKMRDTLTSVMTEKLEPRYFRKGAEEGKRYTVDEARFSFRNGM 148
Query: 132 VQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKTLL 191
Q R+ DG S C+YDM++I +ARSF+P + G + FP+ G+ +
Sbjct: 149 CYVNQKRVRKDGSITETEQSDNRCIYDMLTILAQARSFDPKEYTIGQRIQFPMATGRRVE 208
Query: 192 PARIIYNGKKTIKADNDKKYRCLELAYYE-KEDGKWRNLANFFVTDDENHIPVRLDMNLK 250
+IY G K I A+ND YRCL + E + GK + + F+VTDD NH+PVRLDM+L
Sbjct: 209 EQTLIYRGIKKITAENDTTYRCLIFSLVEYNKKGKEKEVITFYVTDDRNHLPVRLDMHLN 268
Query: 251 FGSAKAFLISMKGIRHKIASQV 272
FGSAKAFL S+ G RH S V
Sbjct: 269 FGSAKAFLKSVSGYRHPQTSIV 290
>gi|150003608|ref|YP_001298352.1| hypothetical protein BVU_1037 [Bacteroides vulgatus ATCC 8482]
gi|149932032|gb|ABR38730.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 269
Score = 230 bits (587), Expect = 7e-59, Method: Composition-based stats.
Identities = 115/262 (43%), Positives = 165/262 (62%), Gaps = 5/262 (1%)
Query: 14 SVLLLLS-----VSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYE 68
++L+LL ++ AQC+ +N A GE L Y+L FNWKF+WV G A + Y+
Sbjct: 4 TILILLCGWFAIMATRAQCAAQNEAIQAGEELVYDLKFNWKFIWVAAGQAKMDMQAITYQ 63
Query: 69 GTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYP 128
G P +R++L + N ++D +F MRDTL C + L P+Y+RKGA+EG RYTVDEV++SY
Sbjct: 64 GKPCFRSNLISVSNRQVDFFFKMRDTLTCITSSRLEPVYFRKGAEEGDRYTVDEVWFSYK 123
Query: 129 NGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGK 188
NGK Q R+ + + EC++DM+SI +RARSF+ + +K G + F + G
Sbjct: 124 NGKCIADQRRMRRERDTVKSKDQSDECIFDMLSILMRARSFDVSDYKVGDKILFDMATGT 183
Query: 189 TLLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRNLANFFVTDDENHIPVRLDMN 248
+ +IY G+K KA+N KYRCL + E + GK + + F+VTDD+NH+PVRLD+
Sbjct: 184 KVEQQTLIYRGRKNFKAENGVKYRCLVFSLVEYKKGKEKEVITFYVTDDKNHLPVRLDLY 243
Query: 249 LKFGSAKAFLISMKGIRHKIAS 270
L FGSAKAFL +KG RH + S
Sbjct: 244 LNFGSAKAFLREIKGNRHPLTS 265
>gi|60679755|ref|YP_209899.1| hypothetical protein BF0159 [Bacteroides fragilis NCTC 9343]
gi|60491189|emb|CAH05937.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 235
Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats.
Identities = 111/232 (47%), Positives = 143/232 (61%), Gaps = 1/232 (0%)
Query: 42 YNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTK 101
Y LYFNWKF+W KVG AS T S+ Y PAYR +L + + D +F MRDTL T+
Sbjct: 2 YELYFNWKFIWKKVGLASLTTNSTTYHSEPAYRVNLLAISSKEADFFFKMRDTLTSVMTE 61
Query: 102 DLAPLYYRKGAKEGKRYTVDEVFYSYPNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMS 161
L P Y+RKGA+EGKRYTVDE +S+ NG Q R+ DG S C+YDM++
Sbjct: 62 KLEPRYFRKGAEEGKRYTVDEARFSFRNGMCYVNQKRVRKDGSITETEQSDNRCIYDMLT 121
Query: 162 IFLRARSFNPASWKKGYVVDFPLIGGKTLLPARIIYNGKKTIKADNDKKYRCLELAYYE- 220
I +ARSF+P + G + FP+ G+ + +IY G K I A+ND YRCL + E
Sbjct: 122 ILAQARSFDPKEYTIGQRIQFPMATGRRVEEQTLIYRGIKKITAENDTTYRCLIFSLVEY 181
Query: 221 KEDGKWRNLANFFVTDDENHIPVRLDMNLKFGSAKAFLISMKGIRHKIASQV 272
+ GK + + F+VTDD NH+PVRLDM+L FGSAKAFL S+ G RH S V
Sbjct: 182 NKKGKEKEVITFYVTDDRNHLPVRLDMHLNFGSAKAFLKSVSGYRHPQTSIV 233
>gi|154489933|ref|ZP_02030194.1| hypothetical protein PARMER_00162 [Parabacteroides merdae ATCC
43184]
gi|154089375|gb|EDN88419.1| hypothetical protein PARMER_00162 [Parabacteroides merdae ATCC
43184]
Length = 260
Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats.
Identities = 67/242 (27%), Positives = 117/242 (48%), Gaps = 3/242 (1%)
Query: 34 FNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYRASLTTRGNGKLDNYFVMRD 93
F+ GE + Y LYF W + + G A+ + YEG P+Y L R +G ++ + MRD
Sbjct: 14 FSPGEEVQYELYFKWGLLMPRAGHATLSIRDAEYEGEPSYHYRLIFRTSGIIEKVYKMRD 73
Query: 94 TLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPNGKVQTKQHRIDNDGEQ-HWKTSSQ 152
T+ C+ T D+ L K E Y +D++ +SY K+ HR + ++
Sbjct: 74 TIDCHFTPDMLLLRSEKRVNENDYYLIDDIRFSYDRKKILAHSHRYTPTRTKIDTMLVTE 133
Query: 153 KECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKTLLPARIIYNGKKTIKADNDKKYR 212
++DM+ + RS + K G F + G+ + Y G++ ++ KYR
Sbjct: 134 DPHMFDMLGATMYLRSLDWDKMKSGESFPFQVAIGRERINISFRYTGQQIVERSETLKYR 193
Query: 213 C--LELAYYEKEDGKWRNLANFFVTDDENHIPVRLDMNLKFGSAKAFLISMKGIRHKIAS 270
+ Y+ + + A ++ DDENHIP+++ LK G+A+ + S KG+R+ + S
Sbjct: 194 TRHFYIDIYDDAFTQSKEAAEIWIGDDENHIPIKIRAKLKIGAAEVYYKSSKGLRYPLTS 253
Query: 271 QV 272
+V
Sbjct: 254 RV 255
>gi|150009431|ref|YP_001304174.1| hypothetical protein BDI_2839 [Parabacteroides distasonis ATCC
8503]
gi|149937855|gb|ABR44552.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 288
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 71/266 (26%), Positives = 127/266 (47%), Gaps = 4/266 (1%)
Query: 9 KFYIASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYE 68
K YI + L V++ + + GE ++Y+LYF W + K G A+ S Y+
Sbjct: 20 KLYIC--VFFLCVASLTSLRAQTLPLSHGERVDYDLYFKWGLIMSKAGLATLSVKESEYQ 77
Query: 69 GTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYP 128
G P++ +L R G ++ F MRDT+ C+ +K+ L+ K EG Y VD + + Y
Sbjct: 78 GAPSWHYNLLFRSAGVIEKVFRMRDTMDCHYSKEPRLLFSSKRTNEGDYYLVDNLQFEYQ 137
Query: 129 -NGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGG 187
+G++ HR + K V+DM+ + RS + + G F + G
Sbjct: 138 GSGRIDIHSHRHTLKETKIDTMLMAKGRVFDMLGATMYLRSLDWRTMSYGAEFPFMIAIG 197
Query: 188 KTLLPARIIYNGKKTIKADNDK-KYRCLELAYYEKEDGKWRNLANFFVTDDENHIPVRLD 246
+ L+ AR Y G++ ++ K + R + Y++ + + A ++ DDENHIPV++
Sbjct: 198 RELVNARFRYTGQQIVEHKEAKFRTRHFYIDIYDEAFSQAKEAAEVWIGDDENHIPVKIR 257
Query: 247 MNLKFGSAKAFLISMKGIRHKIASQV 272
LK G+A+ + +R + ++
Sbjct: 258 AKLKIGAAEVYYKDSYNLRAPLTCRI 283
>gi|34540819|ref|NP_905298.1| hypothetical protein PG1083 [Porphyromonas gingivalis W83]
gi|34397133|gb|AAQ66197.1| hypothetical protein PG_1083 [Porphyromonas gingivalis W83]
Length = 313
Score = 83.2 bits (204), Expect = 1e-14, Method: Composition-based stats.
Identities = 71/265 (26%), Positives = 117/265 (44%), Gaps = 18/265 (6%)
Query: 15 VLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYR 74
++ + S+S AQ N GE L+Y +Y+ W + + G AS S + +R
Sbjct: 57 IISITSISLQAQQPISNDFSRTGECLSYTIYYKWGALMPRAGDASL----SFEKQDRGFR 112
Query: 75 ASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSYPN--GKV 132
+ L R D F MRDTL C + + +K EG YT+D + +S + ++
Sbjct: 113 SRLLFRTAPFFDAIFSMRDTLDCNLDHKMRIVDGQKHVMEGGDYTMDLIHFSRQDDRNRI 172
Query: 133 QTKQHRIDNDGEQHWKTSS-QKECVYDMMSIFLRARSFNPASWKKGYVVDFP--LIGGKT 189
TK+ R +GE T + DM+ L RS W K P + GK
Sbjct: 173 HTKRFR---NGESRIDTVEITNQISLDMIGAILYLRS---TDWSKSTKERIPVRIFAGKK 226
Query: 190 LLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRN---LANFFVTDDENHIPVRLD 246
+ Y + K Y ++ E ++N ++T+D N IPVR+
Sbjct: 227 GIDCFFQYESSEVQKTKKGINYHTHRVSMLINESETFKNPKKAITIWLTNDANRIPVRIR 286
Query: 247 MNLKFGSAKAFLISMKGIRHKIASQ 271
M L+ G+A+ +L S++G+RH ++S+
Sbjct: 287 MELRIGAAEVYLKSVEGLRHPLSSR 311
>gi|149277610|ref|ZP_01883751.1| hypothetical protein PBAL39_05463 [Pedobacter sp. BAL39]
gi|149231843|gb|EDM37221.1| hypothetical protein PBAL39_05463 [Pedobacter sp. BAL39]
Length = 258
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 68/268 (25%), Positives = 124/268 (46%), Gaps = 19/268 (7%)
Query: 7 KLKFYIASVLLLLSVSASAQCSF-RNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSS 65
K F I +++LL + + F ++ F GE L Y L + + GT +
Sbjct: 2 KRSFLIITIVLLSFRGWTQELPFVKDPVFRVGEVLQYKLRYG--IITAAEGTLKVLSSDL 59
Query: 66 VYEGTPAYRASLTTRGNGKLDNYFVMRDTLLCY-NTKDLAPLYYRKGAKEGKRYTVDEV- 123
++G P YR S+ +G D ++ +RD Y + KDL P +Y++ +EG D+
Sbjct: 60 KFDGQPTYRLSVDGNTSGTFDVFYKIRDHYDSYIDQKDLKPYFYQENIREGSYRRQDKAR 119
Query: 124 FYSYPNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFP 183
FY V TK +KT +++ +D++S + ARS + + K G +V+
Sbjct: 120 FYQDDQKVVATKGT---------YKTPNKQ--TFDLVSTYYFARSLDVSKLKTGDMVNLN 168
Query: 184 LIGGKTLLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRNLANFF--VTDDENHI 241
+ +I + G++TIK K +CL+ + K +R + + VTDD N +
Sbjct: 169 YFLSDEVNQLKIEFMGRETIKT-KLGKIKCLKFSPSIKPGRIFRKDSRLYLWVTDDGNRV 227
Query: 242 PVRLDMNLKFGSAKAFLISMKGIRHKIA 269
PV+ + + G+ + S G+++ +A
Sbjct: 228 PVKAQVEILVGAVTMEIKSADGLKYPLA 255
>gi|120437494|ref|YP_863180.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
gi|117579644|emb|CAL68113.1| conserved hypothetical protein, secreted [Gramella forsetii KT0803]
Length = 258
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 68/272 (25%), Positives = 119/272 (43%), Gaps = 31/272 (11%)
Query: 14 SVLLLLSVSASAQCSFRNTAFNDGEYLNYNLY---FNWKFVWVKVGTASWYTVSSVYEGT 70
++ LL ++AS Q F AF+ GE+ + ++ FN + ++V A + T
Sbjct: 6 AITLLFCLTASTQL-FSQKAFDSGEWFKFRIHYGMFNASYATLEVDDA-------IINNT 57
Query: 71 PAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLA-PLYYRKGAKEGKRYTVDEVFYSYPN 129
P Y + G L +F + D Y K P + + EG E+ + + N
Sbjct: 58 PVYHIKGRGKSTGFLGLFFKVDDDYQTYIDKRTGKPYKFIRNINEGGYTKNLEIDFDHSN 117
Query: 130 GKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARS-FNPASWKKGYVVDFPLIGGK 188
K + N K+ S V+DM+S F R+ N K G + +
Sbjct: 118 NKAH-----VLNKKNSEKKSYSVPNNVHDMLSSFYYIRNQINGEELKPGDEMKVNMFIDD 172
Query: 189 TLLPARIIYNGKKTIKADNDK----KYRCLELA---YYEKEDGKWRNLANFFVTDDENHI 241
L ++++ G++ +K K K+R LA + EKE F+++DD+N I
Sbjct: 173 ENLDFKLVFLGREIVKTKFGKVATLKFRPYVLAGRVFKEKES------LTFWISDDKNKI 226
Query: 242 PVRLDMNLKFGSAKAFLISMKGIRHKIASQVN 273
PV+++ +L GS A L + KG++H+ + +N
Sbjct: 227 PVKIEADLAVGSLDADLEAYKGLKHQFSIIMN 258
>gi|88801958|ref|ZP_01117486.1| hypothetical protein PI23P_04827 [Polaribacter irgensii 23-P]
gi|88782616|gb|EAR13793.1| hypothetical protein PI23P_04827 [Polaribacter irgensii 23-P]
Length = 256
Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats.
Identities = 62/263 (23%), Positives = 118/263 (44%), Gaps = 17/263 (6%)
Query: 14 SVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAY 73
+V+LLL +S ++ +AF GE+L Y + ++ +++ GTA + G +
Sbjct: 6 AVVLLLFLSFTSFSQKEKSAFKGGEWLRYKMNYSG---FLRAGTAILKIEETSLGGKKVF 62
Query: 74 RASLTTRGNGKLDNYFVMRDTLLCYNTKD-LAPLYYRKGAKEG--KRYTVDEVFYSYPNG 130
T +G + +F + D Y KD + P +++ EG K++ V + Y+
Sbjct: 63 HTIGTGWTSGMIKWFFKVDDVYESYFDKDTIKPYLFKRKIDEGGYKKHRVTKFDYASNKV 122
Query: 131 KVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYVVDFPLIGGKTL 190
+Q +++ D T+ V D++S F R+ N +KKG + L +
Sbjct: 123 YIQDIKNQTD--------TTMVFSKVQDILSSFYYLRNQNVKGFKKGDEISIDLFIDSQV 174
Query: 191 LPARIIYNGKKTIKADNDKKYRCLELA--YYEKEDGKWRNLANFFVTDDENHIPVRLDMN 248
P ++++ GK+ + K CL + K + ++TDD N IP+++ +
Sbjct: 175 YPFKLLFLGKEVLNTKFGK-VNCLVIRPLVQSGRTFKAQESVTIWITDDANKIPIKMQAD 233
Query: 249 LKFGSAKAFLISMKGIRHKIASQ 271
L GS +A L + KG+ + Q
Sbjct: 234 LAVGSLRAELENYKGLANAFNKQ 256
>gi|83855942|ref|ZP_00949471.1| hypothetical protein CA2559_02605 [Croceibacter atlanticus
HTCC2559]
gi|83849742|gb|EAP87610.1| hypothetical protein CA2559_02605 [Croceibacter atlanticus
HTCC2559]
Length = 307
Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats.
Identities = 68/275 (24%), Positives = 119/275 (43%), Gaps = 21/275 (7%)
Query: 3 NKMKKLKFYIASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYT 62
N MK L IA LLL + + AQ AF+ E+L + +++ W A+
Sbjct: 50 NHMKTL---IAFTLLLTTTVSIAQ----QKAFDTNEWLQFRIHYGW----FNASEATLEV 98
Query: 63 VSSVYEGTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTK-DLAPLYYRKGAKEGKRYTVD 121
Y+GTP + + G L +F + D Y K + P + + EG
Sbjct: 99 KEDTYKGTPVHHIVGKGKSTGLLHLFFEVDDNYETYVDKTNGQPYKFIRKINEGGHTKDI 158
Query: 122 EVFYSYPNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARS-FNPASWKKGYVV 180
E+ + + + N KT + KE V+DM+S F R+ +P + G
Sbjct: 159 EINFDHSKNTAL-----VHNKKYNTKKTFTTKEKVHDMLSAFYYLRNNLDPTTLTSGEEQ 213
Query: 181 DFPLIGGKTLLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRNLAN--FFVTDDE 238
L + ++ + ++TIK K +CL+ Y + ++ + F+V+DD
Sbjct: 214 TLNLFFDEENFKFKLKFLERETIKTKFGK-IKCLKFRPYVQAGRVFKEEESLTFWVSDDP 272
Query: 239 NHIPVRLDMNLKFGSAKAFLISMKGIRHKIASQVN 273
N +P++++ +L GS +A L + KG++H VN
Sbjct: 273 NMLPIKIEADLAVGSLEADLSAFKGLKHPFQIVVN 307
>gi|126646801|ref|ZP_01719311.1| hypothetical protein ALPR1_18713 [Algoriphagus sp. PR1]
gi|126576849|gb|EAZ81097.1| hypothetical protein ALPR1_18713 [Algoriphagus sp. PR1]
Length = 313
Score = 57.8 bits (138), Expect = 8e-07, Method: Composition-based stats.
Identities = 65/272 (23%), Positives = 116/272 (42%), Gaps = 16/272 (5%)
Query: 3 NKMKKLKFYIASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYT 62
N M+K + VLLL S A +Q + +N AF GE LN+ + + W + + A
Sbjct: 49 NFMRKHFSLVILVLLLGSSRAFSQQNPQNDAFTFGEELNFEVSYGW----LNLADAKLQI 104
Query: 63 VSSVY--EGTPAYRASLTTRGNGKLDNYFVMRDTLLCY-NTKDLAPLYYRKGAKEGKRYT 119
+ P Y+ + + G + + D Y +T D+ P + +EGK
Sbjct: 105 NKKPHTQNNRPHYKIDVYGKTKGAATIFGKVNDNWGTYLDTTDIYPSLSYRHIEEGKYRK 164
Query: 120 VDEVFYSYPNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRARSFNPASWKKGYV 179
++V++ N + DN + K V D++S F R+ + K G +
Sbjct: 165 HEKVYFDQVNYTALVELFEKDNKTLKSVKEYKLVSKVQDIVSGFYYLRTLDLEKLKPGDI 224
Query: 180 VDFPLIGGKTLLPARIIYNGKKTIKADNDKKYRCLELAYYEKEDGKWRNL-----ANFFV 234
V P K ++IY G ++++ + +K + + E K + +V
Sbjct: 225 VFIPGFFDKERYNIKLIYEGTESLETEIGEK----DTYIFSPEVPKNKLFRGDYPVKVWV 280
Query: 235 TDDENHIPVRLDMNLKFGSAKAFLISMKGIRH 266
T D+N IPV++ NL GS +++ KG+R+
Sbjct: 281 TQDQNKIPVKIKANLFLGSLNFDIVNAKGLRN 312
>gi|88804750|ref|ZP_01120270.1| hypothetical protein RB2501_07850 [Robiginitalea biformata
HTCC2501]
gi|88785629|gb|EAR16798.1| hypothetical protein RB2501_07850 [Robiginitalea biformata
HTCC2501]
Length = 288
Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats.
Identities = 59/241 (24%), Positives = 107/241 (44%), Gaps = 17/241 (7%)
Query: 33 AFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGTPAYRASLTTRGNGKLDNYFVMR 92
AF GE+L + +++ + + A+ + + G P Y R G +F +
Sbjct: 53 AFKAGEWLKFRIHYGF----LNASYATLHITTDTIRGIPVYHVVGRGRTTGFASIFFKVD 108
Query: 93 DTLLCY-NTKDLAPLYYRKGAKEGKRYTVD-EVFYSYPNGKVQTKQHRIDNDGEQHWKTS 150
DT Y +D P + + EG YT D E+ + Y G + + + TS
Sbjct: 109 DTYESYFGQEDGRPYRFIRKVDEGG-YTKDMEINFDYRKGTALLHDKKNEKKFKFDIGTS 167
Query: 151 SQKECVYDMMSIFLRARS-FNPASWKKGYVVDFPLI-GGKTLLPARIIYNGKKTIKADND 208
Q D++S F R+ ++ S G ++ ++ + P R+ + G++T+K
Sbjct: 168 IQ-----DLISAFYYLRNNYDSKSLVVGESIELNMLYDDDGIFPFRLKFLGRETVKTKFG 222
Query: 209 KKYRCLELAYYEKEDG--KWRNLANFFVTDDENHIPVRLDMNLKFGSAKAFLISMKGIRH 266
K RCL+ Y + K + + +V+DD N IP+R++ +L GS KA L +R+
Sbjct: 223 K-VRCLKFRPYVQSGRVFKEQESLSLWVSDDRNKIPIRIEADLAVGSIKADLDGYNALRN 281
Query: 267 K 267
+
Sbjct: 282 Q 282
>gi|91215524|ref|ZP_01252495.1| hypothetical protein P700755_10428 [Psychroflexus torquis ATCC
700755]
gi|91186476|gb|EAS72848.1| hypothetical protein P700755_10428 [Psychroflexus torquis ATCC
700755]
Length = 255
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 58/265 (21%), Positives = 116/265 (43%), Gaps = 23/265 (8%)
Query: 8 LKFYIASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVY 67
+KF+++ V+ SV C + +A+ DGE+L++ + K+ W A+ +
Sbjct: 1 MKFFLSLVIAFSSVF----CGYSQSAYEDGEWLSFKI----KYGWFNTSKATLEIKKTKL 52
Query: 68 EGTPAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLAPLYYRKGAKEGKRYTVDEVFYSY 127
Y + G LD +F +RD Y +D P+ + + EG E+++ +
Sbjct: 53 YNEDVYHIIGNGKSVGLLDVFFKVRDRYETYVNQDGLPVKFIRDINEGGYKKHKELYFDH 112
Query: 128 PNGKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRAR-SFNPASWKKGYVVDFPLIG 186
+ +V+ ++ +K ++Q DM+S F + R S + + + G +
Sbjct: 113 DSQRVKVVDYKRGTTESFDFKLNTQ-----DMVSAFYKLRNSIDIETLQIGQEFRLNMFF 167
Query: 187 GKTLLPARIIYNGKKTIKADNDKKY---RCLELAYYEKEDGKW--RNLANFFVTDDENHI 241
+ + G + + D K+ CL+ Y K + + + F++T D+N I
Sbjct: 168 DNENYDFKTKFLGYEVL----DSKFGRVACLKFRPYVKAERVFEAQESLTFWITADKNKI 223
Query: 242 PVRLDMNLKFGSAKAFLISMKGIRH 266
P++++ L GS A L + KG+ H
Sbjct: 224 PLKIEAELSVGSLIAELDAFKGLSH 248
>gi|86143741|ref|ZP_01062117.1| hypothetical protein MED217_00570 [Flavobacterium sp. MED217]
gi|85829784|gb|EAQ48246.1| hypothetical protein MED217_00570 [Leeuwenhoekiella blandensis
MED217]
Length = 258
Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats.
Identities = 58/259 (22%), Positives = 114/259 (44%), Gaps = 15/259 (5%)
Query: 11 YIASVLLLLSVSASAQCSFRNTAFNDGEYLNYNLYFNWKFVWVKVGTASWYTVSSVYEGT 70
Y+ ++LL++ SA AQ AF DGE+ + + ++ W K G A+ + +G
Sbjct: 5 YLILLMLLIAGSAFAQ----EAAFKDGEWFRFRISYSG---WWKAGEATLSVNNETLKGK 57
Query: 71 PAYRASLTTRGNGKLDNYFVMRDTLLCYNTKDLA-PLYYRKGAKEGKRYTVDEVFYSYPN 129
P Y G +F + D Y K P + + EG +T D++
Sbjct: 58 PVYHVKGKGVTTGMTKLFFGVEDYYETYIDKQTTLPYRFIRKIDEGG-HTKDKIIDFDQQ 116
Query: 130 GKVQTKQHRIDNDGEQHWKTSSQKECVYDMMSIFLRAR-SFNPASWKKGYVVDFPLIGGK 188
+V T +++ KT + ++DM+S F R + + ++ K+G + +
Sbjct: 117 ARVAT----VNDKKHNEIKTFQTEPNIHDMVSSFYYLRNAIDASTLKEGDETVINMFFDQ 172
Query: 189 TLLPARIIYNGKKTIKADNDKKYRCLELAYYEK-EDGKWRNLANFFVTDDENHIPVRLDM 247
++ + G++ +K K + Y + K + +++DD+N IP+++
Sbjct: 173 ENFKFKLKFLGREVVKTKFGKVKALIFRPYVQAGRVFKEKESLTVWISDDQNKIPLQIKA 232
Query: 248 NLKFGSAKAFLISMKGIRH 266
+L GS KA + + KG++H
Sbjct: 233 DLAVGSLKADIDAYKGLKH 251
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.320 0.135 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,050,087,750
Number of Sequences: 5470121
Number of extensions: 44965190
Number of successful extensions: 85766
Number of sequences better than 1.0e-05: 20
Number of HSP's better than 0.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 85730
Number of HSP's gapped (non-prelim): 20
length of query: 273
length of database: 1,894,087,724
effective HSP length: 131
effective length of query: 142
effective length of database: 1,177,501,873
effective search space: 167205265966
effective search space used: 167205265966
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 128 (53.9 bits)