BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF3154
(397 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|150025188|ref|YP_001296014.1| hypothetical protein FP111... 117 1e-24
gi|110638142|ref|YP_678351.1| hypothetical protein CHU_1742... 108 8e-22
gi|126662372|ref|ZP_01733371.1| hypothetical protein FBBAL3... 74 1e-11
gi|89891315|ref|ZP_01202821.1| hypothetical protein BBFL7_0... 64 2e-08
>gi|150025188|ref|YP_001296014.1| hypothetical protein FP1116 [Flavobacterium psychrophilum JIP02/86]
gi|149771729|emb|CAL43203.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 407
Score = 117 bits (293), Expect = 1e-24, Method: Composition-based stats.
Identities = 100/408 (24%), Positives = 178/408 (43%), Gaps = 41/408 (10%)
Query: 16 LLLLLLGSGMMAQTYRATLPPVEADS---FYAIDLPAPLIGAARDDLADLRIRDEHGREV 72
+LL + + AQ AT ++ D F+ + LP L + DDL+D RI D EV
Sbjct: 5 FILLFIANLSFAQNRIAT-AKIKYDGKEGFHKLVLPTELRSFSNDDLSDFRIFDSKKNEV 63
Query: 73 AYFVREATSVSRGRDFEPYLIELIRRTFQTD----IRIQTDGERISSFMVRVKHADTDKK 128
Y+ + S F Y ++I +T T+ + + IS + + +++ K
Sbjct: 64 PYYFETKNNQSYNNLFVAY--KIISKTAITNKNTVLIFENPKTTISKATLEIANSNITKS 121
Query: 129 AILKGSDDGESWYAVRDCIRLSEQAEVHGTEALLTVSFPVSDYRYYLLSINDSLSAPLNI 188
+ GS D E W+ + + LS + T+ P Y+Y + ND + P+NI
Sbjct: 122 YSISGSMDNEKWFGLVNKDSLSPIYSEETSSGFYTIPIPTCFYKYLKIDFNDKKTLPINI 181
Query: 189 LSVGRNGAGDTHIRH------LMKIPATHYALRTDNDRKQTELTLSFPYPYRFEDVVFYI 242
L++G I H L+ +P + D+K+T++ ++F P + F I
Sbjct: 182 LNIGS-------INHKMSKTTLLTVPYKSKLVTEIKDKKKTKIQINFNNPQIINQISFTI 234
Query: 243 SAPEDYDR--AVWLAGTKIGRLRSEAGSAQALPSLPHSMDT-------------LRFRIA 287
S P+ Y R +V+ T+ + + E Q + + + DT L I
Sbjct: 235 SNPKFYKRNISVYKKATRKVKHKFENYEEQ-IANFELNSDTNNTFNIAQIFEKELFIAIE 293
Query: 288 NGDDQPLRIDSVAARIVRRYAVAQLKRGR-YTLTYGDAAAHAPQYDL-TFRKRVSDDLPH 345
N D+QPL I + + Y + LK YT+T G+ PQYDL F+ ++ +LP
Sbjct: 294 NKDNQPLTISKIELFQIPFYIITDLKNNENYTITTGNKDLDTPQYDLENFKNSITTNLPE 353
Query: 346 LSVVSIEHIGDSDEPPHPWIVFLKTYGVWIVIALVTIQLLYMVWRMMR 393
+ + I +S + + +++ +W+ I+L I + Y V +++
Sbjct: 354 AQITETKQIKNSCKTAANKTFWQQSWFMWLCISLAGIAITYFVISLVK 401
>gi|110638142|ref|YP_678351.1| hypothetical protein CHU_1742 [Cytophaga hutchinsonii ATCC 33406]
gi|110280823|gb|ABG59009.1| hypothetical protein CHU_1742 [Cytophaga hutchinsonii ATCC 33406]
Length = 429
Score = 108 bits (269), Expect = 8e-22, Method: Composition-based stats.
Identities = 116/433 (26%), Positives = 198/433 (45%), Gaps = 58/433 (13%)
Query: 10 KAALTGLLLLLLGSGMMAQT---YRATLP-PVEADSFYAIDLPAPLIGAA-RDDLADLRI 64
K ALT L+ + ++AQ ++ATL V AD FY I + P IGA +D AD+RI
Sbjct: 3 KIALTVFLIAAVSGMLLAQQNYRWKATLADSVTADGFYNI-VVTPEIGAKLQDGYADIRI 61
Query: 65 RDEHGREVAYFVREATSVSRGRDFEPYLI---ELIRRTFQTDIRIQTDGER-ISSFMVRV 120
D +EV + + + F+ Y I E+I + T + I+ ++ I++ +
Sbjct: 62 VDASNQEVPFIQKTEEANFSDTRFKEYPIVSNEIIDKCC-TKLTIKNPAKKPINNICFVL 120
Query: 121 KHADTDKKAILKGSDDGESWYAVRDCIRLSEQAEVHGTEALLTVSFPVSDYRYYLLSI-- 178
K++D K + GSDD +W+AV+ Q T+ + V+FP++DY YY +
Sbjct: 121 KNSDAVKVYQILGSDDSINWFAVKQFDVFYNQYSETDTKVVSYVNFPLTDYLYYRFDLTD 180
Query: 179 -------------NDSLSAPLNILSVGRNGAGDTHIRH--LMKIPATHYALRTDNDRKQT 223
D S P+N+L G +T+++ +PA A + K +
Sbjct: 181 RGYWDETRWNFWHQDRWSYPVNVLKAGYY---ETYLKEGIYQSVPAPAIAQKDSAAVKTS 237
Query: 224 ELTLSFPYPYRFEDVVFYISAPEDYDRAVWLA--GTKIGRLRSEAGSAQALPSLPHSMDT 281
+ +SF Y + F+++ P Y R+V +A T + R + + +S++
Sbjct: 238 FIKISFADNYIINRIRFHVTGPRYYKRSVTIARKETDVQRNKIYYNPIGTIELNSYSLNE 297
Query: 282 LRFR----------IANGDDQPLRIDSVAARIVRRYAVAQLKRG-RYTLTYGDAAAHAPQ 330
F I N D++PL+++ A + RY L++G RY L Y D+A AP
Sbjct: 298 FDFSFFREKEFYLIIENNDNRPLKVEQAEAWQLTRYLKTYLEKGQRYALIYSDSADTAPV 357
Query: 331 YDLTFRKRVSDDLPH-LSVVSIEHIGDSDEPPHPWIV-------FLKTYGVWI-VIALVT 381
YDL + +D +P L +++E D P I+ + VW+ +I L+
Sbjct: 358 YDLQY---FADSIPVILPTLAVEKT-DRHVPALKEIMIDNHTSWYESKLVVWLSIIGLIA 413
Query: 382 IQLLYMVWRMMRK 394
I L +M W+M+ +
Sbjct: 414 I-LSFMSWKMLNE 425
>gi|126662372|ref|ZP_01733371.1| hypothetical protein FBBAL38_03435 [Flavobacteria bacterium BAL38]
gi|126625751|gb|EAZ96440.1| hypothetical protein FBBAL38_03435 [Flavobacteria bacterium BAL38]
Length = 406
Score = 74.3 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 75/349 (21%), Positives = 141/349 (40%), Gaps = 34/349 (9%)
Query: 27 AQTYRATLPPVEADSFYAIDLPAPLIGAARDDLADLRIRDEHGREVAYFVR--------- 77
Q+++ + VE F+ I + + A+ ++ LRI ++ +E+ + V
Sbjct: 18 GQSHKGNIERVEEKGFHRILIAPEVRSASNENFDFLRIYNDEKKEIPFVVDFNKDYFFSN 77
Query: 78 --EATSVSRGRDFEPYLIELIRRTFQTDIRIQTDGERISSFMVRVKHADTDKKAILKGSD 135
++ +S + F+ + I I + + + S +++ + K+ + GSD
Sbjct: 78 RYQSLRISDQKQFKDSVSYYI-------INVVQNMKYCSELSLKIANTGLTKEYNISGSD 130
Query: 136 DGESWYAVRDCIRLSEQAEVHGTEALLTVSFPVSDYRYYLLSINDSLSAPLNILSVGRNG 195
DG W+ + L + + T TVSFP + Y++ + D S P+N+L VG
Sbjct: 131 DGIHWFGLVMNAMLYDLNDFQNTYVRKTVSFPNNSYKFLKIEFIDKNSMPINLLEVGYF- 189
Query: 196 AGDTHIRHLMKIPATHYALRTDNDRKQTELTLSFPYPYRFEDVVFYISAPEDYDRA-VWL 254
GD I + + L D K+T + S Y + + F + +A V++
Sbjct: 190 IGDEKIEPTTVLEVFKHKLIEDKTNKKTIIKFSADNLYTVDGIAFNFKNSRFFRKASVFI 249
Query: 255 AGTKIGRLRSE------------AGSAQALPSLPHSMDTLRFRIANGDDQPLRIDSVAAR 302
T+ + +SE + SA + I N D++PL I +
Sbjct: 250 KETRSVKKKSEIYRKSVATFNLDSNSANSFKWESFQAKDFEIEIENLDNEPLEIKGIKLL 309
Query: 303 IVRRYAVAQLKRGRYTLTYGDAAAHAPQYDLTFRKRVSDDLPHLSVVSI 351
+ Y +A L + D+ PQYDL K + +DL L ++ I
Sbjct: 310 QNQFYLIADLDVSKNYEIVVDSTLSKPQYDLV--KFLPNDLTKLGLIRI 356
>gi|89891315|ref|ZP_01202821.1| hypothetical protein BBFL7_00456 [Flavobacteria bacterium BBFL7]
gi|89516346|gb|EAS19007.1| hypothetical protein BBFL7_00456 [Flavobacteria bacterium BBFL7]
Length = 415
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 79/383 (20%), Positives = 159/383 (41%), Gaps = 44/383 (11%)
Query: 10 KAALTGLLLLLLGSGMMAQ----TYRATLPPVEADSFYAIDLPAPLIGAARDDLADLRI- 64
K + LL+G + AQ Y ++L + D +++I +P + G L D+RI
Sbjct: 5 KLNFCATIFLLIGGYVTAQISDYQYESSLDEIVND-WHSITIPDGVYGKVDPSLKDIRIY 63
Query: 65 ---RDEHGREVAYF--VREATSVSRGRDFEPYLIELIRRTFQTDIRIQTDGERISSFMVR 119
+ + EV Y +++ ++ + + + + I++ I +
Sbjct: 64 GVTKSQDTIEVPYLWNIKKDNVKTKKQSGKIINASKTDQGYYYTIKLNRKST-ICHMKLL 122
Query: 120 VKHADTDKKAILKGSDDGESWYAVRDCIRLSEQAEVHGTEALLTVSFPVSDYRYYLLSIN 179
+++ D K L+ S D + W+ + + R+ + + + FP SDY+YY + +
Sbjct: 123 FSNSNYDWKVDLEASQDQKEWFTILENSRILDIENAVTDYSYTDLHFPTSDYQYYRILVK 182
Query: 180 DSLSAPLNI--LSVGRNGAGDTHIRHLMKIPATHYALRTDNDRKQTELTLSFPYPYRFED 237
+ L +++ AG+ ++ + + K TE+ L
Sbjct: 183 SNEQPKLTSVDMAMEEEVAGE-----IINYEVKKMMVSENKPMKYTEVELELENAVPVSS 237
Query: 238 VVFYISAPEDYDRAV---WLAGTKIGRLRSEAGSAQALPSLPH----SMDTLRFR----- 285
+ I+A DY R + +L+ + +++E G ++ S+D +F+
Sbjct: 238 LQININADYDYFRPMNIRFLSDS----VKTEKGYLYNYTTIKRVTLTSLDNNQFKFKSTV 293
Query: 286 -------IANGDDQPLRIDSVAARIVRRYAVAQLKR-GRYTLTYGDAAAHAPQYDLT-FR 336
I+N D+QPL I S + +A+ Y L YG+ +AH PQYD+T F+
Sbjct: 294 AKKFKIVISNSDNQPLNIQSAMINGFKHELIARFTEPASYKLVYGNPSAHRPQYDITNFK 353
Query: 337 KRVSDDLPHLSVVSIEHIGDSDE 359
+ +L L + S I ++D+
Sbjct: 354 DHIPANLKDLEIGSQNQILNNDD 376
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.322 0.137 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,479,438,997
Number of Sequences: 5470121
Number of extensions: 61432869
Number of successful extensions: 133357
Number of sequences better than 1.0e-05: 4
Number of HSP's better than 0.0 without gapping: 0
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 133346
Number of HSP's gapped (non-prelim): 4
length of query: 397
length of database: 1,894,087,724
effective HSP length: 135
effective length of query: 262
effective length of database: 1,155,621,389
effective search space: 302772803918
effective search space used: 302772803918
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 131 (55.1 bits)