BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= FNP_1248
(211 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|148323783|gb|EDK89033.1| possible outer membrane protein... 370 e-101
gi|15603408|ref|NP_246482.1| hypothetical protein PM1543 [P... 112 1e-23
gi|46156599|ref|ZP_00132423.2| hypothetical protein Haso020... 84 5e-15
gi|46156747|ref|ZP_00132169.2| COG0642: Signal transduction... 83 1e-14
gi|46156746|ref|ZP_00132168.2| hypothetical protein Haso020... 82 2e-14
gi|33151963|ref|NP_873316.1| hypothetical protein HD0805 [H... 80 1e-13
gi|32030859|ref|ZP_00133602.1| hypothetical protein Haso020... 64 7e-09
gi|33152999|ref|NP_874352.1| hypothetical protein HD2037 [H... 61 5e-08
gi|113460483|ref|YP_718545.1| hypothetical protein HS_0337 ... 60 8e-08
gi|53728720|ref|ZP_00131823.2| hypothetical protein Haso020... 56 2e-06
>gi|148323783|gb|EDK89033.1| possible outer membrane protein OmpW [Fusobacterium nucleatum
subsp. polymorphum ATCC 10953]
Length = 211
Score = 370 bits (951), Expect = e-101, Method: Composition-based stats.
Identities = 211/211 (100%), Positives = 211/211 (100%)
Query: 1 MKKILLGLFALTSISAMAAEGVNFYGRVGLDVFSHYNKVSEKDADGEATLKSRGKVAPSV 60
MKKILLGLFALTSISAMAAEGVNFYGRVGLDVFSHYNKVSEKDADGEATLKSRGKVAPSV
Sbjct: 1 MKKILLGLFALTSISAMAAEGVNFYGRVGLDVFSHYNKVSEKDADGEATLKSRGKVAPSV 60
Query: 61 ALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDIDAHGNKEITEGKYPSVNSVPLYVTGKY 120
ALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDIDAHGNKEITEGKYPSVNSVPLYVTGKY
Sbjct: 61 ALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDIDAHGNKEITEGKYPSVNSVPLYVTGKY 120
Query: 121 KFSNSDVKPYVKADLGYSFNKMKKSSDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTG 180
KFSNSDVKPYVKADLGYSFNKMKKSSDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTG
Sbjct: 121 KFSNSDVKPYVKADLGYSFNKMKKSSDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTG 180
Query: 181 AKVKISDEGTSESVKANNSAIRLTVGYKFSI 211
AKVKISDEGTSESVKANNSAIRLTVGYKFSI
Sbjct: 181 AKVKISDEGTSESVKANNSAIRLTVGYKFSI 211
>gi|15603408|ref|NP_246482.1| hypothetical protein PM1543 [Pasteurella multocida subsp. multocida
str. Pm70]
gi|34223152|sp|Q9CKR3|Y1543_PASMU Uncharacterized protein PM1543 precursor
gi|12721933|gb|AAK03627.1| unknown [Pasteurella multocida subsp. multocida str. Pm70]
Length = 236
Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats.
Identities = 77/233 (33%), Positives = 125/233 (53%), Gaps = 25/233 (10%)
Query: 1 MKKILLGLFALTSISAMAAEG-VNFYGRVGLDVFSHYNKVSEKDADGEATLKSRGKVAPS 59
M+KI+LG+ +T+ +AMA N YG+VG+D+ S ++ ++ D +A K+ + +PS
Sbjct: 5 MQKIILGMLVVTASNAMALNNNFNVYGKVGVDLVSRFDTIAITRHDTQAP-KNTARFSPS 63
Query: 60 VALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDIDAHGNKEITE--GKYP--SVNSVPLY 115
+ E+T + E+GLGL Y+ I +GN+ + + +YP NS+PLY
Sbjct: 64 LFAEITYNATPKTEVGLGLGYIHRKSSHHIIKQKTDNGNRALIDVIEEYPVNRYNSIPLY 123
Query: 116 VTGKYKF-SNSDVKPYVKADLGYSFNKMK------KSSDVEG------------AKAKNG 156
K+ F S SD++ Y+K D GY+FNK K +S+D G +NG
Sbjct: 124 FLVKHNFYSQSDLRSYLKVDFGYAFNKTKSTLNLTESTDAAGNNQYYIEKHNISLNVENG 183
Query: 157 LYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGTSESVKANNSAIRLTVGYKF 209
Y +G GVEY N+ A++AY T +++ S + + N A+R +VGY+F
Sbjct: 184 NYYGIGFGVEYKNILAEIAYTHTDSRLTYSSKLIRGESEYKNDALRFSVGYRF 236
>gi|46156599|ref|ZP_00132423.2| hypothetical protein Haso02000779 [Haemophilus somnus 2336]
Length = 198
Score = 84.0 bits (206), Expect = 5e-15, Method: Composition-based stats.
Identities = 65/197 (32%), Positives = 94/197 (47%), Gaps = 36/197 (18%)
Query: 42 KDADGEATLKSRGKVAPSVALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDIDAHGNKEI 101
KD T + +PS+ LE T ++ E+GLGL Y+ ++ F D H +I
Sbjct: 9 KDVTNLPTSSRQNTFSPSLFLEATYNILPQTEIGLGLGYIK--RKGF-----DHHATWQI 61
Query: 102 T-EGKY--------------PSVNSVPLYVTGKYKFS-NSDVKPYVKADLGYSFNKM--- 142
T EG Y +S+P+Y T K ++ N++ K Y+K D+GYS NK+
Sbjct: 62 TPEGNYGIGGDVNVIEKYRINRYSSLPIYFTLKQNYAFNANSKFYLKGDIGYSINKIDNT 121
Query: 143 ----------KKSSDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGTSE 192
K ++ KA+NGLY + +GVEY + AD+ Y AKV G
Sbjct: 122 KYIYYTDLDHKTTTTNRNMKARNGLYLGLSIGVEYKSFLADIGYYHAHAKVNYESNGYYS 181
Query: 193 SVKANNSAIRLTVGYKF 209
NN AIRLT+G+KF
Sbjct: 182 WHPYNNDAIRLTLGFKF 198
>gi|46156747|ref|ZP_00132169.2| COG0642: Signal transduction histidine kinase [Haemophilus somnus
2336]
Length = 248
Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats.
Identities = 74/247 (29%), Positives = 111/247 (44%), Gaps = 40/247 (16%)
Query: 3 KILLGLFALTSISAMAAEG---VNFYGRVGLDVFSHYNKVS-----EKDADGEATLKSRG 54
K L L L+ + A++ G NFYG+ G+D+ S + + D +
Sbjct: 2 KYKLSLVLLSILPALSHAGGGNFNFYGKAGIDLTSRFETMQLRTYLNNDVPSIPASSRQN 61
Query: 55 KVAPSVALEVTKDLDSNLELGLGLAYVSHGKRDFKI---VDIDAH-----GNKEITEGKY 106
+PS+ E T + E+GLGL Y+ D V D N + K
Sbjct: 62 TFSPSIFFETTYTILPQTEIGLGLGYIKRKGFDHSASWHVHTDGEWVIRADNTAVEYYKI 121
Query: 107 PSVNSVPLYVTGKYKFS-NSDVKPYVKADLGYSFNKM-----KKSSDVEGA--------- 151
+S+P+Y T K ++ N++ K Y+K D+GYS NK+ KK +D+
Sbjct: 122 NRYSSLPIYFTLKQNYAFNANSKFYLKGDIGYSINKIADTKYKKYTDITNKFELQGENLR 181
Query: 152 --KAKNGLYTAVGVGVEYMNVTADLAYVFTGAKV-------KISDEGTSESVKANNSAIR 202
KA+NGLY + +G+EY + A++ Y T A V K EG NN AIR
Sbjct: 182 KMKARNGLYLGLAIGLEYKSFLAEIGYYHTHAAVNYKTKPYKSKAEGYYAWHSYNNDAIR 241
Query: 203 LTVGYKF 209
LT+G+KF
Sbjct: 242 LTLGFKF 248
>gi|46156746|ref|ZP_00132168.2| hypothetical protein Haso02000500 [Haemophilus somnus 2336]
Length = 248
Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats.
Identities = 68/239 (28%), Positives = 108/239 (45%), Gaps = 37/239 (15%)
Query: 8 LFALTSISAMAAEGVNFYGRVGLDVFSHYN----KVSEKDADGEATLKSR-GKVAPSVAL 62
L L ++S NFYG+ G+D+ S + + + + SR +PS+
Sbjct: 10 LSILPALSHAGGGNFNFYGKAGIDLTSRFETMVLRTEQNNYVPSIPASSRQNTFSPSIFF 69
Query: 63 EVTKDLDSNLELGLGLAYVS----------HGKRDF-KIVDIDAHGNKEITEGKYPSVNS 111
E T + E+GLGL Y+ H +D K G +I + + +S
Sbjct: 70 ETTYTILPQTEIGLGLGYIKRKGFDHSASWHFNKDIQKKTWAQTSGFTDIEQYRIYRYSS 129
Query: 112 VPLYVTGKYKFS-NSDVKPYVKADLGYSFNKMKKSS-------------------DVEGA 151
+P+Y+ K ++ N++ K Y+K ++GYS NKM + +
Sbjct: 130 LPIYLILKQNYAFNANSKFYLKGNIGYSINKMPNTKYAHYSNETSLKIQQTQIGDPIYKM 189
Query: 152 KAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGTS-ESVKANNSAIRLTVGYKF 209
KAKNGLY +G+G+EY + A++ Y T A V ++ T NN AIRLT+G KF
Sbjct: 190 KAKNGLYLGLGIGIEYKSFLAEIGYYHTHATVSYKNKNTQYPQYSYNNDAIRLTLGVKF 248
>gi|33151963|ref|NP_873316.1| hypothetical protein HD0805 [Haemophilus ducreyi 35000HP]
gi|33148185|gb|AAP95705.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP]
Length = 267
Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats.
Identities = 69/265 (26%), Positives = 118/265 (44%), Gaps = 56/265 (21%)
Query: 1 MKKILLGLFALTSISAMAAEGVNFYGRVGLDVFSHYNKVSEKDADGEATLKSRGKVAPSV 60
+K ++L + A+ ++ + +N Y +VG D+ S + A K+ K + ++
Sbjct: 3 IKPLVLAISAMPVLATASVGDINLYTKVGTDISSRFESFKISHLAKSAMPKNHSKSSINL 62
Query: 61 ALEVTKDLDSNLELGLGLAYVSH------GKRDF-----KIVDIDAHGNKEI-------- 101
LE T L N E+GLG+ YV G+++F ++++ D +I
Sbjct: 63 FLESTYSLTDNTEVGLGIGYVKRKGQNHVGQQEFLRNAWEVLNRDQSNGADIHKIDKVIV 122
Query: 102 -TEGKYPSVNSVPLYVTGKYKFS-NSDVKPYVKADLGYSFNKMKKSS----------DVE 149
E K +S+PLYVT K F + Y+K D+GY+FNK K ++ D++
Sbjct: 123 KEEYKVNRYSSIPLYVTLKQNFPLIQGIDLYLKGDMGYAFNKSKSTAYHFLSTLIGKDIQ 182
Query: 150 GAKAK------------NGLYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGTS------ 191
K +GLY + +GV+Y N+ A++ Y T AK+ + S
Sbjct: 183 SPNGKERFDALVKTKLNHGLYYGIALGVQYKNLLAEVGYYHTNAKIAYNLNKLSLLTKTL 242
Query: 192 -------ESVKANNSAIRLTVGYKF 209
+ N AIRL+VG++F
Sbjct: 243 ELLPPLPQKTSYKNDAIRLSVGFRF 267
>gi|32030859|ref|ZP_00133602.1| hypothetical protein Haso02002117 [Haemophilus somnus 2336]
Length = 208
Score = 63.5 bits (153), Expect = 7e-09, Method: Composition-based stats.
Identities = 54/184 (29%), Positives = 92/184 (50%), Gaps = 33/184 (17%)
Query: 57 APSVALEVTKDLDSNLELGLGLAYVSHGKRDFKIVDI---------DAHGNKEITEG-KY 106
+PS+ E T ++ E+GLGL Y+ ++ F + I + G + E +
Sbjct: 27 SPSIFFETTYNILPQTEIGLGLGYIK--RKGFDHIAIWPVEGSSLYNPQGQFHLKETYRV 84
Query: 107 PSVNSVPLYVTGKYKFS-NSDVKPYVKADLGYSFNKMKKS-------------------- 145
S+P+Y K ++ N++ K Y+K DLGYS NK++K+
Sbjct: 85 NRYASLPIYFILKQNYALNANTKFYLKGDLGYSINKIRKTVYTLYEDLSLGSSTGYKKDR 144
Query: 146 SDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGTSESVKANNSAIRLTV 205
+ K+++GLY +G+G+EY + D+ Y T +K+ ++G+ S NN AIRLT+
Sbjct: 145 DFISNNKSRSGLYLGLGLGIEYKSFLVDIGYYHTHSKITYKEQGSYVSTPYNNDAIRLTL 204
Query: 206 GYKF 209
G+KF
Sbjct: 205 GFKF 208
>gi|33152999|ref|NP_874352.1| hypothetical protein HD2037 [Haemophilus ducreyi 35000HP]
gi|33149224|gb|AAP96741.1| hypothetical protein HD_2037 [Haemophilus ducreyi 35000HP]
Length = 165
Score = 60.8 bits (146), Expect = 5e-08, Method: Composition-based stats.
Identities = 51/149 (34%), Positives = 70/149 (46%), Gaps = 39/149 (26%)
Query: 97 GNKEITEGKYPSVNSVPLYVTGKYKFS---NSDVKPYVKADLGYSFNKMK--------KS 145
GN++ +Y +S+P+Y+T K F + + YVK DLGYSFNK+K K
Sbjct: 20 GNEQYKVNRY---SSLPIYLTLKQNFKLNGHKKINLYVKGDLGYSFNKIKNTHYSLDTKF 76
Query: 146 SDVEGA----------KAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVKISDEGT----- 190
++ G KAKNGLY + VG EY N A++ Y T A + S G
Sbjct: 77 TNNAGIPELTSFPVEIKAKNGLYYGLAVGAEYKNFLAEIGYYRTHANIVYSGTGKFKYPI 136
Query: 191 ----------SESVKANNSAIRLTVGYKF 209
+ NN AIRL+VG+KF
Sbjct: 137 DYINTHKHTINGKSSYNNDAIRLSVGFKF 165
>gi|113460483|ref|YP_718545.1| hypothetical protein HS_0337 [Haemophilus somnus 129PT]
gi|112822526|gb|ABI24615.1| conserved hypothetical protein [Haemophilus somnus 129PT]
Length = 255
Score = 60.1 bits (144), Expect = 8e-08, Method: Composition-based stats.
Identities = 67/249 (26%), Positives = 107/249 (42%), Gaps = 44/249 (17%)
Query: 5 LLGLFALTSISAMAAEGVNFYGRVGLDV---FSHYNKVSEKDADGEATLKSRGK---VAP 58
L+ L L ++S NFYG+ G+D+ F + + D + + + + +P
Sbjct: 7 LVLLSMLPALSHAGGGNFNFYGKAGIDLTSRFETFQLTANNDGPPISAIPASSRKNTFSP 66
Query: 59 SVALEVTKDLDSNLELGLGLAYVSHGKRDFK------------IVDIDAHGNKEITEGKY 106
SV LE T ++ E+GLGL Y+ D D + I K
Sbjct: 67 SVFLEATYNVLPQTEIGLGLGYIKRKGFDHSASWFIKPAGGDGTYSPDPSKHSAIETYKI 126
Query: 107 PSVNSVPLYVTGKYKFS-NSDVKPYVKADLGYSFNKMKKS-------------------- 145
+S+P+Y+ K N + K Y +LGYS NK+ +
Sbjct: 127 NRYSSLPIYMVLKQNIVLNQNTKLYFTGNLGYSLNKIDDTKYNFYRDIDSPGTFKKDEEL 186
Query: 146 SDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVK----ISDEG-TSESVKANNSA 200
S KAKNGLY +GVG+EY + A++ Y T +KV + +G ++ NN A
Sbjct: 187 SKTRKMKAKNGLYLGLGVGLEYHSFLAEIGYYHTKSKVTRPNFVEGQGLVDQTTSYNNDA 246
Query: 201 IRLTVGYKF 209
+R ++G+KF
Sbjct: 247 VRFSIGFKF 255
>gi|53728720|ref|ZP_00131823.2| hypothetical protein Haso02000106 [Haemophilus somnus 2336]
Length = 210
Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats.
Identities = 61/211 (28%), Positives = 91/211 (43%), Gaps = 44/211 (20%)
Query: 39 VSEKDADGEATLKSR-GKVAPSVALEVTKDLDSNLELGLGLAYVS------------HGK 85
++ D + + SR +PS+ E T ++ E+GLGL Y+ HG
Sbjct: 4 IANHDGNPPISASSRQNTFSPSIFFETTYNILPQTEIGLGLGYIKRKGFEHSASWGIHGG 63
Query: 86 RDFKIVDIDAHGNKEITEGKYPSVNSVPLYVTGKYKFSNS-DVKPYVKADLGYSFNKMKK 144
++ + N I K P S+P+Y K ++ S + Y KA+LGYS NK+
Sbjct: 64 GEYAPIS----DNTGIETYKIPRYASLPIYFILKQDYALSRNTNLYFKANLGYSINKISN 119
Query: 145 S--------------------SDVEGAKAKNGLYTAVGVGVEYMNVTADLAYVFTGAKVK 184
+ S + K KNGLY +G+G+EY A++ Y T +KV
Sbjct: 120 TQYKYYTDLEQENVFKLSEFQSGLRPMKVKNGLYLGLGLGIEYKAFLAEIGYYHTKSKVT 179
Query: 185 ISD------EGTSESVKANNSAIRLTVGYKF 209
+ ES NN AIRLT+G KF
Sbjct: 180 RKNYNYEIRNSIYESTSYNNDAIRLTLGVKF 210
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.312 0.131 0.358
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 718,610,932
Number of Sequences: 5470121
Number of extensions: 29693597
Number of successful extensions: 58658
Number of sequences better than 1.0e-05: 10
Number of HSP's better than 0.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 58626
Number of HSP's gapped (non-prelim): 11
length of query: 211
length of database: 1,894,087,724
effective HSP length: 127
effective length of query: 84
effective length of database: 1,199,382,357
effective search space: 100748117988
effective search space used: 100748117988
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 127 (53.5 bits)