BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= AA01299
(159 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|53733263|ref|ZP_00349675.1| hypothetical protein Hflu203... 86 8e-16
gi|53733012|ref|ZP_00349629.1| COG1196: Chromosome segregat... 86 1e-15
gi|68249037|ref|YP_248149.1| competence protein C [Haemophi... 85 1e-15
gi|16272385|ref|NP_438598.1| competence protein C [Haemophi... 82 7e-15
gi|148827634|ref|YP_001292387.1| competence protein C [Haem... 80 3e-14
gi|145640402|ref|ZP_01795986.1| competence protein C [Haemo... 80 3e-14
gi|145632682|ref|ZP_01788416.1| competence protein C [Haemo... 80 3e-14
gi|152978369|ref|YP_001343998.1| competence C protein, puta... 80 3e-14
gi|145630386|ref|ZP_01786167.1| competence protein C [Haemo... 80 3e-14
gi|15603092|ref|NP_246164.1| ComC [Pasteurella multocida su... 64 4e-09
gi|52426027|ref|YP_089164.1| hypothetical protein MS1972 [M... 62 1e-08
>gi|53733263|ref|ZP_00349675.1| hypothetical protein Hflu203001727 [Haemophilus influenzae R2866]
gi|145628807|ref|ZP_01784607.1| competence protein C [Haemophilus influenzae 22.1-21]
gi|145638653|ref|ZP_01794262.1| competence protein C [Haemophilus influenzae PittII]
gi|144979277|gb|EDJ88963.1| competence protein C [Haemophilus influenzae 22.1-21]
gi|145272248|gb|EDK12156.1| competence protein C [Haemophilus influenzae PittII]
Length = 173
Score = 85.5 bits (210), Expect = 8e-16, Method: Composition-based stats.
Identities = 54/153 (35%), Positives = 95/153 (62%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSTVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FLTAL + + +
Sbjct: 85 ELAAQIIPLNKQIQRLAARNGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLTALLANSSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + IK D+ + +E++FQL + K
Sbjct: 145 LSVSRLQFIKPEDS----PLQTEIIFQLDKETK 173
>gi|53733012|ref|ZP_00349629.1| COG1196: Chromosome segregation ATPases [Haemophilus influenzae
R2846]
gi|145636294|ref|ZP_01791963.1| competence protein C [Haemophilus influenzae PittHH]
gi|145270459|gb|EDK10393.1| competence protein C [Haemophilus influenzae PittHH]
Length = 173
Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats.
Identities = 54/153 (35%), Positives = 95/153 (62%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FLTAL + + +
Sbjct: 85 ELAAQIIPLNKQIQRLAARNGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLTALLANSSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + IK D+ + +E++FQL + K
Sbjct: 145 LSVSRLQFIKPEDS----PLQTEIIFQLDRETK 173
>gi|68249037|ref|YP_248149.1| competence protein C [Haemophilus influenzae 86-028NP]
gi|148825283|ref|YP_001290036.1| competence protein C [Haemophilus influenzae PittEE]
gi|59939225|gb|AAX12383.1| ComC [Haemophilus influenzae]
gi|68057236|gb|AAX87489.1| competence protein C [Haemophilus influenzae 86-028NP]
gi|148715443|gb|ABQ97653.1| competence protein C [Haemophilus influenzae PittEE]
Length = 173
Score = 85.1 bits (209), Expect = 1e-15, Method: Composition-based stats.
Identities = 54/153 (35%), Positives = 95/153 (62%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FLTAL + + +
Sbjct: 85 ELAAQIIPLNKQIQRLAARNGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLTALLANSSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + IK D + + +E++FQL + K
Sbjct: 145 LSVSRLQFIKPED----NPLQTEIIFQLDKETK 173
>gi|16272385|ref|NP_438598.1| competence protein C [Haemophilus influenzae Rd KW20]
gi|401667|sp|P31770|COMC_HAEIN Competence protein C (DNA transformation protein comC)
gi|148995|gb|AAA25010.1| A Haemophilus strain carrying a mini-Tn10kan insertion in ORF C
causes a deficiency in transformation. The predicted
molecular weight and pI of ORF C are 19.9 Kd and 10.2
respectively; ORF C; putative
gi|1573412|gb|AAC22096.1| competence protein C (comC) [Haemophilus influenzae Rd KW20]
Length = 173
Score = 82.4 bits (202), Expect = 7e-15, Method: Composition-based stats.
Identities = 52/153 (33%), Positives = 94/153 (61%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FL+AL + + +
Sbjct: 85 ELAAQIIPLNKQIQRLAARNGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLSALLANSSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + +K D + +E++FQL + K
Sbjct: 145 LSVSRLQFMKPEDG----PLQTEIIFQLDKETK 173
>gi|148827634|ref|YP_001292387.1| competence protein C [Haemophilus influenzae PittGG]
gi|148718876|gb|ABR00004.1| competence protein C [Haemophilus influenzae PittGG]
Length = 173
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 52/153 (33%), Positives = 93/153 (60%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + ++ + +L + +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFSPVLDYIEASTRFDETEYELAKKSSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FLTAL + + +
Sbjct: 85 ELAAQIIPLNKQIQRLAARNGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLTALLANSSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + IK D+ + +E++FQL + K
Sbjct: 145 LSVSRLQFIKPEDS----PLQTEIIFQLDKETK 173
>gi|145640402|ref|ZP_01795986.1| competence protein C [Haemophilus influenzae R3021]
gi|145274988|gb|EDK14850.1| competence protein C [Haemophilus influenzae 22.4-21]
Length = 173
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 51/153 (33%), Positives = 92/153 (60%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSTVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ + ++ H +W+ Q+P+L LQL G F + FLTAL + +
Sbjct: 85 ELAAQIIPLNKQIQRLVVRNGLSQHLRWEMGQQPILHLQLTGHFEKTKTFLTALLANTSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + +K D + + +E++FQL + K
Sbjct: 145 LSVSRLQFMKPED----NPLQTEIIFQLDRETK 173
>gi|145632682|ref|ZP_01788416.1| competence protein C [Haemophilus influenzae 3655]
gi|145634566|ref|ZP_01790275.1| competence protein C [Haemophilus influenzae PittAA]
gi|144986877|gb|EDJ93429.1| competence protein C [Haemophilus influenzae 3655]
gi|145268111|gb|EDK08106.1| competence protein C [Haemophilus influenzae PittAA]
Length = 173
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 51/153 (33%), Positives = 92/153 (60%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + S+ I +L +++ +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFRPVLDYIEGSSRFHEIENELAVKRSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
EL ++ P+N+QIQ+ A++ ++ H +W+ Q+P+L LQL G F + FLTAL + +
Sbjct: 85 ELVAQIIPLNKQIQRLAARYGLSQHLRWEMGQQPILHLQLVGHFEKTKTFLTALLANASQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + +K D + +E++FQL + K
Sbjct: 145 LSVSRLQFMKPEDG----PLQTEIIFQLDRETK 173
>gi|152978369|ref|YP_001343998.1| competence C protein, putative [Actinobacillus succinogenes 130Z]
gi|150840092|gb|ABR74063.1| competence C protein, putative [Actinobacillus succinogenes 130Z]
Length = 197
Score = 80.5 bits (197), Expect = 3e-14, Method: Composition-based stats.
Identities = 54/147 (36%), Positives = 79/147 (53%), Gaps = 9/147 (6%)
Query: 21 PTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTPELAGKLPPINQQI 80
P + YW+ +LQ +QE+ HQ ++LAAL++K + L + KL +Q+I
Sbjct: 52 PAYQYWKNCRSLTENFIELQQKQEQFTHQSRLLAALQQKNQRHL-NKAVTAKLATFHQKI 110
Query: 81 QQFASKLHIAHSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDELELLEWHIIKNADT 140
Q FA L I HSQW P L L++ G F ++++FLTAL EL LL WHI K +
Sbjct: 111 QSFAVGLSIIHSQWQVQNAPGLDLKVQGHFAEIQQFLTALLQQIPELALLSWHIQKIEEN 170
Query: 141 NDGH--------SIHSELLFQLHTKEK 159
++ H SI S L F+LH ++
Sbjct: 171 DENHRRETGGGSSIESSLSFRLHLSQE 197
>gi|145630386|ref|ZP_01786167.1| competence protein C [Haemophilus influenzae 22.4-21]
gi|144984121|gb|EDJ91558.1| competence protein C [Haemophilus influenzae R3021]
Length = 173
Score = 80.1 bits (196), Expect = 3e-14, Method: Composition-based stats.
Identities = 51/153 (33%), Positives = 92/153 (60%), Gaps = 7/153 (4%)
Query: 10 LIFVCLLA--LYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQLLTP 67
L F+ LL+ ++ P DY + ++ + +L + +L HQQKIL +L++++ + L+P
Sbjct: 25 LTFLLLLSAVIFRPVLDYIEASTRFDETEYELAKKSSELLHQQKILTSLQQQSESRKLSP 84
Query: 68 ELAGKLPPINQQIQQFASKLHIA-HSQWDFHQKPLLKLQLHGQFTDLREFLTALFSANDE 126
ELA ++ P+N+QIQ+ A++ ++ H +W+ QKP+L LQL G F + FLTAL + +
Sbjct: 85 ELAAQIIPLNKQIQRLATRYGLSQHLRWEMGQKPILHLQLTGHFEKTKTFLTALLANTSQ 144
Query: 127 LELLEWHIIKNADTNDGHSIHSELLFQLHTKEK 159
L + +K D + + +E++FQL + K
Sbjct: 145 LSVSRLQFMKPED----NPLQTEIIFQLDRETK 173
>gi|15603092|ref|NP_246164.1| ComC [Pasteurella multocida subsp. multocida str. Pm70]
gi|12721582|gb|AAK03311.1| ComC [Pasteurella multocida subsp. multocida str. Pm70]
Length = 174
Score = 63.5 bits (153), Expect = 4e-09, Method: Composition-based stats.
Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 1/155 (0%)
Query: 4 WQQRALLIFVCLLALYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQ 63
+QQ L V L L FP +YWQ R + + Q++ L HQ+ ILA+L+ +Q
Sbjct: 19 FQQIGLFFVVFGLLLMFPLSNYWQVRQTLNQFVLEEKAQKQALQHQKHILASLERNNLRQ 78
Query: 64 LLTPELAGKLPPINQQIQQFASKLHIAHSQWDFHQKPLLKLQLHGQFTDLREFLTALFSA 123
L+ ++A +L INQ ++ + L +QW F P L L F + F T L +
Sbjct: 79 LMPLDVAHRLTEINQFLEVQLAHLPEQATQWTFQPFPQLVLHFKSDFEQCQAFFTQLLTQ 138
Query: 124 NDELELLEWHIIKNADTNDGHSIHSELLFQLHTKE 158
+L LL I+KN + + +I +E+ FQL K+
Sbjct: 139 YPQLFLLSLQIMKNEEGEES-TIQTEVSFQLQLKQ 172
>gi|52426027|ref|YP_089164.1| hypothetical protein MS1972 [Mannheimia succiniciproducens MBEL55E]
gi|52308079|gb|AAU38579.1| unknown [Mannheimia succiniciproducens MBEL55E]
Length = 177
Score = 61.6 bits (148), Expect = 1e-08, Method: Composition-based stats.
Identities = 38/132 (28%), Positives = 67/132 (50%)
Query: 5 QQRALLIFVCLLALYFPTWDYWQQRSQAEAISQQLQIQQEKLAHQQKILAALKEKAAKQL 64
+Q L+ + L L+ P + Q + + QQ + +QQ++L +L++KA L
Sbjct: 28 KQNTGLLILALFGLFLPLNRLYSSWEQLIRLENNINEQQRQTIYQQRLLQSLEKKAKNDL 87
Query: 65 LTPELAGKLPPINQQIQQFASKLHIAHSQWDFHQKPLLKLQLHGQFTDLREFLTALFSAN 124
LTP+ A L INQ +Q + + I ++QW F +L+L++ G F L +F+T +
Sbjct: 88 LTPQSAALLSQINQYVQSSSVNVKIQNAQWHFSSSAVLQLRMEGDFLSLNQFITDILQKF 147
Query: 125 DELELLEWHIIK 136
+ L L + K
Sbjct: 148 ETLRLSSLKLFK 159
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.321 0.132 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 573,727,501
Number of Sequences: 5470121
Number of extensions: 21076250
Number of successful extensions: 110496
Number of sequences better than 1.0e-05: 12
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 110475
Number of HSP's gapped (non-prelim): 12
length of query: 159
length of database: 1,894,087,724
effective HSP length: 121
effective length of query: 38
effective length of database: 1,232,203,083
effective search space: 46823717154
effective search space used: 46823717154
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 124 (52.4 bits)