BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF0648
(168 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|154495347|ref|ZP_02034352.1| hypothetical protein PARMER... 211 2e-53
gi|150007681|ref|YP_001302424.1| hypothetical protein BDI_1... 193 3e-48
gi|150008172|ref|YP_001302915.1| hypothetical protein BDI_1... 78 1e-13
gi|150005662|ref|YP_001300406.1| hypothetical protein BVU_3... 70 6e-11
gi|156862193|gb|EDO55624.1| hypothetical protein BACUNI_006... 69 8e-11
gi|154493598|ref|ZP_02032918.1| hypothetical protein PARMER... 69 1e-10
gi|60681863|ref|YP_212007.1| hypothetical protein BF2383 [B... 65 2e-09
gi|34540671|ref|NP_905150.1| FHA domain protein [Porphyromo... 63 5e-09
gi|53713583|ref|YP_099575.1| hypothetical protein BF2294 [B... 61 2e-08
gi|156110313|gb|EDO12058.1| hypothetical protein BACOVA_025... 61 2e-08
gi|29346236|ref|NP_809739.1| hypothetical protein BT_0826 [... 60 7e-08
gi|153806364|ref|ZP_01959032.1| hypothetical protein BACCAC... 60 7e-08
>gi|154495347|ref|ZP_02034352.1| hypothetical protein PARMER_04404 [Parabacteroides merdae ATCC
43184]
gi|154085271|gb|EDN84316.1| hypothetical protein PARMER_04404 [Parabacteroides merdae ATCC
43184]
Length = 171
Score = 211 bits (536), Expect = 2e-53, Method: Composition-based stats.
Identities = 109/171 (63%), Positives = 125/171 (73%), Gaps = 3/171 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIASFKCPKCRHDIPVSYLKHEQ---SADSETVLIRPV 57
MI VKCPHC VGLKVDE KIP I SFKCPKCR IPVS L E+ +++SETVLI+P
Sbjct: 1 MISVKCPHCHVGLKVDEGKIPLDITSFKCPKCRQPIPVSLLSMEKGSVASESETVLIQPF 60
Query: 58 RKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEVKK 117
R G L V+AN DTPEQ FPL EG++IIGRK+ S ATI I T DK MSR H+RIEVKK
Sbjct: 61 RMTTGHLSVIANADTPEQTFPLQEGVYIIGRKSKASTATIGIVTADKSMSREHIRIEVKK 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHTHVRFKE 168
D++G Y H LSD SKNHT YN YL+ E+VVL +NDEI IG T +RF E
Sbjct: 121 DAKGGYKHYLSDNNSKNHTLYNSNYLENGEIVVLNNNDEIIIGRTVLRFNE 171
>gi|150007681|ref|YP_001302424.1| hypothetical protein BDI_1036 [Parabacteroides distasonis ATCC
8503]
gi|149936105|gb|ABR42802.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 171
Score = 193 bits (490), Expect = 3e-48, Method: Composition-based stats.
Identities = 101/171 (59%), Positives = 121/171 (70%), Gaps = 3/171 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIASFKCPKCRHDIPVSYLKHE---QSADSETVLIRPV 57
MI VKCPHC VGLKV E KIP GI SFKCPKC+ IP+S L + Q DS+T+L+ PV
Sbjct: 1 MIAVKCPHCHVGLKVAEGKIPLGITSFKCPKCKEPIPLSLLSEKSNHQEVDSDTILVTPV 60
Query: 58 RKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEVKK 117
+ G L V+ + DTPEQ FPL EG +IGRK++ S+AT+ I T D+ MSR H+ IEVKK
Sbjct: 61 KTGIGRLTVLPDADTPEQAFPLHEGKVVIGRKSNASQATMPIITADRTMSREHICIEVKK 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHTHVRFKE 168
DS+G Y H L+D SKNHT YN YL+ EVVVL DNDEI IG T +RF E
Sbjct: 121 DSKGGYKHFLTDNNSKNHTLYNNSYLENGEVVVLNDNDEIIIGRTVLRFNE 171
>gi|150008172|ref|YP_001302915.1| hypothetical protein BDI_1538 [Parabacteroides distasonis ATCC
8503]
gi|149936596|gb|ABR43293.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 179
Score = 78.2 bits (191), Expect = 1e-13, Method: Composition-based stats.
Identities = 53/165 (32%), Positives = 80/165 (48%), Gaps = 3/165 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEG-IASFKCPKCRHDIPVSY-LKHEQSADSETVLIRPVR 58
M V CP C L DETK PEG + +F CP+C + K ++A E ++
Sbjct: 1 MKRVFCPKCDNQLSFDETKYPEGKVLAFVCPQCGAQFKIKLGRKVVKTATGEEKEVKEPD 60
Query: 59 KECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEAT-ICIETEDKEMSRNHLRIEVKK 117
CG++ ++ N +Q PL G +IGR+ +E + I T D M R H I VKK
Sbjct: 61 FSCGYITIIENGFAFKQDLPLVMGDNVIGRRNKDTEGVDVPIITSDPSMGRKHCVINVKK 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
D+ G +++M+ D+ S T + K E V + + + IG T
Sbjct: 121 DTNGKFVYMVRDFPSLTGTFVKNVLVGKKEQVRIGEGTIVTIGAT 165
>gi|150005662|ref|YP_001300406.1| hypothetical protein BVU_3152 [Bacteroides vulgatus ATCC 8482]
gi|149934086|gb|ABR40784.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 174
Score = 69.7 bits (169), Expect = 6e-11, Method: Composition-based stats.
Identities = 53/165 (32%), Positives = 79/165 (47%), Gaps = 3/165 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPVSYLKHE-QSADSETVLIRPVR 58
M V CP C + DETK EG + F C C+ + K + ++ E +L
Sbjct: 1 MKRVLCPKCDNYITFDETKYTEGQSLVFVCDHCKKQFSIRIGKSKLKATRKEEILDEQAN 60
Query: 59 KE-CGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEVKK 117
KE G ++VV N +Q+ PL EG +IGR++ +E I IET D M R H I VK+
Sbjct: 61 KEDFGSIVVVENVFGYKQVLPLMEGDNLIGRRSKGTEVDIPIETSDPSMDRRHCIINVKR 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
+ +G I+ L D S T L + + +++ I +G T
Sbjct: 121 NKQGEVIYTLRDNDSITGTFLMNEILGNKDRIRIEEGAIITLGAT 165
>gi|156862193|gb|EDO55624.1| hypothetical protein BACUNI_00656 [Bacteroides uniformis ATCC 8492]
Length = 177
Score = 69.3 bits (168), Expect = 8e-11, Method: Composition-based stats.
Identities = 52/167 (31%), Positives = 79/167 (47%), Gaps = 7/167 (4%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPVSYLKHEQSADSETVLIRP--- 56
M ++CP C+ L DETK EG + F C +C+ + K + A + RP
Sbjct: 1 MKRIRCPKCENYLTFDETKYTEGQSLVFVCDQCKKQFSIRLGKSKMKAPQKEE--RPDEA 58
Query: 57 -VRKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEV 115
++ G + V+ N +Q+ PL EG IIGR+ + + IET D M R H I V
Sbjct: 59 EFKEAFGSITVIENVFGFKQVLPLQEGDNIIGRRCVGNVIDVPIETSDMSMDRRHCIINV 118
Query: 116 KKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
K++ +G I+ L D S T N L + + ++D + IG T
Sbjct: 119 KRNKQGVLIYTLRDAPSLTGTFLNNEILGDKDRIRIEDGAIVTIGAT 165
>gi|154493598|ref|ZP_02032918.1| hypothetical protein PARMER_02938 [Parabacteroides merdae ATCC
43184]
gi|154086808|gb|EDN85853.1| hypothetical protein PARMER_02938 [Parabacteroides merdae ATCC
43184]
Length = 177
Score = 68.6 bits (166), Expect = 1e-10, Method: Composition-based stats.
Identities = 52/168 (30%), Positives = 77/168 (45%), Gaps = 9/168 (5%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEG-IASFKCPKCRHDIPVSY----LKHEQSADSETVLIR 55
M V CP C L DETK PEG + +F CP+C + ++ E + E ++
Sbjct: 1 MKRVFCPKCDNQLAFDETKYPEGKVLAFVCPQCGSQFKIKLGRKIVRTESGQEKE---VK 57
Query: 56 PVRKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEAT-ICIETEDKEMSRNHLRIE 114
CG++ V+ N +Q PL G +IGR+ ++ + I T D M R H I
Sbjct: 58 EPDFSCGYITVIENVFGFKQELPLVMGDNVIGRRNKDTDGVDVPIITSDPSMGRKHCVIN 117
Query: 115 VKKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
VK+D G I+ L D+ S T + K E V + + + IG T
Sbjct: 118 VKEDKEGKLIYTLRDFPSLTGTFLCSVLVGKKEQVRVGEGAIVTIGAT 165
>gi|60681863|ref|YP_212007.1| hypothetical protein BF2383 [Bacteroides fragilis NCTC 9343]
gi|60493297|emb|CAH08081.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 177
Score = 64.7 bits (156), Expect = 2e-09, Method: Composition-based stats.
Identities = 53/165 (32%), Positives = 76/165 (46%), Gaps = 3/165 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPVSYLKHEQSAD-SETVLIRPV- 57
M + CP C+ L DETK EG + F C C + K + A E L V
Sbjct: 1 MKRILCPKCENYLSFDETKYSEGQSLVFVCEHCGKQFSIRLGKSKMKAPRKEEKLDEDVY 60
Query: 58 RKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEVKK 117
++E G ++V+ N +Q+ PL EG IIGR+ ++ IET D M R H I VK+
Sbjct: 61 KEEFGCIVVIENVFGFKQVLPLQEGDNIIGRRCVGTDINTPIETGDMSMDRRHCIINVKR 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
+ +G ++ L D S T L + + + D I IG T
Sbjct: 121 NRQGELVYTLRDAPSLTGTFLMNEILGDKDRIRIDDGAIITIGAT 165
>gi|34540671|ref|NP_905150.1| FHA domain protein [Porphyromonas gingivalis W83]
gi|34396985|gb|AAQ66049.1| FHA domain protein [Porphyromonas gingivalis W83]
Length = 190
Score = 63.2 bits (152), Expect = 5e-09, Method: Composition-based stats.
Identities = 55/174 (31%), Positives = 79/174 (45%), Gaps = 14/174 (8%)
Query: 1 MIPVKCPHCKVGLKVDETKI----PEGIASFKCPKCRHDIPVSYL-KHEQSADSETVLIR 55
M V CP C + + + P G S CP C H + + + K + S + +T
Sbjct: 1 MKRVFCPKCDGAITISRETLSQASPNGSLSLLCPHCCHQLRIRLVSKSDNSHNHKTEQEE 60
Query: 56 PVR-----KECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNH 110
+ G+++V+ N Q F L EG IGR+ S I + T D M R+H
Sbjct: 61 QELAQELDRSRGYIVVLENIFGYRQEFALREGDNGIGRRNKDSVVDIAVLTGDPSMGRHH 120
Query: 111 --LRIEVKKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
LR+ KKD GS+++ ++D S T GR LDK E L D D I +G T
Sbjct: 121 CILRVTPKKD--GSFLYSIADDDSLVGTFVGGRLLDKKEWCRLNDGDVITMGAT 172
>gi|53713583|ref|YP_099575.1| hypothetical protein BF2294 [Bacteroides fragilis YCH46]
gi|52216448|dbj|BAD49041.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 177
Score = 61.2 bits (147), Expect = 2e-08, Method: Composition-based stats.
Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 3/165 (1%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPVSYLKHEQSAD-SETVLIRPV- 57
M + CP C+ L DETK EG + F C C + K + A E L V
Sbjct: 1 MKRILCPKCENYLSFDETKYSEGQSLVFVCEHCGKQFSIRLGKSKMKAPRKEEKLDEDVY 60
Query: 58 RKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEVKK 117
++E G ++V+ N +Q+ PL EG IIGR+ ++ IET D M R H I VK+
Sbjct: 61 KEEFGCIVVIENVFGFKQVLPLQEGDNIIGRRCVGTDINTPIETGDMSMDRRHCIINVKR 120
Query: 118 DSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
+ + ++ L D S T L + + + D I IG T
Sbjct: 121 NRQRELVYTLRDAPSLTGTFLMNEILGDKDRIRIDDGAIITIGAT 165
>gi|156110313|gb|EDO12058.1| hypothetical protein BACOVA_02561 [Bacteroides ovatus ATCC 8483]
Length = 177
Score = 61.2 bits (147), Expect = 2e-08, Method: Composition-based stats.
Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 5/167 (2%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPV----SYLKHEQSADSETVLIR 55
M V CP C+ L DETK EG + F+C C + S +K + ++
Sbjct: 1 MKRVLCPKCENYLYFDETKYSEGQSLVFECEHCGKQFSIRLGKSKVKALRKEENLEEEAE 60
Query: 56 PVRKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEV 115
++E G++ V+ N +QL PL EG +IGR+ + IE+ D M R H I +
Sbjct: 61 SHKEEFGYITVIENVFGFKQLLPLQEGDNVIGRRCVGTNINTPIESGDMSMDRRHCIINI 120
Query: 116 KKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
K++ +G ++ L D S T L + V ++D + IG T
Sbjct: 121 KRNKQGKLVYTLRDAPSLTGTFLMNEILGDKDRVRIEDGAIVTIGAT 167
>gi|29346236|ref|NP_809739.1| hypothetical protein BT_0826 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338131|gb|AAO75933.1| putative FHA domain protein [Bacteroides thetaiotaomicron VPI-5482]
Length = 177
Score = 59.7 bits (143), Expect = 7e-08, Method: Composition-based stats.
Identities = 49/167 (29%), Positives = 77/167 (46%), Gaps = 5/167 (2%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPV----SYLKHEQSADSETVLIR 55
M V CP C+ L DETK EG + F+C C + S +K + ++
Sbjct: 1 MKRVLCPKCENYLFFDETKYSEGQSLVFECEHCGKQFSIRLGKSKVKALRKEENLEEEAE 60
Query: 56 PVRKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEV 115
++E G++ V+ N +Q+ PL EG +IGR+ + IE+ D M R H I V
Sbjct: 61 VHKEEFGYITVIENVFGFKQVLPLQEGDNVIGRRCVGTVINTPIESGDMSMDRRHCIINV 120
Query: 116 KKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
K++ +G ++ L D S T L + V ++D + IG T
Sbjct: 121 KRNKQGKLVYTLRDAPSLTGTFLMNEILGDKDRVRMEDGAIVTIGAT 167
>gi|153806364|ref|ZP_01959032.1| hypothetical protein BACCAC_00625 [Bacteroides caccae ATCC 43185]
gi|149131041|gb|EDM22247.1| hypothetical protein BACCAC_00625 [Bacteroides caccae ATCC 43185]
Length = 176
Score = 59.7 bits (143), Expect = 7e-08, Method: Composition-based stats.
Identities = 48/167 (28%), Positives = 77/167 (46%), Gaps = 5/167 (2%)
Query: 1 MIPVKCPHCKVGLKVDETKIPEGIA-SFKCPKCRHDIPV----SYLKHEQSADSETVLIR 55
M V CP C+ L DETK EG + F+C C + S +K + ++
Sbjct: 1 MKRVLCPKCENYLYFDETKYSEGQSLVFECEHCGKQFSIRLGKSKVKALRKEENLEEEAE 60
Query: 56 PVRKECGWLIVVANPDTPEQLFPLCEGIFIIGRKASVSEATICIETEDKEMSRNHLRIEV 115
++E G++ V+ N +Q+ PL EG +IGR+ + IE+ D M R H I +
Sbjct: 61 VHKEEFGYITVIENVFGFKQILPLQEGDNVIGRRCVGTVINTPIESGDMSMDRRHCIINI 120
Query: 116 KKDSRGSYIHMLSDYRSKNHTQYNGRYLDKDEVVVLKDNDEIKIGHT 162
K++ +G I+ L D S T L + + ++D + IG T
Sbjct: 121 KRNKQGKLIYTLRDAPSLTGTFLMNEILGDKDRIRIEDGAIVTIGAT 167
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.319 0.137 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 641,672,957
Number of Sequences: 5470121
Number of extensions: 24800907
Number of successful extensions: 59715
Number of sequences better than 1.0e-05: 12
Number of HSP's better than 0.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 59694
Number of HSP's gapped (non-prelim): 12
length of query: 168
length of database: 1,894,087,724
effective HSP length: 123
effective length of query: 45
effective length of database: 1,221,262,841
effective search space: 54956827845
effective search space used: 54956827845
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 124 (52.4 bits)