WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A08F10_CONSENSUS (605 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 2 Sequences : less than 2 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 301 44 |====================== 6310 257 29 |============== 3980 228 76 |====================================== 2510 152 57 |============================ 1580 95 31 |=============== 1000 64 15 |======= 631 49 15 |======= 398 34 12 |====== 251 22 2 |= 158 20 4 |== 100 16 2 |= 63.1 14 3 |= 39.8 11 0 | 25.1 11 1 |: 15.8 10 0 | >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 10 <<<<<<<<<<<<<<<<< 10.0 10 0 | 6.31 10 3 |= 3.98 7 1 |: 2.51 6 0 | 1.58 6 0 | 1.00 6 0 | 0.63 6 0 | 0.40 6 0 | 0.25 6 0 | 0.16 6 0 | 0.10 6 0 | 0.063 6 0 | 0.040 6 0 | 0.025 6 0 | 0.016 6 0 | 0.010 6 2 |= 0.0063 4 0 | 0.0040 4 0 | 0.0025 4 0 | 0.0016 4 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|4689454|gb|AAD27902.1|AC006267_7(AC006267) putativ... -3 101 4.6e-05 2 gi|4325354|gb|AAD17351.1|(AF128395) contains similari... -3 100 0.00015 2 gi|8778340|gb|AAF79348.1|AC007887_7(AC007887) F15O4.1... -3 100 0.00068 2 gi|5764470|gb|AAD51320.1|(AF169187) polymerase [Triti... -3 89 0.0010 2 gi|6598452|gb|AAF18613.1|AC005171_1(AC005171) putativ... -3 96 0.0068 2 gi|6705984|dbj|BAA89466.1|(AB033235) gag-pol polyprot... -3 93 0.0086 2 gi|1030731|emb|CAA32198.1|(X14037) polyprotein [Droso... -3 90 0.97 2 gi|36070|emb|CAA41389.1|(X58480) rlf fusion protein [... +2 68 0.98 1 gi|9791037ref|NP_063764.1| D ORF C; putative [Vaccini... -1 63 0.991 1 gi|85056|pir||S02021micropia polyprotein - fruit fly ... -3 90 0.996 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 2 Hits gi|36070 | __________ __________________________________________________ Query sequence: | | | | || 202 0 50 100 150 200 Locus_ID Frame -1 Hits gi|9791037 | ________ __________________________________________________ Query sequence: | | | | || 202 0 50 100 150 200 Locus_ID Frame -2 Hits gi|4689454 | ______________ gi|4325354 | _ gi|8778340 | _ gi|5764470 | ________ gi|6598452 | ____________ gi|6705984 | ______________ gi|1030731 | _____________ gi|85056 | _____________ Prosite Hits: ____ __________________________________________________ Query sequence: | | | | || 202 0 50 100 150 200 __________________ Prosite hits: PROKAR_LIPOPROTEIN Prokaryotic membrane lipoprotein lipid a 149..159 __________________ Locus_ID Frame -3 Hits gi|4689454 | gi|4325354 | _ gi|8778340 | _ gi|5764470 | _ gi|6598452 | _ gi|6705984 | _ gi|1030731 | gi|85056 | __________________________________________________ Query sequence: | | | | || 202 0 50 100 150 200
Use the and icons to retrieve links to Entrez:
>gi|4689454|gb|AAD27902.1|AC006267_7 (AC006267) putative polyprotein [Arabidopsis thaliana] >gi|7267450|emb|CAB81146.1| (AL161510) putative polyprotein [Arabidopsis thaliana] Length = 1054 Frame -2 hits (HSPs): ____ Frame -3 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | | || 1054 0 150 300 450 600 750 900 1050 Minus Strand HSPs: Score = 101 (35.6 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05 Identities = 22/64 (34%), Positives = 39/64 (60%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 ELYAL +A + HYL+ ++FV+ TDH++L + ++ A W +++ F +VI+ Sbjct: 780 ELYALVRALQTWQHYLWPKKFVIHTDHESLKHLKG--QQKLNKRHARWVEFIETFPYVIK 837 Query: 96 HKSR 85 +K R Sbjct: 838 YKKR 841 Score = 75 (26.4 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05 Identities = 21/57 (36%), Positives = 27/57 (47%), Frame = -2 Query: 412 FXSHLELHYXAXKFGIGAVLSQSNRPIAX*QETFW-AKLXYSTYACGTLCPISSIQT 245 F E+ A GIGAVL Q ++PIA E A L Y TY + ++QT Sbjct: 734 FMKSFEIECDASGLGIGAVLMQDHKPIAYFSEKLGGATLNYPTYDKELYALVRALQT 790 >gi|4325354|gb|AAD17351.1| (AF128395) contains similarity to retrovirus-related polyproteins and to CCHC zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051, E= 1) [Arabidopsis thaliana] >gi|7267432|emb|CAB77944.1| (AL161508) putative polyprotein [Arabidopsis thaliana] Length = 1138 Frame -2 hits (HSPs): ___ Frame -3 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | | | 1138 0 150 300 450 600 750 900 1050 Minus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.00015, Sum P(2) = 0.00015 Identities = 22/62 (35%), Positives = 38/62 (61%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 ELYAL +A + HYL+ +EFV+ TDH++L + ++ A W +++ F +VI+ Sbjct: 667 ELYALVRALQTWQHYLWPKEFVIHTDHESLKHLKG--QQKLNKRHARWVEFIETFPYVIK 724 Query: 96 HK 91 +K Sbjct: 725 YK 726 Score = 72 (25.3 bits), Expect = 0.00015, Sum P(2) = 0.00015 Identities = 22/61 (36%), Positives = 28/61 (45%), Frame = -2 Query: 424 SMHRFXSHLELHYXAXKFGIGAVLSQSNRPIAX*QETFW-AKLXYSTYACGTLCPISSIQ 248 S+ F E+ A GIGAVL Q +PIA E A L Y TY + ++Q Sbjct: 617 SLPDFLKTFEIECDAPGVGIGAVLMQDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQ 676 Query: 247 T 245 T Sbjct: 677 T 677 >gi|8778340|gb|AAF79348.1|AC007887_7 (AC007887) F15O4.13 [Arabidopsis thaliana] Length = 1887 Frame -2 hits (HSPs): ___ Frame -3 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | 1887 0 500 1000 1500 Minus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.00068, Sum P(2) = 0.00068 Identities = 22/62 (35%), Positives = 38/62 (61%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 ELYAL +A + HYL+ +EFV+ TDH++L + ++ A W +++ F +VI+ Sbjct: 1248 ELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKG--QQKLNKRHARWVEFIETFPYVIK 1305 Query: 96 HK 91 +K Sbjct: 1306 YK 1307 Score = 70 (24.6 bits), Expect = 0.00068, Sum P(2) = 0.00068 Identities = 21/61 (34%), Positives = 27/61 (44%), Frame = -2 Query: 424 SMHRFXSHLELHYXAXKFGIGAVLSQSNRPIAX*QETFW-AKLXYSTYACGTLCPISSIQ 248 S+ F E+ A GIG VL Q +PIA E A L Y TY + ++Q Sbjct: 1198 SLPDFLKTFEIECDASGVGIGVVLMQDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQ 1257 Query: 247 T 245 T Sbjct: 1258 T 1258 >gi|5764470|gb|AAD51320.1| (AF169187) polymerase [Triticum aestivum] Length = 235 Frame -2 hits (HSPs): _______ Frame -3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | | | 235 0 50 100 150 200 Minus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.0010, Sum P(2) = 0.0010 Identities = 20/61 (32%), Positives = 31/61 (50%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 ELYAL + HYL EF++ TDH+ L Y + A W +++ F +VI+ Sbjct: 39 ELYALVRVLHEWEHYLRPHEFIIHTDHETLKYLKG--QTKLNKRHAKWSEFIESFPYVIK 96 Query: 96 H 94 + Sbjct: 97 Y 97 Score = 59 (20.8 bits), Expect = 0.0010, Sum P(2) = 0.0010 Identities = 15/30 (50%), Positives = 18/30 (60%), Frame = -2 Query: 370 GIGAVLSQSNRPIAX*QETF-WAKLXYSTY 284 GIG VL+Q RPIA +E A+L Y Y Sbjct: 7 GIGGVLTQEKRPIAYFREKLSGAQLNYPIY 36 >gi|6598452|gb|AAF18613.1|AC005171_1 (AC005171) putative retroelement integrase [Arabidopsis thaliana] Length = 1215 Frame -2 hits (HSPs): __ Frame -3 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | | || 1215 0 150 300 450 600 750 900 1050 1200 Minus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.0068, Sum P(2) = 0.0068 Identities = 21/62 (33%), Positives = 37/62 (59%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 ELYAL +A + HYL+ + FV+ TDH++L + ++ A W +++ F +VI+ Sbjct: 757 ELYALVRALQRWQHYLWPKVFVIHTDHESLKHLKG--QQKLNKRHARWVEFIETFAYVIK 814 Query: 96 HK 91 +K Sbjct: 815 YK 816 Score = 61 (21.5 bits), Expect = 0.0068, Sum P(2) = 0.0068 Identities = 18/47 (38%), Positives = 21/47 (44%), Frame = -2 Query: 421 MHRFXSHLELHYXAXKFGIGAVLSQSNRPIAX*QETFW-AKLXYSTY 284 + F E+ A GIGAVL Q + IA E A L Y TY Sbjct: 708 LSEFLKTFEIECDASGIGIGAVLMQDQKLIAFFSEKLGGATLNYPTY 754 >gi|6705984|dbj|BAA89466.1| (AB033235) gag-pol polyprotein [Oryza sativa] Length = 1587 Frame -2 hits (HSPs): ___ Frame -3 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | 1587 0 500 1000 1500 Minus Strand HSPs: Score = 93 (32.7 bits), Expect = 0.0086, Sum P(2) = 0.0086 Identities = 20/62 (32%), Positives = 36/62 (58%), Frame = -3 Query: 276 ELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVIE 97 EL+AL + + HYL+ +EFV+ +DH++L + + A W +++ F +VI+ Sbjct: 1084 ELFALVRTLETWQHYLWPKEFVIHSDHESLKHIR--SQAKHNRRHAKWVEFIESFPYVIK 1141 Query: 96 HK 91 HK Sbjct: 1142 HK 1143 Score = 66 (23.2 bits), Expect = 0.0086, Sum P(2) = 0.0086 Identities = 18/57 (31%), Positives = 25/57 (43%), Frame = -2 Query: 412 FXSHLELHYXAXKFGIGAVLSQSNRPIAX*QETFWA-KLXYSTYACGTLCPISSIQT 245 F EL A G+G VL Q +P+A E L YSTY + +++T Sbjct: 1038 FNKTFELECDASGIGLGGVLLQEGKPVAYFSEKLSGPSLNYSTYDKELFALVRTLET 1094 >gi|1030731|emb|CAA32198.1| (X14037) polyprotein [Drosophila melanogaster] Length = 1053 Frame -2 hits (HSPs): ___ Frame -3 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | | || 1053 0 150 300 450 600 750 900 1050 Minus Strand HSPs: Score = 90 (31.7 bits), Expect = 3.6, Sum P(2) = 0.97 Identities = 23/63 (36%), Positives = 34/63 (53%), Frame = -3 Query: 279 VELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVI 100 +E A+ +A KH HYL REFV++TD ++L + W AYLQ F F I Sbjct: 787 LETLAVVKAVKHFRHYLIGREFVVYTDCNSLKASR--TKIDLTPRVHRWWAYLQSFNFEI 844 Query: 99 EHK 91 +++ Sbjct: 845 QYR 847 Score = 40 (14.1 bits), Expect = 3.6, Sum P(2) = 0.97 Identities = 15/56 (26%), Positives = 27/56 (48%), Frame = -2 Query: 400 LELHYXAXKFGIGAVLSQ--SNRP--IAX*QETFWA-KLXYSTYACGTLCPISSIQ 248 +ELH A G GA+L ++P I +T + + Y +Y TL + +++ Sbjct: 742 IELHTDASACGYGAILLHRIESKPHVIEYFSKTTTSVESRYHSYELETLAVVKAVK 797 >gi|36070|emb|CAA41389.1| (X58480) rlf fusion protein [Homo sapiens] Length = 96 Frame 2 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | | 96 0 20 40 60 80 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 4.0, P = 0.98 Identities = 14/37 (37%), Positives = 23/37 (62%), Frame = +2 Query: 77 IRSLDLCSMTNVNYCRYACQEAHDMQKDFSGYQLXYY 187 +R ++ ++++NYCR CQ A DM D+ YQ +Y Sbjct: 60 LREQEVSEVSSLNYCRSFCQGA-DM--DYDSYQHYFY 93 >gi|9791037 ref|NP_063764.1| D ORF C; putative [Vaccinia virus] >gi|141527|sp|P20552|YVDC_VACCC HYPOTHETICAL 7.3 KD PROTEIN >gi|93364|pir||F42516 D-ORF-C protein - vaccinia virus (strain Copenhagen) >gi|335449|gb|AAA48101.1| (M35027) D ORF C; putative [Vaccinia virus] Length = 69 Frame -1 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | 69 0 20 40 60 Minus Strand HSPs: Score = 63 (22.2 bits), Expect = 4.7, P = 0.99 Identities = 14/30 (46%), Positives = 22/30 (73%), Frame = -1 Query: 170 SLKNLSACHE-LLGMHIYNSSHLS*SINLG 84 SL+ ++ H LL ++++NSSHL+ S NLG Sbjct: 12 SLRLSNSVHSGLLSIYLFNSSHLNASTNLG 41 >gi|85056|pir||S02021 micropia polyprotein - fruit fly (Drosophila melanogaster) (fragment) Length = 1291 Frame -2 hits (HSPs): ___ Frame -3 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | 1291 0 500 1000 Minus Strand HSPs: Score = 90 (31.7 bits), Expect = 5.5, Sum P(2) = 1.0 Identities = 23/63 (36%), Positives = 34/63 (53%), Frame = -3 Query: 279 VELYALFQAFKH*HHYLFHREFVLFTDHDAL*YXNW*PEKSFCMS*ASWHAYLQ*FTFVI 100 +E A+ +A KH HYL REFV++TD ++L + W AYLQ F F I Sbjct: 787 LETLAVVKAVKHFRHYLIGREFVVYTDCNSLKASR--TKIDLTPRVHRWWAYLQSFNFEI 844 Query: 99 EHK 91 +++ Sbjct: 845 QYR 847 Score = 40 (14.1 bits), Expect = 5.5, Sum P(2) = 1.0 Identities = 15/56 (26%), Positives = 27/56 (48%), Frame = -2 Query: 400 LELHYXAXKFGIGAVLSQ--SNRP--IAX*QETFWA-KLXYSTYACGTLCPISSIQ 248 +ELH A G GA+L ++P I +T + + Y +Y TL + +++ Sbjct: 742 IELHTDASACGYGAILLHRIESKPHVIEYFSKTTTSVESRYHSYELETLAVVKAVK 797 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.94 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.348 0.152 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.348 0.151 0.559 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.348 0.159 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.352 0.157 0.586 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.364 0.163 0.611 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.356 0.159 0.613 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 201 187 10. 76 3 12 22 0.12 34 31 0.12 37 +2 0 201 189 10. 76 3 12 22 0.12 34 31 0.12 37 +1 0 201 186 10. 76 3 12 22 0.12 34 31 0.12 37 -1 0 201 187 10. 76 3 12 22 0.12 34 31 0.12 37 -2 0 201 187 10. 76 3 12 22 0.12 34 31 0.12 37 -3 0 201 187 10. 76 3 12 22 0.12 34 31 0.12 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 10 No. of states in DFA: 598 (59 KB) Total size of DFA: 216 KB (256 KB) Time to generate neighborhood: 0.01u 0.01s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 172.14u 1.10s 173.24t Elapsed: 00:00:31 Total cpu time: 172.16u 1.15s 173.31t Elapsed: 00:00:31 Start: Mon Oct 1 20:59:43 2001 End: Mon Oct 1 21:00:14 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000