WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D07A06_A18_01.ab1' (465 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 872 207 |=================================================== 6310 665 131 |================================ 3980 534 129 |================================ 2510 405 132 |================================= 1580 273 86 |===================== 1000 187 65 |================ 631 122 32 |======== 398 90 22 |===== 251 68 21 |===== 158 47 21 |===== 100 26 10 |== 63.1 16 1 |: 39.8 15 3 |: 25.1 12 0 | 15.8 12 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 10 <<<<<<<<<<<<<<<<< 10.0 10 0 | 6.31 10 2 |: 3.98 8 0 | 2.51 8 2 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|1173351|sp|P42552|S1FA_SPIOLDNA BINDING PROTEIN S1... +2 179 8.0e-13 1 gi|4371289|gb|AAD18147.1|(AC006260) unknown protein [... +2 168 1.2e-11 1 gi|1173348|sp|P42551|S1FA_ARATHDNA BINDING PROTEIN S1FA +2 165 2.4e-11 1 gi|11357802|pir||T45877hypothetical protein F4P12.70 ... +2 165 2.4e-11 1 gi|1173349|sp|P42554|S1FA_MAIZEDNA BINDING PROTEIN S1FA +2 150 9.5e-10 1 gi|1173350|sp|P42553|S1FA_ORYSADNA BINDING PROTEIN S1FA +2 149 1.2e-09 1 gi|131035|sp|P08433|PRT1_SCYCAPROTAMINE Z1 (SCYLLIORH... +1 41 0.86 2 gi|225668|prf||1310226Ascyliorhinine Z1 [Scyliorhinus... +1 41 0.86 2 gi|1899127|gb|AAC57008.1|(U86779) pol protein [Human ... -1 61 0.99 1 gi|1041721|gb|AAC49087.1|(U32307) Ras1p [Saccharomyce... -1 60 0.997 1
Use the and icons to retrieve links to Entrez:
>gi|1173351|sp|P42552|S1FA_SPIOL DNA BINDING PROTEIN S1FA >gi|629495|pir||S47063 s1Fa protein - spinach >gi|1361972|pir||S54730 s1Fa protein - spinach >gi|498705|emb|CAA56077.1| (X79543) s1Fa [Spinacia oleracea] Length = 70 Frame 3 hits (HSPs): __________________ Frame 2 hits (HSPs): _________________________________________ __________________________________________________ Database sequence: | | | | | 70 0 20 40 60 Plus Strand HSPs: Score = 179 (63.0 bits), Expect = 8.0e-13, P = 8.0e-13 Identities = 33/57 (57%), Positives = 45/57 (78%), Frame = +2 Query: 122 SKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 +KG NP LIVLL++GGLLLTFL+GN++LYTYAQK LPP KK+ K++ ++ + G Sbjct: 8 AKGLNPGLIVLLVIGGLLLTFLVGNFILYTYAQKNLPPKKKKPISKKKMKRERLKQG 64 Score = 108 (38.0 bits), Expect = 2.7e-05, P = 2.7e-05 Identities = 20/24 (83%), Positives = 23/24 (95%), Frame = +3 Query: 240 KKKPVSKKKMKKERLKQGVSAPGE 311 KKKP+SKKKMK+ERLKQGV+ PGE Sbjct: 47 KKKPISKKKMKRERLKQGVAPPGE 70 >gi|4371289|gb|AAD18147.1| (AC006260) unknown protein [Arabidopsis thaliana] Length = 76 Frame 3 hits (HSPs): _____________________________________ Frame 2 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 Plus Strand HSPs: Score = 168 (59.1 bits), Expect = 1.2e-11, P = 1.2e-11 Identities = 32/57 (56%), Positives = 44/57 (77%), Frame = +2 Query: 122 SKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 +KG NP LIVLL++GGLL+TFLI NYV+Y YAQK LPP KK+ K++ ++ + + G Sbjct: 14 AKGLNPGLIVLLVIGGLLVTFLIANYVMYMYAQKNLPPRKKKPLSKKKLKREKLKQG 70 Score = 105 (37.0 bits), Expect = 5.6e-05, P = 5.6e-05 Identities = 25/56 (44%), Positives = 36/56 (64%), Frame = +3 Query: 147 LFSCLLVGCC*HSSLETMYSTHMHRRPSLL-XKKKPVSKKKMKKERLKQGVSAPGE 311 L L++G + L Y +M+ + +L KKKP+SKKK+K+E+LKQGV PGE Sbjct: 21 LIVLLVIGGLLVTFLIANYVMYMYAQKNLPPRKKKPLSKKKLKREKLKQGVPVPGE 76 >gi|1173348|sp|P42551|S1FA_ARATH DNA BINDING PROTEIN S1FA Length = 76 Frame 3 hits (HSPs): ________________ Frame 2 hits (HSPs): _______________________________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 __________________ Annotated Domains: DOMO DM04705: 1..75 PRODOM PD019013: S1FA(2) Q42337(1) Q9ZQC9(1) 8..48 PRODOM PD026675: S1FA(2) 50..75 __________________ Plus Strand HSPs: Score = 165 (58.1 bits), Expect = 2.4e-11, P = 2.4e-11 Identities = 34/59 (57%), Positives = 43/59 (72%), Frame = +2 Query: 116 AGSKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 A +KG NP LIVLL+VGG LL FLI NYVLY YAQK LPP KK+ K++ ++ + + G Sbjct: 12 AEAKGLNPGLIVLLVVGGPLLVFLIANYVLYVYAQKNLPPRKKKPVSKKKLKREKLKQG 70 Score = 102 (35.9 bits), Expect = 0.00012, P = 0.00012 Identities = 19/24 (79%), Positives = 22/24 (91%), Frame = +3 Query: 240 KKKPVSKKKMKKERLKQGVSAPGE 311 KKKPVSKKK+K+E+LKQGV PGE Sbjct: 53 KKKPVSKKKLKREKLKQGVPVPGE 76 >gi|11357802|pir||T45877 hypothetical protein F4P12.70 - Arabidopsis thaliana >gi|6729488|emb|CAB67644.1| (AL132966) hypothetical protein [Arabidopsis thaliana] Length = 76 Frame 3 hits (HSPs): ________________ Frame 2 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 Plus Strand HSPs: Score = 165 (58.1 bits), Expect = 2.4e-11, P = 2.4e-11 Identities = 34/59 (57%), Positives = 43/59 (72%), Frame = +2 Query: 116 AGSKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 A +KG NP LIVLL+VGG LL FLI NYVLY YAQK LPP KK+ K++ ++ + + G Sbjct: 12 AEAKGLNPGLIVLLVVGGPLLVFLIANYVLYVYAQKNLPPRKKKPVSKKKLKREKLKQG 70 Score = 102 (35.9 bits), Expect = 0.00012, P = 0.00012 Identities = 19/24 (79%), Positives = 22/24 (91%), Frame = +3 Query: 240 KKKPVSKKKMKKERLKQGVSAPGE 311 KKKPVSKKK+K+E+LKQGV PGE Sbjct: 53 KKKPVSKKKLKREKLKQGVPVPGE 76 >gi|1173349|sp|P42554|S1FA_MAIZE DNA BINDING PROTEIN S1FA Length = 63 Frame 3 hits (HSPs): _______________________________ Frame 2 hits (HSPs): _____________________________________________ __________________________________________________ Database sequence: | | | | | 63 0 20 40 60 Plus Strand HSPs: Score = 150 (52.8 bits), Expect = 9.5e-10, P = 9.5e-10 Identities = 28/57 (49%), Positives = 39/57 (68%), Frame = +2 Query: 122 SKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 +KG NP ++V L+V LL F +GNY LY YAQKTLPP KK+ K++ +K + + G Sbjct: 1 NKGLNPGMVVXLVVASFLLIFFVGNYALYXYAQKTLPPKKKKPVSKKKLKKEKLKQG 57 Score = 115 (40.5 bits), Expect = 4.8e-06, P = 4.8e-06 Identities = 24/38 (63%), Positives = 31/38 (81%), Frame = +3 Query: 201 YSTHMHRRPSLL-XKKKPVSKKKMKKERLKQGVSAPGE 311 Y+ + + + +L KKKPVSKKK+KKE+LKQGVSAPGE Sbjct: 26 YALYXYAQKTLPPKKKKPVSKKKLKKEKLKQGVSAPGE 63 >gi|1173350|sp|P42553|S1FA_ORYSA DNA BINDING PROTEIN S1FA Length = 76 Frame 3 hits (HSPs): _________________________ Frame 2 hits (HSPs): ______________________________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 __________________ Annotated Domains: DOMO DM04705: 1..75 PRODOM PD019013: S1FA(2) Q42337(1) Q9ZQC9(1) 15..48 PRODOM PD026675: S1FA(2) 50..75 __________________ Plus Strand HSPs: Score = 149 (52.5 bits), Expect = 1.2e-09, P = 1.2e-09 Identities = 29/57 (50%), Positives = 40/57 (70%), Frame = +2 Query: 122 SKGFNPALIVLLLVGGLLLTFLIGNYVLYTYAQKTLPPXKKEASLKEEDEKGETEAG 292 +KG NP IVLL+V LL+ F +GNY LY YAQKTLPP KK+ K++ ++ + + G Sbjct: 14 NKGLNPGTIVLLVVATLLILFFVGNYALYMYAQKTLPPRKKKPVSKKKLKREKLKQG 70 Score = 118 (41.5 bits), Expect = 2.3e-06, P = 2.3e-06 Identities = 24/38 (63%), Positives = 32/38 (84%), Frame = +3 Query: 201 YSTHMHRRPSLL-XKKKPVSKKKMKKERLKQGVSAPGE 311 Y+ +M+ + +L KKKPVSKKK+K+E+LKQGVSAPGE Sbjct: 39 YALYMYAQKTLPPRKKKPVSKKKLKREKLKQGVSAPGE 76 >gi|131035|sp|P08433|PRT1_SCYCA PROTAMINE Z1 (SCYLLIORHININE Z1) >gi|85457|pir||S00016 protamine Z1 - smaller spotted catshark >gi|64303|emb|CAA29099.1| (X05611) protamine Z1 (AA 1-51) [Scyliorhinus canicula] Length = 51 Frame 1 hits (HSPs): ________________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 51 0 20 40 __________________ Annotated Domains: PRODOM PD050534: PRT1_SCYCA 1..49 __________________ Plus Strand HSPs: Score = 41 (14.4 bits), Expect = 2.0, Sum P(2) = 0.86 Identities = 7/16 (43%), Positives = 12/16 (75%), Frame = +1 Query: 85 RRQSPSFLRSRGFKRV 132 ++Q+P FLR R +R+ Sbjct: 8 KKQAPCFLRRRHLRRL 23 Score = 38 (13.4 bits), Expect = 2.0, Sum P(2) = 0.86 Identities = 8/25 (32%), Positives = 13/25 (52%), Frame = +1 Query: 205 LHICTEDPPSXKKRSQSQRRR*KRR 279 L++C D +R + RR K+R Sbjct: 23 LNVCKRDTSKTYRRRRHVRRLPKKR 47 >gi|225668|prf||1310226A scyliorhinine Z1 [Scyliorhinus canicula] Length = 50 Frame 1 hits (HSPs): ________________________________________ __________________________________________________ Database sequence: | | | | 50 0 20 40 Plus Strand HSPs: Score = 41 (14.4 bits), Expect = 2.0, Sum P(2) = 0.86 Identities = 7/16 (43%), Positives = 12/16 (75%), Frame = +1 Query: 85 RRQSPSFLRSRGFKRV 132 ++Q+P FLR R +R+ Sbjct: 7 KKQAPCFLRRRHLRRL 22 Score = 38 (13.4 bits), Expect = 2.0, Sum P(2) = 0.86 Identities = 8/25 (32%), Positives = 13/25 (52%), Frame = +1 Query: 205 LHICTEDPPSXKKRSQSQRRR*KRR 279 L++C D +R + RR K+R Sbjct: 22 LNVCKRDTSKTYRRRRHVRRLPKKR 46 >gi|1899127|gb|AAC57008.1| (U86779) pol protein [Human immunodeficiency virus type 1] Length = 69 Frame -1 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | | 69 0 20 40 60 Minus Strand HSPs: Score = 61 (21.5 bits), Expect = 4.6, P = 0.99 Identities = 14/36 (38%), Positives = 19/36 (52%), Frame = -1 Query: 195 FPMRNVNSNPPTSRRTIKAGLNPFEPARSKEGGTLS 88 FP +N PTSR+ G NP A ++ GTL+ Sbjct: 16 FPTEQARANSPTSRKLQVRGDNPRSEAGAEGQGTLN 51 >gi|1041721|gb|AAC49087.1| (U32307) Ras1p [Saccharomyces cerevisiae] Length = 65 Frame -1 hits (HSPs): ______________________________________________ __________________________________________________ Database sequence: | | | | | 65 0 20 40 60 Minus Strand HSPs: Score = 60 (21.1 bits), Expect = 6.0, P = 1.0 Identities = 14/59 (23%), Positives = 27/59 (45%), Frame = -1 Query: 177 NSNPPTSRRTIKAGLNPFEPARSKEGGTLSANSKSSAMDERRXGTDLRXEESR-CSFLC 4 N+N ++ + N + +R + L++ SK SA ++ + R E S C +C Sbjct: 7 NNNEGNTKYSSNGNGNRSDISRGNQNNALNSRSKQSAEPQKNSSANARKESSGGCCIIC 65 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.95 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.336 0.146 0.467 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.331 0.146 0.470 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.360 0.157 0.582 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.338 0.143 0.453 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.152 0.514 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.351 0.159 0.554 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 154 141 10. 73 3 12 22 0.12 33 30 0.11 36 +2 0 154 143 10. 73 3 12 22 0.12 33 30 0.11 36 +1 0 155 144 10. 73 3 12 22 0.12 33 30 0.11 36 -1 0 155 143 10. 73 3 12 22 0.12 33 30 0.11 36 -2 0 154 143 10. 73 3 12 22 0.12 33 30 0.11 36 -3 0 154 143 10. 73 3 12 22 0.12 33 30 0.11 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 10 No. of states in DFA: 589 (58 KB) Total size of DFA: 165 KB (192 KB) Time to generate neighborhood: 0.01u 0.01s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 138.00u 1.33s 139.33t Elapsed: 00:00:39 Total cpu time: 138.02u 1.36s 139.38t Elapsed: 00:00:39 Start: Wed Jan 16 18:26:25 2002 End: Wed Jan 16 18:27:04 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000