WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker Server unavailable.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B08G12.seq(1>516) (487 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 996 220 |======================================================= 6310 776 169 |========================================== 3980 607 133 |================================= 2510 474 136 |================================== 1580 338 105 |========================== 1000 233 60 |=============== 631 173 53 |============= 398 120 38 |========= 251 82 24 |====== 158 58 14 |=== 100 44 10 |== 63.1 34 8 |== 39.8 26 7 |= 25.1 19 5 |= 15.8 14 5 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 9 <<<<<<<<<<<<<<<<< 10.0 9 3 |: 6.31 6 3 |: 3.98 3 1 |: 2.51 2 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9758555|dbj|BAB09056.1|(AB006703) gb|AAD20086.1~ge... +2 128 1.7e-06 1 gi|10697011|emb|CAC12722.1|(AL390738) bA438F9.2 (nove... +2 66 0.80 1 gi|10177321|dbj|BAB10647.1|(AB008269) contains simila... +2 85 0.97 1 gi|1174321|gb|AAD13517.1|pregastric lipase, LPGL {N-t... +3 62 0.99 1 gi|2136879|pir||S59904lipase, pregastric - sheep (fra... +3 62 0.99 1 gi|3859669|emb|CAA22007.1|(AL033502) hypothetical pro... +2 81 0.99 1 gi|8778814|gb|AAF79819.1|AC007396_20(AC007396) T4O12.... +2 81 0.9996 1 gi|912722|gb|AAA82849.1|(U30776) envelope glycoprotei... +3 56 0.99993 2 gi|912725|gb|AAA82851.1|(U30777) envelope glycoprotei... +3 56 0.99993 2
Use the and icons to retrieve links to Entrez:
>gi|9758555|dbj|BAB09056.1| (AB006703) gb|AAD20086.1~gene_id:MRH10.6~similar to unknown protein [Arabidopsis thaliana] Length = 450 Frame 2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 450 0 150 300 Plus Strand HSPs: Score = 128 (45.1 bits), Expect = 1.7e-06, P = 1.7e-06 Identities = 26/42 (61%), Positives = 31/42 (73%), Frame = +2 Query: 5 GRGSYQSDAPRGRFNSRNFGRGNGQDGGDRDYNKSKGNGFYR 130 GRG Y ++APRGRF R GRGN QDGGD Y + +GNG+YR Sbjct: 409 GRGGYPTEAPRGRFGGRGSGRGN-QDGGD--Y-RPRGNGYYR 446 >gi|10697011|emb|CAC12722.1| (AL390738) bA438F9.2 (novel protein similar to heterogeneous nuclear ribonucleoprotein A1 (HNRPA1)) [Homo sapiens] Length = 70 Frame 2 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | 70 0 20 40 60 Plus Strand HSPs: Score = 66 (23.2 bits), Expect = 1.6, P = 0.80 Identities = 20/53 (37%), Positives = 25/53 (47%), Frame = +2 Query: 5 GRGSYQSDAPRGRFN--SRNFGRGNGQDGGDRDYNKSKGNGFYRPSPHQERGHSG 163 G GSY +D G +N S NFG G + G R G G Y P + G+ G Sbjct: 6 GSGSY-NDF--GNYNNQSSNFGPMKGGNFGGRSSGPCGGGGQYFAKPRNQGGYGG 57 >gi|10177321|dbj|BAB10647.1| (AB008269) contains similarity to RNA-binding protein~gene_id:MSL3.12 [Arabidopsis thaliana] Length = 460 Frame 2 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 460 0 150 300 450 Plus Strand HSPs: Score = 85 (29.9 bits), Expect = 3.4, P = 0.97 Identities = 25/60 (41%), Positives = 32/60 (53%), Frame = +2 Query: 8 RGSYQSDAPRGRFNSRNF--GRGNGQDGGDRDYNKSKGNGFYRP-SPHQERGHSGHHQVP 178 RG Y S RG F + +F GRG G GG Y + G RP S + G G+ +VP Sbjct: 382 RGRYFSG--RGSFRNESFKGGRGGGGRGG---YGRGGGEFSGRPKSSNPRNGGEGYQRVP 436 Query: 179 RNG 187 +NG Sbjct: 437 QNG 439 >gi|1174321|gb|AAD13517.1| pregastric lipase, LPGL {N-terminal} [lambs, pharyngeal organs, Peptide Partial, 35 aa, segment 1 of 2] Length = 35 Frame 3 hits (HSPs): ______________________________ __________________________________________________ Database sequence: | | | 35 0 20 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 4.5, P = 0.99 Identities = 10/21 (47%), Positives = 16/21 (76%), Frame = +3 Query: 183 MGRILLSPESTKNISQRIQFW 245 +G+I +PE++ N+SQ I FW Sbjct: 2 LGKIAENPEASMNVSQMISFW 22 >gi|2136879|pir||S59904 lipase, pregastric - sheep (fragments) Length = 62 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 62 0 20 40 60 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 4.5, P = 0.99 Identities = 10/21 (47%), Positives = 16/21 (76%), Frame = +3 Query: 183 MGRILLSPESTKNISQRIQFW 245 +G+I +PE++ N+SQ I FW Sbjct: 2 LGKIAENPEASMNVSQMISFW 22 >gi|3859669|emb|CAA22007.1| (AL033502) hypothetical protein [Candida albicans] Length = 263 Frame 2 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | | | 263 0 50 100 150 200 250 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 4.6, P = 0.99 Identities = 22/55 (40%), Positives = 31/55 (56%), Frame = +2 Query: 5 GRGSYQSDAPRGRFNS-RNFGRGNGQDGGDRDYNKSKG--NGFYRPSPHQERGHSGH 166 GRG Y+ D RG + R+ GRG G GG RD + G +G YR +++ G G+ Sbjct: 147 GRGGYR-DGGRGGYGGYRDGGRGGGY-GGYRDGGRGGGYRDGGYRDGGYRDGGRGGY 201 >gi|8778814|gb|AAF79819.1|AC007396_20 (AC007396) T4O12.22 [Arabidopsis thaliana] Length = 369 Frame 2 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | 369 0 150 300 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 7.8, P = 1.0 Identities = 21/61 (34%), Positives = 32/61 (52%), Frame = +2 Query: 14 SYQSDAPRGRF-NSRNFGRGNGQDGGDRDYNKSKGNGFYRPSPHQERGHS--GHHQVPRN 184 SY RGR +SR GRG G +G +Y+ + G+ +PH+ RG+ G + P Sbjct: 206 SYGRGRGRGRGRSSRGRGRG-GYNGPPNEYDAPQDGGYGYDAPHEHRGYDDRGGYDAPPQ 264 Query: 185 GQ 190 G+ Sbjct: 265 GR 266 >gi|912722|gb|AAA82849.1| (U30776) envelope glycoprotein gp41 [Human immunodeficiency virus type 1] Length = 91 Frame 3 hits (HSPs): _____________________ Frame 2 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 9.6, Sum P(2) = 1.0 Identities = 11/37 (29%), Positives = 19/37 (51%), Frame = +3 Query: 240 FWVQIFQL*IVNFSRLDQILIFFMS*IKLFREYGWDV 350 FWV + L + + RL +L+ ++L GWD+ Sbjct: 38 FWVDLRNLCLFLYHRLRDLLLIVTRIVELLGRRGWDL 74 Score = 38 (13.4 bits), Expect = 9.6, Sum P(2) = 1.0 Identities = 6/15 (40%), Positives = 12/15 (80%), Frame = +2 Query: 68 GNGQDGGDRDYNKSK 112 G ++GGD+D ++S+ Sbjct: 14 GIEEEGGDKDRDRSR 28 >gi|912725|gb|AAA82851.1| (U30777) envelope glycoprotein gp41 [Human immunodeficiency virus type 1] Length = 91 Frame 3 hits (HSPs): _____________________ Frame 2 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 9.6, Sum P(2) = 1.0 Identities = 11/37 (29%), Positives = 19/37 (51%), Frame = +3 Query: 240 FWVQIFQL*IVNFSRLDQILIFFMS*IKLFREYGWDV 350 FWV + L + + RL +L+ ++L GWD+ Sbjct: 38 FWVDLRNLCLFLYHRLRDLLLIVTRIVELLGRRGWDL 74 Score = 38 (13.4 bits), Expect = 9.6, Sum P(2) = 1.0 Identities = 6/15 (40%), Positives = 12/15 (80%), Frame = +2 Query: 68 GNGQDGGDRDYNKSK 112 G ++GGD+D ++S+ Sbjct: 14 GIEEEGGDKDRDRSR 28 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.99 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.347 0.156 0.491 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.344 0.156 0.537 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.357 0.156 0.590 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.355 0.155 0.557 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.367 0.172 0.692 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.345 0.154 0.526 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 161 160 10. 75 3 12 22 0.10 34 30 0.099 37 +2 0 162 161 10. 75 3 12 22 0.10 34 30 0.10 37 +1 0 162 161 10. 75 3 12 22 0.10 34 30 0.10 37 -1 0 162 161 10. 75 3 12 22 0.10 34 30 0.10 37 -2 0 162 161 10. 75 3 12 22 0.10 34 30 0.10 37 -3 0 161 161 10. 75 3 12 22 0.10 34 30 0.10 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 9 No. of states in DFA: 591 (58 KB) Total size of DFA: 197 KB (256 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 157.85u 1.16s 159.01t Elapsed: 00:00:34 Total cpu time: 157.87u 1.20s 159.07t Elapsed: 00:00:34 Start: Mon Feb 4 23:01:54 2002 End: Mon Feb 4 23:02:28 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000