WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'E13F05_L05_11.ab1' (552 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 614 135 |============================================= 6310 479 73 |======================== 3980 406 119 |======================================= 2510 287 95 |=============================== 1580 192 67 |====================== 1000 125 21 |======= 631 104 21 |======= 398 83 20 |====== 251 63 19 |====== 158 44 3 |= 100 41 5 |= 63.1 36 15 |===== 39.8 21 4 |= 25.1 17 7 |== 15.8 10 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 9 <<<<<<<<<<<<<<<<< 10.0 9 1 |: 6.31 8 2 |: 3.98 6 2 |: 2.51 4 0 | 1.58 4 0 | 1.00 4 0 | 0.63 4 0 | 0.40 4 0 | 0.25 4 1 |: 0.16 3 1 |: 0.10 2 0 | 0.063 2 0 | 0.040 2 0 | 0.025 2 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9994098|gb|AAG12126.1|AC079730_22(AC079730) hypoth... +1 115 0.00023 1 gi|8778403|gb|AAF79411.1|AC068197_21(AC068197) F16A14... +1 100 0.023 1 gi|7487150|pir||T02697hypothetical protein T18E12.4 -... +1 94 0.11 1 gi|3087719|emb|CAA76642.1|(Y17129) S31 protein [Angui... +1 76 0.15 1 gi|5852697|gb|AAD54172.1|AF134972_1(AF134972) MHC cla... +3 45 0.97 2 gi|5834873ref|NP_006942.1|ND4L_10020 NADH dehydrogena... +1 64 0.97 1 gi|11355301|pir||F82093hypothetical protein VC2317 [i... -3 63 0.99 1 gi|6630552|gb|AAF19571.1|AC011708_14(AC011708) putati... +1 81 0.990 1 gi|12597802|gb|AAG60114.1|AC073178_25(AC073178) unkno... +1 78 0.99991 1
Use the and icons to retrieve links to Entrez:
>gi|9994098|gb|AAG12126.1|AC079730_22 (AC079730) hypothetical protein; 33506-35224 [Arabidopsis thaliana] >gi|12321358|gb|AAG50754.1|AC079733_22 (AC079733) integral membrane protein, putative [Arabidopsis thaliana] Length = 212 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 212 0 50 100 150 200 Plus Strand HSPs: Score = 115 (40.5 bits), Expect = 0.00023, P = 0.00023 Identities = 22/30 (73%), Positives = 27/30 (90%), Frame = +1 Query: 1 FSIFSLGICILVSGLQLWYLKRFFRKKKLI 90 +SI SLGICI+VSGLQ+ YLK++F KKKLI Sbjct: 183 YSIMSLGICIVVSGLQILYLKQYFEKKKLI 212 >gi|8778403|gb|AAF79411.1|AC068197_21 (AC068197) F16A14.23 [Arabidopsis thaliana] Length = 221 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.023, P = 0.023 Identities = 18/29 (62%), Positives = 23/29 (79%), Frame = +1 Query: 4 SIFSLGICILVSGLQLWYLKRFFRKKKLI 90 S SL +C+ V+GLQ W+LK FF+KKKLI Sbjct: 193 SFVSLAVCLSVAGLQFWHLKTFFQKKKLI 221 >gi|7487150|pir||T02697 hypothetical protein T18E12.4 - Arabidopsis thaliana >gi|4335768|gb|AAD17445.1| (AC006284) putative Golgi-associated membrane trafficking protein [Arabidopsis thaliana] Length = 213 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 213 0 50 100 150 200 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.12, P = 0.11 Identities = 18/29 (62%), Positives = 22/29 (75%), Frame = +1 Query: 4 SIFSLGICILVSGLQLWYLKRFFRKKKLI 90 S SL +C+ V+GLQ W+LK FF KKKLI Sbjct: 185 SFGSLVVCLSVAGLQFWHLKTFFEKKKLI 213 >gi|3087719|emb|CAA76642.1| (Y17129) S31 protein [Anguilla anguilla] Length = 48 Frame 1 hits (HSPs): ________________________________ __________________________________________________ Database sequence: | | | | 48 0 20 40 Plus Strand HSPs: Score = 76 (26.8 bits), Expect = 0.16, P = 0.15 Identities = 19/30 (63%), Positives = 23/30 (76%), Frame = +1 Query: 1 FSIFSLGICILVSGLQLW---YLKRFFRKKKLI 90 FSIFS+ C L+ GL W YL+RFF+ KKLI Sbjct: 18 FSIFSM--CCLI-GLATWQVFYLRRFFKAKKLI 47 >gi|5852697|gb|AAD54172.1|AF134972_1 (AF134972) MHC class II antigen [Psettodes sp.] Length = 63 Frame 3 hits (HSPs): ______________________ Frame 1 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | 63 0 20 40 60 Plus Strand HSPs: Score = 45 (15.8 bits), Expect = 3.4, Sum P(2) = 0.97 Identities = 13/33 (39%), Positives = 16/33 (48%), Frame = +3 Query: 234 QELESLVRSWR*WNDGDWF*LLSIYYIYEFLRH 332 QE+ S V S DGDW+ Y I+ L H Sbjct: 20 QEVTSDVTSTEELADGDWY-----YQIHSHLEH 47 Score = 40 (14.1 bits), Expect = 3.4, Sum P(2) = 0.97 Identities = 9/19 (47%), Positives = 10/19 (52%), Frame = +1 Query: 106 YETYPHQIVFSSFRDDSNI 162 YE YP QI S RD + Sbjct: 4 YEFYPKQIRVSWTRDGQEV 22 >gi|5834873 ref|NP_006942.1|ND4L_10020 NADH dehydrogenase subunit 4L [Ascaris suum] >gi|1171867|sp|P24874|NULM_ASCSU NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 4L >gi|102586|pir||S26015 NADH dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 4L - pig roundworm mitochondrion >gi|559489|emb|CAA38164.1| (X54253) ND4L protein [Ascaris suum] Length = 77 Frame 1 hits (HSPs): ___________________________________ __________________________________________________ Database sequence: | | | | | 77 0 20 40 60 Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 3.6, P = 0.97 Identities = 18/54 (33%), Positives = 29/54 (53%), Frame = +1 Query: 55 YLKRFFRKKKLI*ILLSYETYPHQIVFSSFRDDSNIV*FFFLWILSCVIGVCGL 216 +L FF+ ++L+ IL+S E + +F F D N + FF+ S V V G+ Sbjct: 8 FLSLFFKWQRLMFILISLE-FIVMSLFILFSGDLNEMMFFYFMCFSVVSSVLGM 60 >gi|11355301|pir||F82093 hypothetical protein VC2317 [imported] - Vibrio cholerae (group O1 strain N16961) >gi|9656882|gb|AAF95461.1| (AE004301) hypothetical protein [Vibrio cholerae] Length = 54 Frame -3 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | 54 0 20 40 Minus Strand HSPs: Score = 63 (22.2 bits), Expect = 4.6, P = 0.99 Identities = 10/25 (40%), Positives = 19/25 (76%), Frame = -3 Query: 523 CTNIRLIHSSIHILFHLINSISVKH 449 C N+ L+ +S+H LF+L+N + ++H Sbjct: 28 CNNLHLLFTSVHTLFNLLN-LKLQH 51 >gi|6630552|gb|AAF19571.1|AC011708_14 (AC011708) putative membrane protein [Arabidopsis thaliana] Length = 217 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 217 0 50 100 150 200 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 4.6, P = 0.99 Identities = 17/28 (60%), Positives = 22/28 (78%), Frame = +1 Query: 7 IFSLGICILVSGLQLWYLKRFFRKKKLI 90 + SLG+ I+VS Q+ YLKR+F KKKLI Sbjct: 190 LMSLGVAIVVSISQVLYLKRYFLKKKLI 217 >gi|12597802|gb|AAG60114.1|AC073178_25 (AC073178) unknown protein [Arabidopsis thaliana] Length = 204 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 204 0 50 100 150 200 Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 9.3, P = 1.0 Identities = 15/29 (51%), Positives = 21/29 (72%), Frame = +1 Query: 4 SIFSLGICILVSGLQLWYLKRFFRKKKLI 90 S+ S +CI V+G+Q +LK FF KKK+I Sbjct: 176 SVLSFFVCIGVAGMQFLHLKTFFEKKKVI 204 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.159 0.620 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.354 0.158 0.570 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.361 0.165 0.617 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.347 0.153 0.501 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.345 0.150 0.499 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.361 0.157 0.561 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 183 183 10. 76 3 12 22 0.12 34 31 0.12 37 +2 0 183 183 10. 76 3 12 22 0.12 34 31 0.12 37 +1 0 184 184 10. 76 3 12 22 0.12 34 31 0.12 37 -1 0 184 184 10. 76 3 12 22 0.12 34 31 0.12 37 -2 0 183 183 10. 76 3 12 22 0.12 34 31 0.12 37 -3 0 183 183 10. 76 3 12 22 0.12 34 31 0.12 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 9 No. of states in DFA: 594 (59 KB) Total size of DFA: 219 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 182.88u 1.06s 183.94t Elapsed: 00:00:31 Total cpu time: 182.90u 1.09s 183.99t Elapsed: 00:00:31 Start: Wed Jan 23 18:25:21 2002 End: Wed Jan 23 18:25:52 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000