WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker Server unavailable.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B05H01.seq(1>482) (446 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1103 239 |=========================================================== 6310 864 137 |================================== 3980 727 194 |================================================ 2510 533 110 |=========================== 1580 423 118 |============================= 1000 305 73 |================== 631 232 60 |=============== 398 172 28 |======= 251 144 35 |======== 158 109 40 |========== 100 69 18 |==== 63.1 51 10 |== 39.8 41 8 |== 25.1 33 21 |===== 15.8 12 5 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 7 <<<<<<<<<<<<<<<<< 10.0 7 4 |= 6.31 3 1 |: 3.98 2 0 | 2.51 2 2 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|139588|sp|P28087|VSH_MUMPRSMALL HYDROPHOBIC PROTEI... +2 64 0.89 1 gi|102069|pir||B35349H+-transporting ATP synthase (EC... +3 64 0.89 1 gi|6425501|emb|CAB60603.1|(AL132865) contains similar... +3 80 0.997 1 gi|1840457|gb|AAC56591.1|(U50293) SH protein [Mumps v... +2 60 0.998 1 gi|808679|gb|AAA66559.1|(M19004) unknown protein [Sai... +1 59 0.9998 1 gi|1840429|gb|AAC56577.1|(U50181) SH protein [Mumps v... +2 59 0.9998 1 gi|1840439|gb|AAC56582.1|(U50284) SH protein [Mumps v... +2 59 0.9998 1
Use the and icons to retrieve links to Entrez:
>gi|139588|sp|P28087|VSH_MUMPR SMALL HYDROPHOBIC PROTEIN >gi|94037|pir||S24827 small hydrophobic protein - mumps virus >gi|60595|emb|CAA45241.1| (X63708) SH gene [Mumps virus] Length = 57 Frame 2 hits (HSPs): __________________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 57 0 20 40 __________________ Annotated Domains: Entrez Transmembrane region: POTENTIAL. 8..32 PFAM SH: Viral small hydrophobic protein 1..57 PRODOM PD001504: VSH(13) 27..56 __________________ Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 2.2, P = 0.89 Identities = 13/30 (43%), Positives = 20/30 (66%), Frame = +2 Query: 161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250 L+LL+R + L +I+ + Y+T VRH LH Sbjct: 15 LILLYRIITLYVWIILTITYKTSVRHAALH 44 >gi|102069|pir||B35349 H+-transporting ATP synthase (EC 3.6.1.34) protein 6 - Sauroleishmania tarentolae mitochondrion (SGC6) (fragment) Length = 67 Frame 3 hits (HSPs): _________________________________ Annotated Domains: ____________ __________________________________________________ Database sequence: | | | | | 67 0 20 40 60 __________________ Annotated Domains: PROSITE CRYSTALLIN_BETAGAMMA: Crystallins beta a 43..58 __________________ Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 2.2, P = 0.89 Identities = 19/45 (42%), Positives = 25/45 (55%), Frame = +3 Query: 153 MFDYCYCLEDSV*LK*LLCFCF------IRQL*GTVFYMCLQLSACI 275 MF + C D V ++ LLCFC+ I L VFY+C +L CI Sbjct: 1 MFVFFVC--DLVIMRILLCFCYSVWSRIIFVLPVNVFYICTELMFCI 45 >gi|6425501|emb|CAB60603.1| (AL132865) contains similarity to Pfam domain: PF00001 (7 transmembrane receptor (rhodopsin family)), Score=10.9, E-value=0.0031, N=1 [Caenorhabditis elegans] Length = 310 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | | | | 310 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 80 (28.2 bits), Expect = 5.7, P = 1.0 Identities = 26/89 (29%), Positives = 42/89 (47%), Frame = +3 Query: 123 CSLQIHKEV*MFDYCYCLEDSV*LK*LLCFCFIRQL*GTVFYMCLQLSACIVIFKVRFIK 302 C+ IH V MF Y L + CF+ G +F + +Q S C++I R Sbjct: 59 CTYLIHLRVLMFLDMYMLTSTQ--------CFLLSAYG-LFALNMQSSLCLIIGLDRLYN 109 Query: 303 LTSQINYSLSPSSGGIDLFLYLIEVFIIV 389 +T + YS P+S I LFL+ + +++ Sbjct: 110 VTYPLRYSQLPNSFYIALFLFCVGFSVLI 138 >gi|1840457|gb|AAC56591.1| (U50293) SH protein [Mumps virus] Length = 57 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 57 0 20 40 Plus Strand HSPs: Score = 60 (21.1 bits), Expect = 6.4, P = 1.0 Identities = 13/30 (43%), Positives = 19/30 (63%), Frame = +2 Query: 161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250 L LL+ + L +I+ + Y+T VRH VLH Sbjct: 15 LTLLYLIITLYIWIILTITYETTVRHAVLH 44 >gi|808679|gb|AAA66559.1| (M19004) unknown protein [Saimiriine herpesvirus 2] Length = 53 Frame 1 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | 53 0 20 40 Plus Strand HSPs: Score = 59 (20.8 bits), Expect = 8.4, P = 1.0 Identities = 12/29 (41%), Positives = 15/29 (51%), Frame = +1 Query: 199 SYC-AFVLSDSCKALCSTCVCSCLHALLF 282 S C A SD C+ TC SC+H L + Sbjct: 1 SICRAAAASDLCELCQGTCPASCIHTLFY 29 >gi|1840429|gb|AAC56577.1| (U50181) SH protein [Mumps virus] Length = 57 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 57 0 20 40 Plus Strand HSPs: Score = 59 (20.8 bits), Expect = 8.4, P = 1.0 Identities = 12/30 (40%), Positives = 19/30 (63%), Frame = +2 Query: 161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250 L+LL+ + L +I+ + Y+T VRH LH Sbjct: 15 LILLYLIITLYVWIILTITYKTAVRHAALH 44 >gi|1840439|gb|AAC56582.1| (U50284) SH protein [Mumps virus] >gi|1840447|gb|AAC56586.1| (U50288) SH protein [Mumps virus] >gi|1840449|gb|AAC56587.1| (U50289) SH protein [Mumps virus] >gi|1840453|gb|AAC56589.1| (U50291) SH protein [Mumps virus] >gi|9581809|emb|CAC00540.1| (AJ272363) small hydrophobic protein [Mumps virus] Length = 57 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 57 0 20 40 Plus Strand HSPs: Score = 59 (20.8 bits), Expect = 8.4, P = 1.0 Identities = 12/30 (40%), Positives = 19/30 (63%), Frame = +2 Query: 161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250 L+LL+ + L +I+ + Y+T VRH LH Sbjct: 15 LILLYLIITLYVWIILTITYKTAVRHAALH 44 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.97 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.354 0.161 0.561 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.358 0.163 0.609 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.356 0.154 0.543 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.352 0.158 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.348 0.152 0.535 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.375 0.172 0.697 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 148 147 10. 74 3 12 22 0.12 33 30 0.11 36 +2 0 148 147 10. 74 3 12 22 0.12 33 30 0.11 36 +1 0 148 147 10. 74 3 12 22 0.12 33 30 0.11 36 -1 0 148 148 10. 74 3 12 22 0.091 34 30 0.11 36 -2 0 148 147 10. 74 3 12 22 0.12 33 30 0.11 36 -3 0 148 147 10. 74 3 12 22 0.12 33 30 0.11 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 7 No. of states in DFA: 591 (58 KB) Total size of DFA: 186 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 147.96u 1.05s 149.01t Elapsed: 00:00:26 Total cpu time: 147.98u 1.07s 149.05t Elapsed: 00:00:26 Start: Sat Feb 2 04:53:56 2002 End: Sat Feb 2 04:54:22 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000