WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH4C06.SEQ(1>505) (474 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 6 Sequences : less than 6 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1318 334 |======================================================= 6310 984 211 |=================================== 3980 773 175 |============================= 2510 598 189 |=============================== 1580 409 142 |======================= 1000 267 98 |================ 631 169 47 |======= 398 122 43 |======= 251 79 22 |=== 158 57 11 |= 100 46 14 |== 63.1 32 10 |= 39.8 22 14 |== 25.1 8 1 |: 15.8 7 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 5 <<<<<<<<<<<<<<<<< 10.0 5 1 |: 6.31 4 1 |: 3.98 3 2 |: 2.51 1 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|1174318|gb|AAB35482.1|Ca2+/calmodulin-dependent pr... -1 65 0.80 1 gi|7189444|gb|AAF38354.1|(AE002212) hypothetical prot... -3 41 0.93 2 gi|7482503|pir||F69124hypothetical protein MTH201 - M... -1 76 0.98 1 gi|3915439|sp|O13695|YEN1_SCHPOHYPOTHETICAL 52.9 KD S... -1 83 0.996 1 gi|1174316|gb|AAB35480.1|Ca2+/calmodulin-dependent pr... -1 67 0.999 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame -1 Hits gi|1174318 | gi|7482503 | gi|3915439 | gi|1174316 | __________________________________________________ Query sequence: | | | | | 158 0 50 100 150 Locus_ID Frame -3 Hits gi|7189444 | ___ ____ Prosite Hits: __ __________________________________________________ Query sequence: | | | | | 158 0 50 100 150 __________________ Prosite hits: CYTOCHROME_C Cytochrome c family heme-binding site si 43..48 __________________
Use the and icons to retrieve links to Entrez:
>gi|1174318|gb|AAB35482.1| Ca2+/calmodulin-dependent protein kinase beta e isoform, beta e CaM kinase {alternatively spliced} [rats, Sprague-Dawley, Peptide Partial, 74 aa] Length = 74 Frame -1 hits (HSPs): ______________________________________________ __________________________________________________ Database sequence: | | | | | 74 0 20 40 60 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 1.6, P = 0.80 Identities = 21/69 (30%), Positives = 29/69 (42%), Frame = -1 Query: 234 ASLSSVLAEINSRLRNVVGNMQGDNTVPSGQMESNSRDLSSGSESRPPTVNKQQDTVDVN 55 A L+++LA N + N + D P NS ++S S PP + Q TV N Sbjct: 1 AILTTMLATRNFSAAKSLLNKKADGVKPQTNSTKNSSAITSPKGSLPPAALEPQTTVIHN 60 Query: 54 GFGAISASS 28 I SS Sbjct: 61 PVDGIKESS 69 >gi|7189444|gb|AAF38354.1| (AE002212) hypothetical protein [Chlamydophila pneumoniae AR39] Length = 39 Frame -3 hits (HSPs): _______________ ___________ __________________________________________________ Database sequence: | | | 39 0 20 Minus Strand HSPs: Score = 41 (14.4 bits), Expect = 2.6, Sum P(2) = 0.93 Identities = 5/11 (45%), Positives = 8/11 (72%), Frame = -3 Query: 274 WVWYFYLTATF 242 W W+FYL + + Sbjct: 4 WQWFFYLYSQY 14 Score = 37 (13.0 bits), Expect = 2.6, Sum P(2) = 0.93 Identities = 6/8 (75%), Positives = 7/8 (87%), Frame = -3 Query: 133 KQQGLVFW 110 K+QGL FW Sbjct: 25 KKQGLCFW 32 >gi|7482503|pir||F69124 hypothetical protein MTH201 - Methanobacterium thermoautotrophicum (strain Delta H) >gi|2621246|gb|AAB84707.1| (AE000807) unknown [Methanobacterium thermoautotrophicum] Length = 147 Frame -1 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 147 0 50 100 Minus Strand HSPs: Score = 76 (26.8 bits), Expect = 3.7, P = 0.98 Identities = 19/75 (25%), Positives = 35/75 (46%), Frame = -1 Query: 324 TIPSHPPGVGVSSSTQTGFGISTSQPPS---DSASLSSVLAEINSRLRNVVGNMQGDNTV 154 T S P S + TG G TS+ PS ++ S A+++++ + + G+ + Sbjct: 49 TTTSPAPTKKTSDRSSTGSGTKTSKSPSGGVETGSAQGEAADVSAQNKTDIQGQDGNEST 108 Query: 153 PSGQMESNSRDLSSG 109 PSG + + +SG Sbjct: 109 PSGNITGKEKQGTSG 123 >gi|3915439|sp|O13695|YEN1_SCHPO HYPOTHETICAL 52.9 KD SERINE-RICH PROTEIN C11G7.01 IN CHROMOSOME I >gi|7492035|pir||T37544 hypothetical serine-rich protein - fission yeast (Schizosaccharomyces pombe) >gi|2408006|emb|CAB16206.1| (Z99161) hypothetical serine-rich protein. [Schizosaccharomyces pombe] Length = 536 Frame -1 hits (HSPs): ___________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 536 0 150 300 450 __________________ Annotated Domains: Entrez Domain: SER-RICH. 25..238 Entrez Transmembrane region: POTENTIAL. 247..267 Entrez Transmembrane region: POTENTIAL. 351..371 PRODOM PD000651: VL2(74) O18758(34) MUC2(10) 3..228 PRODOM PD126628: YEN1_SCHPO 230..535 __________________ Minus Strand HSPs: Score = 83 (29.2 bits), Expect = 5.5, P = 1.0 Identities = 29/111 (26%), Positives = 53/111 (47%), Frame = -1 Query: 339 MS*QQTIPSHPPGVGVSSSTQTGFGISTSQPPSDSASLSSVLAEINSRLRNVVGNMQGDN 160 +S IPS + +SSS+ T S+S P S+S +S + +S + ++V + + Sbjct: 35 LSSSSVIPSSSSSM-LSSSSATAISSSSSSSPLSSSSFTSPAS--SSFITSLVSSSSQQS 91 Query: 159 TVPSGQMESNSRDLSSGSESRPPTVNKQQDTVDVNGFGAI-SASSVGCTSES 7 + S + S+S + S S PT + + + + S+SS G +S S Sbjct: 92 SSSSASLTSSSSATLTSSSSASPTSSSSSHALSSSSSSLVASSSSSGMSSSS 143 >gi|1174316|gb|AAB35480.1| Ca2+/calmodulin-dependent protein kinase beta isoform, beta CaM kinase {alternatively spliced} [rats, Sprague-Dawley, Peptide Partial, 98 aa] Length = 98 Frame -1 hits (HSPs): ________________________________________ __________________________________________________ Database sequence: | | | | | | 98 0 20 40 60 80 Minus Strand HSPs: Score = 67 (23.6 bits), Expect = 6.8, P = 1.0 Identities = 21/80 (26%), Positives = 34/80 (42%), Frame = -1 Query: 267 GISTSQPPSDSASLSSVLAEINSRLRNVVGNMQGDNTVPSGQMESNSRDLSSGSESRPPT 88 G T+ P + S + S + + ++++ N + D P NS ++S S PP Sbjct: 15 GRQTTAPATMSTAASGTTMGLVEQAKSLL-NKKADGVKPQTNSTKNSSAITSPKGSLPPA 73 Query: 87 VNKQQDTVDVNGFGAISASS 28 + Q TV N I SS Sbjct: 74 ALEPQTTVIHNPVDGIKESS 93 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.363 0.164 0.612 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.329 0.145 0.473 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.365 0.160 0.617 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.317 0.131 0.387 same same same Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.346 0.152 0.530 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.364 0.161 0.701 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 157 157 10. 74 3 12 22 0.098 34 30 0.12 36 +2 0 157 157 10. 74 3 12 22 0.098 34 30 0.12 36 +1 0 158 157 10. 74 3 12 22 0.098 34 30 0.12 36 -1 0 158 158 10. 74 3 12 22 0.095 34 30 0.097 37 -2 0 157 157 10. 74 3 12 22 0.098 34 30 0.12 36 -3 0 157 157 10. 74 3 12 22 0.098 34 30 0.12 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 5 No. of states in DFA: 593 (58 KB) Total size of DFA: 195 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 216.28u 1.26s 217.54t Elapsed: 00:01:35 Total cpu time: 216.33u 1.28s 217.61t Elapsed: 00:01:36 Start: Wed Feb 14 17:55:17 2001 End: Wed Feb 14 17:56:53 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000