WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH3A09.SEQ(1>281) (256 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 7 Sequences : less than 7 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1876 413 |=========================================================== 6310 1463 371 |===================================================== 3980 1092 208 |============================= 2510 884 293 |========================================= 1580 591 139 |=================== 1000 452 174 |======================== 631 278 116 |================ 398 162 56 |======== 251 106 47 |====== 158 59 13 |= 100 46 11 |= 63.1 35 15 |== 39.8 20 2 |: 25.1 18 6 |: 15.8 12 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 10 <<<<<<<<<<<<<<<<< 10.0 10 2 |: 6.31 8 0 | 3.98 8 1 |: 2.51 7 1 |: 1.58 6 0 | 1.00 6 0 | 0.63 6 0 | 0.40 6 0 | 0.25 6 1 |: 0.16 5 0 | 0.10 5 0 | 0.063 5 0 | 0.040 5 0 | 0.025 5 0 | 0.016 5 0 | 0.010 5 0 | 0.0063 5 0 | 0.0040 5 0 | 0.0025 5 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|4850385|gb|AAD31055.1|AC007357_4(AC007357) EST gb|... -1 116 3.6e-05 1 gi|6739522|gb|AAF27287.1|AF136007_1(AF136007) eukaryo... -1 116 3.6e-05 1 gi|6739539|gb|AAF27294.1|(AF145232) eukaryotic initia... -1 96 0.00040 1 gi|6224611|gb|AAF05869.1|AF021805_1(AF021805) eIF4B [... -1 104 0.00067 1 gi|6739518|gb|AAF27285.1|(AF136005) eukaryotic initia... -1 96 0.0023 1 gi|6739520|gb|AAF27286.1|AF136006_1(AF136006) eukaryo... -1 71 0.19 1 gi|6739515|gb|AAF27284.1|(AF136004) eukaryotic initia... -1 71 0.86 1 gi|2127792|pir||B64432capsular polysaccharide biosynt... +3 68 0.97 1 gi|7450263|pir||H69530conserved hypothetical protein ... +3 56 0.9991 1 gi|1176643|sp|Q09202|YP23_CAEELHYPOTHETICAL 25.9 KD P... +1 61 0.9999 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|2127792 | _____________________________________ gi|7450263 | ___________________________ __________________________________________________ Query sequence: | | | | | | 86 0 20 40 60 80 Locus_ID Frame 1 Hits gi|1176643 | ___________________ __________________________________________________ Query sequence: | | | | | | 86 0 20 40 60 80 Locus_ID Frame -1 Hits gi|4850385 | ________________________ gi|6739522 | ________________________ gi|6739539 | _______________________ gi|6224611 | _______________________ gi|6739518 | _______________________ gi|6739520 | _______________________ gi|6739515 | _______________________ __________________________________________________ Query sequence: | | | | | | 86 0 20 40 60 80
Use the and icons to retrieve links to Entrez:
>gi|4850385|gb|AAD31055.1|AC007357_4 (AC007357) EST gb|T22808 comes from this gene. [Arabidopsis thaliana] Length = 549 Frame -1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | 549 0 150 300 450 Minus Strand HSPs: Score = 116 (40.8 bits), Expect = 3.6e-05, P = 3.6e-05 Identities = 24/39 (61%), Positives = 31/39 (79%), Frame = -1 Query: 256 TGDMWMRPSDDRRQFQGGSRERGWFSSGSRNRSTS-RERW 140 TGD W RP DDRR FQG S+ERG+F++ + +RS+S RE W Sbjct: 511 TGDNWPRPVDDRRNFQG-SKERGFFNNRNFDRSSSAREGW 549 >gi|6739522|gb|AAF27287.1|AF136007_1 (AF136007) eukaryotic initiation factor 4B [Arabidopsis thaliana] Length = 549 Frame -1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | 549 0 150 300 450 Minus Strand HSPs: Score = 116 (40.8 bits), Expect = 3.6e-05, P = 3.6e-05 Identities = 24/39 (61%), Positives = 31/39 (79%), Frame = -1 Query: 256 TGDMWMRPSDDRRQFQGGSRERGWFSSGSRNRSTS-RERW 140 TGD W RP DDRR FQG S+ERG+F++ + +RS+S RE W Sbjct: 511 TGDNWPRPVDDRRNFQG-SKERGFFNNRNFDRSSSAREGW 549 >gi|6739539|gb|AAF27294.1| (AF145232) eukaryotic initiation factor 4B [Arabidopsis thaliana] Length = 93 Frame -1 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | | 93 0 20 40 60 80 Minus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.00040, P = 0.00040 Identities = 21/37 (56%), Positives = 29/37 (78%), Frame = -1 Query: 250 DMWMRPSDD-RRQFQGGSRERGWFSSGSRNRSTSRERW 140 D W+RP+++ RR FQG ++ERG+FS NRS+SRE W Sbjct: 61 DAWVRPANEQRRNFQG-TKERGFFS----NRSSSREGW 93 >gi|6224611|gb|AAF05869.1|AF021805_1 (AF021805) eIF4B [Arabidopsis thaliana] Length = 531 Frame -1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | 531 0 150 300 450 Minus Strand HSPs: Score = 104 (36.6 bits), Expect = 0.00067, P = 0.00067 Identities = 23/37 (62%), Positives = 29/37 (78%), Frame = -1 Query: 250 DMWMRPSDD-RRQFQGGSRERGWFSSGSRNRSTSRERW 140 D W+RP+D+ RR FQG S+ERG+FS NRS+SRE W Sbjct: 499 DAWVRPADEQRRNFQG-SKERGFFS----NRSSSREGW 531 >gi|6739518|gb|AAF27285.1| (AF136005) eukaryotic initiation factor 4B [Arabidopsis thaliana] Length = 325 Frame -1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | | 325 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.0023, P = 0.0023 Identities = 21/37 (56%), Positives = 29/37 (78%), Frame = -1 Query: 250 DMWMRPSDD-RRQFQGGSRERGWFSSGSRNRSTSRERW 140 D W+RP+++ RR FQG ++ERG+FS NRS+SRE W Sbjct: 293 DAWVRPANEQRRNFQG-TKERGFFS----NRSSSREGW 325 >gi|6739520|gb|AAF27286.1|AF136006_1 (AF136006) eukaryotic initiation factor 4B [Triticum aestivum] Length = 135 Frame -1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 135 0 50 100 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.20, P = 0.19 Identities = 16/37 (43%), Positives = 25/37 (67%), Frame = -1 Query: 250 DMWMRPSDDRRQFQGGSRERGWFSSG-SRNRSTSRERW 140 + + +P ++R F G SRERG F G S +RS++R+ W Sbjct: 99 ETYPKPVEERWGFHG-SRERGSFGGGGSSDRSSTRQGW 135 >gi|6739515|gb|AAF27284.1| (AF136004) eukaryotic initiation factor 4B [Triticum aestivum] Length = 462 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | 462 0 150 300 450 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 2.0, P = 0.86 Identities = 16/37 (43%), Positives = 25/37 (67%), Frame = -1 Query: 250 DMWMRPSDDRRQFQGGSRERGWFSSG-SRNRSTSRERW 140 + + +P ++R F G SRERG F G S +RS++R+ W Sbjct: 426 ETYPKPVEERWGFHG-SRERGSFGGGGSSDRSSTRQGW 462 >gi|2127792|pir||B64432 capsular polysaccharide biosynthsis protein M homolog - Methanococcus jannaschii >gi|1591712|gb|AAB99063.1| (U67549) capsular polysaccharide biosynthsis protein M [Methanococcus jannaschii] Length = 406 Frame 3 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | 406 0 150 300 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 3.5, P = 0.97 Identities = 22/63 (34%), Positives = 37/63 (58%), Frame = +3 Query: 45 KRL-EKTSN-KIVVLGSGHQKLKITLNPHKTAVHHLSLDVDLFLLPEENHPLSLLPP*NC 218 KR+ EK N K+++LG G K K+ + ++ L+L +++LL + +P L NC Sbjct: 225 KRVTEKYPNAKLIILGDGELKNKL-----QELINKLNLQNNVYLLGMQKNPFKFLKHSNC 279 Query: 219 LL-SSL 233 + SSL Sbjct: 280 FVFSSL 285 >gi|7450263|pir||H69530 conserved hypothetical protein AF2248 - Archaeoglobus fulgidus >gi|2648278|gb|AAB89009.1| (AE000949) conserved hypothetical protein [Archaeoglobus fulgidus] Length = 77 Frame 3 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | | | 77 0 20 40 60 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 7.0, P = 1.0 Identities = 15/44 (34%), Positives = 22/44 (50%), Frame = +3 Query: 69 KIVVLGSGHQKLKITLNPHKTAVHHLSLDVDLFLLPEENHPLSL 200 KI V+G G + K T + K V LDV+L + + N + L Sbjct: 2 KIKVVGPGCARCKATFDVVKKVVEKEGLDVELEYVTDMNEAIEL 45 >gi|1176643|sp|Q09202|YP23_CAEEL HYPOTHETICAL 25.9 KD PROTEIN AH6.3 IN CHROMOSOME II >gi|7494683|pir||T18613 hypothetical protein AH6.3 - Caenorhabditis elegans >gi|3873631|emb|CAA88077.1| (Z48009) AH6.3 [Caenorhabditis elegans] Length = 230 Frame 1 hits (HSPs): ________ Annotated Domains: ____________________ ______________ __________________________________________________ Database sequence: | | | | | | 230 0 50 100 150 200 __________________ Annotated Domains: Entrez Transmembrane region: POTENTIAL. 7..23 PRODOM PD064152: YP23_CAEEL 1..90 PRODOM PD053296: YP23_CAEEL 169..229 __________________ Plus Strand HSPs: Score = 61 (21.5 bits), Expect = 9.1, P = 1.0 Identities = 15/35 (42%), Positives = 21/35 (60%), Frame = +1 Query: 121 HTKLLFTISLLTLTY--FCCQKKTIPFPCYHLETV 219 + ++LFT S+L+L Y F C KK P P E+V Sbjct: 3 YIQMLFTASILSLGYLVFICGKKKKPKPTASTESV 37 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.94 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.338 0.152 0.475 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.349 0.145 0.509 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.378 0.172 0.685 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.347 0.151 0.570 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.335 0.151 0.442 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.373 0.161 0.626 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 84 84 10. 61 3 12 22 0.12 31 28 0.096 33 +2 0 85 84 10. 61 3 12 22 0.12 31 28 0.096 33 +1 0 85 84 10. 61 3 12 22 0.12 31 28 0.096 33 -1 0 85 85 10. 61 3 12 22 0.12 31 28 0.098 33 -2 0 85 84 10. 61 3 12 22 0.12 31 28 0.096 33 -3 0 84 84 10. 61 3 12 22 0.12 31 28 0.096 33 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 10 No. of states in DFA: 583 (57 KB) Total size of DFA: 135 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 116.47u 1.15s 117.62t Elapsed: 00:00:45 Total cpu time: 116.51u 1.17s 117.68t Elapsed: 00:00:45 Start: Wed Feb 14 15:31:46 2001 End: Wed Feb 14 15:32:31 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000