WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH3E08.SEQ(1>207) (184 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 7 Sequences : less than 7 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 2808 244 |================================== 6310 2564 407 |========================================================== 3980 2157 363 |=================================================== 2510 1794 378 |====================================================== 1580 1416 380 |====================================================== 1000 1036 297 |========================================== 631 739 224 |================================ 398 515 177 |========================= 251 338 104 |============== 158 234 72 |========== 100 162 44 |====== 63.1 118 45 |====== 39.8 73 35 |===== 25.1 38 20 |== 15.8 18 6 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 12 <<<<<<<<<<<<<<<<< 10.0 12 6 |: 6.31 6 0 | 3.98 6 3 |: 2.51 3 1 |: 1.58 2 1 |: 1.00 1 0 | 0.63 1 0 | 0.40 1 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|465541|sp|P34780|YCX5_ASTLOHYPOTHETICAL 13.3 KD PR... +3 69 0.25 1 gi|6714454|gb|AAF26141.1|AC011620_17(AC011620) putati... +2 63 0.76 1 gi|7505458|pir||T32045hypothetical protein K07E8.5 - ... +3 45 0.82 2 gi|5734551|emb|CAB52777.1|(AJ243656) ORF3 [Methanobac... +3 65 0.95 1 gi|7504981|pir||T33244hypothetical protein H27D07.4 -... +2 56 0.97 2 gi|7497661|pir||T31992hypothetical protein C49D10.3 -... +2 50 0.98 2 gi|7332059|gb|AAF60746.1|(AC024804) Hypothetical prot... +3 44 0.999 2 gi|77241|pir||S02769gag 75K protein precursor - Molon... +3 56 0.9990 1 gi|1171810|sp|P24883|NU3M_ASCSUNADH-UBIQUINONE OXIDOR... +3 56 0.9993 1 gi|3769651|gb|AAC64600.1|(AF091580) olfactory recepto... +3 46 0.9995 2 gi|3769653|gb|AAC64601.1|(AF091581) olfactory recepto... +3 46 0.9995 2 gi|122782|sp|P20864|HBP_VICFAPOTENTIAL HEME-BINDING P... +3 55 0.9999 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|465541 | _______________________________________ gi|7505458 | _______________ gi|5734551 |______________________________________________ gi|7504981 | __________ gi|7332059 | __________________ gi|77241 |__________________________ gi|1171810 | _________________________________ gi|3769651 |___________________________ gi|3769653 |___________________________ gi|122782 | ________________________________ __________________________________________________ Query sequence: | | | | | 62 0 20 40 60 Locus_ID Frame 2 Hits gi|6714454 |___________ gi|7504981 | ________________ gi|7497661 | _____________ gi|3769651 | _____________________________ gi|3769653 | _____________________________ __________________________________________________ Query sequence: | | | | | 62 0 20 40 60 Locus_ID Frame 1 Hits gi|7505458 | _________________ gi|7497661 | _____________ gi|7332059 | _________________ __________________________________________________ Query sequence: | | | | | 62 0 20 40 60
Use the and icons to retrieve links to Entrez:
>gi|465541|sp|P34780|YCX5_ASTLO HYPOTHETICAL 13.3 KD PROTEIN IN RPL23-RPL5 INTERGENIC REGION (ORF105) >gi|481417|pir||S38605 hypothetical protein 105 (rpl23 3' region) - euglenid (Astasia longa) plastid >gi|414871|emb|CAA53329.1| (X75653) orf105 [Astasia longa] Length = 105 Frame 3 hits (HSPs): ______________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | | 105 0 20 40 60 80 100 __________________ Annotated Domains: PRODOM PD064585: YCX5_ASTLO 1..104 __________________ Plus Strand HSPs: Score = 69 (24.3 bits), Expect = 0.29, P = 0.25 Identities = 19/47 (40%), Positives = 28/47 (59%), Frame = +3 Query: 24 RERKKISCLIILFACVLFTLFLYL---AFFLANIA*DTMLCFYQLQFNGF 164 R+RKKI +II C L +++ L ++F+ NI + FYQ NGF Sbjct: 36 RKRKKIIYIIIYIFCFLILMYILLIMDSYFIVNI-----IEFYQKYENGF 80 >gi|6714454|gb|AAF26141.1|AC011620_17 (AC011620) putative 60S ribosomal protein L22 [Arabidopsis thaliana] Length = 124 Frame 2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 124 0 50 100 Plus Strand HSPs: Score = 63 (22.2 bits), Expect = 1.4, P = 0.76 Identities = 12/12 (100%), Positives = 12/12 (100%), Frame = +2 Query: 5 FNIAENEGEEED 40 FNIAENEGEEED Sbjct: 113 FNIAENEGEEED 124 >gi|7505458|pir||T32045 hypothetical protein K07E8.5 - Caenorhabditis elegans >gi|2315723|gb|AAB66150.1| (AF016678) contains similarity to seven-transmembrane receptors [Caenorhabditis elegans] Length = 305 Frame 3 hits (HSPs): ___ Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | || 305 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 45 (15.8 bits), Expect = 1.7, Sum P(2) = 0.82 Identities = 5/18 (27%), Positives = 13/18 (72%), Frame = +3 Query: 30 RKKISCLIILFACVLFTL 83 R KI+C++++ C ++ + Sbjct: 112 RAKITCVVLMLICFIYNI 129 Score = 42 (14.8 bits), Expect = 1.7, Sum P(2) = 0.82 Identities = 9/20 (45%), Positives = 12/20 (60%), Frame = +1 Query: 112 ILLEIRCFVFISCNLTVLII 171 I L + F FI CN T L++ Sbjct: 209 ISLILVVFFFIFCNFTALMV 228 >gi|5734551|emb|CAB52777.1| (AJ243656) ORF3 [Methanobacterium thermoautotrophicum] Length = 221 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 65 (22.9 bits), Expect = 3.1, P = 0.95 Identities = 21/59 (35%), Positives = 28/59 (47%), Frame = +3 Query: 6 STLPRMRERKKISCL--IILFACVLFTLFLYLAF-FLANIA*-DTMLCFYQLQFNGFDH 170 S +P M K + + LF V+F +F L +LA IA D L FY + GF H Sbjct: 32 SMIPDMDHEVKSENVSTVFLFGLVIFLVFYILGLPYLAGIALMDLALIFYLSRHRGFTH 90 >gi|7504981|pir||T33244 hypothetical protein H27D07.4 - Caenorhabditis elegans >gi|3171258|gb|AAC18402.1| (AF067950) H27D07.4 gene product [Caenorhabditis elegans] Length = 663 Frame 3 hits (HSPs): __ Frame 2 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | | 663 0 150 300 450 600 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 3.6, Sum P(2) = 0.97 Identities = 9/19 (47%), Positives = 13/19 (68%), Frame = +2 Query: 71 FVYPFPIPSFLFGKYCLRY 127 F+ F +P +LFG YC+ Y Sbjct: 34 FLTAFSLPVYLFGGYCILY 52 Score = 35 (12.3 bits), Expect = 3.6, Sum P(2) = 0.97 Identities = 5/11 (45%), Positives = 8/11 (72%), Frame = +3 Query: 138 FYQLQFNGFDH 170 + Q+ F GF+H Sbjct: 353 YLQIDFPGFEH 363 >gi|7497661|pir||T31992 hypothetical protein C49D10.3 - Caenorhabditis elegans >gi|2315614|gb|AAC71178.1| (AF016665) C49D10.3 gene product [Caenorhabditis elegans] Length = 324 Frame 2 hits (HSPs): ___ Frame 1 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | | 324 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 50 (17.6 bits), Expect = 3.8, Sum P(2) = 0.98 Identities = 7/15 (46%), Positives = 12/15 (80%), Frame = +2 Query: 83 FPIPSFLFGKYCLRY 127 F IP ++FG YC+++ Sbjct: 27 FEIPIWIFGAYCIQF 41 Score = 34 (12.0 bits), Expect = 3.8, Sum P(2) = 0.98 Identities = 7/16 (43%), Positives = 10/16 (62%), Frame = +1 Query: 124 IRCFVFISCNLTVLII 171 I F+FI C L + I+ Sbjct: 248 IMTFMFIPCVLLLYIV 263 >gi|7332059|gb|AAF60746.1| (AC024804) Hypothetical protein Y51H7BR.3 [Caenorhabditis elegans] Length = 162 Frame 3 hits (HSPs): _______ Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 162 0 50 100 150 Plus Strand HSPs: Score = 44 (15.5 bits), Expect = 6.7, Sum P(2) = 1.0 Identities = 10/20 (50%), Positives = 13/20 (65%), Frame = +3 Query: 36 KISCLIILFACVLFTLFLYL 95 KI CLI L +L +LF+ L Sbjct: 13 KICCLIFLTLQLLISLFILL 32 Score = 30 (10.6 bits), Expect = 6.7, Sum P(2) = 1.0 Identities = 7/20 (35%), Positives = 11/20 (55%), Frame = +1 Query: 112 ILLEIRCFVFISCNLTVLII 171 +LL I F FIS ++ + Sbjct: 81 LLLTITLFWFISSIFALIFV 100 >gi|77241|pir||S02769 gag 75K protein precursor - Moloney murine leukemia virus (fragment) Length = 91 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 7.0, P = 1.0 Identities = 12/31 (38%), Positives = 18/31 (58%), Frame = +3 Query: 6 STLPRMRERKKISCLIILFACVLFTLFLYLA 98 S R R + + C I+L C+ T+FLYL+ Sbjct: 57 SVWDRSRAARLVCCSIVL-CCLCLTVFLYLS 86 >gi|1171810|sp|P24883|NU3M_ASCSU NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 3 >gi|102584|pir||S26024 NADH dehydrogenase (ubiquinone) (EC 1.6.5.3) chain 3 - pig roundworm mitochondrion >gi|559498|emb|CAA38173.1| (X54253) ND3 protein [Ascaris suum] >gi|5834882|gnl|NCBI_MITO|ND3_10020 NADH dehydrogenase subunit 3 Length = 111 Frame 3 hits (HSPs): __________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 111 0 50 100 __________________ Annotated Domains: DOMO DM00232: NADHDEHYDROGENASE(UBIQUINONE)CH 1..108 PFAM oxidored_q4: NADH-ubiquinone/plastoquino 33..109 PRODOM PD019171: NU3M(2) 1..33 PRODOM PD007184: NU3M(3) 35..75 PRODOM PD003416: NU3M(2) 77..110 __________________ Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 7.3, P = 1.0 Identities = 13/40 (32%), Positives = 23/40 (57%), Frame = +3 Query: 48 LIILFACVLFTLFLYLAFFLANIA*DTMLC--FYQLQFNGFD 167 +++L VLFTL L F++ N + C FY+ + + F+ Sbjct: 1 MLVLVMVVLFTLVLLFVFYIGNFV---LSCKDFYKNKISSFE 39 >gi|3769651|gb|AAC64600.1| (AF091580) olfactory receptor [Rattus norvegicus] Length = 221 Frame 3 hits (HSPs): ________ Frame 2 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 46 (16.2 bits), Expect = 7.6, Sum P(2) = 1.0 Identities = 10/33 (30%), Positives = 19/33 (57%), Frame = +3 Query: 3 TSTLPRMRERKKISCLIILFACVLFTLFLYLAF 101 ++T+P+M + +I FA L +F ++AF Sbjct: 12 STTVPKMLVNIQTQSKMITFAGCLTQIFFFIAF 44 Score = 31 (10.9 bits), Expect = 7.6, Sum P(2) = 1.0 Identities = 10/34 (29%), Positives = 15/34 (44%), Frame = +2 Query: 83 FPIPSFLFGKYCLRYDALFLSVAI*RF*SFKNCG 184 FP+ LF + L +S A + +F CG Sbjct: 146 FPLCGILFSYSQIFSSVLRVSSARGQHKAFSTCG 179 >gi|3769653|gb|AAC64601.1| (AF091581) olfactory receptor [Rattus norvegicus] Length = 221 Frame 3 hits (HSPs): ________ Frame 2 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 46 (16.2 bits), Expect = 7.6, Sum P(2) = 1.0 Identities = 10/33 (30%), Positives = 19/33 (57%), Frame = +3 Query: 3 TSTLPRMRERKKISCLIILFACVLFTLFLYLAF 101 ++T+P+M + +I FA L +F ++AF Sbjct: 12 STTVPKMLVNIQTQSKMITFAGCLTQIFFFIAF 44 Score = 31 (10.9 bits), Expect = 7.6, Sum P(2) = 1.0 Identities = 10/34 (29%), Positives = 15/34 (44%), Frame = +2 Query: 83 FPIPSFLFGKYCLRYDALFLSVAI*RF*SFKNCG 184 FP+ LF + L +S A + +F CG Sbjct: 146 FPLCGILFSYSQIFSSVLRVSSARGQHKAFSTCG 179 >gi|122782|sp|P20864|HBP_VICFA POTENTIAL HEME-BINDING PROTEIN Length = 76 Frame 3 hits (HSPs): __________________________ Annotated Domains: ___________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 __________________ Annotated Domains: Entrez Transmembrane region: POTENTIAL. 1..17 Entrez metal-binding site: IRON (HEME AXIAL LIG 12 __________________ Plus Strand HSPs: Score = 55 (19.4 bits), Expect = 8.9, P = 1.0 Identities = 10/39 (25%), Positives = 24/39 (61%), Frame = +3 Query: 39 ISCLIILFACVLFTLFLYLAF-FLANIA*DTMLCFYQLQ 152 +SCL+ +F +L T+F Y F +L ++ ++ ++ ++ Sbjct: 37 LSCLVSIFPVILDTIFKYSIFRYLNRVSPSLVVIYHSMK 75 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.92 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.352 0.157 0.509 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.355 0.166 0.599 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.382 0.178 0.704 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.355 0.157 0.459 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.341 0.148 0.450 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.362 0.157 0.522 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 60 60 10. 57 3 12 22 0.098 30 26 0.10 30 +2 0 61 60 10. 57 3 12 22 0.098 30 26 0.10 30 +1 0 61 60 10. 57 3 12 22 0.098 30 26 0.10 30 -1 0 61 61 10. 57 3 12 22 0.10 30 26 0.10 30 -2 0 61 60 10. 57 3 12 22 0.098 30 26 0.10 30 -3 0 60 60 10. 57 3 12 22 0.098 30 26 0.10 30 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 12 No. of states in DFA: 565 (56 KB) Total size of DFA: 98 KB (128 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 74.24u 1.06s 75.30t Elapsed: 00:00:25 Total cpu time: 74.27u 1.10s 75.37t Elapsed: 00:00:26 Start: Wed Feb 14 16:26:00 2001 End: Wed Feb 14 16:26:26 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000