BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= 'D17H01_O13_15.ab1' (585 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 5 Sequences     : less than 5 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 1059 250 |==================================================
   6310  809 188 |=====================================
   3980  621 130 |==========================
   2510  491 156 |===============================
   1580  335 105 |=====================
   1000  230  63 |============
    631  167  27 |=====
    398  140  48 |=========
    251   92  28 |=====
    158   64  11 |==
    100   53  10 |==
   63.1   43   8 |=
   39.8   35   6 |=
   25.1   29   6 |=
   15.8   23   1 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 22  <<<<<<<<<<<<<<<<<
   10.0   22   1 |:
   6.31   21   1 |:
   3.98   20   0 |
   2.51   20   2 |:
   1.58   18   1 |:
   1.00   17   2 |:
   0.63   15   0 |
   0.40   15   3 |:
   0.25   12   1 |:
   0.16   11   1 |:
   0.10   10   0 |
  0.063   10   2 |:
  0.040    8   1 |:
  0.025    7   0 |
  0.016    7   1 |:
  0.010    6   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|11283525|pir||T47912hypothetical protein T20K12.10... +3   333  3.9e-29   1
gi|1903364|gb|AAB70447.1|(AC000104) EST gb|T45093 com... +3   177  1.3e-12   1
gi|4210351|emb|CAA21139.1|(AL031775) dJ30M3.1 (novel ... +3   111  1.7e-05   1
gi|8923812ref|NP_060943.1| uncharacterized hypothalam... +3   111  1.7e-05   1
gi|465883|sp|P34419|YLZ6_CAEELHYPOTHETICAL 20.1 KD PR... +3   115  0.00016   1
gi|7484139|pir||S74052hypothetical protein c0116 - Su... +3    96  0.0085    1
gi|7496519|pir||T15630hypothetical protein C25H3.3 - ... +3   105  0.011     1
gi|7630243|dbj|BAA94776.1|(AP001859) hypothetical pro... +3    97  0.025     1
gi|7630244|dbj|BAA94777.1|(AP001859) Similar to Arabi... +3    96  0.040     1
gi|7491588|pir||T40205hypothetical protein SPBC31F10.... +3    94  0.052     1
gi|11350388|pir||E82995hypothetical protein PA5202 [i... +3    87  0.11      1
gi|11499845ref|NP_071089.1| conserved hypothetical pr... +3    89  0.18      1
gi|5733877|gb|AAD49765.1|AC007932_13(AC007932) F11A17... +3    88  0.25      1
gi|7292257|gb|AAF47666.1|(AE003475) CG16985 gene prod... +3    87  0.27      1
gi|11350055|pir||A83149hypothetical protein PA3971 [i... +3    86  0.29      1
gi|7473619|pir||E75289probable phenylacetic acid degr... +3    84  0.51      1
gi|7292258|gb|AAF47667.1|(AE003475) CG16986 gene prod... +3    83  0.58      1
gi|141254|sp|P20378|YPHR_HALHAHYPOTHETICAL 15.6 KD PR... +3    82  0.77      1
gi|7471259|pir||E75467ComA-related protein - Deinococ... +3    77  0.86      1
gi|3980377|gb|AAC95180.1|(AC004561) unknown protein [... +3    81  0.90      1
gi|11350298|pir||B83042hypothetical protein PA4830 [i... +3    79  0.997     1
gi|10177184|dbj|BAB10318.1|(AB017061) gb|AAD49765.1~g... +3    77  0.9992    1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|11283525|pir||T47912  hypothetical protein T20K12.100 - Arabidopsis thaliana
            >gi|6850887|emb|CAB71050.1| (AL137898) putative protein
            [Arabidopsis thaliana]
            Length = 188

Frame  3 hits (HSPs):          ___________________________________________
                        __________________________________________________
Database sequence:     |             |            |            |          | 188
                       0            50          100          150

  Plus Strand HSPs:

 Score = 333 (117.2 bits), Expect = 3.9e-29, P = 3.9e-29
 Identities = 68/161 (42%), Positives = 107/161 (66%), Frame = +3

Query:     6 ISKEVDPSHASETLRIVNAMGAATPIPANCNARGFYDAFLRSF---IKVDHIQRGRISCT 176
             +SK +DP++    L + +   A +P   +CN    +D+F   F    +   I RGR+SC+
Sbjct:    29 VSKVIDPNYV---LMVADFFKAISP-DESCNDFTSFDSFSVLFQNNTRALSIARGRVSCS 84

Query:   177 VVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEV 356
             V   P I N +  LHGG+V S+ E ++ AC +TVV++DK LF+GE+S+SYLS+ P + E+
Sbjct:    85 VTVTPGISNFFKGLHGGAVASIAERVAMACVKTVVSEDKHLFIGELSMSYLSSAPISSEL 144

Query:   357 LANASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNMPVASL 488
             L   +VV+TGRNL+VV VEFK+K+   + Y++ +TFY+ P++ L
Sbjct:   145 LVEGTVVRTGRNLSVVTVEFKIKETMKVTYLSRATFYHSPISKL 188


to_Entrezto_Relatedto_Related >gi|1903364|gb|AAB70447.1|  (AC000104) EST gb|T45093 comes from this gene.
            [Arabidopsis thaliana]
            Length = 155

Frame  3 hits (HSPs):        _____________________________________________
                        __________________________________________________
Database sequence:     |               |               |                | | 155
                       0              50             100              150

  Plus Strand HSPs:

 Score = 177 (62.3 bits), Expect = 1.3e-12, P = 1.3e-12
 Identities = 44/140 (31%), Positives = 76/140 (54%), Frame = +3

Query:    69 AATPIPANCNARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVE 248
             A  P+ A    R F + F+ + +KVD I+ GRI C++   P + N    LHGG+  +LV+
Sbjct:    18 AKEPMVAKLPHR-FLERFVTNGLKVDLIEPGRIVCSMKIPPHLLNAGKFLHGGATATLVD 76

Query:   249 ILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKK 428
             ++ +A   T  A    + + EI++SYL A   +EE+   +  ++ G+ + VV+VE + K 
Sbjct:    77 LIGSAVIYTAGASHSGVSV-EINVSYLDAAFLDEEIEIESKALRVGKAVAVVSVELRKKT 135

Query:   429 AGNLLYITHSTFYNMPVASL 488
              G ++     T Y  P ++L
Sbjct:   136 TGKIIAQGRHTKYFAPRSNL 155


to_Entrezto_Relatedto_Related >gi|4210351|emb|CAA21139.1|  (AL031775) dJ30M3.1 (novel protein similar to
            (predicted) plant, worm, yeast and archaea bacterial proteins)
            [Homo sapiens]
            Length = 113

Frame  3 hits (HSPs):      __________________________________________     
                        __________________________________________________
Database sequence:     |                     |                     |      | 113
                       0                    50                   100

  Plus Strand HSPs:

 Score = 111 (39.1 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 26/95 (27%), Positives = 47/95 (49%), Frame = +3

Query:   159 GRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSAT 338
             G++ C +  +    N  GTLHGG   +LV+ +S   A     +       +++I+Y+S  
Sbjct:     9 GKVICEMKVEEEHTNAIGTLHGGLTATLVDNISTM-ALLCTERGAPGVSVDMNITYMSPA 67

Query:   339 PANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443
                E+++  A V+K G+ L   +V+   K  G L+
Sbjct:    68 KLGEDIVITAHVLKQGKTLAFTSVDLTNKATGKLI 102


to_Entrezto_Related >gi|8923812  ref|NP_060943.1| uncharacterized hypothalamus protein HT012 [Homo
            sapiens] >gi|11418468 ref|XP_004262.1| uncharacterized hypothalamus
            protein HT012 [Homo sapiens] >gi|7020647|dbj|BAA91215.1| (AK000508)
            unnamed protein product [Homo sapiens]
            >gi|7677052|gb|AAF67006.1|AF155649_1 (AF155649) hypothetical 15 kDa
            protein [Homo sapiens] >gi|7689023|gb|AAF67651.1|AF220186_1
            (AF220186) uncharacterized hypothalamus protein HT012 [Homo
            sapiens] >gi|12654153|gb|AAH00894.1|AAH00894 (BC000894)
            uncharacterized hypothalamus protein HT012 [Homo sapiens]
            Length = 140

Frame  3 hits (HSPs):               __________________________________    
                        __________________________________________________
Database sequence:     |                 |                 |              | 140
                       0                50               100

  Plus Strand HSPs:

 Score = 111 (39.1 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 26/95 (27%), Positives = 47/95 (49%), Frame = +3

Query:   159 GRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSAT 338
             G++ C +  +    N  GTLHGG   +LV+ +S   A     +       +++I+Y+S  
Sbjct:    36 GKVICEMKVEEEHTNAIGTLHGGLTATLVDNISTM-ALLCTERGAPGVSVDMNITYMSPA 94

Query:   339 PANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443
                E+++  A V+K G+ L   +V+   K  G L+
Sbjct:    95 KLGEDIVITAHVLKQGKTLAFTSVDLTNKATGKLI 129


to_Entrezto_Relatedto_Related >gi|465883|sp|P34419|YLZ6_CAEEL  HYPOTHETICAL 20.1 KD PROTEIN F42H10.6 IN
            CHROMOSOME III >gi|1078863|pir||S44652 f42h10.6 protein -
            Caenorhabditis elegans >gi|289680|gb|AAA28024.1| (L08403) putative
            [Caenorhabditis elegans]
            Length = 184

Frame  3 hits (HSPs):                         _____________________       
Annotated Domains:         __________________________________________     
                        __________________________________________________
Database sequence:     |             |            |             |         | 184
                       0            50          100           150
__________________

Annotated Domains:
   PRODOM               PD006741: Q18187(2)                      14..165
__________________


  Plus Strand HSPs:

 Score = 115 (40.5 bits), Expect = 0.00016, P = 0.00016
 Identities = 26/77 (33%), Positives = 41/77 (53%), Frame = +3

Query:   210 GTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGR 389
             GTLHGG   +L ++++ A A  V  KDK +   E+++SYL      + +   A V+K GR
Sbjct:    84 GTLHGGQTATLTDVIT-ARAVGVTVKDKGMASVELAVSYLLPVKVGDVLEITAHVLKVGR 142

Query:   390 NLTVVAVEFKLKKAGNL 440
              +     EF+ K  G +
Sbjct:   143 TMAFTDCEFRRKSDGKM 159


to_Entrezto_Relatedto_Related >gi|7484139|pir||S74052  hypothetical protein c0116 - Sulfolobus solfataricus
            >gi|1707746|emb|CAA69466.1| (Y08256) orf c01016 [Sulfolobus
            solfataricus] >gi|12313068|emb|CAC23784.1| (AL512975) ORF-c01_016
            [Sulfolobus solfataricus]
            Length = 140

Frame  3 hits (HSPs):           __________________________________        
                        __________________________________________________
Database sequence:     |                 |                 |              | 140
                       0                50               100

  Plus Strand HSPs:

 Score = 96 (33.8 bits), Expect = 0.0085, P = 0.0085
 Identities = 28/95 (29%), Positives = 46/95 (48%), Frame = +3

Query:   135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV-AKDKELFLGE 311
             +KV ++++GR    +  K     R G LHGG + S ++I     A TV  A D+     E
Sbjct:    24 VKVINLEKGRAVVEIPYKEEFTRRGGVLHGGIIMSAIDITGGLAALTVNDAMDQ--VTQE 81

Query:   312 ISISYLSATPANEEVLANASVVKTGRNLTVVAVEFK 419
             + I++L         +    V++ G  + VV +EFK
Sbjct:    82 LKINFLEPMYKGPFTI-EGKVLRKGSTVIVVEIEFK 116


to_Entrezto_Relatedto_Related >gi|7496519|pir||T15630  hypothetical protein C25H3.3 - Caenorhabditis elegans
            >gi|868253|gb|AAA68782.1| (U29535) C25H3.3 gene product
            [Caenorhabditis elegans]
            Length = 273

Frame  3 hits (HSPs):        __________________     _____________________ 
                        __________________________________________________
Database sequence:     |        |         |        |        |        |    | 273
                       0       50       100      150      200      250

  Plus Strand HSPs:

 Score = 105 (37.0 bits), Expect = 0.011, P = 0.011
 Identities = 28/114 (24%), Positives = 56/114 (49%), Frame = +3

Query:   111 YDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKD 290
             Y A  R+ ++  H + G +      +    N++ TLHGG   +L++  +      ++ K+
Sbjct:   154 YAAGARN-VRAVHAEEGNLRVEFEVEKDQTNQFETLHGGCTAALIDCFTTGAL--LLTKE 210

Query:   291 KELFLG-EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYIT 452
                 +  ++ I+YL+A    E ++ N++V+K GR+L     E   +K  N +  T
Sbjct:   211 ARPGVSVDLHITYLTAANIGETLVLNSTVIKQGRSLGFTKAEL-YRKRDNAMIAT 264

 Score = 96 (33.8 bits), Expect = 0.13, P = 0.13
 Identities = 21/97 (21%), Positives = 45/97 (46%), Frame = +3

Query:   135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEI 314
             ++  H + G +      +    N + TLHGG   +L++I +   A  +    +     ++
Sbjct:    31 VRAVHAEEGNLRVEFEVEKDQSNHFNTLHGGCTSTLIDIFTTG-ALLLTKPARPGVSVDL 89

Query:   315 SISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLK 425
              ++YL+A    E ++ +++V+K G+ L     E   K
Sbjct:    90 HVTYLTAAKIGETLVLDSTVIKQGKTLAFTKAELYRK 126


to_Entrezto_Relatedto_Related >gi|7630243|dbj|BAA94776.1|  (AP001859) hypothetical protein [Oryza sativa]
            Length = 167

Frame  3 hits (HSPs):             ___________________________________     
                        __________________________________________________
Database sequence:     |              |              |              |     | 167
                       0             50            100            150

  Plus Strand HSPs:

 Score = 97 (34.1 bits), Expect = 0.026, P = 0.025
 Identities = 28/116 (24%), Positives = 55/116 (47%), Frame = +3

Query:   102 RGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV 281
             R  ++A   +  +V   + GR  C++     + +  G  H G++ +  +   + CA  ++
Sbjct:    35 RRAFNALPLAGARVSLAEAGRAVCSLRVTAELTDAEGNWHPGAIAAAAD---DVCAAAIM 91

Query:   282 AKDKELFLG-EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYI 449
             + +  + +     ISY S    +EEV  +  VV+    +T V VE + K +G L+ I
Sbjct:    92 SVEGIIKVSVHYDISYFSPAKLHEEVELDGRVVEQKGKMTAVTVEIRKKDSGELVAI 148


to_Entrezto_Relatedto_Related >gi|7630244|dbj|BAA94777.1|  (AP001859) Similar to Arabidopsis thaliana
            chromosome 1 BAC F19P19; unknown protein (AC000104) [Oryza sativa]
            Length = 174

Frame  3 hits (HSPs):    ___________________________________________      
                        __________________________________________________
Database sequence:     |              |             |             |       | 174
                       0             50           100           150

  Plus Strand HSPs:

 Score = 96 (33.8 bits), Expect = 0.041, P = 0.040
 Identities = 39/147 (26%), Positives = 61/147 (41%), Frame = +3

Query:    24 PSHASETLRIVNAMGAATPIPANCNAR----GFYDAF---LRSFIKVDHIQRGRISCTVV 182
             P+ A+  LR+          P +  AR    G  DAF   +    +V   + GR+ C+  
Sbjct:     6 PAAAAAALRLAAVARRWLENPRDSLARSREEGCGDAFNTVVMPGFRVSLAEPGRLVCSFC 65

Query:   183 AKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDK-ELFLGEISISYLSATPANEEVL 359
                 + +  G  H G++ + V+   N CA  V   D    F    ++S+ S     EEV 
Sbjct:    66 VPAAVADADGRWHAGAMAAAVD---NLCAAVVYTADGVHRFTISQAMSFFSPAAHGEEVE 122

Query:   360 ANASVVKTGRNLTVVAVEFKLKKAGNLLYI 449
              +  V      LT   VE + K +G L+ I
Sbjct:   123 MDGRVAHRKGKLTAAVVEVRRKASGELVAI 152


to_Entrezto_Relatedto_Related >gi|7491588|pir||T40205  hypothetical protein SPBC31F10.02 - fission yeast
            (Schizosaccharomyces pombe) >gi|2226413|emb|CAB10079.1| (Z97204)
            hypothetical protein [Schizosaccharomyces pombe]
            Length = 161

Frame  3 hits (HSPs):         __________________________________          
                        __________________________________________________
Database sequence:     |               |              |               |   | 161
                       0              50            100             150

  Plus Strand HSPs:

 Score = 94 (33.1 bits), Expect = 0.053, P = 0.052
 Identities = 29/107 (27%), Positives = 53/107 (49%), Frame = +3

Query:    96 NARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACART 275
             N  GF DA + S I++     G + C++  +    NR G LHGG + +L ++       +
Sbjct:    23 NTNGF-DAHVVSDIQIISAVPGFVECSLKLQKHHLNRMGNLHGGCIAALTDL-----GGS 76

Query:   276 VVAKDKELFLGEISI----SYL-SATPANEEVLANASVVKTGRNLTVVAVEF 416
             +    + LF+  +SI    ++L S       +L +A   + G N+   +V+F
Sbjct:    77 LALASRGLFISGVSIDMNQTFLQSGGTLGSSILLHAKCDRLGSNIAFTSVDF 128


to_Entrezto_Relatedto_Related >gi|11350388|pir||E82995  hypothetical protein PA5202 [imported] - Pseudomonas
            aeruginosa (strain PAO1) >gi|9951508|gb|AAG08587.1|AE004933_3
            (AE004933) hypothetical protein [Pseudomonas aeruginosa]
            Length = 129

Frame  3 hits (HSPs):                  _____________________________      
                        __________________________________________________
Database sequence:     |                  |                   |           | 129
                       0                 50                 100

  Plus Strand HSPs:

 Score = 87 (30.6 bits), Expect = 0.11, P = 0.11
 Identities = 23/73 (31%), Positives = 37/73 (50%), Frame = +3

Query:   201 NRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVK 380
             NR G +HGG++ SL+++       +    D++    E  I+Y+ A  A+ EV   A V+ 
Sbjct:    42 NRGGVMHGGALFSLMDVTMGLACSSSHGFDRQSVTLECKINYIRAV-ADGEVRCVARVLH 100

Query:   381 TGRNLTVVAVEFK 419
              GR   VV  E +
Sbjct:   101 AGRRSLVVEAEVR 113


to_Entrezto_Related >gi|11499845  ref|NP_071089.1| conserved hypothetical protein [Archaeoglobus
            fulgidus] >gi|3334444|sp|O28020|YM64_ARCFU HYPOTHETICAL PROTEIN
            AF2264 >gi|7430238|pir||H69532 conserved hypothetical protein
            AF2264 - Archaeoglobus fulgidus >gi|2648253|gb|AAB88986.1|
            (AE000948) conserved hypothetical protein [Archaeoglobus fulgidus]
            Length = 154

Frame  3 hits (HSPs):               ____________________________________  
                        __________________________________________________
Database sequence:     |               |                |               | | 154
                       0              50              100             150

  Plus Strand HSPs:

 Score = 89 (31.3 bits), Expect = 0.20, P = 0.18
 Identities = 28/112 (25%), Positives = 47/112 (41%), Frame = +3

Query:   138 KVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEIS 317
             ++  ++ G     +V K    N     HGG + SL ++   A A    +  K     E+S
Sbjct:    39 RILEMKEGYAKVEMVVKKEHLNAANVCHGGIIFSLADL---AFALASNSHGKLALAIEVS 95

Query:   318 ISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNM 473
             I+Y+ A    E+++A A  V  G       +E K   A  L+ +   T Y +
Sbjct:    96 ITYMKAAYEGEKLVAEAKEVNLGNKTATYLMEVK-NSANKLIALAKGTVYRV 146


to_Entrezto_Relatedto_Related >gi|5733877|gb|AAD49765.1|AC007932_13  (AC007932) F11A17.13 [Arabidopsis
            thaliana]
            Length = 156

Frame  3 hits (HSPs):          ______________________________________     
                        __________________________________________________
Database sequence:     |               |               |               |  | 156
                       0              50             100             150

  Plus Strand HSPs:

 Score = 88 (31.0 bits), Expect = 0.28, P = 0.25
 Identities = 32/117 (27%), Positives = 56/117 (47%), Frame = +3

Query:   144 DHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG-EISI 320
             D +   RI+  +   P  C  +  LHGG    + E L++  A   +A   +   G ++SI
Sbjct:    23 DELSPTRITGRLPVSPVCCQPFKVLHGGVSALIAESLASMGAH--MASGFKRVAGIQLSI 80

Query:   321 SYLSATPANEEVLANASVVKTGRNLTVVAVE-FKL--KKAGNLLYITHST---FYNMPV 479
             ++L +    + V A A+ V TG+ + V  V+ +K   K   N + I+ S      N+P+
Sbjct:    81 NHLKSADLGDLVFAEATPVSTGKTIQVWEVKLWKTTQKDKANKILISSSRVTLICNLPI 139


to_Entrezto_Relatedto_Related >gi|7292257|gb|AAF47666.1|  (AE003475) CG16985 gene product [Drosophila
            melanogaster]
            Length = 149

Frame  3 hits (HSPs):         ______________________________________      
                        __________________________________________________
Database sequence:     |                |                |                | 149
                       0               50              100

  Plus Strand HSPs:

 Score = 87 (30.6 bits), Expect = 0.32, P = 0.27
 Identities = 30/115 (26%), Positives = 56/115 (48%), Frame = +3

Query:    99 ARGFYDAFLRSFIKVDHIQRGR-ISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACART 275
             + GF D  L+  IK+     GR I    VA   + NR GTLHGG   ++V+   N     
Sbjct:    21 SNGF-DRVLK-MIKITGGGDGRAIGEFTVANEHL-NRQGTLHGGLTATIVD---NCTTYA 74

Query:   276 VVAKDKELFL-GEISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443
             +++K     +   +++SY++A    E +  + + V+ G+ +  +    + K  G ++
Sbjct:    75 LMSKGSHPGVTANLNVSYIAAAKPGELIEIDCNTVRAGKKMAYLDCILRRKSDGKII 131


to_Entrezto_Relatedto_Related >gi|11350055|pir||A83149  hypothetical protein PA3971 [imported] - Pseudomonas
            aeruginosa (strain PAO1) >gi|9950161|gb|AAG07358.1|AE004815_2
            (AE004815) hypothetical protein [Pseudomonas aeruginosa]
            Length = 143

Frame  3 hits (HSPs):                  __________________________________ 
                        __________________________________________________
Database sequence:     |                 |                |               | 143
                       0                50              100

  Plus Strand HSPs:

 Score = 86 (30.3 bits), Expect = 0.35, P = 0.29
 Identities = 28/100 (28%), Positives = 48/100 (48%), Frame = +3

Query:   189 PPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPAN-EEVLAN 365
             P    + G +H G   +L +  + A A T+V + + +   E  ++ L   PA  + +L  
Sbjct:    44 PRHAQQNGFIHAGVQATLADHAAGAAAATLVEEGQTVLTLEFKLNLLR--PALCQRLLCR 101

Query:   366 ASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNMPVASL 488
             A V+K GR +TVV  E   ++ G     + +T   M V +L
Sbjct:   102 AEVLKAGRQVTVVEAEVFAERDGRRHLFSKATV-TMAVVAL 141


to_Entrezto_Relatedto_Related >gi|7473619|pir||E75289  probable phenylacetic acid degradation protein PaaI -
            Deinococcus radiodurans (strain R1)
            >gi|6460126|gb|AAF11862.1|AE002063_5 (AE002063) phenylacetic acid
            degradation protein PaaI, putative [Deinococcus radiodurans]
            Length = 146

Frame  3 hits (HSPs):       _________________________________________     
                        __________________________________________________
Database sequence:     |                |                |                | 146
                       0               50              100

  Plus Strand HSPs:

 Score = 84 (29.6 bits), Expect = 0.72, P = 0.51
 Identities = 35/130 (26%), Positives = 58/130 (44%), Frame = +3

Query:    54 VNAMGAATPIPANCNARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSV 233
             V+A  A TP P    A  + +    + +        R++ TV       N +GT HGG +
Sbjct:    13 VSAPPARTPYP---EAMSYAEVLGMTILDASP-DLTRVALTVTEAG--LNMHGTAHGGLI 66

Query:   234 GSLVE----ILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGRNLTV 401
              SL +    ++SN  A+ V A        E  +S+  A    E ++A A+  + GR L  
Sbjct:    67 FSLADEAFAVISNLDAQAVAA--------ETHMSFFRAAREGERLVAVATPERVGRTLAT 118

Query:   402 VAVEFKLKKAGNLL 443
               +E +  + G +L
Sbjct:   119 YRIEVRRGEEGEVL 132


to_Entrezto_Relatedto_Related >gi|7292258|gb|AAF47667.1|  (AE003475) CG16986 gene product [Drosophila
            melanogaster]
            Length = 143

Frame  3 hits (HSPs):             __________________________________      
                        __________________________________________________
Database sequence:     |                 |                |               | 143
                       0                50              100

  Plus Strand HSPs:

 Score = 83 (29.2 bits), Expect = 0.87, P = 0.58
 Identities = 27/97 (27%), Positives = 48/97 (49%), Frame = +3

Query:   138 KVDHIQRGRISCTVVAK--PPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG- 308
             KV  +  G  +CT   K      N Y  LHGG + +LV++++     T     K    G 
Sbjct:    30 KVKIVDGGDGACTAELKVDQDHVNLYKFLHGGYIMTLVDLIT-----TYALMSKPCHPGV 84

Query:   309 --EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKK 428
               ++S+++L+     ++V+  A++ K G+ L  +    K KK
Sbjct:    85 SVDLSVNFLNGAKLGDDVVIQANLSKVGKYLAFIDCTLKHKK 126


to_Entrezto_Relatedto_Related >gi|141254|sp|P20378|YPHR_HALHA  HYPOTHETICAL 15.6 KD PROTEIN IN PHR 5'REGION
            >gi|99200|pir||A32580 hypothetical 15K protein
            (deoxyribodipyrimidine photo-lyase 5' region) - Halobacterium
            salinarum >gi|148792|gb|AAA72748.1| (M24544) ORF151 [Halobacterium
            halobium] >gi|10580850|gb|AAG19672.1| (AE005055) Vng1336c
            [Halobacterium sp. NRC-1]
            Length = 151

Frame  3 hits (HSPs):                     _______________________         
Annotated Domains:      ________________________________________________  
                        __________________________________________________
Database sequence:     |                |               |                || 151
                       0               50             100              150
__________________

Annotated Domains:
   PRODOM               PD175497: YPHR_HALHA                     1..23
   PRODOM               PD006741: Q18187(2)                      25..144
__________________


  Plus Strand HSPs:

 Score = 82 (28.9 bits), Expect = 1.5, P = 0.77
 Identities = 19/67 (28%), Positives = 37/67 (55%), Frame = +3

Query:   210 GTLHGGSVGSLVEILSNACARTVVAKDKELFLG--EISISYLSATPANEEVLANASVVKT 383
             G +HGG   +L++       R+ + K     +   ++++SYL   PA  +++A+ASVV+ 
Sbjct:    56 GDVHGGIAATLIDTAGGLAVRSALPKPVAANVATIDLNVSYLR--PARGDLIADASVVRV 113

Query:   384 GRNLTVVAV 410
             G  + V  +
Sbjct:   114 GSTVGVAEI 122


to_Entrezto_Relatedto_Related >gi|7471259|pir||E75467  ComA-related protein - Deinococcus radiodurans (strain
            R1) >gi|6458566|gb|AAF10426.1|AE001939_3 (AE001939) ComA-related
            protein [Deinococcus radiodurans]
            Length = 119

Frame  3 hits (HSPs):    ______________________________________________   
                        __________________________________________________
Database sequence:     |                    |                    |        | 119
                       0                   50                  100

  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 1.9, P = 0.86
 Identities = 27/109 (24%), Positives = 46/109 (42%), Frame = +3

Query:   135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG-E 311
             I+  HI+RG +   +  +P +    G LH  SV +L +       R ++  +   F   E
Sbjct:     4 IRFTHIERGLLRSELTVRPELFAPNGYLHAASVVALADTTCGYGTRVLLPDEATGFTTIE 63

Query:   312 ISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYITHST 461
             +  ++L  T     V   A  V  GR   V   E +  + GN++ +   T
Sbjct:    64 LKSNHLG-TSRQGVVTCEARAVHAGRTTQVWDAEVR-NEQGNVMALFRCT 111


to_Entrezto_Relatedto_Related >gi|3980377|gb|AAC95180.1|  (AC004561) unknown protein [Arabidopsis thaliana]
            Length = 157

Frame  3 hits (HSPs):            _______________________________________  
                        __________________________________________________
Database sequence:     |               |               |               |  | 157
                       0              50             100             150

  Plus Strand HSPs:

 Score = 81 (28.5 bits), Expect = 2.3, P = 0.90
 Identities = 31/121 (25%), Positives = 61/121 (50%), Frame = +3

Query:   102 RGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV 281
             + FY+ F    I+V+ ++ G ISC+      + +R   L  G++ +LV+ +  A    V 
Sbjct:    31 KSFYENFSLRGIRVNRVEPGFISCSFKVPLRLTDRDKNLANGAIANLVDEVGGAL---VH 87

Query:   282 AKDKELFLG-EISISYLSATPANEEVLANASVV--KTGRNLTVVAVEFKLKKAGNLLYI- 449
              +   + +  ++SI++LS     EE+   + ++  + G   T+V V  K+   G ++   
Sbjct:    88 GEGLPMSVSVDMSIAFLSKAKLGEELEITSRLLGERGGYKGTIVVVRNKM--TGEIIAEG 145

Query:   450 THSTF 464
              HS F
Sbjct:   146 RHSMF 150


to_Entrezto_Relatedto_Related >gi|11350298|pir||B83042  hypothetical protein PA4830 [imported] - Pseudomonas
            aeruginosa (strain PAO1) >gi|9951099|gb|AAG08215.1|AE004896_5
            (AE004896) hypothetical protein [Pseudomonas aeruginosa]
            Length = 179

Frame  3 hits (HSPs):                          _________________________  
                        __________________________________________________
Database sequence:     |             |             |             |        | 179
                       0            50           100           150

  Plus Strand HSPs:

 Score = 79 (27.8 bits), Expect = 5.8, P = 1.0
 Identities = 29/87 (33%), Positives = 40/87 (45%), Frame = +3

Query:   201 NRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSA-TPANEEVLANASVV 377
             N  G++HGG   SL++        T +   +     ++ ISYL A T     V A   VV
Sbjct:    85 NPLGSVHGGYAASLLDSCMGCAIHTRLQAGQGYTTTDLRISYLRALTDKVGPVRAEGRVV 144

Query:   378 KTGRNLTVVAVEFKLKKAGNLLYITHST 461
               GR+ T VA E +L    + LY   ST
Sbjct:   145 HLGRS-TAVA-EGRLYDVDDRLYAVGST 170


to_Entrezto_Relatedto_Related >gi|10177184|dbj|BAB10318.1|  (AB017061) gb|AAD49765.1~gene_id:K19E20.6~similar
            to unknown protein [Arabidopsis thaliana]
            Length = 157

Frame  3 hits (HSPs):         _______________________________             
                        __________________________________________________
Database sequence:     |               |               |               |  | 157
                       0              50             100             150

  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 7.2, P = 1.0
 Identities = 26/95 (27%), Positives = 42/95 (44%), Frame = +3

Query:   144 DHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISIS 323
             D +   R+S  +      C  +  LHGG    + E L++  A  + +  K +    +SI 
Sbjct:    22 DELSATRVSGHLTLTEKCCQPFKVLHGGVSALIAEALASLGAG-IASGFKRVAGIHLSIH 80

Query:   324 YLSATPANEEVLANASVVKTGRNLTVVAVE-FKLKK 428
             +L      E V A +  V  G+N+ V  V  +K KK
Sbjct:    81 HLRPAALGEIVFAESFPVSVGKNIQVWEVRLWKAKK 116


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.98

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.325   0.139   0.405  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.359   0.159   0.702  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.342   0.149   0.509  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.358   0.157   0.654  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.328   0.140   0.431  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.348   0.149   0.474  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      194       194       10.  76 3  12 22  0.091   35
                                                    31  0.10    38
   +2      0      194       194       10.  76 3  12 22  0.091   35
                                                    31  0.10    38
   +1      0      195       195       10.  76 3  12 22  0.091   35
                                                    31  0.10    38
   -1      0      195       195       10.  76 3  12 22  0.091   35
                                                    31  0.10    38
   -2      0      194       194       10.  76 3  12 22  0.091   35
                                                    31  0.10    38
   -3      0      194       194       10.  76 3  12 22  0.091   35
                                                    31  0.10    38


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  22
  No. of states in DFA:  594 (59 KB)
  Total size of DFA:  236 KB (256 KB)
  Time to generate neighborhood:  0.02u 0.00s 0.02t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  208.11u 1.22s 209.33t  Elapsed: 00:00:36
  Total cpu time:  208.18u 1.22s 209.40t  Elapsed: 00:00:36
  Start:  Thu Jan 17 17:27:24 2002   End:  Thu Jan 17 17:28:00 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000