BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract

RepeatMasker Server unavailable.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= B04G02.seq(1>488); (450 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 5 Sequences     : less than 5 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 1154 269 |=====================================================
   6310  885 155 |===============================
   3980  730 233 |==============================================
   2510  497 136 |===========================
   1580  361 114 |======================
   1000  247  92 |==================
    631  155  53 |==========
    398  102  37 |=======
    251   65  10 |==
    158   55  15 |===
    100   40   5 |=
   63.1   35   5 |=
   39.8   30   2 |:
   25.1   28   1 |:
   15.8   27   2 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 25  <<<<<<<<<<<<<<<<<
   10.0   25   3 |:
   6.31   22   0 |
   3.98   22   0 |
   2.51   22   0 |
   1.58   22   2 |:
   1.00   20   0 |
   0.63   20   0 |
   0.40   20   0 |
   0.25   20   0 |
   0.16   20   0 |
   0.10   20   0 |
  0.063   20   0 |
  0.040   20   0 |
  0.025   20   0 |
  0.016   20   0 |
  0.010   20   0 |
 0.0063   20   0 |
 0.0040   20   0 |
 0.0025   20   0 |
 0.0016   20   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|9758410|dbj|BAB08952.1|(AB006700) lysine decarboxy... +2   361  4.2e-32   1
gi|11357791|pir||T45885hypothetical protein F4P12.150... +2   345  2.1e-30   1
gi|10140743|gb|AAG13575.1|AC037425_6(AC037425) unknow... +2   325  2.7e-28   1
gi|11358493|pir||T48554lysine decarboxylase-like prot... +2   284  6.0e-24   1
gi|11358492|pir||T48348lysine decarboxylase-like prot... +2   267  3.8e-22   1
gi|4371280|gb|AAD18138.1|(AC006260) hypothetical prot... +2   261  1.6e-21   1
gi|4510370|gb|AAD21458.1|(AC007017) unknown protein [... +2   254  9.1e-21   1
gi|9757778|dbj|BAB08387.1|(AB005240) lysine decarboxy... +2   253  1.2e-20   1
gi|12231051|sp|P48636|YDC3_PSEAEHYPOTHETICAL PROTEIN ... +2   207  8.7e-16   1
gi|7486909|pir||T04966hypothetical protein T12J5.60 -... +2   202  2.9e-15   1
gi|77664|pir||PQ0114hypothetical protein (azu region)... +2   197  1.0e-14   1
gi|7451099|pir||D70033conserved hypothetical protein ... +2   167  1.5e-11   1
gi|10175706|dbj|BAB06803.1|(AP001517) BH3084~unknown ... +2   165  2.4e-11   1
gi|11280345|pir||T45176conserved hypothetical protein... +2   144  4.1e-09   1
gi|10175367|dbj|BAB06465.1|(AP001516) lysine decarbox... +2   144  4.1e-09   1
gi|1169649|sp|P46378|FAS6_RHOFAHYPOTHETICAL 21.1 KD P... +2   132  7.7e-08   1
gi|4337446|gb|AAD18125.1|(U89166) ECORLD_ORF1; simila... +2   122  8.8e-07   1
gi|6322406ref|NP_012480.1| Yjl055wp [Saccharomyces ce... +2   124  1.2e-06   1
gi|10954698ref|NP_066633.1| similar to orf6 gene(unkn... +2   106  0.00013   1
gi|7451100|pir||C70609hypothetical protein Rv1205 - M... +2   103  0.0010    1
gi|7462102|pir||A72302conserved hypothetical protein ... +2    81  0.74      1
gi|12697620|dbj|BAB21615.1|(AB037974) cytochrome oxid... +2    41  0.75      2
gi|1196510|gb|AAA88231.1|(M15467) unknown protein [My... -1    76  0.998     1
gi|7479785|pir||T35807hypothetical protein SC8D9.03 S... +2    76  0.999     1
gi|7451672|pir||H70312hypothetical protein aq_134 - A... +2    74  0.9995    1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|9758410|dbj|BAB08952.1|  (AB006700) lysine decarboxylase-like protein
            [Arabidopsis thaliana]
            Length = 217

Frame  2 hits (HSPs):   ______________________                            
                        __________________________________________________
Database sequence:     |           |          |           |          |    | 217
                       0          50        100         150        200

  Plus Strand HSPs:

 Score = 361 (127.1 bits), Expect = 4.2e-32, P = 4.2e-32
 Identities = 68/93 (73%), Positives = 80/93 (86%), Frame = +2

Query:   170 MMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVV 349
             M ++ SRF+RICVFCG+S GK PSYQ AAIQL  +LVER IDLVYGGGS+GLMGL+SQ V
Sbjct:     1 MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV 60

Query:   350 FDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448
               GGRHVLGVIP TLMPREITGE++GEV++V +
Sbjct:    61 HHGGRHVLGVIPKTLMPREITGETIGEVKAVAD 93


to_Entrezto_Relatedto_Related >gi|11357791|pir||T45885  hypothetical protein F4P12.150 - Arabidopsis thaliana
            >gi|6729496|emb|CAB67652.1| (AL132966) putative protein
            [Arabidopsis thaliana]
            Length = 215

Frame  2 hits (HSPs):   _______________________                           
                        __________________________________________________
Database sequence:     |           |           |          |           |   | 215
                       0          50         100        150         200

  Plus Strand HSPs:

 Score = 345 (121.4 bits), Expect = 2.1e-30, P = 2.1e-30
 Identities = 68/103 (66%), Positives = 81/103 (78%), Frame = +2

Query:   140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319
             ME+  +TM+      S+F RICVFCG+S GK  SYQ AA+ L  +LV RNIDLVYGGGSI
Sbjct:     1 MEVNNETMQ-----KSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSI 55

Query:   320 GLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448
             GLMGL+SQ V DGGRHV+GVIP TLMPRE+TGE+VGEV +V +
Sbjct:    56 GLMGLVSQAVHDGGRHVIGVIPKTLMPRELTGETVGEVRAVAD 98


to_Entrezto_Relatedto_Related >gi|10140743|gb|AAG13575.1|AC037425_6  (AC037425) unknown protein [Oryza sativa]
            Length = 204

Frame  2 hits (HSPs):   _______________________                           
                        __________________________________________________
Database sequence:     |            |           |           |           | | 204
                       0           50         100         150         200

  Plus Strand HSPs:

 Score = 325 (114.4 bits), Expect = 2.7e-28, P = 2.7e-28
 Identities = 63/88 (71%), Positives = 73/88 (82%), Frame = +2

Query:   185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364
             SRF+RICVFCG+S GK  SY  AAI+L  +LV R+IDLVYGGGSIGLMGL+SQ VFDGGR
Sbjct:     4 SRFKRICVFCGSSQGKKRSYHDAAIELGNELVARSIDLVYGGGSIGLMGLVSQAVFDGGR 63

Query:   365 HVLGVIPTTLMPREITGESVGEVESVGE 448
             HV+GVIP TLM  EI+GE+VGEV  V +
Sbjct:    64 HVIGVIPKTLMTPEISGETVGEVRPVAD 91


to_Entrezto_Relatedto_Related >gi|11358493|pir||T48554  lysine decarboxylase-like protein - Arabidopsis
            thaliana >gi|7573362|emb|CAB87668.1| (AL163812) lysine
            decarboxylase-like protein [Arabidopsis thaliana]
            Length = 215

Frame  2 hits (HSPs):    _____________________                            
                        __________________________________________________
Database sequence:     |           |           |          |           |   | 215
                       0          50         100        150         200

  Plus Strand HSPs:

 Score = 284 (100.0 bits), Expect = 6.0e-24, P = 6.0e-24
 Identities = 54/88 (61%), Positives = 69/88 (78%), Frame = +2

Query:   185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364
             SRFR+ICVFCG+  G    +  AAI+L  +LV+R IDLVYGGGS+GLMGLIS+ V++GG 
Sbjct:     7 SRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGGL 66

Query:   365 HVLGVIPTTLMPREITGESVGEVESVGE 448
             HVLG+IP  LMP EI+GE+VG+V  V +
Sbjct:    67 HVLGIIPKALMPIEISGETVGDVRVVAD 94


to_Entrezto_Relatedto_Related >gi|11358492|pir||T48348  lysine decarboxylase-like protein - Arabidopsis
            thaliana >gi|7413604|emb|CAB86094.1| (AL163002) lysine
            decarboxylase-like protein [Arabidopsis thaliana]
            Length = 229

Frame  2 hits (HSPs):   _________________________                         
                        __________________________________________________
Database sequence:     |          |          |          |          |      | 229
                       0         50        100        150        200

  Plus Strand HSPs:

 Score = 267 (94.0 bits), Expect = 3.8e-22, P = 3.8e-22
 Identities = 61/114 (53%), Positives = 73/114 (64%), Frame = +2

Query:   140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319
             ME EE   +M  K SSRF+ ICVFCG+S G   SYQ AAI LAK+LV R IDLVYGGGSI
Sbjct:     1 MENEEGKREMTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSI 60

Query:   320 GLMGLISQVVFDGGRH-----------VLGVIPTTLMPREITGESVGEVESVGE 448
             GLMGL+SQ V DGGRH               +  +    ++TGE+VGEV+ V +
Sbjct:    61 GLMGLVSQAVHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVAD 114


to_Entrezto_Relatedto_Related >gi|4371280|gb|AAD18138.1|  (AC006260) hypothetical protein [Arabidopsis
            thaliana]
            Length = 178

Frame  2 hits (HSPs):   ________________________                          
                        __________________________________________________
Database sequence:     |             |             |             |        | 178
                       0            50           100           150

  Plus Strand HSPs:

 Score = 261 (91.9 bits), Expect = 1.6e-21, P = 1.6e-21
 Identities = 58/103 (56%), Positives = 71/103 (68%), Frame = +2

Query:   140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319
             MEI+ ++M+      S+FRRICVFCG+S GK  SYQ AA+ L  +LV RNIDLVYGGGSI
Sbjct:     1 MEIKGESMQ-----KSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSI 55

Query:   320 GLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448
             GLMGL+SQ V DGGRH             +TGE+VGEV +V +
Sbjct:    56 GLMGLVSQAVHDGGRH-------------LTGETVGEVRAVAD 85


to_Entrezto_Relatedto_Related >gi|4510370|gb|AAD21458.1|  (AC007017) unknown protein [Arabidopsis thaliana]
            Length = 181

Frame  2 hits (HSPs):   ____________________                              
                        __________________________________________________
Database sequence:     |             |             |             |        | 181
                       0            50           100           150

  Plus Strand HSPs:

 Score = 254 (89.4 bits), Expect = 9.1e-21, P = 9.1e-21
 Identities = 50/70 (71%), Positives = 57/70 (81%), Frame = +2

Query:   170 MMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVV 349
             M ++ SRFRRICVFCG+S G   +Y  AA+QLA QLVERNIDLVYGGGS+GLMGLISQ V
Sbjct:     1 MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 60

Query:   350 FDGGRHVLGV 379
              DGGR V+ V
Sbjct:    61 HDGGREVITV 70


to_Entrezto_Relatedto_Related >gi|9757778|dbj|BAB08387.1|  (AB005240) lysine decarboxylase-like protein
            [Arabidopsis thaliana]
            Length = 220

Frame  2 hits (HSPs):   ________________________                          
                        __________________________________________________
Database sequence:     |           |          |          |           |    | 220
                       0          50        100        150         200

  Plus Strand HSPs:

 Score = 253 (89.1 bits), Expect = 1.2e-20, P = 1.2e-20
 Identities = 57/105 (54%), Positives = 68/105 (64%), Frame = +2

Query:   167 MMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQV 346
             M  K SSRF+ ICVFCG+S G   SYQ AAI LAK+LV R IDLVYGGGSIGLMGL+SQ 
Sbjct:     1 MTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSIGLMGLVSQA 60

Query:   347 VFDGGRH-----------VLGVIPTTLMPREITGESVGEVESVGE 448
             V DGGRH               +  +    ++TGE+VGEV+ V +
Sbjct:    61 VHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVAD 105


to_Entrezto_Relatedto_Related >gi|12231051|sp|P48636|YDC3_PSEAE  HYPOTHETICAL PROTEIN PA4923
            >gi|11348257|pir||A83031 conserved hypothetical protein PA4923
            [imported] - Pseudomonas aeruginosa (strain PAO1)
            >gi|9951201|gb|AAG08308.1|AE004905_6 (AE004905) conserved
            hypothetical protein [Pseudomonas aeruginosa]
            Length = 195

Frame  2 hits (HSPs):   ______________________                            
                        __________________________________________________
Database sequence:     |            |            |            |           | 195
                       0           50          100          150

  Plus Strand HSPs:

 Score = 207 (72.9 bits), Expect = 8.7e-16, P = 8.7e-16
 Identities = 38/83 (45%), Positives = 53/83 (63%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373
             R +CVFCG SPG +P YQ AA+ L + L ER + LVYGGG++GLMG ++      G  V+
Sbjct:     4 RSVCVFCGASPGASPVYQEAAVALGRHLAERGLTLVYGGGAVGLMGTVADAALAAGGEVI 63

Query:   374 GVIPTTLMPREITGESVGEVESV 442
             G+IP +L   EI  + +  +E V
Sbjct:    64 GIIPQSLQEAEIGHKGLTRLEVV 86


to_Entrezto_Relatedto_Related >gi|7486909|pir||T04966  hypothetical protein T12J5.60 - Arabidopsis thaliana
            >gi|4455345|emb|CAB36726.1| (AL035522) putative protein
            [Arabidopsis thaliana] >gi|7270471|emb|CAB80236.1| (AL161587)
            putative protein [Arabidopsis thaliana]
            Length = 268

Frame  2 hits (HSPs):   _____________________                             
                        __________________________________________________
Database sequence:     |         |        |        |         |        |   | 268
                       0        50      100      150       200      250

  Plus Strand HSPs:

 Score = 202 (71.1 bits), Expect = 2.9e-15, P = 2.9e-15
 Identities = 49/104 (47%), Positives = 62/104 (59%), Frame = +2

Query:   185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVE----------------RNIDLVYGGGS 316
             SRF+R+CVFCG+S GK   Y  AA  LA++LV                 R ++LVYGGGS
Sbjct:     6 SRFKRVCVFCGSSSGKRECYSDAATDLAQELVRLCLNLNESLENLKWVTRRLNLVYGGGS 65

Query:   317 IGLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448
             IGLMGL+SQ V + G HVLG      +   ITGE+ GEV +V +
Sbjct:    66 IGLMGLVSQAVHEAGGHVLGYAQIYDLFTLITGETYGEVIAVAD 109


to_Entrezto_Relatedto_Related >gi|77664|pir||PQ0114  hypothetical protein (azu region) - Pseudomonas
            aeruginosa  (fragment) >gi|7251673|gb|AAA25729.2| (M30389) ORF1
            [Pseudomonas aeruginosa]
            Length = 71

Frame  2 hits (HSPs):     _______________________________________________ 
                        __________________________________________________
Database sequence:     |             |             |             |        | 71
                       0            20            40            60

  Plus Strand HSPs:

 Score = 197 (69.3 bits), Expect = 1.0e-14, P = 1.0e-14
 Identities = 34/67 (50%), Positives = 46/67 (68%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373
             R +CVFCG SPG +P YQ AA+ L + L ER + LVYGGG++GLMG ++      G  V+
Sbjct:     4 RSVCVFCGASPGASPVYQEAAVALGRHLAERGLTLVYGGGAVGLMGTVADAALAAGSEVI 63

Query:   374 GVIPTTL 394
             G+IP +L
Sbjct:    64 GIIPQSL 70


to_Entrezto_Relatedto_Related >gi|7451099|pir||D70033  conserved hypothetical protein yvdD - Bacillus subtilis
            >gi|1945663|emb|CAB08033.1| (Z94043) hypothetical protein [Bacillus
            subtilis] >gi|2635977|emb|CAB15469.1| (Z99121) similar to
            hypothetical proteins [Bacillus subtilis]
            Length = 191

Frame  2 hits (HSPs):   ______________________                            
                        __________________________________________________
Database sequence:     |            |            |             |          | 191
                       0           50          100           150

  Plus Strand HSPs:

 Score = 167 (58.8 bits), Expect = 1.5e-11, P = 1.5e-11
 Identities = 31/83 (37%), Positives = 51/83 (61%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373
             + ICVF G++PG N +Y+  A +L   + E+ I LVYGG  +GLMG I+  + + G   +
Sbjct:     2 KTICVFAGSNPGGNEAYKRKAAELGVYMAEQGIGLVYGGSRVGLMGTIADAIMENGGTAI 61

Query:   374 GVIPTTLMPREITGESVGEVESV 442
             GV+P+ L   E+  +++ E+  V
Sbjct:    62 GVMPSGLFSGEVVHQNLTELIEV 84


to_Entrezto_Relatedto_Related >gi|10175706|dbj|BAB06803.1|  (AP001517) BH3084~unknown conserved protein
            [Bacillus halodurans]
            Length = 187

Frame  2 hits (HSPs):   ____________________                              
                        __________________________________________________
Database sequence:     |             |            |            |          | 187
                       0            50          100          150

  Plus Strand HSPs:

 Score = 165 (58.1 bits), Expect = 2.4e-11, P = 2.4e-11
 Identities = 33/72 (45%), Positives = 46/72 (63%), Frame = +2

Query:   197 RICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLG 376
             +I VFCG+S G +  Y+  A QL K+L  R I LVYGG S+G+MG ++  V + G  V+G
Sbjct:     2 KIAVFCGSSNGASDVYKEGARQLGKELARRGITLVYGGASVGIMGAVADSVLEAGGEVIG 61

Query:   377 VIPTTLMPREIT 412
             V+P  L   EI+
Sbjct:    62 VMPRFLEEPEIS 73


to_Entrezto_Relatedto_Related >gi|11280345|pir||T45176  conserved hypothetical protein ctf [imported] -
            Mycobacterium leprae >gi|699154|gb|AAA62920.1| (U15180) ctf
            [Mycobacterium leprae]
            Length = 187

Frame  2 hits (HSPs):      _____________________                          
                        __________________________________________________
Database sequence:     |             |            |            |          | 187
                       0            50          100          150

  Plus Strand HSPs:

 Score = 144 (50.7 bits), Expect = 4.1e-09, P = 4.1e-09
 Identities = 31/78 (39%), Positives = 43/78 (55%), Frame = +2

Query:   200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379
             ICVFC   P      +LAA +L + + ER   LV+GGG +  MG ++   +  G  ++GV
Sbjct:    13 ICVFCAAGPMHPELLELAA-ELGEAIAERGWTLVWGGGRVSAMGAVASAAWTRGGRIVGV 71

Query:   380 IPTTLMPREITGESVGEV 433
             IP  L  REI    VGE+
Sbjct:    72 IPEMLQRREIADTYVGEL 89


to_Entrezto_Relatedto_Related >gi|10175367|dbj|BAB06465.1|  (AP001516) lysine decarboxylase [Bacillus
            halodurans]
            Length = 190

Frame  2 hits (HSPs):   _________________                                 
                        __________________________________________________
Database sequence:     |            |             |            |          | 190
                       0           50           100          150

  Plus Strand HSPs:

 Score = 144 (50.7 bits), Expect = 4.1e-09, P = 4.1e-09
 Identities = 26/63 (41%), Positives = 43/63 (68%), Frame = +2

Query:   197 RICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLG 376
             +IC+F G+S G++P Y      L +Q+ ++  +++YGGG+ GLMG+++Q V D G  V G
Sbjct:     2 KICLFSGSSLGQHPIYAEQVRALGEQIGKQGWEVIYGGGNAGLMGVLAQSVLDNGGRVTG 61

Query:   377 VIP 385
             +IP
Sbjct:    62 IIP 64


to_Entrezto_Relatedto_Related >gi|1169649|sp|P46378|FAS6_RHOFA  HYPOTHETICAL 21.1 KD PROTEIN IN FASCIATION
            LOCUS (ORF6) >gi|1076047|pir||F55578 hypothetical protein 2 (ipt 3'
            region) - Rhodococcus fascians plasmid pFiD188
            >gi|455006|emb|CAA82746.1| (Z29635) orf6 [Rhodococcus fascians]
            Length = 198

Frame  2 hits (HSPs):       _________________                             
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |            |           |            |            | 198
                       0           50         100          150
__________________

Annotated Domains:
   DOMO                 DM06442:                                 1..197
   PRODOM               PD005712: YJF5(2)                        20..160
   PRODOM               PD090879: FAS6_RHOFA                     162..197
__________________


  Plus Strand HSPs:

 Score = 132 (46.5 bits), Expect = 7.7e-08, P = 7.7e-08
 Identities = 24/64 (37%), Positives = 35/64 (54%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373
             + + VFCG  PG+   Y   A  + + +    + LVYGG  +GLMG ++    D G  V+
Sbjct:    20 KSVTVFCGAMPGRGTKYGQLAEGMGRAIARSKLRLVYGGARVGLMGTLANAALDSGGTVV 79

Query:   374 GVIP 385
             GVIP
Sbjct:    80 GVIP 83


to_Entrezto_Relatedto_Related >gi|4337446|gb|AAD18125.1|  (U89166) ECORLD_ORF1; similar to the Pseudomonas
            aeruginosa ORF upstream of the Azu gene and to the Rhodococcus
            fascians fas operon ORF6 protein, encoded by GenBank Accession
            Numbers M30388 and Z29635, respectively [Eikenella corrodens]
            Length = 183

Frame  2 hits (HSPs):   ________________                                  
                        __________________________________________________
Database sequence:     |             |             |            |         | 183
                       0            50           100          150

  Plus Strand HSPs:

 Score = 122 (42.9 bits), Expect = 8.8e-07, P = 8.8e-07
 Identities = 26/57 (45%), Positives = 33/57 (57%), Frame = +2

Query:   239 SYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGVIPTTLMPREI 409
             SY  AA +L + + ER   LVYGGG IGLMG ++      G  V G+IPT L   E+
Sbjct:     1 SYSQAARELGRAIAERGSRLVYGGGGIGLMGEVASAALAAGGKVTGIIPTFLRHEEM 57


to_Entrezto_Related >gi|6322406  ref|NP_012480.1| Yjl055wp [Saccharomyces cerevisiae]
            >gi|1352984|sp|P47044|YJF5_YEAST HYPOTHETICAL 26.9 KD PROTEIN IN
            BTN1-PEP8 INTERGENIC REGION >gi|1077814|pir||S56827 conserved
            hypothetical protein YJL055w - yeast  (Saccharomyces cerevisiae)
            >gi|1008195|emb|CAA89346.1| (Z49330) ORF YJL055w [Saccharomyces
            cerevisiae]
            Length = 245

Frame  2 hits (HSPs):      __________________                             
                        __________________________________________________
Database sequence:     |          |         |         |         |         | 245
                       0         50       100       150       200

  Plus Strand HSPs:

 Score = 124 (43.7 bits), Expect = 1.2e-06, P = 1.2e-06
 Identities = 30/82 (36%), Positives = 44/82 (53%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVF--DGGRH 367
             + +CV+CG+S G    Y  +A +L     +    LVYGGG+ GLMG I++     D    
Sbjct:    19 KSVCVYCGSSFGAKALYSESAEELGALFHKLGWKLVYGGGTTGLMGKIARSTMGPDLSGQ 78

Query:   368 VLGVIPTTLMPREITGESVGEV 433
             V G+IP  L+ +E T E   +V
Sbjct:    79 VHGIIPNALVSKERTDEDKEDV 100


to_Entrezto_Related >gi|10954698  ref|NP_066633.1| similar to orf6 gene(unknown,in P450 operon) in
            Rhodococcus fascians [Agrobacterium rhizogenes]
            >gi|8918698|dbj|BAA97763.1| (AB039932) similar to orf6 gene in
            Rhodococcus fascians [Agrobacterium rhizogenes]
            >gi|10567362|dbj|BAB16171.1| (AP002086) similar to orf6
            gene(unknown,in P450 operon) in Rhodococcus fascians [Agrobacterium
            rhizogenes]
            Length = 169

Frame  2 hits (HSPs):   _________________                                 
                        __________________________________________________
Database sequence:     |              |              |              |     | 169
                       0             50            100            150

  Plus Strand HSPs:

 Score = 106 (37.3 bits), Expect = 0.00013, P = 0.00013
 Identities = 23/56 (41%), Positives = 30/56 (53%), Frame = +2

Query:   275 LVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESV 442
             +    I LVYGG SIGLMG I+      G  V+GVIP  L  +EI    + ++  V
Sbjct:     1 MARSGIGLVYGGASIGLMGAIADAARSDGGEVIGVIPRALAEKEIAHTDLADLRVV 56


to_Entrezto_Relatedto_Related >gi|7451100|pir||C70609  hypothetical protein Rv1205 - Mycobacterium
            tuberculosis  (strain H37RV) >gi|1929079|emb|CAB07828.1| (Z93777)
            hypothetical protein Rv1205 [Mycobacterium tuberculosis]
            Length = 187

Frame  2 hits (HSPs):      _____________________                          
                        __________________________________________________
Database sequence:     |             |            |            |          | 187
                       0            50          100          150

  Plus Strand HSPs:

 Score = 103 (36.3 bits), Expect = 0.0010, P = 0.0010
 Identities = 24/78 (30%), Positives = 38/78 (48%), Frame = +2

Query:   200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379
             + V+C  SP      +LAA ++   +  R   LV+GGG +  MG ++      G   +GV
Sbjct:    13 VAVYCAASPTHAELLELAA-EVGAAIAGRGWTLVWGGGHVSAMGAVASAARACGGWTVGV 71

Query:   380 IPTTLMPREITGESVGEV 433
             IP  L+ RE+      E+
Sbjct:    72 IPKMLVYRELADHDADEL 89


to_Entrezto_Relatedto_Related >gi|7462102|pir||A72302  conserved hypothetical protein - Thermotoga maritima
            (strain MSB8) >gi|4981597|gb|AAD36132.1|AE001765_11 (AE001765)
            conserved hypothetical protein [Thermotoga maritima]
            Length = 171

Frame  2 hits (HSPs):   ____________________                              
                        __________________________________________________
Database sequence:     |              |             |              |      | 171
                       0             50           100            150

  Plus Strand HSPs:

 Score = 81 (28.5 bits), Expect = 1.4, P = 0.74
 Identities = 22/66 (33%), Positives = 39/66 (59%), Frame = +2

Query:   194 RRICVFCGTSP-GKNPSYQLAAI--QLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364
             +++ V   + P  K+P  +L  I  +L + L ++   LV+ GG  G+M L+SQ V + G 
Sbjct:     2 KKVVVVGYSGPVNKSPVSELRDICLELGRTLAKKGY-LVFNGGRDGVMELVSQGVREAGG 60

Query:   365 HVLGVIP 385
              V+G++P
Sbjct:    61 TVVGILP 67


to_Entrezto_Relatedto_Related >gi|12697620|dbj|BAB21615.1|  (AB037974) cytochrome oxidase subunit I
            [Thalassiosira nordenskioeldii]
            Length = 57

Frame  2 hits (HSPs):   ___________        ________________________       
                        __________________________________________________
Database sequence:     |                |                 |               | 57
                       0               20                40

  Plus Strand HSPs:

 Score = 41 (14.4 bits), Expect = 1.4, Sum P(2) = 0.75
 Identities = 7/13 (53%), Positives = 9/13 (69%), Frame = +2

Query:    47 FFPHPSLYVHIHP 85
             FF HP +Y+ I P
Sbjct:     1 FFGHPEVYILILP 13

 Score = 40 (14.1 bits), Expect = 1.4, Sum P(2) = 0.75
 Identities = 9/27 (33%), Positives = 16/27 (59%), Frame = +2

Query:   257 IQLAKQLVERNIDLVYGGGSIGLMGLI 337
             +  AK+ +   + +VY   SIG++G I
Sbjct:    23 VSTAKKPIFGYLGMVYAMFSIGVLGFI 49


to_Entrezto_Relatedto_Related >gi|1196510|gb|AAA88231.1|  (M15467) unknown protein [Mycobacterium
            tuberculosis]
            Length = 175

Frame -1 hits (HSPs):              ___________________                    
                        __________________________________________________
Database sequence:     |              |             |             |       | 175
                       0             50           100           150

  Minus Strand HSPs:

 Score = 76 (26.8 bits), Expect = 6.3, P = 1.0
 Identities = 22/64 (34%), Positives = 32/64 (50%), Frame = -1

Query:   447 SPTLSTSPTLSPVISLG-IRVVGITPN-TWRPPSNTTCEIKPINPMLPPP*TKSMFLSTS 274
             +P    S T S   S+  +R  G+ P  T R PS T      +  ++P P T S+FL+TS
Sbjct:    40 APLSRVSVTFSTAFSMPRLRPSGLAPAATLRRPSRTNAWASTVAVVVPSPATSSVFLATS 99

Query:   273 CLAS 262
               +S
Sbjct:   100 LTSS 103


to_Entrezto_Relatedto_Related >gi|7479785|pir||T35807  hypothetical protein SC8D9.03 SC8D9.03 - Streptomyces
            coelicolor >gi|4467242|emb|CAB37567.1| (AL035569) SC8D9.03,
            unknown, len: 182aa; similar to many of undefined function eg.
            TR:Q49952 (EMBL:U15180) hypothetical protein from Mycobacterium
            leprae (187 aa) fasta scores; opt: 331, z-score: 394.0, E():
            1.2e-14, (36.1% identity in 166 aa overlap)
            Length = 182

Frame  2 hits (HSPs):    _________________                                
                        __________________________________________________
Database sequence:     |             |             |            |         | 182
                       0            50           100          150

  Plus Strand HSPs:

 Score = 76 (26.8 bits), Expect = 6.8, P = 1.0
 Identities = 20/60 (33%), Positives = 32/60 (53%), Frame = +2

Query:   200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379
             ICVF   +   +  Y   A + A+ L +    LV+GG  +GLM +++  V + G  +LGV
Sbjct:     6 ICVFLSAAD-LDEHYTRPAKEFAELLGKGGHTLVWGGSDVGLMKVVADGVQESGGKLLGV 64


to_Entrezto_Relatedto_Related >gi|7451672|pir||H70312  hypothetical protein aq_134 - Aquifex aeolicus
            >gi|2982880|gb|AAC06500.1| (AE000675) putative protein [Aquifex
            aeolicus]
            Length = 151

Frame  2 hits (HSPs):   _____________________                             
                        __________________________________________________
Database sequence:     |                |               |                || 151
                       0               50             100              150

  Plus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 7.6, P = 1.0
 Identities = 19/65 (29%), Positives = 37/65 (56%), Frame = +2

Query:   194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373
             R++ V  G+S      Y+ A  +L K+L +RN+ +V GG + G+M  + +   + G   +
Sbjct:     2 RQVSVI-GSSKASEEEYEFA-YRLGKELAKRNLVVVCGGRT-GVMEAVCKGAKEEGGLTI 58

Query:   374 GVIPT 388
             G++P+
Sbjct:    59 GIMPS 63


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.98

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.358   0.155   0.599  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.335   0.148   0.458  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.340   0.148   0.550  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.341   0.150   0.525  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.347   0.151   0.548  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.351   0.154   0.571  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      149       148       10.  74 3  12 22  0.091   34
                                                    30  0.11    36
   +2      0      149       148       10.  74 3  12 22  0.091   34
                                                    30  0.11    36
   +1      0      150       149       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   -1      0      150       149       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   -2      0      149       149       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   -3      0      149       148       10.  74 3  12 22  0.091   34
                                                    30  0.11    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  25
  No. of states in DFA:  591 (58 KB)
  Total size of DFA:  203 KB (256 KB)
  Time to generate neighborhood:  0.02u 0.00s 0.02t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  164.72u 1.14s 165.86t  Elapsed: 00:00:28
  Total cpu time:  164.75u 1.20s 165.95t  Elapsed: 00:00:28
  Start:  Fri Feb  1 22:11:31 2002   End:  Fri Feb  1 22:11:59 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000