Proc Nut1 Ad Sei USA Vol 77. No 2, pp 799-803, February 1980 Biochemistry An Escherichia coli replication protein that recognizes a unique sequence within a hairpin region in 6x174 DNA (DNAdependent ATPase/sequence specificity/protein n'/origin of &X DNA complementary strand replication) JOSEPH SHLOMAI AND ARTHUR KORNBERG Department of Biochemistry. Stanford University School of Medicine, Stanford, California 94305 Contributed by Arthur Kornberg, Nouember 5,1979 ABSTRACT Protein n', a prepriming DNA re lication en- zyme of Escherichia coli, is a 4x174 DNAdepencfent ATPase. Restriction endonuclease fragmentation and exonuclease VI1 digestion of 9x174 DNA have led to the identification of a Snucleotide fra ent that carries the protein n' recognition sequence. Molecg hybridization and sequence analysis have located this sequence within the untranslated region between genes F and G, a map location analogous to that of the unique complementa strand origin of phage G4 DNA. Within the 55nucleotidexagment is a sequence of 44 nucleotides that forms a stable haiqiin structure. This duplex ma be the signal for protein n' to initiate the prepriming events tiat lead to the start of 6x174 complementary DNA strand replication. Conversion of the single-stranded chromosomes of phages M13, G4, and 4x174 (4X) to their duplex replicative forms are model systems for studies of the mechanisms of initiation of DNA synthesis in Escherichia colt. The more complex $X system appears to be particularly pertinent to the discontinuous phase of E. colt chromosomal replication (1,2). The M13, G4, and @X templates differ primarily in their enzymatic requirements for the initiation of primer synthesis. Coated with single-stranded-DNA binding protein (SB), M13 DNA can be primed directly by RNA polymerase (3) and G4 DNA, by primase (4,5). However, SSB-coated #JX DNA, al- though primed by the same E. colt primase, must first be acti- vated in a prepriming stage. In this prepriming reaction, the E. coli proteins n', n and n", i, dnaB, and dnaC form an acti- vated complex with 4X DNA (2.6-8). The distinctions in primer synthesis on these three phage templates are probably due to structural differences in "pro- moter"-like sites recognized by the distinctive priming systems. Although the origin of complementary DNA strand replication is unique and well characterized for M13 (9) and G4 (10-14), it does not appear to be at a unique site for 4X. as judged by in oioo and tn oitro studies (7,9, 11, 15). We will describe elsewhere the purification of protein n' to near homogeneity, its 4X DNA-dependent ATPase activity, and its capacity to destabilize an SWX DNA complex. In this paper we report the recognition by protein n' of a specific se- quence in 4X DNA located at an intergenic region and suggest that it may be the signal that leads to the initiation of 4X complementary DNA strand replication. MATERIALS AND METHODS Nucleic Acids, Enzymes, Resins, and Nucleotides. @X, qiX replicative form (RF)I, G4, and M13 DNAs were prepared as described (16). E. coli SSB was 4 X l(r units/mg (17); Hae 111 and HinfI restriction endonucleases were from Bethesda Re- search Laboratories, Rockville, MD, and Mbo XI was from New England BioLabs; exonuclease VI1 preparations were gifts from S. P. Goff (Massachusetts Institute of Technology) and from J. W. Chase (Albert Einstein Medical School); TCpolynucleotide kinase was from P-L Biochemicals. Hydroxyapatite was ob tained from Bio-Rad, agarose was from Bethwja Research Table 1. Localization of the n' recognition site to a particular nuclease restriction fragment ATP hydrolyzed, DNA effector pmol None +X174 G4 Hoe I11 fragments of t$X 21 22 23 24 25-11* 45%) even after prolonged incubation at 37"C, but at 45°C the activity of Z1 was destroyed (>95%) (Fig. 1). Resistance of the activity of the Z1 fragment at 37°C and susceptibility at 45°C suggested that the effector activity of this single-stranded DNA fragment is in a duplex denatured at 45°C. Isolation of Duplex DNA Region that Carries Protein n'- Recognition Site. The limit-digestion products of exonuclease VI1 hydrolysis of fragment Z1 separated by electrophoresis were mainly oligomers 8-12 nucleotides long (Fig. 2). However, 7.5% of the digested DNA was found in longer chains: fragment 1, 65-70 residues (2.5%); fragment 2,55-60 residues (3.6%); and fragment 3, 18-22 residues (1.4%). Only fragment 2, repre- senting about 1% of the original $X chromosome, was active as an effector for n'-ATPase (Fig. 3); the longest fragment (fragment 1) was inactive. ATP hydrolysis rate with fragment 2 (Fig. 3) was 52% of that with intact 4X DNA. ATPase activity 8 ORIGIN -53U6- -1353- -812- -413- - 250- - - . n FIG. 5L Sequence homology of exonuclease VII-resistant fragment 2 as judged by hybridization to various regions of gX DNA. gX RFI DNA (0.8 pg) was incubated at 37OC for 1 hr with restriction endonucleases as follows: Hae I11 (3 units) in 50 mM Tris.HC1, pH 7.5/5 mM MgC12/0.5 mM dithiothreitok HinfI (3 units) or Hoe I11 and HinfI simultaneously (each 3 units) in 6 mM Tris-HCI. pH 7.5/6 mM MgCln/6 mM 2-mercap- toethanoU100 mM NaCI; or Mbo I1 (3 units) or HinfI and Mbo I1 simultaneously (each 3 units) in 10 mM Tris.HC1, pH 7.9/6 mM KCI/10 mM MgCIJ1 mM dithiothreitol. Reactions were stopped by addition of EDTA, Sarkosyl. glycerol, and bromphenol blue to final concentrations of 50 mM, 296, 10% and 0.0196, respectively. (A) DNA fragments were electrophoresed in 2% agarose gels and stained with ethidium bromide (1 pg/ml). DNA was denatured in the gel and transferred to DBM paper (22). (R) 5'-:v2P-labeled exonuclease VII-resistant DNA fragment 2 (Fig. 2) was hybridized (23) in the presence of 10% dextran sulfate-500 at 42'C for 40 hr to the DNA fragments bound to DBM paper; the paper was washed (23) and subjected to autoradiography at -80°C by using an intensifying screen (Cronex, Du Pont). In the far right lane, :"P-labeled @X DNA was used as a size and transfer marker. 802 Biochemistry: Shlomai and Kornberg PRDTflN n Proc. Nut[. Acad. Sci. USA 77 (1980) 2394 2465 RECOGNITION LOCIJS \, ---~ 22x5, ,3731 T-T 2330 /\ TA \/ T*A II A*T II 2325 A-T II A'T 2335 11 A-T II A*T I1 2320 G :P *C II j* 4949 `4877 5ia Location of protein n' recognition locus on @X chromo- some. Heavy lines represent restriction fragments of @X RFI DNA, to which exonuclease VII-resistant DNA fragment 2 was hybridized (Fig. 5B). Broken lines indicate products of a partial digestion of an M60 I1 fragment (1653-2465) by HinfI, to which the radioactive DNA probe was hybridized (Fig. 58). The magnified region (2264-2465) closest to the center of the diagram is the hybridized sequence com- mon to all three restriction digests. The n' recognition site is located within this 201-nucleotide region. It contains an untranslated se- quence of 110 nucleotides (22s2394) between structural genes F and G; within this intergenic region is the hairpin sequence of 44 nucleo- tides (see Fig. 7) in the exonuclease-resistant DNA fragment 2 FIG. 6. (230a235i). is a function of the concentration of the DNA effector, but the dependence is more nonlinear at low concentrations (Fig. 4). Location of Isolated Protein n` Recognition Sequence in 4X Chromosome. Molecular hybridization and sequence analysis were used to map the location of the exonuclease VII- resistant fragment 2 (above). Products of digestion of @X RFI DNA by the restriction endonucleases Hoe HI, HinfI, and Mbo I1 were separated on 2% agarose gels (Fig. 5A), denatured, and transferred to DBM paper (22). Fragment 2, labeled with 32P at its 5' end by polynucleotide kinase, was hybridized to the denatured, paper-bound @X DNA fragments (Fig. 5B) (23). with the Hoe 111 digest separated into nine bands (Fig. 5A), the fragment 2 probe hybridized, as expected, only to fragment Zl (24). 1353 residues at position 1777-3129 (Fig. 6). Of the ten bands produced by HinfI digestion (Fig. 5A), the probe hybridized mainly to fragment F5b (24), 413 nucleotides long (Fig. 5B) at position 2265-2677 (Fig. 6). F5b was part of a triplet band with fragments F5a and F5c (24), 417 and 427 nucleotides long, respectively; but these two fragments (at positions 3732-4148 and 4702-5128) are well outside the Hue 111 Zl sequence. Fragment F5a is also excluded by the results of the double digestion by Hue I11 and Hinfl. Although the F5a sequence contains three Hue 111 sites (at positions 4759,4877, and 4949) (Fig. 6), hybridization of the probe to a fragment about 413 residues long was not affected. Slight hybridization to a HfnfI (or HinfI plus Hoe 111) digestion product about 250 nucleotides long (Fig. 5B) will be considered below. With the Mbo I1 digest, which yielded six bands (Fig. 5A), the fragment 2 probe hybridized to fragment M4 (24), 812 A*+ 2350 2354 2301 2305 11 5'-A-G-G -T- T-A -T- A o T-G -A-C - C -3 FIG. 7. Possible secondary structure in recognition locus of protein n'. The proposed structure of base-paired regions separated by two internal loops and a hairpin loop is based on a calculation of free-energy contributions of base-paired regions and loops of the se- quence at position 230a2351 (Fig. 6) within the 55-nucleotide frag- ment possessing the protein n' recognition site. nucleotides long (Fig. 5B) at position 1654-2465 (Fig. 6). M4 migrated in the agarose gel as part of a doublet band with M3, 837 nucleotides long (at position 817-1653) (Fig. 6). Not only is M3 outside the Hoe 111 Z1 sequence, but it is also eliminated by the results of a double digestion (completely with Mbo I1 and partially with HinfI). Fragment 2 hybridized to another set of fragments generated by the incomplete digestion of the Mbo I1 fragment M4 by HinfI (Figs. 5B and 6). Hybridization was observed with the 812-nucleotide fragment when no cuts were introduced in this region by HinfI, and with fragments 201, 450, and 568 nucleotides long, produced by HinfI cuts within M4 at positions 2264,2015, and 1897. In summary, the hybridization experiments limit the location of the exonuclease VII-resistant fragment 2 to a 201-residue region at position 2265-2465. It remained to be determined whether the sequence of fragment 2 contains the potential for secondary structure. A Hairpin Structure in Recognition Site. Exonuclease VII-resistant fragment 2 upon electrophoresis in 20% polyac- rylamidej7 M urea contained a major component 55 nucleot- ides long and a minor one, one or two nucleotides shorter. To map this fragment precisely, the DNA sequence was deter- mined. Analysis of a 5%nucleotide region in this fragment (data not shown) and comparison of the sequence with the Sanger map (24) locates it at position 2304-2355. Within this fragment is a 44-nucleotide-long sequence at position 2308-2351 with a strong potential for secondary structure (Fig. 7). Calculations of the free energy contributions of the base-paired regions and loops, based on studies with polyribonucleotides (E), yield a value of -14 kcal/mol at 25°C and 1 M NaCI. This value is consistent with a duplex form for this sequence (Fig. 7) under the experimental conditions in which it resists destabilization by SSB and digestion by exonuclease VI1 (Fig. 2). The HinfI (or HinfI plus Hue 111) fragment about 2-50 nucleotides long that showed some homology to fragment 2 (see Biochemistry: Shlomai and Kornberg Proc. Natl. Acad. Sci. USA 77 (1980) 803 above) is the 249 nucleotide-long HinfI fragment F7 (24) at position 2016-2264, as judged by its size and its location within a sequence that overlaps both the Hae 111 Z1 and the Mbo II M4 DNA fragments that were shown to be the only Hae 111 and Mbo I1 4X DNA fragments with homology to the DNA probe (Fig. 5B). The F7 DNA fragment contains (in its viral strand) a sequence of eight nucleotides, GCT-GA-GGG (at position 2039-2046), identical to one on the exonuclease VII-resistant DNA fragment (position 2341-2348); this may explain its hy- bridization to the DNA probe. DISCUSSION A prepriming system reconstituted from six E. colt proteins is needed to prepare the single-stranded, binding-protein-covered $X DNA circle for primer synthesis by primase to initiate DNA replication (26, 27). This system is specific for @X DNA and does not operate on the SSs of phages G4 and M13 (26, 27). Unlike phages G4 and M13, which posses unique origins for complementary strand starts, both in utvo and in uttro studies indicate multiple starts on $X DNA (7,9, 11, 14). However, the strict specificity of the prepriming system for 4X DNA suggests that a particular locus is recognized by one or more of the proteins of the prepriming system. Protein n` is a candidate for this role. Protein n', now available in homogeneous form (unpublished observations), is an ATPase dependent on the presence of DNA. Unlike most other such ATPases, protein n' is strikingly specific in its DNA dependence. Among a large number of binding- protein-coated single strands and duplexes, only 4X DNA is active (unpublished observations). Because recognition of coated G4 DNA by primase and M13 DNA by RNA polymer- ase depends in each instance on a unique hairpinlike structure, it seemed that protein n' might also depend on such a helical structure that resists destabilization by SSB. Exonuclease VII, an enzyme that acts on single-stranded DNA from both 3' and 5' termini (18, 19). leaves a resistant .%-nucleotide stretch that retains the full effector capacity of the #X circle to support protein n' ATPase activity (Figs. 1-4). Within the exonuclease VII-resistant, 55-nucleotide fragment ,there is a 44-nucleotide sequence with a potential for forming a relatively stable hairpin structure (Fig. 7). Because its size distinguishes this sequence neither from other duplex regions in $X that also resist exonuclease VI1 action (Fig. 2) nor from the duplex origins in G4 and M13 DNAs, the basis for recog- nition by protein n' must depend on other features, such as a particular sequence within the hairpin, mismatched regions, the terminal loop, or neighboring regions of the hairpin. Available methods for chemical synthesis of polydeoxynu- cleotides (28) offer an approach for settling this question. The 55-nucleotide recognition sequence for protein n` is contained within a 107-nucleotide, untranslated region of the $X chromosome between genes F and G, a location analogous to that of the complementary strand origin of phage G4 (12). This is remarkable in view of the low degree of overall sequence homology between these phages and the lack of resemblance between the hairpins recognized by protein n' in $X and by primase in G4. It seems likely that the series of reactions that culminates in the priming of 4X complementary strand syn- thesis is initiated by protein n' at a specific locus, and that subsequent events, such as movement of dnaB protein around the $X circle, may obscure this unique start. We thank Profesor 1. Tinoco for a discussion of stability of secondary structures in DNA. This work was supported by grants from the Na- tional Science Foundation and National Institutes of Health. J.S. was a Fellow of the Fogarty International Center of the National Institutes of Health. 1. 2. Schekman, R.. Weiner. A. & Kornberg, A. (1974) Science 186, Wickner. S. & Hunvitz, I. (1974) Proc. Natl. Ad. Sei. USA 71, 987-993. 4120-4124. Geider, K. & Komberg. A. (1974) J. Bfof. Chem. 249, 3999- 4005. Bouch6, J.-P., Zechel. K. & Kornberg, A. (1975) J. Bfol. Chem. 250,5995-6001. Zechel. K.. BouchC. l.-P. & Kornberg, A. (1975) J. Bfol. Chem. 3. 4. 5. 6 7 8 9. 10. 11. 12. 13. 14 15 16 17. 18. 19. 20 21 22 23 24 25. 26. 27 28. 250,4&-4689. - - McMacken, R., BouchC, J,-P,, Rowen, S. L., Weiner, J. H.. Ueda, K., Thelander, L., McHenry, C. & Kornberg, A. (197'7) in Nudeic Acid-Protefn Recognftion, ed. Vogel, H. J. (Academic, New York), pp. 15-29. McMacken, R., Ueda. K. & Kornberg, A. (1977) Proc. Natf. Ad. Scf. USA 74,419&4194. Weiner, J. H., McMacken, R. & Kornberg, A. (1976) Proc. Natl. Acad. Scf. USA 73,752-756. Tabak, H. F., Griffith, J., Geider. K.. Schaller, H. & Kornberg, A. (1974) J. Bioi. Chem. 249,30493054. BouchC, J,-P., Rowen, L. & Kornberg, A. (1978) J. Biof. Chem. 253,765-769. Sims, J. & Dressler, D. (1978) Proc. Natl. Acad. Sei. USA 75, 30943098. Fiddes. J. C., Barrell, B. G. & Godson. G. N. (1978) Proc. Natf. Godson, G. N. (1975) in DNA Synthesis and Its Regulatia, eds. Goulian. M., Hanawalt. P. & Fox, C. F. (W. A. Benjamin, Menlo Park, CA), pp. 387-397. Martin, D. M. & Godson. G. N. (1977) J. Mol. Bwf. 117,321- 335. McMacken, R. & Kornberg. A. (1978) J. Bfol. Chem. 253, 3313-3319. Eisenberg. S.. Harbers, B., Hours. C. & Denhardt. D. T. (1975) J. Mol. Bfol. 99,107-123. Weiner, J. H., Bertsch, L. L. & Komberg, A. (1975) J. Biol. Chem. 250,1972-1980. Chase, J. W. & Richardson, C. C. (1974) J. Bfof. Chem. 249, 4545-4552. Chase, J. W. & Richardson. C. C. (1974) J. Bfol. Chem. 249, 4553-4561. Maniatis, T., Jeffrey, A. & van de Sande, H. (1975) Biochemistry 14,3787-3794. Maxam. A. M. & Gilbert. W. (1977) Proc. Natf. Acad. Scf. USA 74,560-564. Alwine, J. C., Kemp, D. J. & Stark. G. R. (1977) Proc. Natl. Ad. Scf. USA 74,5350-5354. Wahl, G. M., Stern, M. & Stark, G. R. (1979) Proc. Natf. Acad. Scf. USA 76,36833687, Sanger, F.. Coulson. A. R.. Friedmann, T.. Air, G. M.. Barrell. B. G., Brown, N. L., Fiddes. J. C., Hutchison, C. A,. Slocombe, P. M. & Smith, M. (1978) J. Mol. Bfof. 125,225-246. Tinoco, I., Boner, D. N., Dengler, B.. Levine, M. D., Uhlenbeck. 0. L., Crothen, D. M., & Giralla, J. (1973) Nature (London) New Biof. 246,40-41. Kornberg. A. (1978) Cold Spring Harbor Symp. @ant. Bwf. 43, 1-9. Meyer, R. R., Shlomai, J., Kobori, J., Bates, D L., Rowen, L.. McMacken, R., Ueda, K. & Kornberg, A. (1978) Cold Sprfng Harbm Symp. Qwnt. Bfof. 43,289-293. Crea. R., Hirose. T., Kraszewski, A. & Itakura, K. (1978) Proc. Natl. Acad. Sei. USA 75,5765-5769. Ad. Scf. USA 75,1081-1085.