pmc logo imageJournal ListSearchpmc logo image
Logo of narJournal URL: redirect3.cgi?&&auth=0Pkl78328HpvpzTPbtnyfil_RHJyHF28DerMffQn4&reftype=publisher&artid=309913&article-id=309913&iid=9059&issue-id=9059&jid=4&journal-id=4&FROM=Article|Banner&TO=Publisher|Other|N%2FA&rendering-type=normal&&http://nar.oupjournals.org
Nucleic Acids Res. 1993 August 11; 21(16): 3875–3884.
PMCID: PMC309913
Comparative DNA sequence features in two long Escherichia coli contigs.
L R Cardon, C Burge, G A Schachtel, B E Blaisdell, and S Karlin
Department of Mathematics, Stanford University, CA 94035.
Abstract
The recent sequencing of two relatively long (approximately 100 kb) contigs of E.coli presents unique opportunities for investigating heterogeneity and genomic organization of the E.coli chromosome. We have evaluated a number of common and contrasting sequence features in the two new contigs with comparisons to all available E.coli sequences (> 1.6 Mb). Our analyses include assessments of: (i) counts and distributions of restriction sites, special oligonucleotides (e.g., Chi sites, Dam and Dcm methylase targets), and other marker arrays; (ii) significant distant and close direct and inverted repeat sequences; (iii) sequence similarities between the long contigs and other E.coli sequences; (iv) characterization and identification of rare and frequent oligonucleotides; (v) compositional biases in short oligonucleotides; and (vi) position-dependent fluctuations in sequence composition. The two contigs reveal a number of distinctive features, including: a cluster of five repeat/dyad elements with very regular spacings resembling a transcription attenuator in one of the contigs; REP elements, ERICs, and other long repeats; distinction of the Chi sequence as the most frequent oligonucleotide; regions of clustering, overdispersion, and regularity of certain restriction sites and short palindromes; and comparative domains of inhomogeneities in the two long contigs. These and other features are discussed in relation to the organization of the E.coli chromosome.
Full text
Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.8M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
  • Churchill, GA; Daniels, DL; Waterman, MS. The distribution of restriction enzyme sites in Escherichia coli. Nucleic Acids Res. 1990 Feb 11;18(3):589–597. [PubMed]
  • Daniels, DL; Plunkett, G, 3rd; Burland, V; Blattner, FR. Analysis of the Escherichia coli genome: DNA sequence of the region from 84.5 to 86.5 minutes. Science. 1992 Aug 7;257(5071):771–778. [PubMed]
  • Dimri, GP; Rudd, KE; Morgan, MK; Bayat, H; Ames, GF. Physical mapping of repetitive extragenic palindromic sequences in Escherichia coli and phylogenetic distribution among Escherichia coli strains and other enteric bacteria. J Bacteriol. 1992 Jul;174(14):4583–4593. [PubMed]
  • Fickett, JW. Recognition of protein coding regions in DNA sequences. Nucleic Acids Res. 1982 Sep 11;10(17):5303–5318. [PubMed]
  • Gilson, E; Saurin, W; Perrin, D; Bachellier, S; Hofnung, M. Palindromic units are part of a new bacterial interspersed mosaic element (BIME). Nucleic Acids Res. 1991 Apr 11;19(7):1375–1383. [PubMed]
  • Higgins, CF; Ames, GF; Barnes, WM; Clement, JM; Hofnung, M. A novel intercistronic regulatory element of prokaryotic operons. Nature. 1982 Aug 19;298(5876):760–762. [PubMed]
  • Karlin, S; Blaisdell, BE; Sapolsky, RJ; Cardon, L; Burge, C. Assessments of DNA inhomogeneities in yeast chromosome III. Nucleic Acids Res. 1993 Feb 11;21(3):703–711. [PubMed]
  • Karlin, S; Brendel, V. Chance and statistical significance in protein and DNA sequence analysis. Science. 1992 Jul 3;257(5066):39–49. [PubMed]
  • Karlin, S; Macken, C. Assessment of inhomogeneities in an E. coli physical map. Nucleic Acids Res. 1991 Aug 11;19(15):4241–4246. [PubMed]
  • Kohara, Y; Akiyama, K; Isono, K. The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library. Cell. 1987 Jul 31;50(3):495–508. [PubMed]
  • Kozhukhin, CG; Pevzner, PA. Genome inhomogeneity is determined mainly by WW and SS dinucleotides. Comput Appl Biosci. 1991 Jan;7(1):39–49. [PubMed]
  • Krawiec, S; Riley, M. Organization of the bacterial chromosome. Microbiol Rev. 1990 Dec;54(4):502–539. [PubMed]
  • Kröger, M; Wahl, R; Schachtel, G; Rice, P. Compilation of DNA sequences of Escherichia coli (update 1992). Nucleic Acids Res. 1992 May 11;20 Suppl:2119–2144. [PubMed]
  • Kunisawa, T; Nakamura, M. Identification of regulatory building blocks in Escherichia coli genome. Protein Seq Data Anal. 1991 Jul;4(1):43–47. [PubMed]
  • Lawther, RP; Calhoun, DH; Adams, CW; Hauser, CA; Gray, J; Hatfield, GW. Molecular basis of valine resistance in Escherichia coli K-12. Proc Natl Acad Sci U S A. 1981 Feb;78(2):922–925. [PubMed]
  • Leung, MY; Blaisdell, BE; Burge, C; Karlin, S. An efficient algorithm for identifying matches with errors in multiple long molecular sequences. J Mol Biol. 1991 Oct 20;221(4):1367–1378. [PubMed]
  • Masters, M. The Escherichia coli chromosome and its replication. Curr Opin Cell Biol. 1989 Apr;1(2):241–249. [PubMed]
  • McClelland, M; Jones, R; Patel, Y; Nelson, M. Restriction endonucleases for pulsed field mapping of bacterial genomes. Nucleic Acids Res. 1987 Aug 11;15(15):5985–6005. [PubMed]
  • Merkl, R; Kröger, M; Rice, P; Fritz, HJ. Statistical evaluation and biological interpretation of non-random abundance in the E. coli K-12 genome of tetra- and pentanucleotide sequences related to VSP DNA mismatch repair. Nucleic Acids Res. 1992 Apr 11;20(7):1657–1662. [PubMed]
  • Milkman, R; Stoltzfus, A. Molecular evolution of the Escherichia coli chromosome. II. Clonal segments. Genetics. 1988 Oct;120(2):359–366. [PubMed]
  • Nussinov, R. Nearest neighbor nucleotide patterns. Structural and biological implications. J Biol Chem. 1981 Aug 25;256(16):8458–8462. [PubMed]
  • O'Day, K; Lopilato, J; Wright, A. Physical locations of bglA and serA on the Escherichia coli K-12 chromosome. J Bacteriol. 1991 Mar;173(5):1571. [PubMed]
  • Platt, T. Transcription termination and the regulation of gene expression. Annu Rev Biochem. 1986;55:339–372. [PubMed]
  • Richardson, JP. Transcription termination. Crit Rev Biochem Mol Biol. 1993;28(1):1–30. [PubMed]
  • Rosenberg, M; Court, D. Regulatory sequences involved in the promotion and termination of RNA transcription. Annu Rev Genet. 1979;13:319–353. [PubMed]
  • Rudd, KE; Miller, W; Werner, C; Ostell, J; Tolstoshev, C; Satterfield, SG. Mapping sequenced E.coli genes by computer: software, strategies and examples. Nucleic Acids Res. 1991 Feb 11;19(3):637–647. [PubMed]
  • Sengstag, C; Iida, S; Hiestand-Nauer, R; Arber, W. Terminal inverted repeats of prokaryotic transposable element IS186 which can generate duplications of variable length at an identical target sequence. Gene. 1986;49(1):153–156. [PubMed]
  • Sharples, GJ; Lloyd, RG. A novel repeated DNA sequence located in the intergenic regions of bacterial chromosomes. Nucleic Acids Res. 1990 Nov 25;18(22):6503–6508. [PubMed]
  • Stahl, FW; Thomason, LC; Siddiqi, I; Stahl, MM. Further tests of a recombination model in which chi removes the RecD subunit from the RecBCD enzyme of Escherichia coli. Genetics. 1990 Nov;126(3):519–533. [PubMed]
  • Turnbough, CL, Jr; Hicks, KL; Donahue, JP. Attenuation control of pyrBI operon expression in Escherichia coli K-12. Proc Natl Acad Sci U S A. 1983 Jan;80(2):368–372. [PubMed]
  • Yanofsky, C. Transcription attenuation. J Biol Chem. 1988 Jan 15;263(2):609–612. [PubMed]
  • Yura, T; Mori, H; Nagai, H; Nagata, T; Ishihama, A; Fujita, N; Isono, K; Mizobuchi, K; Nakata, A. Systematic sequencing of the Escherichia coli genome: analysis of the 0-2.4 min region. Nucleic Acids Res. 1992 Jul 11;20(13):3305–3308. [PubMed]
  • Bhagwat, AS; McClelland, M. DNA mismatch correction by Very Short Patch repair may have altered the abundance of oligonucleotides in the E. coli genome. Nucleic Acids Res. 1992 Apr 11;20(7):1663–1668. [PubMed]
  • Birkenbihl, RP; Vielmetter, W. Completion of the IS map in E. coli: IS186 positions on the E. coli K12 chromosome. Mol Gen Genet. 1991 Apr;226(1-2):318–320. [PubMed]
  • Blaisdell, BE; Rudd, KE; Matin, A; Karlin, S. Significant dispersed recurrent DNA sequences in the Escherichia coli genome. Several new groups. J Mol Biol. 1993 Feb 20;229(4):833–848. [PubMed]
  • Burge, C; Campbell, AM; Karlin, S. Over- and under-representation of short oligonucleotides in DNA sequences. Proc Natl Acad Sci U S A. 1992 Feb 15;89(4):1358–1362. [PubMed]
  • Campbell, JL; Kleckner, N. E. coli oriC and the dnaA gene promoter are sequestered from dam methyltransferase following the passage of the chromosomal replication fork. Cell. 1990 Sep 7;62(5):967–979. [PubMed]