pmc logo imageJournal ListSearchpmc logo image
Logo of molcellbMol Cell Biol SubscriptionsMol Cell Biol Web Site
Mol Cell Biol. 1988 April; 8(4): 1385–1397.
PMCID: PMC363295
Unit-length line-1 transcripts in human teratocarcinoma cells.
J Skowronski, T G Fanning, and M F Singer
Laboratory of Biochemistry, National Cancer Institute, Bethesda, Maryland 20892.
Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Full text
Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (2.8M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.
Images in this article
Click on the image to see a larger version.
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
  • Adams, JW; Kaufman, RE; Kretschmer, PJ; Harrison, M; Nienhuis, AW. A family of long reiterated DNA sequences, one copy of which is next to the human beta globin gene. Nucleic Acids Res. 1980 Dec 20;8(24):6113–6128. [PubMed]
  • Andrews, PW; Damjanov, I; Simon, D; Banting, GS; Carlin, C; Dracopoli, NC; Føgh, J. Pluripotent embryonal carcinoma clones derived from the human teratocarcinoma cell line Tera-2. Differentiation in vivo and in vitro. Lab Invest. 1984 Feb;50(2):147–162. [PubMed]
  • Bennett, KL; Hastie, ND. Looking for relationships between the most repeated dispersed DNA sequences in the mouse: small R elements are found associated consistently with long MIF repeats. EMBO J. 1984 Feb;3(2):467–472. [PubMed]
  • Benton, WD; Davis, RW. Screening lambdagt recombinant clones by hybridization to single plaques in situ. Science. 1977 Apr 8;196(4286):180–182. [PubMed]
  • Bernstein, LB; Manser, T; Weiner, AM. Human U1 small nuclear RNA genes: extensive conservation of flanking sequences suggests cycles of gene amplification and transposition. Mol Cell Biol. 1985 Sep;5(9):2159–2171. [PubMed]
  • Bird, AP. CpG-rich islands and the function of DNA methylation. Nature. 1986 May 15;321(6067):209–213. [PubMed]
  • Birnstiel, ML; Busslinger, M; Strub, K. Transcription termination and 3' processing: the end is in site! Cell. 1985 Jun;41(2):349–359. [PubMed]
  • Burke, WD; Calalang, CC; Eickbush, TH. The site-specific ribosomal insertion element type II of Bombyx mori (R2Bm) contains the coding sequence for a reverse transcriptase-like enzyme. Mol Cell Biol. 1987 Jun;7(6):2221–2230. [PubMed]
  • Burton, FH; Loeb, DD; Chao, SF; Hutchison, CA, 3rd; Edgell, MH. Transposition of a long member of the L1 major interspersed DNA family into the mouse beta globin gene locus. Nucleic Acids Res. 1985 Jul 25;13(14):5071–5084. [PubMed]
  • Burton, FH; Loeb, DD; Voliva, CF; Martin, SL; Edgell, MH; Hutchison, CA., 3rd Conservation throughout mammalia and extensive protein-encoding capacity of the highly repeated DNA long interspersed sequence one. J Mol Biol. 1986 Jan 20;187(2):291–304. [PubMed]
  • D'Ambrosio, E; Waitzkin, SD; Witney, FR; Salemme, A; Furano, AV. Structure of the highly repeated, long interspersed DNA family (LINE or L1Rn) of the rat. Mol Cell Biol. 1986 Feb;6(2):411–424. [PubMed]
  • Demers, GW; Brech, K; Hardison, RC. Long interspersed L1 repeats in rabbit DNA are homologous to L1 repeats of rodents and primates in an open-reading-frame region. Mol Biol Evol. 1986 May;3(3):179–190. [PubMed]
  • DiGiovanni, L; Haynes, SR; Misra, R; Jelinek, WR. Kpn I family of long-dispersed repeated DNA sequences of man: evidence for entry into genomic DNA of DNA copies of poly(A)-terminated Kpn I RNAs. Proc Natl Acad Sci U S A. 1983 Nov;80(21):6533–6537. [PubMed]
  • Di Nocera, PP; Digan, ME; Dawid, IB. A family of oligo-adenylate-terminated transposable sequences in Drosophila melanogaster. J Mol Biol. 1983 Aug 25;168(4):715–727. [PubMed]
  • Doerfler, W. DNA methylation and gene activity. Annu Rev Biochem. 1983;52:93–124. [PubMed]
  • Dudley, JP. Discrete high molecular weight RNA transcribed from the long interspersed repetitive element L1Md. Nucleic Acids Res. 1987 Mar 25;15(6):2581–2592. [PubMed]
  • Enders, GH; Ganem, D; Varmus, H. Mapping the major transcripts of ground squirrel hepatitis virus: the presumptive template for reverse transcriptase is terminally redundant. Cell. 1985 Aug;42(1):297–308. [PubMed]
  • Fanning, T; Singer, M. The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins. Nucleic Acids Res. 1987 Mar 11;15(5):2251–2260. [PubMed]
  • Fanning, TG. Size and structure of the highly repetitive BAM HI element in mice. Nucleic Acids Res. 1983 Aug 11;11(15):5073–5091. [PubMed]
  • Fawcett, DH; Lister, CK; Kellett, E; Finnegan, DJ. Transposable elements controlling I-R hybrid dysgenesis in D. melanogaster are similar to mammalian LINEs. Cell. 1986 Dec 26;47(6):1007–1015. [PubMed]
  • Grimaldi, G; Skowronski, J; Singer, MF. Defining the beginning and end of KpnI family segments. EMBO J. 1984 Aug;3(8):1753–1759. [PubMed]
  • Hattori, M; Hidaka, S; Sakaki, Y. Sequence analysis of a KpnI family member near the 3' end of human beta-globin gene. Nucleic Acids Res. 1985 Nov 11;13(21):7813–7827. [PubMed]
  • Hattori, M; Kuhara, S; Takenaka, O; Sakaki, Y. L1 family of repetitive DNA sequences in primates may be derived from a sequence encoding a reverse transcriptase-related protein. Nature. 1986 Jun 5;321(6070):625–628. [PubMed]
  • Hattori, M; Sakaki, Y. Dideoxy sequencing method using denatured plasmid templates. Anal Biochem. 1986 Feb 1;152(2):232–238. [PubMed]
  • Hollis, GF; Hieter, PA; McBride, OW; Swan, D; Leder, P. Processed genes: a dispersed human immunoglobulin gene bearing evidence of RNA-type processing. Nature. 1982 Mar 25;296(5855):321–325. [PubMed]
  • Jacks, T; Varmus, HE. Expression of the Rous sarcoma virus pol gene by ribosomal frameshifting. Science. 1985 Dec 13;230(4731):1237–1242. [PubMed]
  • Jackson, M; Heller, D; Leinwand, L. Transcriptional measurements of mouse repeated DNA sequences. Nucleic Acids Res. 1985 May 10;13(9):3389–3403. [PubMed]
  • Kimmel, BE; ole-MoiYoi, OK; Young, JR. Ingi, a 5.2-kb dispersed sequence element from Trypanosoma brucei that carries half of a smaller mobile element at either end and has homology with mammalian LINEs. Mol Cell Biol. 1987 Apr;7(4):1465–1475. [PubMed]
  • Kole, LB; Haynes, SR; Jelinek, WR. Discrete and heterogeneous high molecular weight RNAs complementary to a long dispersed repeat family (a possible transposon) of human DNA. J Mol Biol. 1983 Apr 5;165(2):257–286. [PubMed]
  • Kozak, M. Bifunctional messenger RNAs in eukaryotes. Cell. 1986 Nov 21;47(4):481–483. [PubMed]
  • Lakshmikumaran, MS; D'Ambrosio, E; Laimins, LA; Lin, DT; Furano, AV. Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus. Mol Cell Biol. 1985 Sep;5(9):2197–2203. [PubMed]
  • Lee, TN; Singer, MF. Analysis of LINE-1 family sequences on a single monkey chromosome. Nucleic Acids Res. 1986 May 12;14(9):3859–3870. [PubMed]
  • Lerman, MI; Thayer, RE; Singer, MF. Kpn I family of long interspersed repeated DNA sequences in primates: polymorphism of family members and evidence for transcription. Proc Natl Acad Sci U S A. 1983 Jul;80(13):3966–3970. [PubMed]
  • Loeb, DD; Padgett, RW; Hardies, SC; Shehee, WR; Comer, MB; Edgell, MH; Hutchison, CA., 3rd The sequence of a large L1Md element reveals a tandemly repeated 5' end and several features found in retrotransposons. Mol Cell Biol. 1986 Jan;6(1):168–182. [PubMed]
  • Martin, SL; Voliva, CF; Burton, FH; Edgell, MH; Hutchison, CA., 3rd A large interspersed repeat found in mouse DNA contains a long open reading frame that evolves as if it encodes a protein. Proc Natl Acad Sci U S A. 1984 Apr;81(8):2308–2312. [PubMed]
  • Maxam, AM; Gilbert, W. Sequencing end-labeled DNA with base-specific chemical cleavages. Methods Enzymol. 1980;65(1):499–560. [PubMed]
  • Miyake, T; Migita, K; Sakaki, Y. Some KpnI family members are associated with the Alu family in the human genome. Nucleic Acids Res. 1983 Oct 11;11(19):6837–6846. [PubMed]
  • Morzycka-Wroblewska, E; Selker, EU; Stevens, JN; Metzenberg, RL. Concerted evolution of dispersed Neurospora crassa 5S RNA genes: pattern of sequence conservation between allelic and nonallelic genes. Mol Cell Biol. 1985 Jan;5(1):46–51. [PubMed]
  • Mottez, E; Rogan, PK; Manuelidis, L. Conservation in the 5' region of the long interspersed mouse L1 repeat: implications of comparative sequence analysis. Nucleic Acids Res. 1986 Apr 11;14(7):3119–3136. [PubMed]
  • Nomiyama, H; Obaru, K; Jinno, Y; Matsuda, I; Shimada, K; Miyata, T. Amplification of human argininosuccinate synthetase pseudogenes. J Mol Biol. 1986 Nov 20;192(2):221–233. [PubMed]
  • Norrander, J; Kempe, T; Messing, J. Construction of improved M13 vectors using oligodeoxynucleotide-directed mutagenesis. Gene. 1983 Dec;26(1):101–106. [PubMed]
  • Paterson, BM; Eldridge, JD. alpha-Cardiac actin is the major sarcomeric isoform expressed in embryonic avian skeletal muscle. Science. 1984 Jun 29;224(4656):1436–1438. [PubMed]
  • Peabody, DS; Berg, P. Termination-reinitiation occurs in the translation of mammalian cell mRNAs. Mol Cell Biol. 1986 Jul;6(7):2695–2703. [PubMed]
  • Peabody, DS; Subramani, S; Berg, P. Effect of upstream reading frames on translation efficiency in simian virus 40 recombinants. Mol Cell Biol. 1986 Jul;6(7):2704–2711. [PubMed]
  • Persico, MG; Viglietto, G; Martini, G; Toniolo, D; Paonessa, G; Moscatelli, C; Dono, R; Vulliamy, T; Luzzatto, L; D'Urso, M. Isolation of human glucose-6-phosphate dehydrogenase (G6PD) cDNA clones: primary structure of the protein and unusual 5' non-coding region. Nucleic Acids Res. 1986 Mar 25;14(6):2511–2522. [PubMed]
  • Potter, SS. Rearranged sequences of a human Kpn I element. Proc Natl Acad Sci U S A. 1984 Feb;81(4):1012–1016. [PubMed]
  • Rogers, JH. The origin and evolution of retroposons. Int Rev Cytol. 1985;93:187–279. [PubMed]
  • Sakaki, Y; Hattori, M; Fujita, A; Yoshioka, K; Kuhara, S; Takenaka, O. The LINE-1 family of primates may encode a reverse transcriptase-like protein. Cold Spring Harb Symp Quant Biol. 1986;51 Pt 1:465–469. [PubMed]
  • Sanger, F; Nicklen, S; Coulson, AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977 Dec;74(12):5463–5467. [PubMed]
  • Schmeckpeper, BJ; Scott, AF; Smith, KD. Transcripts homologous to a long repeated DNA element in the human genome. J Biol Chem. 1984 Jan 25;259(2):1218–1225. [PubMed]
  • SenGupta, DN; Zmudzka, BZ; Kumar, P; Cobianchi, F; Skowronski, J; Wilson, SH. Sequence of human DNA polymerase beta mRNA obtained through cDNA cloning. Biochem Biophys Res Commun. 1986 Apr 14;136(1):341–347. [PubMed]
  • Shafit-Zagardo, B; Brown, FL; Zavodny, PJ; Maio, JJ. Transcription of the KpnI families of long interspersed DNAs in human cells. Nature. 1983 Jul 21;304(5923):277–280. [PubMed]
  • Skowronski, J; Singer, MF. Expression of a cytoplasmic LINE-1 transcript is regulated in a human teratocarcinoma cell line. Proc Natl Acad Sci U S A. 1985 Sep;82(18):6050–6054. [PubMed]
  • Skowronski, J; Singer, MF. The abundant LINE-1 family of repeated DNA sequences in mammals: genes and pseudogenes. Cold Spring Harb Symp Quant Biol. 1986;51 Pt 1:457–464. [PubMed]
  • Soares, MB; Schon, E; Efstratiadis, A. Rat LINE1: the origin and evolution of a family of long interspersed middle repetitive DNA elements. J Mol Evol. 1985;22(2):117–133. [PubMed]
  • Stark, GR; Wahl, GM. Gene amplification. Annu Rev Biochem. 1984;53:447–491. [PubMed]
  • Sun, L; Paulson, KE; Schmid, CW; Kadyk, L; Leinwand, L. Non-Alu family interspersed repeats in human DNA and their transcriptional activity. Nucleic Acids Res. 1984 Mar 26;12(6):2669–2690. [PubMed]
  • Temin, HM. Reverse transcription in the eukaryotic genome: retroviruses, pararetroviruses, retrotransposons, and retrotranscripts. Mol Biol Evol. 1985 Nov;2(6):455–468. [PubMed]
  • Tuttleman, JS; Pourcel, C; Summers, J. Formation of the pool of covalently closed circular viral DNA in hepadnavirus-infected cells. Cell. 1986 Nov 7;47(3):451–460. [PubMed]
  • Voliva, CF; Martin, SL; Hutchison, CA, 3rd; Edgell, MH. Dispersal process associated with the L1 family of interspersed repetitive DNA sequences. J Mol Biol. 1984 Oct 5;178(4):795–813. [PubMed]
  • Wiedemann, LM; Perry, RP. Characterization of the expressed gene and several processed pseudogenes for the mouse ribosomal protein L30 gene family. Mol Cell Biol. 1984 Nov;4(11):2518–2528. [PubMed]
  • Weiner, AM; Deininger, PL; Efstratiadis, A. Nonviral retroposons: genes, pseudogenes, and transposable elements generated by the reverse flow of genetic information. Annu Rev Biochem. 1986;55:631–661. [PubMed]
  • Wilson, MC; Sawicki, SG; White, PA; Darnell, JE., Jr A correlation between the rate of poly(A) shortening and half-life of messenger RNA in adenovirus transformed cells. J Mol Biol. 1978 Nov 25;126(1):23–36. [PubMed]
  • Witney, FR; Furano, AV. Highly repeated DNA families in the rat. J Biol Chem. 1984 Aug 25;259(16):10481–10492. [PubMed]
  • Wolf, SF; Migeon, BR. Clusters of CpG dinucleotides implicated by nuclease hypersensitivity as control elements of housekeeping genes. Nature. 314(6010):467–469. [PubMed]
  • Yanisch-Perron, C; Vieira, J; Messing, J. Improved M13 phage cloning vectors and host strains: nucleotide sequences of the M13mp18 and pUC19 vectors. Gene. 1985;33(1):103–119. [PubMed]
  • Yoshinaka, Y; Katoh, I; Copeland, TD; Oroszlan, S. Murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon. Proc Natl Acad Sci U S A. 1985 Mar;82(6):1618–1622. [PubMed]