Bacterial Genome Submission Examples
Figure 1: Sample FASTA-formatted sequence
>HTE831 [organism=Oceanobacillus iheyensis] [strain=HTE831] actttcaaaaaaatcagcgtaaaaaacatactaatttgggcaaattcccacctgttttta gggacatttttctttgaattagagcctcagcagctcgtcattgctgaattttcttgaagt [etc.]
Figure 2: Sequin table format
>Feature HTE831 1830 2966 gene gene dnaN locus_tag OBB_0002 1830 2966 CDS product DNA-directed DNA polymerase III beta chain EC_number 2.7.7.7 protein_id gnl|ncbi|OBB_0002 3219 3440 gene locus_tag OBB_0003 3219 3440 CDS product hypothetical protein protein_id gnl|ncbi|OBB_0003 3443 4552 gene gene recF locus_tag OBB_0004 3443 4552 CDS product RecF function DNA repair and genetic recombination protein_id gnl|ncbi|OBB_0004 5109 7034 gene gene gyrB locus_tag OBB_0006 5109 7034 CDS product DNA gyrase subunit B EC_number 5.99.1.3 protein_id gnl|ncbi|OBB_0006 45081 44806 gene gene abrB locus_tag OBB_0045 45081 44806 CDS product AbrB protein_id gnl|ncbi|OBB_0045 function transcriptional pleiotropic regulator 64225 64758 gene locus_tag OBB_0064 64225 64758 CDS product stage V sporulation protein T function transcriptional regulator protein_id gnl|ncbi|OBB_0064 84524 85393 gene locus_tag OBB_0082 84524 85393 CDS product chaperonin product heat shock protein 33 protein_id gnl|ncbi|OBB_0082 89569 91050 gene locus_tag OBB_0088 89569 91050 CDS product lysine-tRNA ligase EC_number 6.1.1.6 protein_id gnl|ncbi|OBB_0088 91493 96462 operon operon rrnA 91493 93058 gene gene rrsA locus_tag OBB_0089 91493 93058 rRNA product 16S ribosomal RNA 93292 96213 gene gene rrlA locus_tag OBB_0090 93292 96213 rRNA product 23S ribosomal RNA 96347 96462 gene gene rrfA locus_tag OBB_0091 96347 96462 rRNA product 5S ribosomal RNA 96468 96744 operon operon trnC 96468 96543 gene gene trnV locus_tag OBB_0092 96468 96543 tRNA product tRNA-Val 96545 96620 gene gene trnT locus_tag OBB_0093 96545 96620 tRNA product tRNA-Thr 96669 96744 gene gene trnK locus_tag OBB_0094 96669 96744 tRNA product tRNA-Lys 1914923 1914066 gene gene folD locus_tag OBB_1880 1914923 1914066 CDS product bifunctional methylenetetrahydrofolate dehydrogenase (NADP+)/methenyltetrahydrofolate cyclohydrolase EC_number 1.5.1.5 EC_number 3.5.4.9 protein_id gnl|ncbi|OB1880
Figure 3: GenBank flatfile
LOCUS OB_HTE831 3630528 bp DNA circular BCT 11-DEC-2002 DEFINITION Oceanobacillus iheyensis HTE831, complete genome. ACCESSION VERSION KEYWORDS . SOURCE Oceanobacillus iheyensis HTE831 ORGANISM Oceanobacillus iheyensis HTE831 Bacteria; Firmicutes; Bacillales; Oceanobacillus. REFERENCE 1 (bases 1 to 3630528) AUTHORS Takami,H., Takaki,Y. and Uchiyama,I. TITLE Genome sequence of Oceanobacillus iheyensis isolated from the Iheya Ridge and its unexpected adaptive capabilities to extreme environments JOURNAL Nucleic Acids Res. 30 (18), 3927-3935 (2002) PUBMED 12235376 REFERENCE 2 (bases 1 to 3630528) AUTHORS Takami,H., Takaki,Y. and Chee,G. TITLE Direct Submission JOURNAL Submitted (26-DEC-2001) Hideto Takami, Japan Marine Science and Technology Center, Deep-sea Microorganisms Research Group; 2-15 Natsushima-cho, Yokosuka, Kanagawa 237-0061, Japan FEATURES Location/Qualifiers source 1..3630528 /organism="Oceanobacillus iheyensis HTE831" /strain="HTE831" /db_xref="taxon:221109" gene 1830..2966 /gene="dnaN" /locus_tag="OBB_0002" CDS 1830..2966 /gene="dnaN" /locus_tag="OBB_0002" /EC_number="2.7.7.7" /codon_start=1 /transl_table=11 /product="DNA-directed DNA polymerase III beta chain" /translation="MRFTIQRDKLINGVSNVMKAISARTVIPILTGMKIEVKNHGVTL TGSDSDISIEYYIPIEEDGIVHVENIEEGTIILQAKYFPDIVRKLPESTVDIVVDDQL NVRITSGKAEFNLNGQSAEEYPQLPKVQTENSFELPIDLLKSMIKQTVFAVSTMETRP ILTGVNLKLVDNSLSFTATDSHRLARREIPVSNAPIEISQIVVPGKSLNELNKILGDS EETVEISVTNNQILFRTKHLNFLSRLLDGNYPETSRLIPEQSKTKIQLKTKELLGTID RASLLAKEERNNVVKFNAPGNSMIEISSNSPEVGNVVEEITADQMEGEDVKISFSSKY MIDALKAIEYDEVQIEFTGAMRPFIIRPVGDDSILQLILPVRTY" gene 3219..3440 /locus_tag="OBB_0003" CDS 3219..3440 /locus_tag="OBB_0003" /codon_start=1 /transl_table=11 /product="hypothetical protein" /translation="MHEQIQIDTEYITLGQLIKLLNFLESGGMVKTFLQEEGALVNGH LEQRRGRKLYPKDVVEIQGIGSYIVIKED" gene 3443..4552 /gene="recF" /locus_tag="OBB_0004" CDS 3443..4552 /gene="recF" /locus_tag="OBB_0004" /function="DNA repair and genetic recombination" /codon_start=1 /transl_table=11 /product="RecF" /translation="MHIEKLELTNYRNYDQLEIAFDDQINVIIGENAQGKTNLMEAIY VLSFARSHRTPREKELIQWDKDYAKIEGRITKRNQSIPLQISITSKGKKAKVNHLEQH RLSDYIGSVNVVMFAPEDLTIVKGAPQIRRRFMDMELGQIQPTYIYHLAQYQKVLKQR NHLLKQLQRKPNSDTTMLEVLTDQLIEHASILLERRFIYLELLRKWAQPIHRGISREL EQLEIQYSPSIEVSEDANKEKIGNIYQMKFAEVKQKEIERGTTLAGPHRDDLIFFVNG KDVQTYGSQGQQRTTALSIKLAEIELIYQEVGEYPILLLDDVLSELDDYRQSHLLNTI QGKVQTFVSTTSVEGIHHETLQQAELFRVTDGVVN" gene 5109..7034 /gene="gyrB" /locus_tag="OBB_0006" CDS 5109..7034 /gene="gyrB" /locus_tag="OBB_0006" /EC_number="5.99.1.3" /codon_start=1 /transl_table=11 /product="DNA gyrase subunit B" /translation="MSMEDKITENQEYGADQIQVLEGLEAVRKRPGMYIGSTSEKGLH HLVWEIVDNSIDEALAGYCDHIQVVVEEDNSITVKDNGRGIPVDIQQKTGRPALEVIM TVLHAGGKFGGGGYKVSGGLHGVGASVVNALSSELEVYVHRDGKVHFLSFKKGVPDGE IKVIGDTDITGTVTHFRPDTEIFTETTEYNFDTLEQRLRELAFLNKGLKISIEDKRTD REQVTYHYEGGISSYVEFINKNKEVLHEPFFAEGEDQGISVEVAIQYNDGFASNLYSF ANNIHTYEGGSHEVGFRSGLTRIINDYAKKNGLIKDGDSNLSGDDVREGMTTIVSIKH PDPQFEGQTKTKLGNSEVRAITDGVFSEAFSKFLYENPSTAKIIVEKGLMASRARLAA KKARELTRRKSNLEISNLPGKLADCSSRDAAISELYIVEGDSAGGSAKSGRDRHFQAI LPLRGKILNVEKARLDRILSNNEVRAMITALGSGVGEEFDISKARYHKIVIMTDADVD GAHIRTLLLTFFYRYMRPLIEQGYIYIAQPPLYQVKQGKTVNYAYNDKELDRILNEIP KAPKPNIQRYKGLGEMNADQLWDTTMDPDTRTLLQVELSDAIDADQVFDMLMGDKVEP RRIFIEENAQYVKNLDI" gene complement(44806..45081) /gene="abrB" /locus_tag="OBB_0045" CDS complement(44806..45081) /gene="abrB" /locus_tag="OBB_0045" /function="transcriptional pleiotropic regulator" /codon_start=1 /transl_table=11 /product="AbrB" /translation="MKSTGIVRKVDELGRVVIPIELRRTLDIHEKDTMEIYVDNDKIV LKKYKPNMTCQVTGEVSDENLSIANGNLVLSPAGAQILLEEIQSRFK" gene 64225..64758 /locus_tag="OBB_0064" CDS 64225..64758 /locus_tag="OBB_0064" /function="transcriptional regulator" /codon_start=1 /transl_table=11 /product="stage V sporulation protein T" /translation="MKATGIVRRIDDLGRVVVPKEIRRTLRIREGDPLEIFVDREGEV ILKKYSPINELGHFAKEYAEALFQSLQTPVMITDRDDVIAVAGESKKEYLNKPISNAI ADTIEGRSQVFEVDTKSMEIIDGQEQQLQSYCIHPVIANGDPIGCVLIFSKEEKLSKI EQKAAETASTFLAKQME" gene 84524..85393 /locus_tag="OBB_0082" CDS 84524..85393 /locus_tag="OBB_0082" /note="heat shock protein 33" /codon_start=1 /transl_table=11 /product="chaperonin" /translation="MKDYLIKATANNGKIRAYAVQSTNTIEEARRRQDTFATASAALG RTITITAMMGAMLKGDDSITTKVMGNGPLGAIVADADADGHVRGYVTNPHVDFDLNDK GKLDVARAVGTEGNISVIKDLGLKDFFTGETPIVSGEISEDFTYYYATSEQLPSAVGA GVLVNPDHTILAAGGFIVQVMPGAEEEVINELEDQIQAIPAISSLIREGKSPEEILTQ LFGEECLTIHEKMPIEFRCKCSKDRLAQAIIGLGNDEIQAMIEEDQGAEATCHFCNEK YHFTEEELEDLKQ" gene 89569..91050 /locus_tag="OBB_0088" CDS 89569..91050 /locus_tag="OBB_0088" /EC_number="6.1.1.6" /codon_start=1 /transl_table=11 /product="lysine-tRNA ligase" /translation="MSEELNEHMQVRRDKLAEHMEKGLDPFGGKFERSHQATDLIEKY DSYSKEELEETTDEVTIAGRLMTKRGKGKAGFAHIQDLSGQIQLYVRKDMIGDDAYEV FKSADLGDIVGVTGVMFKTNVGEISVKAKQFQLLTKSLRPLPEKYHGLKDIEQRYRQR YLDLITNPDSRGTFVSRSKIIQSMREYLNGQGFLEVETPMMHSIPGGASARPFITHHN ALDIELYMRIAIELHLKRLMVGGLEKVYEIGRVFRNEGVSTRHNPEFTMIELYEAYAD YHDIMELTENLVAHIAKQVHGSTTITYGEHEINLEPKWTRLHIVDAVKDATGVDFWKE VSDEEARALAKEHGVQVTESMSYGHVVNEFFEQKVEETLIQPTFIHGHPVEISPLAKK NKEDERFTDRFELFIVGREHANAFSELNDPIDQRARFEAQVKERAEGNDEAHYMDEDF LEALEYGMPPTGGLGIGVDRLVMLLTNSPSIRDVLLFPQMRTK" operon 91493..96462 /operon="rrnA" gene 91493..93058 /gene="rrsA" /locus_tag="OBB_0089" /operon="rrnA" rRNA 91493..93058 /gene="rrsA" /locus_tag="OBB_0089" /operon="rrnA" /product="16S ribosomal RNA" gene 93292..96213 /gene="rrlA" /locus_tag="OBB_0090" /operon="rrnA" rRNA 93292..96213 /gene="rrlA" /locus_tag="OBB_0090" /operon="rrn" /product="23S ribosomal RNA" gene 96347..96462 /gene="rrfA" /locus_tag="OBB_0091" /operon="rrnA" >rRNA 96347..96462 /gene="rrfA" /locus_tag="OBB_0091" /operon="rrnA" /product="5S ribosomal RNA" operon 96468..96744 /operon="trnC" gene 96468..96543 /gene="trnV" /locus_tag="OBB_0092" /operon="trnC" tRNA 96468..96543 /gene="trnV" /locus_tag="OBB_0092" /operon="trnC" /product="tRNA-Val" gene 96545..96620 /gene="trnT" /locus_tag="OBB_0093" /operon="trnC" tRNA 96545..96620 /gene="trnT" /locus_tag="OBB_0093" /operon="trnC" /product="tRNA-Thr" gene 96669..96744 /gene="trnK" /locus_tag="OBB_0094" /operon="trnC" tRNA 96669..96744 /gene="trnK" /locus_tag="OBB_0094" /operon="trnC" /product="tRNA-Lys" gene complement(1914066..1914923) /gene="folD" /locus_tag="OBB_1880" CDS complement(1914066..1914923) /gene="folD" /EC_number="1.5.1.5" /EC_number="3.5.4.9" /locus_tag="OBB_1880" /codon_start=1 /transl_table=11 /product="bifunctional methylenetetrahydrofolate dehydrogenase (NADP+)/ methenyltetrahydrofolate cyclohydrolase" /translation="MATLLNGKELSEELKQKMKIEVDELKEKGLTPHLTVILVGDNPA SKSYVKGKEKACAVTGISSNLIELPENISQDELLQIIDEQNNDDSVHGILVQLPLPDQ MDEQKIIHAISPAKDVDGFHPINVGKMMTGEDTFIPCTPYGILTMLRSKDISLEGKHA VIIGRSNIVGKPIGLLLLQENATVTYTHSRTKNLQEITKQADILIVAIGRAHAINADY IKEDAVVIDVGINRKDDGKLTGDVDFESAEQKASYITPVPRGVGPMTITMLLKNTIKA AKGLNDVER" BASE COUNT 1165552 a 648314 c 647106 g1169556 t ORIGIN 1 actttcaaaa aaatcagcgt aaaaaacata ctaatttggg caaattccca cctgttttta 61 gggacatttt tctttgaatt agagcctcag cagctcgtca ttgctgaatt ttcttgaagt
For an additional examples see GenBank Accession Number CP000141.
Revised November 17, 2008