U.S. flag

An official website of the United States government

Escherichia coli strain H2F9R NODE_451_length_282_cov_0.729032, whole genome shotgun sequence

GenBank: JACGFQ010000331.1

FASTA Graphics 

LOCUS       JACGFQ010000331          282 bp    DNA     linear   BCT 02-MAR-2021
DEFINITION  Escherichia coli strain H2F9R NODE_451_length_282_cov_0.729032,
            whole genome shotgun sequence.
ACCESSION   JACGFQ010000331 JACGFQ010000000
VERSION     JACGFQ010000331.1
DBLINK      BioProject: PRJNA649083
            BioSample: SAMN15659098
KEYWORDS    WGS.
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 282)
  AUTHORS   Cen,D., Fang,L. and Sun,R.
  TITLE     Escherichia coli H2F9R, complete genome
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 282)
  AUTHORS   Cen,D., Fang,L. and Sun,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (29-JUL-2020) College of Veterinary Medicine, South China
            Agricultural University, Wushan, Tianhe District, Guangzhou,
            Guangdong 510642, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            Bacteria and source DNA available from South China Agricultural
            University.
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: SPAdes v. 3.6.2
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 20.0x
            Sequencing Technology  :: Illumina MiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/30/2020 18:46:12
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 5,039
            CDSs (total)                      :: 4,914
            Genes (coding)                    :: 4,711
            CDSs (with protein)               :: 4,711
            Genes (RNA)                       :: 125
            rRNAs                             :: 6, 7, 13 (5S, 16S, 23S)
            complete rRNAs                    :: 6, 1 (5S, 16S)
            partial rRNAs                     :: 6, 13 (16S, 23S)
            tRNAs                             :: 84
            ncRNAs                            :: 15
            Pseudo Genes (total)              :: 203
            CDSs (without protein)            :: 203
            Pseudo Genes (ambiguous residues) :: 1 of 203
            Pseudo Genes (frameshifted)       :: 63 of 203
            Pseudo Genes (incomplete)         :: 129 of 203
            Pseudo Genes (internal stop)      :: 37 of 203
            Pseudo Genes (multiple problems)  :: 22 of 203
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..282
                     /organism="Escherichia coli"
                     /mol_type="genomic DNA"
                     /submitter_seqid="NODE_451_length_282_cov_0.729032"
                     /strain="H2F9R"
                     /host="goose"
                     /db_xref="taxon:562"
                     /geo_loc_name="China: Yangjiang"
                     /collection_date="2018-07-07T09:21:18Z"
     gene            <1..>282
                     /locus_tag="H4F31_24750"
     CDS             <1..>282
                     /locus_tag="H4F31_24750"
                     /inference="COORDINATES: protein motif:HMM:NF012710.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=3
                     /transl_table=11
                     /product="L1 family major capsid protein"
                     /protein_id="MBN4724591.1"
                     /translation="VTTGDCPPLALINTPIEDGDMIDTGFGAMDFKVLQESKAEVPLD
                     IVQSTCKYPDYLKMSADAYGDSMWFYLRREQLFARHYFSRAGKVGETIP"
ORIGIN      
        1 aggtcaccac cggcgactgc ccccctctgg ccctgatcaa cacccccatc gaggacggcg
       61 acatgatcga caccggcttc ggcgccatgg acttcaaggt gctgcaggaa agcaaggccg
      121 aggtccccct ggacatcgtg cagagcacct gcaagtaccc cgactacctg aagatgagcg
      181 ccgacgccta cggcgacagc atgtggttct acctgcggcg ggagcagctg ttcgcccggc
      241 actacttcag cagagccggc aaagtgggcg agacaatccc gg
//
Feature
Display: FASTA GenBank Help
Details

Supplemental Content

Change region shown

Customize view

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
External link. Please review our privacy policy.