U.S. flag

An official website of the United States government

Escherichia coli strain H2F18W NODE_547_length_280_cov_0.143791, whole genome shotgun sequence

GenBank: JACGFS010000435.1

FASTA Graphics 

LOCUS       JACGFS010000435          280 bp    DNA     linear   BCT 02-MAR-2021
DEFINITION  Escherichia coli strain H2F18W NODE_547_length_280_cov_0.143791,
            whole genome shotgun sequence.
ACCESSION   JACGFS010000435 JACGFS010000000
VERSION     JACGFS010000435.1
DBLINK      BioProject: PRJNA649083
            BioSample: SAMN15659102
KEYWORDS    WGS.
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 280)
  AUTHORS   Cen,D., Fang,L. and Sun,R.
  TITLE     Escherichia coli H2F18W, complete genome
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 280)
  AUTHORS   Cen,D., Fang,L. and Sun,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-JUL-2020) College of Veterinary Medicine, South China
            Agricultural University, Wushan, Tianhe District, Guangzhou,
            Guangdong 510642, China
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            Bacteria and source DNA available from South China Agricultural
            University.
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: SPAdes v. 3.6.2
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 20.0x
            Sequencing Technology  :: Illumina MiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 07/30/2020 19:02:19
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.12
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 4,979
            CDSs (total)                      :: 4,854
            Genes (coding)                    :: 4,649
            CDSs (with protein)               :: 4,649
            Genes (RNA)                       :: 125
            rRNAs                             :: 7, 11, 10 (5S, 16S, 23S)
            complete rRNAs                    :: 6, 1 (5S, 23S)
            partial rRNAs                     :: 1, 11, 9 (5S, 16S, 23S)
            tRNAs                             :: 82
            ncRNAs                            :: 15
            Pseudo Genes (total)              :: 205
            CDSs (without protein)            :: 205
            Pseudo Genes (ambiguous residues) :: 0 of 205
            Pseudo Genes (frameshifted)       :: 60 of 205
            Pseudo Genes (incomplete)         :: 127 of 205
            Pseudo Genes (internal stop)      :: 41 of 205
            Pseudo Genes (multiple problems)  :: 21 of 205
            CRISPR Arrays                     :: 2
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..280
                     /organism="Escherichia coli"
                     /mol_type="genomic DNA"
                     /submitter_seqid="NODE_547_length_280_cov_0.143791"
                     /strain="H2F18W"
                     /host="goose"
                     /db_xref="taxon:562"
                     /geo_loc_name="China: Yangjiang"
                     /collection_date="2018-07-07T09:29:20Z"
     gene            complement(<1..>280)
                     /locus_tag="H4F36_24600"
     CDS             complement(<1..>280)
                     /locus_tag="H4F36_24600"
                     /inference="COORDINATES: protein motif:HMM:NF012710.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=3
                     /transl_table=11
                     /product="L1 family major capsid protein"
                     /protein_id="MBN4664002.1"
                     /translation="MWFYLRREQLFARHYFNRAGKVGETIPAELYLKGSNGREPPPSS
                     VYVATPSGSMITSEAQLFNKPYWLQRDQGHNNGICWGNQLFVTVVDTTR"
ORIGIN      
        1 cgggtagtgt ccacgacggt cacgaacagc tggttgcccc agcagatgcc gttgttgtgt
       61 ccctggtccc gctgcagcca gtagggcttg ttgaacagct gggcctcgct cgtgatcatg
      121 ctgccgctgg gggtggccac gtacacgctg ctgggaggag gctcccgtcc gttgctgccc
      181 ttcaggtaca gctcggcggg gattgtctcg cccactttgc cggctctgtt gaagtagtgc
      241 cgggcgaaca gctgctcccg ccgcaggtag aaccacatga
//
Feature
Display: FASTA GenBank Help
Details

Supplemental Content

Change region shown

Customize view

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
External link. Please review our privacy policy.