LOCUS ON928946 29585 bp RNA linear VRL 06-JUL-2022
DEFINITION Severe acute respiratory syndrome coronavirus 2 isolate
SARS-CoV-2/human/PRI/PR-UPRRP-582/2020 ORF1ab polyprotein (ORF1ab),
ORF1a polyprotein (ORF1ab), surface glycoprotein (S), ORF3a protein
(ORF3a), envelope protein (E), membrane glycoprotein (M), and ORF6
protein (ORF6) genes, complete cds; ORF7a protein (ORF7a), ORF7b
(ORF7b), and ORF8 protein (ORF8) genes, partial cds; nucleocapsid
phosphoprotein (N) gene, complete cds; and ORF10 protein (ORF10)
gene, partial cds.
ACCESSION ON928946
VERSION ON928946.1
KEYWORDS .
SOURCE Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
ORGANISM Severe acute respiratory syndrome coronavirus 2
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
Betacoronavirus; Sarbecovirus; Betacoronavirus pandemicum.
REFERENCE 1 (bases 1 to 29585)
AUTHORS Van Belleghem,S., Papa,R., Planas,S., Ortiz,Y., Zenon,C., Cora
Huertas,L., Candelaria Velez,I., Cruz,M., Rodriguez Orengo,J.,
Godoy,F., Carlos Velez,J. and Sariol,C.
TITLE Genomic surveillance of SARS-CoV-2 in Puerto Rico
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 29585)
AUTHORS Van Belleghem,S., Papa,R., Planas,S., Ortiz,Y., Zenon,C., Cora
Huertas,L., Candelaria Velez,I., Cruz,M., Rodriguez Orengo,J.,
Godoy,F., Carlos Velez,J. and Sariol,C.
TITLE Direct Submission
JOURNAL Submitted (05-JUL-2022) Biology, University of Puerto RIco, Ave
Universidad, San Juan 00931, Puerto Rico
COMMENT ##Assembly-Data-START##
Assembly Method :: DRAGEN Genome Pipeline - Illumina v.
December-2022
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..29585
/organism="Severe acute respiratory syndrome coronavirus
2"
/mol_type="genomic RNA"
/isolate="SARS-CoV-2/human/PRI/PR-UPRRP-582/2020"
/isolation_source="oral swab"
/host="Homo sapiens"
/db_xref="taxon:2697049"
/geo_loc_name="Puerto Rico"
/collection_date="2020-12-15"
gene 212..21489
/gene="ORF1ab"
CDS join(212..13402,13402..21489)
/gene="ORF1ab"
/ribosomal_slippage
/codon_start=1
/product="ORF1ab polyprotein"
/protein_id="UTB53416.1"
/translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQ
HLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGE
TLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQEN
WNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQ
LDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFP
LNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTG
DFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESG
LKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNL
LEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGN
FKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAA
ITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKL
KPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLV
NKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEII
FLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEK
YCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNER
CSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEF
KLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPE
EEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSG
YLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESD
DYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLA
PLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIA
EIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGN
LHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKV
PTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREM
LAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLIN
TLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS
KTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVIT
FDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPH
NSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIK
WADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELG
DVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQ
IPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSK
ETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYY
KKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVT
FFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWST
KPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGD
IILKPANNIKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSV
PWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIK
ASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAA
LGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETI
QITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLM
WLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVEC
TTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPI
NPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPIN
VIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNT
FSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECL
KLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNITLIW
NVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLK
QLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFAN
KHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPR
VFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAY
ESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGR
WVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLA
YYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTND
VSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEE
AALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLA
KALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGL
WLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLK
LKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCG
SVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVL
DMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWL
LLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLL
PSLATVAYFNMVYMPASWVMRIMTWLDMVDTSFKLKDCVMYASAVVLLILMTARTVYD
DGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGVVF
MCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQ
EFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVL
QQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEM
LDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFD
RDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIIN
NARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKI
VQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDD
NALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPK
VKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKD
YLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPK
GFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQS
FLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDN
LIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKY
TMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGE
RVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYY
SLLMPILTLTRALTAESHVDTDLTNPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHP
NCVNCLDDRCILHCANFNVLFSTVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVV
HNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKP
GNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLL
FVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALF
AYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGAT
VVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTT
CCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTA
NVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSD
DAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTM
LVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQE
YADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAV
GACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDV
TQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAG
DYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPL
NRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMP
LSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFA
IGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNS
TLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAP
RTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKD
KSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASK
ILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLY
DKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKTEGLC
VDVPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHAT
REAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYK
GLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRR
ATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVAS
CDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADKFPVL
HDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNC
NVDRYPANSIVCRFDTRVLSNFNLPGCDGGXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTK
VDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYKRDA
PAHISTIGVCSMTDXAXXPXETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKG
LQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRSQ
MEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELE
DFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDY
TEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATL
PKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTL
LVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTY
ICGFIQQKLALGGSVAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQIN
DMILSLLSKGRLIIRENNRVVISSDVLVNN"
mat_peptide 212..751
/gene="ORF1ab"
/product="leader protein"
mat_peptide 752..2665
/gene="ORF1ab"
/product="nsp2"
mat_peptide 2666..8497
/gene="ORF1ab"
/product="nsp3"
mat_peptide 8498..9997
/gene="ORF1ab"
/product="nsp4"
mat_peptide 9998..10915
/gene="ORF1ab"
/product="3C-like proteinase"
mat_peptide 10916..11776
/gene="ORF1ab"
/product="nsp6"
mat_peptide 11777..12025
/gene="ORF1ab"
/product="nsp7"
mat_peptide 12026..12619
/gene="ORF1ab"
/product="nsp8"
mat_peptide 12620..12958
/gene="ORF1ab"
/product="nsp9"
mat_peptide 12959..13375
/gene="ORF1ab"
/product="nsp10"
mat_peptide join(13376..13402,13402..16170)
/gene="ORF1ab"
/product="RNA-dependent RNA polymerase"
mat_peptide 16171..17973
/gene="ORF1ab"
/product="helicase"
mat_peptide 17974..19554
/gene="ORF1ab"
/product="3'-to-5' exonuclease"
mat_peptide 19555..20592
/gene="ORF1ab"
/product="endoRNAse"
mat_peptide 20593..21486
/gene="ORF1ab"
/product="2'-O-ribose methyltransferase"
CDS 212..13417
/gene="ORF1ab"
/codon_start=1
/product="ORF1a polyprotein"
/protein_id="UTB53417.1"
/translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQ
HLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGE
TLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQEN
WNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQ
LDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFP
LNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTG
DFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESG
LKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNL
LEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGN
FKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAA
ITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKL
KPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLV
NKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEII
FLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEK
YCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNER
CSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEF
KLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPE
EEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSG
YLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESD
DYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLA
PLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIA
EIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGN
LHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKV
PTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREM
LAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLIN
TLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS
KTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVIT
FDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPH
NSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIK
WADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELG
DVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQ
IPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSK
ETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYY
KKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVT
FFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWST
KPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGD
IILKPANNIKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSV
PWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIK
ASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAA
LGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETI
QITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLM
WLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVEC
TTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPI
NPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPIN
VIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNT
FSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECL
KLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNITLIW
NVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWLK
QLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFAN
KHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPR
VFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAY
ESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGR
WVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLA
YYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTND
VSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEE
AALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLA
KALNDFSNSGSDVLYQPPQISITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGL
WLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLK
LKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRHNFTIKGSFLNGSCG
SVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNV
LAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVL
DMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHWL
LLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLL
PSLATVAYFNMVYMPASWVMRIMTWLDMVDTSFKLKDCVMYASAVVLLILMTARTVYD
DGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGVVF
MCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQ
EFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVLLSVL
QQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEM
LDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFD
RDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIIN
NARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKI
VQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDD
NALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPK
VKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAKAYKD
YLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPK
GFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSADAQS
FLNGFAV"
mat_peptide 212..751
/gene="ORF1ab"
/product="leader protein"
mat_peptide 752..2665
/gene="ORF1ab"
/product="nsp2"
mat_peptide 2666..8497
/gene="ORF1ab"
/product="nsp3"
mat_peptide 8498..9997
/gene="ORF1ab"
/product="nsp4"
mat_peptide 9998..10915
/gene="ORF1ab"
/product="3C-like proteinase"
mat_peptide 10916..11776
/gene="ORF1ab"
/product="nsp6"
mat_peptide 11777..12025
/gene="ORF1ab"
/product="nsp7"
mat_peptide 12026..12619
/gene="ORF1ab"
/product="nsp8"
mat_peptide 12620..12958
/gene="ORF1ab"
/product="nsp9"
mat_peptide 12959..13375
/gene="ORF1ab"
/product="nsp10"
mat_peptide 13376..13414
/gene="ORF1ab"
/product="nsp11"
stem_loop 13410..13437
/gene="ORF1ab"
/note="Coronavirus frameshifting stimulation element
stem-loop 1"
stem_loop 13422..13476
/gene="ORF1ab"
/note="Coronavirus frameshifting stimulation element
stem-loop 2"
gap 19225..19490
/estimated_length=266
gap 21097..21298
/estimated_length=202
gene 21497..25309
/gene="S"
CDS 21497..25309
/gene="S"
/codon_start=1
/product="surface glycoprotein"
/protein_id="UTB53418.1"
/translation="MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFR
SSVLHSTQDLFLPFFSNVTWFHVISGTNGTKRFDNPVLPFNDGVYFASIEKSNIIRGW
IFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDHKNNKSWMESEFRVYSSANN
CTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPIIVREPEDLPQGFS
ALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKY
NENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCP
FDEVFNATRFASVYAWNRKRISNCVADYSVLYNFAPFFAFKCYGVSPTKLNDLCFTNV
YADSFVIRGNEVSQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKVGGNYNYLY
RLFRKSNLKPFERDISTEIYQAGNKPCNGVAGFNCYFPLRSYGFRPTYGVGHQPYRVV
VLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLKGTGVLTESNKKFLPFQQFGRDI
ADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHAD
QLTPTWRVYSTGSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHRRAR
SVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYIC
GDSTECSNLLLQYGSFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFN
FSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFKGLT
VLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLY
ENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISS
VLNDIFSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECV
LGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPR
EGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFK
EELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYE
QYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVL
KGVKLHYT"
gene 25318..26145
/gene="ORF3a"
CDS 25318..26145
/gene="ORF3a"
/codon_start=1
/product="ORF3a protein"
/protein_id="UTB53419.1"
/translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFG
WLIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLE
APFLYLYALVYFLQSINFVRIIMRLLLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPY
NSVTSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQ
LSTDTGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL"
gene 26170..26397
/gene="E"
CDS 26170..26397
/gene="E"
/codon_start=1
/product="envelope protein"
/protein_id="UTB53420.1"
/translation="MYSFVSEEIGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCC
NIVNVSLVKPSFYVYSRVKNLNSSRVPDLLV"
gene 26448..27116
/gene="M"
CDS 26448..27116
/gene="M"
/codon_start=1
/product="membrane glycoprotein"
/protein_id="UTB53421.1"
/translation="MAGSNGTITVEELKKLLEEWNLVIGFLFLTWICLLQFAYANRNR
FLYIIKLIFLWLLWPVTLTCFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRL
FARTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCD
IKDLPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIA
LLVQ"
gene 27127..27312
/gene="ORF6"
CDS 27127..27312
/gene="ORF6"
/codon_start=1
/product="ORF6 protein"
/protein_id="UTB53422.1"
/translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSL
TENKYSQLDEEQPMEID"
gene 27319..>27642
/gene="ORF7a"
CDS 27319..>27642
/gene="ORF7a"
/codon_start=1
/product="ORF7a protein"
/protein_id="UTB53423.1"
/translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNS
PFHPLADNXXXXTCFXTQXAXAXXXXXXXXXXXXXXXXXXXLFIRQEEVQXLYSPIFL
IVAAXX"
gene <27686..>27750
/gene="ORF7b"
CDS <27686..>27750
/gene="ORF7b"
/codon_start=2
/product="ORF7b"
/protein_id="UTB53424.1"
/translation="ELSLIDFYLCFLAFLLFLVLI"
gap 27751..28013
/estimated_length=263
gene <28014..28184
/gene="ORF8"
CDS <28014..28184
/gene="ORF8"
/codon_start=1
/product="ORF8 protein"
/protein_id="UTB53425.1"
/translation="GSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSFYEDFL
EYHDVRVVLDFI"
gene 28199..29449
/gene="N"
CDS 28199..29449
/gene="N"
/codon_start=1
/product="nucleocapsid phosphoprotein"
/protein_id="UTB53426.1"
/translation="MSDNGPQNQRNALRITFGGPSDSTGSNQNGGARSKQRRPQGLPN
NTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLS
PRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQ
GTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSKRTSPARMAGNGGDAALAL
LLLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGP
EQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAI
KLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQXVTLL
PAADLDDFSKQLQQSMSSADSTQA"
gene 29474..>29585
/gene="ORF10"
CDS 29474..>29585
/gene="ORF10"
/codon_start=1
/product="ORF10 protein"
/protein_id="UTB53427.1"
/translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNL"
stem_loop 29525..29560
/gene="ORF10"
/note="Coronavirus 3' UTR pseudoknot stem-loop 1"
stem_loop 29545..29573
/gene="ORF10"
/note="Coronavirus 3' UTR pseudoknot stem-loop 2"
ORIGIN
1 agatctgttc tctaaacgaa ctttaaaatc tgtgtggctg tcactcggct gcatgcttag
61 tgcactcacg cagtataatt aataactaat tactgtcgtt gacaggacac gagtaactcg
121 tctatcttct gcaggctgct tacggtttcg tccgtgttgc agccgatcat cagcacatct
181 aggttttgtc cgggtgtgac cgaaaggtaa gatggagagc cttgtccctg gtttcaacga
241 gaaaacacac gtccaactca gtttgcctgt tttacaggtt cgcgacgtgc tcgtacgtgg
301 ctttggagac tccgtggagg aggtcttatc agaggcacgt caacatctta aagatggcac
361 ttgtggctta gtagaagttg aaaaaggcgt tttgcctcaa cttgaacagc cctatgtgtt
421 catcaaacgt tcggatgctc gaactgcacc tcatggtcat gttatggttg agctggtagc
481 agaactcgaa ggcattcagt acggtcgtag tggtgagaca cttggtgtcc ttgtccctca
541 tgtgggcgaa ataccagtgg cttaccgcaa ggttcttctt cgtaagaacg gtaataaagg
601 agctggtggc catagttacg gcgccgatct aaagtcattt gacttaggcg acgagcttgg
661 cactgatcct tatgaagatt ttcaagaaaa ctggaacact aaacatagca gtggtgttac
721 ccgtgaactc atgcgtgagc ttaacggagg ggcatacact cgctatgtcg ataacaactt
781 ctgtggccct gatggctacc ctcttgagtg cattaaagac cttctagcac gtgctggtaa
841 agcttcatgc actttgtccg aacaactgga ctttattgac actaagaggg gtgtatactg
901 ctgccgtgaa catgagcatg aaattgcttg gtacacggaa cgttctgaaa agagctatga
961 attgcagaca ccttttgaaa ttaaattggc aaagaaattt gacaccttca atggggaatg
1021 tccaaatttt gtatttccct taaattccat aatcaagact attcaaccaa gggttgaaaa
1081 gaaaaagctt gatggcttta tgggtagaat tcgatctgtc tatccagttg cgtcaccaaa
1141 tgaatgcaac caaatgtgcc tttcaactct catgaagtgt gatcattgtg gtgaaacttc
1201 atggcagacg ggcgattttg ttaaagccac ttgcgaattt tgtggcactg agaatttgac
1261 taaagaaggt gccactactt gtggttactt accccaaaat gctgttgtta aaatttattg
1321 tccagcatgt cacaattcag aagtaggacc tgagcatagt cttgccgaat accataatga
1381 atctggcttg aaaaccattc ttcgtaaggg tggtcgcact attgcctttg gaggctgtgt
1441 gttctcttat gttggttgcc ataacaagtg tgcctattgg gttccacgtg ctagcgctaa
1501 cataggttgt aaccatacag gtgttgttgg agaaggttcc gaaggtctta atgacaacct
1561 tcttgaaata ctccaaaaag agaaagtcaa catcaatatt gttggtgact ttaaacttaa
1621 tgaagagatc gccattattt tggcatcttt ttctgcttcc acaagtgctt ttgtggaaac
1681 tgtgaaaggt ttggattata aagcattcaa acaaattgtt gaatcctgtg gtaattttaa
1741 agttacaaaa ggaaaagcta aaaaaggtgc ctggaatatt ggtgaacaga aatcaatact
1801 gagtcctctt tatgcatttg catcagaggc tgctcgtgtt gtacgatcaa ttttctcccg
1861 cactcttgaa actgctcaaa attctgtgcg tgttttacag aaggccgcta taacaatact
1921 agatggaatt tcacagtatt cactgagact cattgatgct atgatgttca catctgattt
1981 ggctactaac aatctagttg taatggccta cattacaggt ggtgttgttc agttgacttc
2041 gcagtggcta actaacatct ttggcactgt ttatgaaaaa ctcaaacccg tccttgattg
2101 gcttgaagag aagtttaagg aaggtgtaga gtttcttaga gacggttggg aaattgttaa
2161 atttatctca acctgtgctt gtgaaattgt cggtggacaa attgtcacct gtgcaaagga
2221 aattaaggag agtgttcaga cattctttaa gcttgtaaat aaatttttgg ctttgtgtgc
2281 tgactctatc attattggtg gagctaaact taaagccttg aatttaggtg aaacatttgt
2341 cacgcactca aagggattgt acagaaagtg tgttaaatcc agagaagaaa ctggcctact
2401 catgcctcta aaagctccaa aagaaattat cttcttagag ggagaaacac ttcccacaga
2461 agtgttaaca gaggaagttg tcttgaaaac tggtgattta caaccattag aacaacctac
2521 tagtgaagct gttgaagctc cattggttgg tacaccagtt tgtattaacg ggcttatgtt
2581 gctcgaaatc aaagacacag aaaagtactg tgcccttgca cctaatatga tggtaacaaa
2641 caataccttc acactcaaag gcggtgcacc aacaaaggtt acttttggtg atgacactgt
2701 gatagaagtg caaggttaca agagtgtgaa tatcactttt gaacttgatg aaaggattga
2761 taaagtactt aatgagaggt gctctgccta tacagttgaa ctcggtacag aagtaaatga
2821 gttcgcctgt gttgtggcag atgctgtcat aaaaactttg caaccagtat ctgaattact
2881 tacaccactg ggcattgatt tagatgagtg gagtatggct acatactact tatttgatga
2941 gtctggtgag tttaaattgg cttcacatat gtattgttct ttttaccctc cagatgagga
3001 tgaagaagaa ggtgattgtg aagaagaaga gtttgagcca tcaactcaat atgagtatgg
3061 tactgaagat gattaccaag gtaaaccttt ggaatttggt gccacttctg ctgctcttca
3121 acctgaagaa gagcaagaag aagattggtt agatgatgat agtcaacaaa ctgttggtca
3181 acaagacggc agtgaggaca atcagacaac tactattcaa acaattgttg aggttcaacc
3241 tcaattagag atggaactta caccagttgt tcagactatt gaagtgaata gttttagtgg
3301 ttatttaaaa cttactgaca atgtatacat taaaaatgca gacattgtgg aagaagctaa
3361 aaaggtaaaa ccaacagtgg ttgttaatgc agccaatgtt taccttaaac atggaggagg
3421 tgttgcagga gccttaaata aggctactaa caatgccatg caagttgaat ctgatgatta
3481 catagctact aatggaccac ttaaagtggg tggtagttgt gttttaagcg gacacaatct
3541 tgctaaacac tgtcttcatg ttgtcggccc aaatgttaac aaaggtgaag acattcaact
3601 tcttaagagt gcttatgaaa attttaatca gcacgaagtt ctacttgcac cattattatc
3661 agctggtatt tttggtgctg accctataca ttctttaaga gtttgtgtag atactgttcg
3721 cacaaatgtc tacttagctg tctttgataa aaatctctat gacaaacttg tttcaagctt
3781 tttggaaatg aagagtgaaa agcaagttga acaaaagatc gctgagattc ctaaagagga
3841 agttaagcca tttataactg aaagtaaacc ttcagttgaa cagagaaaac aagatgataa
3901 gaaaatcaaa gcttgtgttg aagaagttac aacaactctg gaagaaacta agttcctcac
3961 agaaaacttg ttactttata ttgacattaa tggcaatctt catccagatt ctgccactct
4021 tgttagtgac attgacatca ctttcttaaa gaaagatgct ccatatatag tgggtgatgt
4081 tgttcaagag ggtgttttaa ctgctgtggt tatacctact aaaaaggctg gtggcactac
4141 tgaaatgcta gcgaaagctt tgagaaaagt gccaacagac aattatataa ccacttaccc
4201 gggtcagggt ttaaatggtt acactgtaga ggaggcaaag acagtgctta aaaagtgtaa
4261 aagtgccttt tacattctac catctattat ctctaatgag aagcaagaaa ttcttggaac
4321 tgtttcttgg aatttgcgag aaatgcttgc acatgcagaa gaaacacgca aattaatgcc
4381 tgtctgtgtg gaaactaaag ccatagtttc aactatacag cgtaaatata agggtattaa
4441 aatacaagag ggtgtggttg attatggtgc tagattttac ttttacacca gtaaaacaac
4501 tgtagcgtca cttatcaaca cacttaacga tctaaatgaa actcttgtta caatgccact
4561 tggctatgta acacatggct taaatttgga agaagctgct cggtatatga gatctctcaa
4621 agtgccagct acagtttctg tttcttcacc tgatgctgtt acagcgtata atggttatct
4681 tacttcttct tctaaaacac ctgaagaaca ttttattgaa accatctcac ttgctggttc
4741 ctataaagat tggtcctatt ctggacaatc tacacaacta ggtatagaat ttcttaagag
4801 aggtgataaa agtgtatatt acactagtaa tcctaccaca ttccacctag atggtgaagt
4861 tatcaccttt gacaatctta agacacttct ttctttgaga gaagtgagga ctattaaggt
4921 gtttacaaca gtagacaaca ttaacctcca cacgcaagtt gtggacatgt caatgacata
4981 tggacaacag tttggtccaa cttatttgga tggagctgat gttactaaaa taaaacctca
5041 taattcacat gaaggtaaaa cattttatgt tttacctaat gatgacactc tacgtgttga
5101 ggcttttgag tactaccaca caactgatcc tagttttctg ggtaggtaca tgtcagcatt
5161 aaatcacact aaaaagtgga aatacccaca agttaatggt ttaacttcta ttaaatgggc
5221 agataacaac tgttatcttg ccactgcatt gttaacactc caacaaatag agttgaagtt
5281 taatccacct gctctacaag atgcttatta cagagcaagg gctggtgaag cggctaactt
5341 ttgtgcactt atcttagcct actgtaataa gacagtaggt gagttaggtg atgttagaga
5401 aacaatgagt tacttgtttc aacatgccaa tttagattct tgcaaaagag tcttgaacgt
5461 ggtgtgtaaa acttgtggac aacagcagac aacccttaag ggtgtagaag ctgttatgta
5521 catgggcaca ctttcttatg aacaatttaa gaaaggtgtt cagatacctt gtacgtgtgg
5581 taaacaagct acaaaatatc tagtacaaca ggagtcacct tttgttatga tgtcagcacc
5641 acctgctcag tatgaactta agcatggtac atttacttgt gctagtgagt acactggtaa
5701 ttaccagtgt ggtcactata aacatataac ttctaaagaa actttgtatt gcatagacgg
5761 tgctttactt acaaagtcct cagaatacaa aggtcctatt acggatgttt tctacaaaga
5821 aaacagttac acaacaacca taaaaccagt tacttataaa ttggatggtg ttgtttgtac
5881 agaaattgac cctaagttgg acaattatta taagaaagac aattcttatt tcacagagca
5941 accaattgat cttgtaccaa accaaccata tccaaacgca agcttcgata attttaagtt
6001 tgtatgtgat aatatcaaat ttgctgatga tttaaaccag ttaactggtt ataagaaacc
6061 tgcttcaaga gagcttaaag ttacattttt ccctgactta aatggtgatg tggtggctat
6121 tgattataaa cactacacac cctcttttaa gaaaggagct aaattgttac ataaacctat
6181 tgtttggcat gttaacaatg caactaataa agccacgtat aaaccaaata cctggtgtat
6241 acgttgtctt tggagcacaa aaccagttga aacatcaaat tcgtttgatg tactgaagtc
6301 agaggacgcg cagggaatgg ataatcttgc ctgcgaagat ctaaaaccag tctctgaaga
6361 agtagtggaa aatcctacca tacagaaaga cgttcttgag tgtaatgtga aaactaccga
6421 agttgtagga gacattatac ttaaaccagc aaataatata aaaattacag aagaggttgg
6481 ccacacagat ctaatggctg cttatgtaga caattctagt cttactatta agaaacctaa
6541 tgaattatct agagtattag gtttgaaaac ccttgctact catggtttag ctgctgttaa
6601 tagtgtccct tgggatacta tagctaatta tgctaagcct tttcttaaca aagttgttag
6661 tacaactact aacatagtta cacggtgttt aaaccgtgtt tgtactaatt atatgcctta
6721 tttctttact ttattgctac aattgtgtac ttttactaga agtacaaatt ctagaattaa
6781 agcatctatg ccgactacta tagcaaagaa tactgttaag agtgtcggta aattttgtct
6841 agaggcttca tttaattatt tgaagtcacc taatttttct aaactgataa atattataat
6901 ttggttttta ctattaagtg tttgcctagg ttctttaatc tactcaaccg ctgctttagg
6961 tgttttaatg tctaatttag gcatgccttc ttactgtact ggttacagag aaggctattt
7021 gaactctact aatgtcacta ttgcaaccta ctgtactggt tctatacctt gtagtgtttg
7081 tcttagtggt ttagattctt tagacaccta tccttcttta gaaactatac aaattaccat
7141 ttcatctttt aaatgggatt taactgcttt tggcttagtt gcagagtggt ttttggcata
7201 tattcttttc actaggtttt tctatgtact tggattggct gcaatcatgc aattgttttt
7261 cagctatttt gcagtacatt ttattagtaa ttcttggctt atgtggttaa taattaatct
7321 tgtacaaatg gccccgattt cagctatggt tagaatgtac atcttctttg catcatttta
7381 ttatgtatgg aaaagttatg tgcatgttgt agacggttgt aattcatcaa cttgtatgat
7441 gtgttacaaa cgtaatagag caacaagagt cgaatgtaca actattgtta atggtgttag
7501 aaggtccttt tatgtctatg ctaatggagg taaaggcttt tgcaaactac acaattggaa
7561 ttgtgttaat tgtgatacat tctgtgctgg tagtacattt attagtgatg aagttgcgag
7621 agacttgtca ctacagttta aaagaccaat aaatcctact gaccagtctt cttacatcgt
7681 tgatagtgtt acagtgaaga atggttccat ccatctttac tttgataaag ctggtcaaaa
7741 gacttatgaa agacattctc tctctcattt tgttaactta gacaacctga gagctaataa
7801 cactaaaggt tcattgccta ttaatgttat agtttttgat ggtaaatcaa aatgtgaaga
7861 atcatctgca aaatcagcgt ctgtttacta cagtcagctt atgtgtcaac ctatactgtt
7921 actagatcag gcattagtgt ctgatgttgg tgatagtgcg gaagttgcag ttaaaatgtt
7981 tgatgcttac gttaatacgt tttcatcaac ttttaacgta ccaatggaaa aactcaaaac
8041 actagttgca actgcagaag ctgaacttgc aaagaatgtg tccttagaca atgtcttatc
8101 tacttttatt tcagcagctc ggcaagggtt tgttgattca gatgtagaaa ctaaagatgt
8161 tgttgaatgt cttaaattgt cacatcaatc tgacatagaa gttactggcg atagttgtaa
8221 taactatatg ctcacctata acaaagttga aaacatgaca ccccgtgacc ttggtgcttg
8281 tattgactgt agtgcgcgtc atattaatgc gcaggtagca aaaagtcaca acattacttt
8341 gatatggaac gttaaagatt tcatgtcatt gtctgaacaa ctacgaaaac aaatacgtag
8401 tgctgctaaa aagaataact taccttttaa gttgacatgt gcaactacta gacaagttgt
8461 taatgttgta acaacaaaga tagcacttaa gggtggtaaa attgttaata attggttgaa
8521 gcagttaatt aaagttacac ttgtgttcct ttttgttgct gctattttct atttaataac
8581 acctgttcat gtcatgtcta aacatactga cttttcaagt gaaatcatag gatacaaggc
8641 tattgatggt ggtgtcactc gtgacatagc atctacagat acttgttttg ctaacaaaca
8701 tgctgatttt gacacatggt ttagccagcg tggtggtagt tatactaatg acaaagcttg
8761 cccattgatt gctgcagtca taacaagaga agtgggtttt gtcgtgcctg gtttgcctgg
8821 cacgatatta cgcacaacta atggtgactt tttgcatttc ttacctagag tttttagtgc
8881 agttggtaac atctgttaca caccatcaaa acttatagag tacactgact ttgcaacatc
8941 agcttgtgtt ttggctgctg aatgtacaat ttttaaagat gcttctggta agccagtacc
9001 atattgttat gataccaatg tactagaagg ttctgttgct tatgaaagtt tacgccctga
9061 cacacgttat gtgctcatgg atggctctat tattcaattt cctaacacct accttgaagg
9121 ttctgttaga gtggtaacaa cttttgattc tgagtactgt aggcacggca cttgtgaaag
9181 atcagaagct ggtgtttgtg tatctactag tggtagatgg gtacttaaca atgattatta
9241 cagatcttta ccaggagttt tctgtggtgt agatgctgta aatttactta ctaatatgtt
9301 tacaccacta attcaaccta ttggtgcttt ggacatatca gcatctatag tagctggtgg
9361 tattgtagct atcgtagtaa catgccttgc ctactatttt atgaggttta gaagagcttt
9421 tggtgaatac agtcatgtag ttgcctttaa tactttacta ttccttatgt cattcactgt
9481 actctgttta acaccagttt actcattctt acctggtgtt tattctgtta tttacttgta
9541 cttgacattt tatcttacta atgatgtttc ttttttagca catattcagt ggatggttat
9601 gttcacacct ttagtacctt tctggataac aattgcttat atcatttgta tttccacaaa
9661 gcatttctat tggttcttta gtaattacct aaagagacgt gtagtcttta atggtgtttc
9721 ctttagtact tttgaagaag ctgcgctgtg cacctttttg ttaaataaag aaatgtatct
9781 aaagttgcgt agtgatgtgc tattacctct tacgcaatat aatagatact tagctcttta
9841 taataagtac aagtatttta gtggagcaat ggatacaact agctacagag aagctgcttg
9901 ttgtcatctc gcaaaggctc tcaatgactt cagtaactca ggttctgatg ttctttacca
9961 accaccacaa atctctatca cctcagctgt tttgcagagt ggttttagaa aaatggcatt
10021 cccatctggt aaagttgagg gttgtatggt acaagtaact tgtggtacaa ctacacttaa
10081 cggtctttgg cttgatgacg tagtttactg tccaagacat gtgatctgca cctctgaaga
10141 catgcttaac cctaattatg aagatttact cattcgtaag tctaatcata atttcttggt
10201 acaggctggt aatgttcaac tcagggttat tggacattct atgcaaaatt gtgtacttaa
10261 gcttaaggtt gatacagcca atcctaagac acctaagtat aagtttgttc gcattcaacc
10321 aggacagact ttttcagtgt tagcttgtta caatggttca ccatctggtg tttaccaatg
10381 tgctatgagg cacaatttca ctattaaggg ttcattcctt aatggttcat gtggtagtgt
10441 tggttttaac atagattatg actgtgtctc tttttgttac atgcaccata tggaattacc
10501 aactggagtt catgctggca cagacttaga aggtaacttt tatggacctt ttgttgacag
10561 gcaaacagca caagcagctg gtacggacac aactattaca gttaatgttt tagcttggtt
10621 gtacgctgct gttataaatg gagacaggtg gtttctcaat cgatttacca caactcttaa
10681 tgactttaac cttgtggcta tgaagtacaa ttatgaacct ctaacacaag accatgttga
10741 catactagga cctctttctg ctcaaactgg aattgccgtt ttagatatgt gtgcttcatt
10801 aaaagaatta ctgcaaaatg gtatgaatgg acgtaccata ttgggtagtg ctttattaga
10861 agatgaattt acaccttttg atgttgttag acaatgctca ggtgttactt tccaaagtgc
10921 agtgaaaaga acaatcaagg gtacacacca ctggttgtta ctcacaattt tgacttcact
10981 tttagtttta gtccagagta ctcaatggtc tttgttcttt tttttgtatg aaaatgcctt
11041 tttacctttt gctatgggta ttattgctat gtctgctttt gcaatgatgt ttgtcaaaca
11101 taagcatgca tttctctgtt tgtttttgtt accttctctt gccactgtag cttattttaa
11161 tatggtctat atgcctgcta gttgggtgat gcgtattatg acatggttgg atatggttga
11221 tactagtttt aagctaaaag actgtgttat gtatgcatca gctgtagtgt tactaatcct
11281 tatgacagca agaactgtgt atgatgatgg tgctaggaga gtgtggacac ttatgaatgt
11341 cttgacactc gtttataaag tttattatgg taatgcttta gatcaagcca tttccatgtg
11401 ggctcttata atctctgtta cttctaacta ctcaggtgta gttacaactg tcatgttttt
11461 ggccagaggt gttgttttta tgtgtgttga gtattgccct attttcttca taactggtaa
11521 tacacttcag tgtataatgc tagtttattg tttcttaggc tatttttgta cttgttactt
11581 tggcctcttt tgtttactca accgctactt tagactgact cttggtgttt atgattactt
11641 agtttctaca caggagttta gatatatgaa ttcacaggga ctactcccac ccaagaatag
11701 catagatgcc ttcaaactca acattaaatt gttgggtgtt ggtggcaaac cttgtatcaa
11761 agtagccact gtacagtcta aaatgtcaga tgtaaagtgc acatcagtag tcttactctc
11821 agttttgcaa caactcagag tagaatcatc atctaaattg tgggctcaat gtgtccagtt
11881 acacaatgac attctcttag ctaaagatac tactgaagcc tttgaaaaaa tggtttcact
11941 actttctgtt ttgctttcca tgcagggtgc tgtagacata aacaagcttt gtgaagaaat
12001 gctggacaac agggcaacct tacaagctat agcctcagag tttagttccc ttccatcata
12061 tgcagctttt gctactgctc aagaagctta tgagcaggct gttgctaatg gtgattctga
12121 agttgttctt aaaaagttga agaagtcttt gaatgtggct aaatctgaat ttgaccgtga
12181 tgcagccatg caacgtaagt tggaaaagat ggctgatcaa gctatgaccc aaatgtataa
12241 acaggctaga tctgaggaca agagggcaaa agttactagt gctatgcaga caatgctttt
12301 cactatgctt agaaagttgg ataatgatgc actcaacaac attatcaaca atgcaagaga
12361 tggttgtgtt cccttgaaca taatacctct tacaacagca gccaaactaa tggttgtcat
12421 accagactat aacacatata aaaatacgtg tgatggtaca acatttactt atgcatcagc
12481 attgtgggaa atccaacagg ttgtagatgc agatagtaaa attgttcaac ttagtgaaat
12541 tagtatggac aattcaccta atttagcatg gcctcttatt gtaacagctt taagggccaa
12601 ttctgctgtc aaattacaga ataatgagct tagtcctgtt gcactacgac agatgtcttg
12661 tgctgccggt actacacaaa ctgcttgcac tgatgacaat gcgttagctt actacaacac
12721 aacaaaggga ggtaggtttg tacttgcact gttatccgat ttacaggatt tgaaatgggc
12781 tagattccct aagagtgatg gaactggtac tatctataca gaactggaac caccttgtag
12841 gtttgttaca gacacaccta aaggtcctaa agtgaagtat ttatacttta ttaaaggatt
12901 aaacaaccta aatagaggta tggtacttgg tagtttagct gccacagtac gtctacaagc
12961 tggtaatgca acagaagtgc ctgccaattc aactgtatta tctttctgtg cttttgctgt
13021 agatgctgct aaagcttaca aagattatct agctagtggg ggacaaccaa tcactaattg
13081 tgttaagatg ttgtgtacac acactggtac tggtcaggca ataacagtca caccggaagc
13141 caatatggat caagaatcct ttggtggtgc atcgtgttgt ctgtactgcc gttgccacat
13201 agatcatcca aatcctaaag gattttgtga cttaaaaggt aagtatgtac aaatacctac
13261 aacttgtgct aatgaccctg tgggttttac acttaaaaac acagtctgta ccgtctgcgg
13321 tatgtggaaa ggttatggct gtagttgtga tcaactccgc gaacccatgc ttcagtcagc
13381 tgatgcacaa tcgtttttaa acgggtttgc ggtgtaagtg cagcccgtct tacaccgtgc
13441 ggcacaggca ctagtactga tgtcgtatac agggcttttg acatctacaa tgataaagta
13501 gctggttttg ctaaattcct aaaaactaat tgttgtcgct tccaagaaaa ggacgaagat
13561 gacaatttaa ttgattctta ctttgtagtt aagagacaca ctttctctaa ctaccaacat
13621 gaagaaacaa tttataattt acttaaggat tgtccagctg ttgctaaaca tgacttcttt
13681 aagtttagaa tagacggtga catggtacca catatatcac gtcaacgtct tactaaatac
13741 acaatggcag acctcgtcta tgctttaagg cattttgatg aaggtaattg tgacacatta
13801 aaagaaatac ttgtcacata caattgttgt gatgatgatt atttcaataa aaaggactgg
13861 tatgattttg tagaaaaccc agatatatta cgcgtatacg ccaacttagg tgaacgtgta
13921 cgccaagctt tgttaaaaac agtacaattc tgtgatgcca tgcgaaatgc tggtattgtt
13981 ggtgtactga cattagataa tcaagatctc aatggtaact ggtatgattt cggtgatttc
14041 atacaaacca cgccaggtag tggagttcct gttgtagatt cttattattc attgttaatg
14101 cctatattaa ccttgaccag ggctttaact gcagagtcac atgttgacac tgacttaaca
14161 aacccttaca ttaagtggga tttgttaaaa tatgacttca cggaagagag gttaaaactc
14221 tttgaccgtt attttaaata ttgggatcag acataccacc caaattgtgt taactgtttg
14281 gatgacagat gcattctgca ttgtgcaaac tttaatgttt tattctctac agtgttccca
14341 cttacaagtt ttggaccact agtgagaaaa atatttgttg atggtgttcc atttgtagtt
14401 tcaactggat accacttcag agagctaggt gttgtacata atcaggatgt aaacttacat
14461 agctctagac ttagttttaa ggaattactt gtgtatgctg ctgaccctgc tatgcacgct
14521 gcttctggta atctattact agataaacgc actacgtgct tttcagtagc tgcacttact
14581 aacaatgttg cttttcaaac tgtcaaaccc ggtaatttta acaaagactt ctatgacttt
14641 gctgtgtcta agggtttctt taaggaagga agttctgttg aattaaaaca cttcttcttt
14701 gctcaggatg gtaatgctgc tatcagcgat tatgactact atcgttataa tctaccaaca
14761 atgtgtgata tcagacaact actatttgta gttgaagttg ttgataagta ctttgattgt
14821 tacgatggtg gctgtattaa tgctaaccaa gtcatcgtca acaacctaga caaatcagct
14881 ggttttccat ttaataaatg gggtaaggct agactttatt atgattcaat gagttatgag
14941 gatcaagatg cacttttcgc atatacaaaa cgtaatgtca tccctactat aactcaaatg
15001 aatcttaagt atgccattag tgcaaagaat agagctcgca ccgtagctgg tgtctctatc
15061 tgtagtacta tgaccaatag acagtttcat caaaaattat tgaaatcaat agccgccact
15121 agaggagcta ctgtagtaat tggaacaagc aaattctatg gtggttggca caatatgtta
15181 aaaactgttt atagtgatgt agaaaaccct caccttatgg gttgggatta tcctaaatgt
15241 gatagagcca tgcctaacat gcttagaatt atggcctcac ttgttcttgc tcgcaaacat
15301 acaacgtgtt gtagcttgtc acaccgtttc tatagattag ctaatgagtg tgctcaagta
15361 ttgagtgaaa tggtcatgtg tggcggttca ctatatgtta aaccaggtgg aacctcatca
15421 ggagatgcca caactgctta tgctaatagt gtttttaaca tttgtcaagc tgtcacggcc
15481 aatgttaatg cacttttatc tactgatggt aacaaaattg ccgataagta tgtccgcaat
15541 ttacaacaca gactttatga gtgtctctat agaaatagag atgttgacac agactttgtg
15601 aatgagtttt acgcatattt gcgtaaacat ttctcaatga tgatactctc tgacgatgct
15661 gttgtgtgtt tcaatagcac ttatgcatct caaggtctag tggctagcat aaagaacttt
15721 aagtcagttc tttattatca aaacaatgtt tttatgtctg aagcaaaatg ttggactgag
15781 actgacctta ctaaaggacc tcatgaattt tgctctcaac atacaatgct agttaaacag
15841 ggtgatgatt atgtgtacct tccttaccca gatccatcaa gaatcctagg ggccggctgt
15901 tttgtagatg atatcgtaaa aacagatggt acacttatga ttgaacggtt cgtgtcttta
15961 gctatagatg cttacccact tactaaacat cctaatcagg agtatgctga tgtctttcat
16021 ttgtacttac aatacataag aaagctacat gatgagttaa caggacacat gttagacatg
16081 tattctgtta tgcttactaa tgataacact tcaaggtatt gggaacctga gttttatgag
16141 gctatgtaca caccgcatac agtcttacag gctgttgggg cttgtgttct ttgcaattca
16201 cagacttcat taagatgtgg tgcttgcata cgtagaccat tcttatgttg taaatgctgt
16261 tacgaccatg tcatatcaac atcacataaa ttagtcttgt ctgttaatcc gtatgtttgc
16321 aatgctccag gttgtgatgt cacagatgtg actcaacttt acttaggagg tatgagctat
16381 tattgtaaat cacataaacc acccattagt tttccattgt gtgctaatgg acaagttttt
16441 ggtttatata aaaatacatg tgttggtagc gataatgtta ctgactttaa tgcaattgca
16501 acatgtgact ggacaaatgc tggtgattac attttagcta acacctgtac tgaaagactc
16561 aagctttttg cagcagaaac gctcaaagct actgaggaga catttaaact gtcttatggt
16621 attgctactg tacgtgaagt gctgtctgac agagaattac atctttcatg ggaagttggt
16681 aaacctagac caccacttaa ccgaaattat gtctttactg gttatcgtgt aactaaaaac
16741 agtaaagtac aaataggaga gtacaccttt gaaaaaggtg actatggtga tgctgttgtt
16801 taccgaggta caacaactta caaattaaat gttggtgatt attttgtgct gacatcacat
16861 acagtaatgc cattaagtgc acctacacta gtgccacaag agcactatgt tagaattact
16921 ggcttatacc caacactcaa tatctcagat gagttttcta gcaatgttgc aaattatcaa
16981 aaggttggta tgcaaaagta ttctacactc cagggaccac ctggtactgg taagagtcat
17041 tttgctattg gcctagctct ctactaccct tctgctcgca tagtgtatac agcttgctct
17101 catgccgctg ttgatgcact atgtgagaag gcattaaaat atttgcctat agataaatgt
17161 agtagaatta tacctgcacg tgctcgtgta gagtgttttg ataaattcaa agtgaattca
17221 acattagaac agtatgtctt ttgtactgta aatgcattgc ctgagacgac agcagatata
17281 gttgtctttg atgaaatttc aatggccaca aattatgatt tgagtgttgt caatgccaga
17341 ttacgtgcta agcactatgt gtacattggc gaccctgctc aattacctgc accacgcaca
17401 ttgctaacta agggcacact agaaccagaa tatttcaatt cagtgtgtag acttatgaaa
17461 actataggtc cagacatgtt cctcggaact tgtcggcgtt gtcctgctga aattgttgac
17521 actgtgagtg ctttggttta tgataataag cttaaagcac ataaagacaa atcagctcaa
17581 tgctttaaaa tgttttataa gggtgttatc acgcatgatg tttcatctgc aattaacagg
17641 ccacaaatag gcgtggtaag agaattcctt acacgtaacc ctgcttggag aaaagctgtc
17701 tttatttcac cttataattc acagaatgct gtagcctcaa agattttggg actaccaact
17761 caaactgttg attcatcaca gggctcagaa tatgactatg tcatattcac tcaaaccact
17821 gaaacagctc actcttgtaa tgtaaacaga tttaatgttg ctattaccag agcaaaagta
17881 ggcatacttt gcataatgtc tgatagagac ctttatgaca agttgcaatt tacaagtctt
17941 gaaattccac gtaggaatgt ggcaacttta caagctgaaa atgtaacagg actctttaaa
18001 gattgtagta aggtaatcac tgggttacat cctacacagg cacctacaca cctcagtgtt
18061 gacactaaat tcaaaactga aggtttatgt gttgacgtac ctggcatacc taaggacatg
18121 acctatagaa gactcatctc tatgatgggt tttaaaatga attatcaagt taatggttac
18181 cctaacatgt ttatcacccg cgaagaagct ataagacatg tacgtgcatg gattggcttc
18241 gatgtcgagg ggtgtcatgc tactagagaa gctgttggta ccaatttacc tttacagcta
18301 ggtttttcta caggtgttaa cctagttgct gtacctacag gttatgttga tacacctaat
18361 aatacagatt tttccagagt tagtgctaaa ccaccgcctg gagatcaatt taaacacctc
18421 ataccactta tgtacaaagg acttccttgg aatgtagtgc gtataaagat tgtacaaatg
18481 ttaagtgaca cacttaaaaa tctctctgac agagtcgtat ttgtcttatg ggcacatggc
18541 tttgagttga catctatgaa gtattttgtg aaaataggac ctgagcgcac ctgttgtcta
18601 tgtgatagac gtgccacatg cttttccact gcttcagaca cttatgcctg ttggcatcat
18661 tctattggat ttgattacgt ctataatccg tttatgattg atgttcaaca atggggtttt
18721 acaggtaacc tacaaagcaa ccatgatctg tattgtcaag tccatggtaa tgcacatgta
18781 gctagttgtg atgcaatcat gactaggtgt ctagctgtcc acgagtgctt tgttaagcgt
18841 gttgactgga ctattgaata tcctataatt ggtgatgaac tgaagattaa tgcggcttgt
18901 agaaaggttc aacacatggt tgttaaagct gcattattag cagacaaatt cccagttctt
18961 cacgacattg gtaaccctaa agctattaag tgtgtacctc aagctgatgt agaatggaag
19021 ttctatgatg cacagccttg tagtgacaaa gcttataaaa tagaagaatt attctattct
19081 tatgccacac attctgacaa attcacagat ggtgtatgcc tattttggaa ttgcaatgtc
19141 gatagatatc ctgctaattc cattgtttgt agatttgaca ctagagtgct atctaacttt
19201 aacttgcctg gttgtgatgg tggc
[gap 266 bp] Expand Ns
19491 tagcttgtgg
19501 gtttacaaac aatttgatac ttataacctc tggaacactt ttacaagact tcagagttta
19561 gaaaatgtgg cttttaatgt tgtaaataag ggacactttg atggacaaca gggtgaagta
19621 ccagtttcta tcattaataa cactgtttac acaaaagttg atggtgttga tgtagaattg
19681 tttgaaaata aaacaacatt acctgttaat gtagcatttg agctttgggc taagcgcaac
19741 attaaaccag taccagaggt gaaaatactc aataatttgg gtgtggacat tgctgctaat
19801 actgtgatct gggactacaa aagagatgct ccagcacata tatctactat tggtgtttgt
19861 tctatgactg acntagccan gnaaccanct gaaacgattt gtgcaccact cactgtcttt
19921 tttgatggta gagttgatgg tcaagtagac ttatttagaa atgcccgtaa tggtgttctt
19981 attacagaag gtagtgttaa aggtttacaa ccatctgtag gtcccaaaca agctagtctt
20041 aatggagtca cattaattgg agaagccgta aaaacacagt tcaattatta taagaaagtt
20101 gatggtgttg tccaacaatt acctgaaact tactttactc agagtagaaa tttacaagaa
20161 tttaaaccca ggagtcaaat ggaaattgat ttcttagaat tagctatgga tgaattcatt
20221 gaacggtata aattagaagg ctatgccttc gaacatatcg tttatggaga ttttagtcat
20281 agtcagttag gtggtttaca tctactgatt ggactagcta aacgttttaa ggaatcacct
20341 tttgaattag aagattttat tcctatggac agtacagtta aaaactattt cataacagat
20401 gcgcaaacag gttcatctaa gtgtgtgtgt tctgttattg atttattact tgatgatttt
20461 gttgaaataa taaaatccca agatttatct gtagtttcta aggttgtcaa agtgactatt
20521 gactatacag aaatttcatt tatgctttgg tgtaaagatg gccatgtaga aacattttac
20581 ccaaaattac aatctagtca agcgtggcaa ccgggtgttg ctatgcctaa tctttacaaa
20641 atgcaaagaa tgctattaga aaagtgtgac cttcaaaatt atggtgatag tgcaacatta
20701 cctaaaggca taatgatgaa tgtcgcaaaa tatactcaac tgtgtcaata tttaaacaca
20761 ttaacattag ctgtacccta taatatgaga gttatacatt ttggtgctgg ttctgataaa
20821 ggagttgcac caggtacagc tgttttaaga cagtggttgc ctacgggtac gctgcttgtc
20881 gattcagatc ttaatgactt tgtctctgat gcagattcaa ctttgattgg tgattgtgca
20941 actgtacata cagctaataa atgggatctc attattagtg atatgtacga ccctaagact
21001 aaaaatgtta caaaagaaaa tgactctaaa gagggttttt tcacttacat ttgtgggttt
21061 atacaacaaa agctagctct tggaggttcc gtggct
[gap 202 bp] Expand Ns
21299 ca
21301 attcagttgt cttcctattc tttatttgac atgagtaaat ttccccttaa attaaggggt
21361 actgctgtta tgtctttaaa agaaggtcaa atcaatgata tgattttatc tcttcttagt
21421 aaaggtagac ttataattag agaaaacaac agagttgtta tttctagtga tgttcttgtt
21481 aacaactaaa cgaacaatgt ttgtttttct tgttttattg ccactagtct ctagtcagtg
21541 tgttaatctt acaaccagaa ctcaattacc ccctgcatac actaattctt tcacacgtgg
21601 tgtttattac cctgacaaag ttttcagatc ctcagtttta cattcaactc aggacttgtt
21661 cttacctttc ttttccaatg ttacttggtt ccatgttatc tctgggacca atggtactaa
21721 gaggtttgat aaccctgtcc taccatttaa tgatggtgtt tattttgctt ccattgagaa
21781 gtctaacata ataagaggct ggatttttgg tactacttta gattcgaaga cccagtccct
21841 acttattgtt aataacgcta ctaatgttgt tattaaagtc tgtgaatttc aattttgtaa
21901 tgatccattt ttggaccaca aaaacaacaa aagttggatg gaaagtgagt tcagagttta
21961 ttctagtgcg aataattgca cttttgaata tgtctctcag ccttttctta tggaccttga
22021 aggaaaacag ggtaatttca aaaatcttag ggaatttgtg tttaagaata ttgatggtta
22081 ttttaaaata tattctaagc acacgcctat tatagtgcgt gagccagaag atctccctca
22141 gggtttttcg gctttagaac cattggtaga tttgccaata ggtattaaca tcactaggtt
22201 tcaaacttta cttgctttac atagaagtta tttgactcct ggtgattctt cttcaggttg
22261 gacagctggt gctgcagctt attatgtggg ttatcttcaa cctaggactt ttctattaaa
22321 atataatgaa aatggaacca ttacagatgc tgtagactgt gcacttgacc ctctctcaga
22381 aacaaagtgt acgttgaaat ccttcactgt agaaaaagga atctatcaaa cttctaactt
22441 tagagtccaa ccaacagaat ctattgttag atttcctaat attacaaact tgtgcccttt
22501 tgatgaagtt tttaacgcca ccagatttgc atctgtttat gcttggaaca ggaagagaat
22561 cagcaactgt gttgctgatt attctgtcct atataatttc gcaccatttt tcgcttttaa
22621 gtgttatgga gtgtctccta ctaaattaaa tgatctctgc tttactaatg tctatgcaga
22681 ttcatttgta attagaggta atgaagtcag ccaaatcgct ccagggcaaa ctggaaatat
22741 tgctgattat aattataaat taccagatga ttttacaggc tgcgttatag cttggaattc
22801 taacaagctt gattctaagg ttggtggtaa ttataattac ctgtatagat tgtttaggaa
22861 gtctaatctc aaaccttttg agagagatat ttcaactgaa atctatcagg ccggtaacaa
22921 accttgtaat ggtgttgcag gttttaattg ttactttcct ttacgatcat atggtttccg
22981 acccacttat ggtgttggtc accaaccata cagagtagta gtactttctt ttgaacttct
23041 acatgcacca gcaactgttt gtggacctaa aaagtctact aatttggtta aaaacaaatg
23101 tgtcaatttc aacttcaatg gtttaaaagg cacaggtgtt cttactgagt ctaacaaaaa
23161 gtttctgcct ttccaacaat ttggcagaga cattgctgac actactgatg ctgtccgtga
23221 tccacagaca cttgagattc ttgacattac accatgttct tttggtggtg tcagtgttat
23281 aacaccagga acaaatactt ctaaccaggt tgctgttctt tatcagggtg ttaactgcac
23341 agaagtccct gttgctattc atgcagatca acttactcct acttggcgtg tttattctac
23401 aggttctaat gtttttcaaa cacgtgcagg ctgtttaata ggggctgaat atgtcaacaa
23461 ctcatatgag tgtgacatac ccattggtgc aggtatatgc gctagttatc agactcagac
23521 taagtctcat cggcgggcac gtagtgtagc tagtcaatcc atcattgcct acactatgtc
23581 acttggtgca gaaaattcag ttgcttactc taataactct attgccatac ccacaaattt
23641 tactattagt gttaccacag aaattctacc agtgtctatg accaagacat cagtagattg
23701 tacaatgtac atttgtggtg attcaactga atgcagcaat cttttgttgc aatatggcag
23761 tttttgtaca caattaaaac gtgctttaac tggaatagct gttgaacaag acaaaaacac
23821 ccaagaagtt tttgcacaag tcaaacaaat ttacaaaaca ccaccaatta aatattttgg
23881 tggttttaat ttttcacaaa tattaccaga tccatcaaaa ccaagcaaga ggtcatttat
23941 tgaagatcta cttttcaaca aagtgacact tgcagatgct ggcttcatca aacaatatgg
24001 tgattgcctt ggtgatattg ctgctagaga cctcatttgt gcacaaaagt ttaaaggcct
24061 tactgttttg ccacctttgc tcacagatga aatgattgct caatacactt ctgcactgtt
24121 agcgggtaca atcacttctg gttggacctt tggtgcaggt gctgcattac aaataccatt
24181 tgctatgcaa atggcttata ggtttaatgg tattggagtt acacagaatg ttctctatga
24241 gaaccaaaaa ttgattgcca accaatttaa tagtgctatt ggcaaaattc aagactcact
24301 ttcttccaca gcaagtgcac ttggaaaact tcaagatgtg gtcaaccata atgcacaagc
24361 tttaaacacg cttgttaaac aacttagctc caaatttggt gcaatttcaa gtgttttaaa
24421 tgatatcttt tcacgtcttg acaaagttga ggctgaagtg caaattgata ggttgatcac
24481 aggcagactt caaagtttgc agacatatgt gactcaacaa ttaattagag ctgcagaaat
24541 cagagcttct gctaatcttg ctgctactaa aatgtcagag tgtgtacttg gacaatcaaa
24601 aagagttgat ttttgtggaa agggctatca tcttatgtcc ttccctcagt cagcacctca
24661 tggtgtagtc ttcttgcatg tgacttatgt ccctgcacaa gaaaagaact tcacaactgc
24721 tcctgccatt tgtcatgatg gaaaagcaca ctttcctcgt gaaggtgtct ttgtttcaaa
24781 tggcacacac tggtttgtaa cacaaaggaa tttttatgaa ccacaaatca ttactacaga
24841 caacacattt gtgtctggta actgtgatgt tgtaatagga attgtcaaca acacagttta
24901 tgatcctttg caacctgaat tagattcatt caaggaggag ttagataaat attttaagaa
24961 tcatacatca ccagatgttg atttaggtga catctctggc attaatgctt cagttgtaaa
25021 cattcaaaaa gaaattgacc gcctcaatga ggttgccaag aatttaaatg aatctctcat
25081 cgatctccaa gaacttggaa agtatgagca gtatataaaa tggccatggt acatttggct
25141 aggttttata gctggcttga ttgccatagt aatggtgaca attatgcttt gctgtatgac
25201 cagttgctgt agttgtctca agggctgttg ttcttgtgga tcctgctgca aatttgatga
25261 agacgactct gagccagtgc tcaaaggagt caaattacat tacacataaa cgaacttatg
25321 gatttgttta tgagaatctt cacaattgga actgtaactt tgaagcaagg tgaaatcaag
25381 gatgctactc cttcagattt tgttcgcgct actgcaacga taccgataca agcctcactc
25441 cctttcggat ggcttattgt tggcgttgca cttcttgctg tttttcagag cgcttccaaa
25501 atcataactc tcaaaaagag atggcaacta gcactctcca agggtgttca ctttgtttgc
25561 aacttgctgt tgttgtttgt aacagtttac tcacaccttt tgctcgttgc tgctggcctt
25621 gaagcccctt ttctctatct ttatgcttta gtctacttct tgcagagtat aaactttgta
25681 agaataataa tgaggctttt gctttgctgg aaatgccgtt ccaaaaaccc attactttat
25741 gatgccaact attttctttg ctggcatact aattgttacg actattgtat accttacaat
25801 agtgtaactt cttcaattgt cattacttca ggtgatggca caacaagtcc tatttctgaa
25861 catgactacc agattggtgg ttatactgaa aaatgggaat ctggagtaaa agactgtgtt
25921 gtattacaca gttacttcac ttcagactat taccagctgt actcaactca attgagtaca
25981 gacactggtg ttgaacatgt taccttcttc atctacaata aaattgttga tgagcctgaa
26041 gaacatgtcc aaattcacac aatcgacggt tcatccggag ttgttaatcc agtaatggaa
26101 ccaatttatg atgaaccgac gacgactact agcgtgcctt tgtaagcaca agctgatgag
26161 tacgaactta tgtactcatt cgtttcggaa gagataggta cgttaatagt taatagcgta
26221 cttctttttc ttgctttcgt ggtattcttg ctagttacac tagccatcct tactgcgctt
26281 cgattgtgtg cgtactgctg caatattgtt aacgtgagtc ttgtaaaacc ttctttttac
26341 gtttactctc gtgttaaaaa tctgaattct tctagagttc ctgatcttct ggtctaaacg
26401 aactaaatat tatattagtt tttctgtttg gaactttaat tttagccatg gcaggttcca
26461 acggtactat taccgttgaa gagcttaaaa agctccttga agaatggaac ctagtaatag
26521 gtttcctatt ccttacatgg atttgtcttc tacaatttgc ctatgccaac aggaataggt
26581 ttttgtatat aattaagtta attttcctct ggctgttatg gccagtaact ttaacttgtt
26641 ttgtgcttgc tgctgtttac agaataaatt ggatcaccgg tggaattgct atcgcaatgg
26701 cttgtcttgt aggcttgatg tggctcagct acttcattgc ttctttcaga ctgtttgcgc
26761 gtacgcgttc catgtggtca ttcaatccag aaactaacat tcttctcaac gtgccactcc
26821 atggcactat tctgaccaga ccgcttctag aaagtgaact cgtaatcgga gctgtgatcc
26881 ttcgtggaca tcttcgtatt gctggacacc atctaggacg ctgtgacatc aaggacctgc
26941 ctaaagaaat cactgttgct acatcacgaa cgctttctta ttacaaattg ggagcttcgc
27001 agcgtgtagc aggtgactca ggttttgctg catacagtcg ctacaggatt ggcaactata
27061 aattaaacac agaccattcc agtagcagtg acaatattgc tttgcttgta cagtaagtga
27121 caacagatgt ttcatctcgt tgactttcag gttactatag cagagatatt actaattatt
27181 atgcggactt ttaaagtttc catttggaat cttgattaca tcataaacct cataattaaa
27241 aatttatcta agtcactaac tgagaataaa tattctcaat tagatgaaga gcaaccaatg
27301 gagattgatt aaacgaacat gaaaattatt cttttcttgg cactgataac actcgctact
27361 tgtgagcttt atcactacca agagtgtgtt agaggtacaa cagtactttt aaaagaacct
27421 tgctcttctg gaacatacga gggcaattca ccatttcatc ctctagctga taacaannnn
27481 gnnntgactt gctttngcac tcaanttgcn nttgcttnnn nnnnnnnnnn nnnnnnnnnn
27541 nnnnnnnnnn nnnnnnnnnn nnnnnnacnn aanctgttca tcagacaaga ggaagttcaa
27601 gnactttact ctccaatttt tcttattgtt gcggcaannn ngnnnnnnnn nnnnnnnnnn
27661 nnnnnnnnnn nnnnnnnnnn nnnnntgaac tttcattaat tgacttctat ttgtgctttt
27721 tagcctttct gttattcctt gtnttaatta
[gap 263 bp] Expand Ns
28014 ggttcta
28021 aatcacccat tcagtacatc gatatcggta attatacagt ttcctgttta ccttttacaa
28081 ttaattgcca ggaacctaaa ttgggtagtc ttgtagtgcg ttgttcgttc tatgaagact
28141 ttttagagta tcatgacgtt cgtgttgttt tagatttcat ctaaacgaac aaacttaaat
28201 gtctgataat ggaccccaaa atcagcgaaa tgcactccgc attacgtttg gtggaccctc
28261 agattcaact ggcagtaacc agaatggtgg ggcgcgatca aaacaacgtc ggccccaagg
28321 tttacccaat aatactgcgt cttggttcac cgctctcact caacatggca aggaagacct
28381 taaattccct cgaggacaag gcgttccaat taacaccaat agcagtccag atgaccaaat
28441 tggctactac cgaagagcta ccagacgaat tcgtggtggt gacggtaaaa tgaaagatct
28501 cagtccaaga tggtatttct actacctagg aactgggcca gaagctggac ttccctatgg
28561 tgctaacaaa gacggcatca tatgggttgc aactgaggga gccttgaata caccaaaaga
28621 tcacattggc acccgcaatc ctgctaacaa tgctgcaatc gtgctacaac ttcctcaagg
28681 aacaacattg ccaaaaggct tctacgcaga agggagcaga ggcggcagtc aagcctcttc
28741 tcgttcctca tcacgtagtc gcaacagttc aagaaattca actccaggca gcagtaaacg
28801 aacttctcct gctagaatgg ctggcaatgg cggtgatgct gctcttgctt tgctgctgct
28861 tgacagattg aaccagcttg agagcaaaat gtctggtaaa ggccaacaac aacaaggcca
28921 aactgtcact aagaaatctg ctgctgaggc ttctaagaag cctcggcaaa aacgtactgc
28981 cactaaagca tacaatgtaa cacaagcttt cggcagacgt ggtccagaac aaacccaagg
29041 aaattttggg gaccaggaac taatcagaca aggaactgat tacaaacatt ggccgcaaat
29101 tgcacaattt gcccccagcg cttcagcgtt cttcggaatg tcgcgcattg gcatggaagt
29161 cacaccttcg ggaacgtggt tgacctacac aggtgccatc aaattggatg acaaagatcc
29221 aaatttcaaa gatcaagtca ttttgctgaa taagcatatt gacgcataca aaacattccc
29281 accaacagag cctaaaaagg acaaaaagaa gaaggctgat gaaactcaag ccttaccgca
29341 gagacagaag aaacagcaan ctgtgactct tcttcctgct gcagatttgg atgatttctc
29401 caaacaattg caacaatcca tgagcagtgc tgactcaact caggcctaaa ctcatgcaga
29461 ccacacaagg cagatgggct atataaacgt tttcgctttt ccgtttacga tatatagtct
29521 actcttgtgc agaatgaatt ctcgtaacta catagcacaa gtagatgtag ttaactttaa
29581 tctca
//