Monarch geneset OGS2.0

DPOGS202106
TranscriptDPOGS202106-TA2367 bp
ProteinDPOGS202106-PA788 aa
Genomic positionDPSCF300150 - 335732-342796
RNAseq coverage1348x (Rank: top 9%)
Annotation
HeliconiusHMEL0146000.085.89% 
BombyxBGIBMGA006898-TA0.080.02% 
DrosophilaCG1962-PB7e-12039.23% 
EBI UniRef50UniRef50_D6X2L72e-16848.42%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X2L7_TRICA
NCBI RefSeqXP_969297.15e-16948.42%PREDICTED: similar to CG1962 CG1962-PA [Tribolium castaneum]
NCBI nr blastpgi|910937489e-16848.42%PREDICTED: similar to CG1962 CG1962-PA [Tribolium castaneum]
NCBI nr blastxgi|910937483e-16847.98%PREDICTED: similar to CG1962 CG1962-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15053 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202106-TA
ATGACTGAGGAACATGAATGCGAGTCTTTCAATTCGTGGGAACTGAACGGACTGAATTCCCTGGATTTATGGGATTATACCGTGGAACTTGAGTGTCTGCAAGGCACCGAAGATCTGCAGCTCGCGGCTGAACTAGGGAAGACTTTGTTGGAACGAAATAAGGAGTTGGAGACGGCTCTTCGTCAACACCAAAATGTAATAGAAGATCAAACGCAGGAAATCGAATACCTTACAAAGCAAACTGTTGCGTTGCGCGAAGTAAACGACTCAAGACTGAGAATCTATGAACAACTTGAAGTTAGCATCCAAGATTTAGAACGAGCTAACCACAGGCTCGCCGTTGATCATGCTGCCGATAAAAAACACATTAAAACCTTATGCAACACCATCGAAACCTTGGAAACCAAGTGTGAAGAATTCCAAAAAACCGTGGACGATCTTAACGCGCAATTAGAAATCGTAAAACGGCGTGCAGAACGTAAGCCAGAAACCATTGAAAAACCCAAAGAAAAAGAACAAAAAACGGTTGAAAATGCAAAACAGAATACAACTCCGTCAAAACTTCCAACTGTTACCCCACAAAAAATTGCTCCGCTCCCAGAACTGACAAAAGAGGATGAGGATTTGTTGAGATTAAGTGATGAGTTAAGGGAAAATAAAGTGGCCTTTGCCCAAGAACACAGGAGGGTCACAGAGCTTGAGGAACAGCTGGCATCGGTTATACAAGAGAACAACAGGCTGCAGGAGCAGTTAAGAAACTGGAACCTTAAGGATGATGAGCCTAAATCAATGCATGAAGAATTCTCGATCCTTGAAGAAGTTAGACAAGGACAACTCTGCATCAGGTGTCTGAGAGGTATGGAGCGGGATGACATGTCGTCTATCCTGGATGGAGAAGACGATGATAGAAGCGCCATCAGTTCACTCAATCTTACTCCCGGTCAACAAAGCCCAAGAGACGACCATGTGTCCAGTAAACTCGTCAAGTTTGATAATGCTAAGGAGAAATTACTCCAAGGTATTTGGGCGAATAAGGAAGATGGCCACGACAATCCGTACAGAGACTTAGTACAGAAGTATGAGGCTCTGCTTGAGGTACAACTATCACAGGGCAAGAATATAAAGAAGAAACCAAACACATTGCCGAACGCACAAACTCAGTCTGGCCCCGTGTCTCTTCAAGACGAATTGCAAACTTCGGGCGATTTCAGCCAATTCAGTGTTAAAGATACCGACGAAGAAAGCGGGCATGGTGAGGAGGCCCAAAAAGATAATCAACAAAAGAAAGTTGAGCCAACATCTCGCAAGAAGATAATTCAAACCCCAGACTTTTCTGAAGCCGAAACATCAAGCTCAGGCTTCTCAGATGAAACTAGTAATAAAGGAACTCAAACTGAACGCGAAAGACCTGGTTCCTTTCTATGCACTATCGCGGACGGAGAAGATTATCGCTTTAGCATTTACGACGATGCTAGTCCGATGGATAGTCGCTTCCGAAACCGACCAGAATACCGTGTTCTTTTCAAAGAAATATTTACTATTCTGAAAAAGGCAGCCGAAAATAAAGACGATGGAGAACAGCTACCACTCCTAGATGATACAGTCAGCGGAAAAGTACCACCTGTGACGCCTGCCACTGAAGAACCCCCCGGCAACTTTACCGATGACACCCAAAGTGTATTGTCGTCTGTGATGTCCGAACAATCGATTCCAGTATCTGACATAACTGCTCCAGAAACACCAACTCTAAAGGAAAAGGAAGAGCCACTTCCGGAACCTAACAAAAAACAAGAGAAACAAAATCACGTAGAAGATGTAAAGGAGAACAAGCCCATGGATGATAGTACCAACGGAAATCAAAAAGAAACTAATCAAGGAAAAGAAAAGGAAAAGGAAAAGAAAGAAAAAGAACGAGTACTAACACCGCTGGTGCGTCAACCATTGGAGTACATAGCGGCTACTAGAAAGAAGTCCAGACATCGCAACCGCAAGCACAGTCAAGATCGCCAGGGAGCTGACTCCCCCGTGTTTCCATCTCCACCTAAAATTATATACCAGAAATCGGCAAACAAGAAGAGGAGAGATTACAGGCCCATAGAAATCAGCCCACTCGTTAGAACTTCAGAAGCTGAATGGAATGGATCTACTCTTCAGTTCTACAACAAAAATATAAGTTCCCCAACTCCGAGCGTTAGCGGCCGGACGGGAAAAATATATCAAAGCTGGAATTCTGAAACTGACTCTTGGGATATAAAACAAAGCACTGCTTCTCAGGAGATACATAAGCTCCGAAAATTGGAGCTGTCCTATGCTGAAGTATTAAGAAATGCTGATAAGACTAAAAATAGGCGCAAGAAACATCAGTAA

Protein sequence:

>DPOGS202106-PA
MTEEHECESFNSWELNGLNSLDLWDYTVELECLQGTEDLQLAAELGKTLLERNKELETALRQHQNVIEDQTQEIEYLTKQTVALREVNDSRLRIYEQLEVSIQDLERANHRLAVDHAADKKHIKTLCNTIETLETKCEEFQKTVDDLNAQLEIVKRRAERKPETIEKPKEKEQKTVENAKQNTTPSKLPTVTPQKIAPLPELTKEDEDLLRLSDELRENKVAFAQEHRRVTELEEQLASVIQENNRLQEQLRNWNLKDDEPKSMHEEFSILEEVRQGQLCIRCLRGMERDDMSSILDGEDDDRSAISSLNLTPGQQSPRDDHVSSKLVKFDNAKEKLLQGIWANKEDGHDNPYRDLVQKYEALLEVQLSQGKNIKKKPNTLPNAQTQSGPVSLQDELQTSGDFSQFSVKDTDEESGHGEEAQKDNQQKKVEPTSRKKIIQTPDFSEAETSSSGFSDETSNKGTQTERERPGSFLCTIADGEDYRFSIYDDASPMDSRFRNRPEYRVLFKEIFTILKKAAENKDDGEQLPLLDDTVSGKVPPVTPATEEPPGNFTDDTQSVLSSVMSEQSIPVSDITAPETPTLKEKEEPLPEPNKKQEKQNHVEDVKENKPMDDSTNGNQKETNQGKEKEKEKKEKERVLTPLVRQPLEYIAATRKKSRHRNRKHSQDRQGADSPVFPSPPKIIYQKSANKKRRDYRPIEISPLVRTSEAEWNGSTLQFYNKNISSPTPSVSGRTGKIYQSWNSETDSWDIKQSTASQEIHKLRKLELSYAEVLRNADKTKNRRKKHQ-