Monarch geneset OGS2.0

DPOGS202689
TranscriptDPOGS202689-TA2103 bp
ProteinDPOGS202689-PA700 aa
Genomic positionDPSCF300324 - 112552-118187
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0123650.078.97% 
BombyxBGIBMGA004855-TA4e-12774.41% 
DrosophilaCG34110-PC9e-7030.15% 
EBI UniRef50UniRef50_D6WPE11e-11636.60%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WPE1_TRICA
NCBI RefSeqXP_969446.12e-11736.60%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|910939854e-11636.60%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|2608312941e-12135.70%hypothetical protein BRAFLDRAFT_275841 [Branchiostoma floridae]
Group
KEGG pathway 
Orthology groupMCL13497 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202689-TA
ATGTCTCCGTACACGGCCATAATCAGACGCAGCGGTAATCCCTTTGAGTTGGCACACGTTCTGGTCTCCTGGCTGATTGGAGCTGGGTATGACGCGTATGTGGTGGTGGGGAACGCGTCCCGTGACGTTTGCATGGCGATACGATATCGCACGGTTTGTCCAGAACTACCCGATGAGACTGAGGTAATCGACGAACCTCCTCCTCCAGAGGAAGAACCTCGCTACCGCTTGGTGCCGCTGCCGGACCTCACTTCCAAATATTGCAAAGAAATGGATCGTAAGGAAGAGGAGAGGATACAGGGCGAACAAGACAAGATAGAAAAAGAAAGATTGAGAAAAATTGCGGAGCTGGAGAAACCTCCGCCTGATGATATTCATAGCTGGCGGACTCATGCCTGGGTCCTCGTGCTCGGTGGCTTTCGAGGAGTTGAGGAACCCTTCTTCATAGAGCCCAGTGATGGAAACCGCTTCCCGCTGGATGCTGAACAGTACCAATACATTGACAGCGTTTATAATCACGAGAATTATTACGTAAATCTGCAATCTTGTGATGAAGGTCTAGGTTCCTTAAACTATGATCTATCTGATCTCACGTGTTGGGAACATCTGCTGGCTGGTGAGCCTTATTACAGGAGGCAACTCGTCGGCATTGACTGTTCTGATAAAAGAACCGCCGTTGACACTGAGAAACACCTCGACGTGCCCACCAGTTGGGTTGAGAAGCTGGACATAACCGCTGATGAGTATGAACAAAGATATCCCGGGAGCCACAAAGTGATACATTATAAGAAAGTTTTGTTAGAAAAGTTCTCTCCATATTCACAGAAAGATGGCATCATCAAAAGAATAAAGATATTTGAAGATTACGCTCTAACTTCACCACTTTTGACATATGAATGGTACAAGAACAGAGCGGACAAAATGGACACGGTCAAAATAGATCACGTGAAGAAAGAGATTCGTGAAAATTTTCTGATCGGCAGAAAAGACCATCTGTTGAAGCACGTATACGCTATCGACGCGCCAGCCACGTCTGTGGAAGGCACTCGCATGATGCAGTTCAACTATTATGCAAGACTGGACCACCTCGAGCGTCTTGAGTGCGATGCCTTAAACTTTAACGAATATTACACGGATAGAGGGGACAGACTTGAAACAAGATTTATAACATACACAGAAGGCACAAAACTGGAGCCGACACGGCAGGTTAAGGATATAACCGAGACTTACAGCCGTAATCCTGATGTACCAGCCAAAGACGACATCTGGAAAAGAATCTTCCACATTCAAAACAACACCATAGAACTGCTCTACCACTACGCATATAATTTTGTGACGAACAACACACGAAGCTTCATTAAACCAAATCTAGCTGAGACCGGCGGGAAAATACTATTCTATCCCGACAAGACTTCCGGTTACATTGCTGATGCATGCGCAAACCAACCTCGTCCGCTTCACGTGTACTACGCTCTATGCGACAACATGGAGTCGGAACACCGGTCCCGTAAACACATCAGAGACCGCGAGTCAGATGTCACGGAATACTTAAAACAGAGACTAAAAGAACTCACTGAACCCGTGCTGTTTGTGTCGCTGTTTGATACGGAGAGAAATGACGCCGCTAAGAAAGGTTGGAGGGAACAGGAAACGCAAAAAGCGGAAGTAACGGAACGCGAGAAGGAAGCTGAGATAGATCCTCTAGCTCCATATCTTGCTCGGATGTTTGGATCGGGTCGAGGACGAGGGTTGTTGTCCATCAAGGAGGCAGCTCTGGTCAGGGACCAGTGCATCAACGACTTCCGGGCCAAGCAACTCGCGAGACAGAACCTCGTGCAAGAGAGATTTGATAAGTTGAACGCCGAATATAAGAATAAAAGACTTTGGTATTTGGCGAACCAGTTTATTCTGACACCTGAAAAGGAGAGCGCTTATTTCGCTGCGAGCGCGGAACTGGTATTCCGAGTTCATACTTTGGAAGTGCGTCTGACAAGACATAAGGACCTATCGGCTCCGAGATTCAAAGTTCTTGAAGATTATTTAGATAAGCATCCCTTGCTCAAGGAGTACAATCGTGTGCGGCATCATTACAAAATTAAATAG

Protein sequence:

>DPOGS202689-PA
MSPYTAIIRRSGNPFELAHVLVSWLIGAGYDAYVVVGNASRDVCMAIRYRTVCPELPDETEVIDEPPPPEEEPRYRLVPLPDLTSKYCKEMDRKEEERIQGEQDKIEKERLRKIAELEKPPPDDIHSWRTHAWVLVLGGFRGVEEPFFIEPSDGNRFPLDAEQYQYIDSVYNHENYYVNLQSCDEGLGSLNYDLSDLTCWEHLLAGEPYYRRQLVGIDCSDKRTAVDTEKHLDVPTSWVEKLDITADEYEQRYPGSHKVIHYKKVLLEKFSPYSQKDGIIKRIKIFEDYALTSPLLTYEWYKNRADKMDTVKIDHVKKEIRENFLIGRKDHLLKHVYAIDAPATSVEGTRMMQFNYYARLDHLERLECDALNFNEYYTDRGDRLETRFITYTEGTKLEPTRQVKDITETYSRNPDVPAKDDIWKRIFHIQNNTIELLYHYAYNFVTNNTRSFIKPNLAETGGKILFYPDKTSGYIADACANQPRPLHVYYALCDNMESEHRSRKHIRDRESDVTEYLKQRLKELTEPVLFVSLFDTERNDAAKKGWREQETQKAEVTEREKEAEIDPLAPYLARMFGSGRGRGLLSIKEAALVRDQCINDFRAKQLARQNLVQERFDKLNAEYKNKRLWYLANQFILTPEKESAYFAASAELVFRVHTLEVRLTRHKDLSAPRFKVLEDYLDKHPLLKEYNRVRHHYKIK-