Monarch geneset OGS2.0

DPOGS207016
TranscriptDPOGS207016-TA5163 bp
ProteinDPOGS207016-PA1720 aa
Genomic positionDPSCF300001 + 1340728-1347917
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0106230.055.55% 
BombyxBGIBMGA012940-TA0.055.26% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020622441e-1035.71%UPI0002062244 related cluster n=1 Tax=unknown RepID=UPI0002062244
NCBI RefSeqXP_001948227.13e-1135.71%PREDICTED: similar to LD15043p [Acyrthosiphon pisum]
NCBI nr blastpgi|3287169484e-1035.71%PREDICTED: hypothetical protein LOC100168098 [Acyrthosiphon pisum]
NCBI nr blastxgi|1544187732e-1919.96%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology groupMCL26021 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207016-TA
ATGGAAACAGATGAGACGGATACAAAAAGTGGATTTGATTACAACATTGAAGATTCATATTCTGCGATAAATAGTTCAATTTTGGAATATTACAAAAAGTTTGGTAGAAAAAGAGACTTAGAGCAATTTTTTTCCTTATCGACAACACAAAGTGATATAAAAGATCCTTCTAGTATATTTTGGCGGAGAATGAAGTCACAGTATGATTCCTCTGACTCGGGAGATAATAAGAAAAGTGAATCTTCGACGGAACTATGTAGGATTTCTATCAAGTGTTCAATTCCTGAACCTGGTTCTTCCAAGGAAGATAATGCGAGGACAAAGATTGATTCAGTGTCCCCTCCCATCATAAGTGAAGAAGCACCGACACATCAATGTCAGAGTTCTGACAATGACTCTGATAAATCAAATGATGTTCAGTCTCAGAAATCCCATGATTGCACCCTTGATACCTCAATGAATAAACCCATTTCACCGACAAGTAGTGTAACATCGCAGCGAAGGCTTGAATGGGACTCCTTAGCTGATGTTGGCTATGCTAATGAAAGTGACAGAAAAAATTCAGCTTCCAGTCTAAGTACTCTTGAGAGGATGGCTTTGCAACAGCAATACTCTAATGAAACAAAACACAATTCAGACTTAGGGCCACCAACTGCACAATCAACCCCAGTTGATGTTAGTGAGGTTAAAATGAAAAGTAAGAAGATGGATTCAAAAAAATCCACAAAGATATATAAAAAGGATGTTGATTTATTTGAACTGAACGTACCACAAATGACCGAATATATGAAGCCTATAAATGTTAATTTAACAAAACATATATCATTTAATGTTGAGAGAGATGGTGGTGTGTCTGTTGAAAATATGACAAAAAGTGTTAGTTTGAGCCCTGAAAAAGTTTCTGTTGAAACTGCCATAACACCACAGGTTAAAATGGATAAAGAAATCCAAACAACCTTGATAAAAACTAAGGACCACAATGCACAGCCTAAACAAAACGAACCTGTTAAAAAGATTCCAGTGTTAATCAGTGTAAATACTTTAAAGAAAAGAGTTAGGAGGAAAAAAGTAAGAAAACTAAGAAGGCAGAATAGTAGAAAACAAAATGTCATTGACAAAGAGAACATTCCAGAAAAACATGCTGAACCAGTTTCCGATGCAGAGAGTTTTGAATATATGCCAGGACATATTTATAATCAGAATCAGTTAAAAATTGACAAGAATACAGTAAATCCATCTGGTAATAAATCTAGCTTGGAGTCAAGTGCAGGTTTGACAACAGATTCTAGCAAAACATCAAAGTATTCATTTACCAAAGACTTAGAAAAGGGAATAGACATGCTTAAGAACACATTAGACTATAAATGTGACGACCCTAATTTAAAGAAGAAACTTATCAGAGATGTTGTTGAAAAATTGATTAAATCAAAATACAGAGATGATGAATCTTCCACTGAATTTCTGTCCGGACTAACTTTTGAAAGCAAAAAGTTAGATTTGTATAATCAACATCACACTACAACAAGTACATCTGAGAATGATACTATGAAAAGGAGTAAATTACTAAAACCCAAGAAGTCCATTTTGCGTATTGATAAATTCAATTCAGGACCTGTAGCTTCCACTTCTCAAAGTGTTCCTAATCTACATACCGTTATCAATCAAGAGAAACCTATTGCACCTAAGATTGGACAAATACAACTATCTCATACTGATTCAGATATTTCAAATAAAAACAAGATCACTTCTGACACAGGTATTGATAAAATATCTTCAGAGCAACTATATCAAAAATATCTTGAGGCTTTACAGAGAGAACAAGCTTACAAAAGACATCTAAAAGATAAGGAAATATTTCTAAAACAGAAGCTAGCAAGTTCTGACATTGCTTTTGATGTTGTGAGACATGCTGAAACTAAAACACAAAACAGAATAAAGGATCTCATGAAAGATTTGATAAGAAATAACTATGATGATGGATCAGGTGATGCAAGTAGATTGGAAGGTGGCTCAGGGTCTAATCTTAATATTGAACAATACAATAGTGCTAGACGGCAAAGAAGTCATTCTGTTTTCACACTTTCATCTGGTACTTCAGACAACCATTACAAGCAAGCAAAATTAACTAAATGCACCCAAGATGCTGATGCCCTAAAGACAAACTTTACTAAAACAGATAACCACTATTGTTGTTGTCCTTATCACAAAGCACATCCAAAAATTGGGGTCGTAGATAGTTCAGTTCAGGTTAACTTAAACTGTTGTAAAGATTATTCTCCTGTAAGAACACAAACCTGTTCCACGTCATGTGCGCCGGTCCATCAATCGGTTCCATACAAATGTGAAAAATGTAATACTATTTACAAAACTCATAAATTACCCAAATGCGTACCCGAAGACAGAGATGAAATAAAATATGTGTGTTTGTGCACAGAGGACACAGTCACAAAAAATGAGATGCCTGAAAACATATTAATTTACAAGTGTTCGCGGCTTAATAGTAAAGGATTAAAAGTGGATACAGTTTTAAAAGCATCTAATGCAGTTAGTGCCCCATCTAGCAATTCGACATCCCCTCGAAAAATATCTACATTCGGAAGTCAAGATTGTGAGAGGAAATGTTTGGTTATAAAGGACAGTAATAGCTCTAATATTCAAACGAAATCATCACAAACTGATCTCAATTTACGACTAGCGTTACGAAATCGAACAAGTGAACAATCAACACAATCATCGACTTCTGAAGACAGAAATGATAAGTTTTCTCTTCATGGTAAAAAACCTAGTATATTTGTCCATGAAGCTACAAGAGTTGTTCAAACTGAAATGAGCATAGATCCGAAGATATCTGATCCATCAATATCTGATATTAATATTGTTAATGACGAATGTTGCGTTCAGTTGATTAGCGAGAGATTTAAAGAAATTACACAAAATTCTTCACGTAATCAGGTCAATAAATGTTTGGAAGGAGCTATACGAATGGAAGAAAGAGAACCCGTAAACATAGTTTATGCAAATACTAAAGAGACTTCGACAAACACAGTTGAAAAACACGATAAAGAAATACAATCAAGCCCAGATGTACCGAACACACCTATAACTGAAGCTAACGACAGACAAGGTGAAAATAATTTCACTATACCTATACAAGGTACGAACATGATGCTTAAAGTGAGCTTGGGCTCTGAAAACAAAGGAATAAACAAGGAATTTGACAGGAACCTGTTGAAAAAGACTGAGGTAGAGTCTACCTCAAAAGGGACTGAAACGCTAAAGAACGACGTTGTTGAAAATTTCACATCATTAAGGGAAGACTGTTCGAAAGCCGTACAATGCAATGATACCAACATATTTAAAAACCTTTGTAAAAAACCTGAACCTTCATGTGGACATTGCAATACATATCAAGAAAAGACCCCAGCAGATATCGTTAGTTCTGAACTAAATAATAAGGCTAATTGTAAACTCAATACAAAACCTACTTGTGAAAAAAAAATTCTTACTGATAATAAATGCAATACTTTCCCGAGAAACGACCGATGCAACGTACAGAAACCACTTCTCCGTTCAAATACTGATACAGGGAAAATGGAAAGATGTTGTCGCGTTACATTCAACAAAAAATCCACCAAAGACGACGGGGTACAACCAGACCCAGAAATTTTAAAAGAGAAGCAACAACCAAAAGGTGTTTGTACAGAAAAATGTCAAGAAAAGTATGATTCCTGCTGTTCCGAAAAATCTTTAAAATCCATTGATGTGAGTTCGAAAAGCAAATTTAGTTCCGATATAGATGAACCGTCTCAATTAAAACCATCCACCAGTGAATCTGATGATAATATAAACAAATGCATATCCAGGGATCCCTTGATAGAAATGATTCAAGACATAACAAAACGATATTCAAAGAAAGATTACGAAAAAGGTAAAAGAAAAAAATGTTTTAAGGAAATTATCACGGTTCTCAACTATTTTCTCGATACGGAGGAGAGTACGGATAAGAGTACGGATCAAGATATAAATAAAACATCTTGTTCTACTGGTGACGAAAATAAAGGTCCTGAGAAAAAAAAGACTAACAATGAATGCTGTGGATCCATTCCAAGCAAGACTTTTGTAGATAAAAGTGTTCAGTTATCTTCAAAAAAATCTAAAACTAAACAGTGTACCGAATCATCTGATCTACCGCTCTCCAGTGATCTACCTAGCACTTCGTCCGATTCGGCTACTTGTAAAGTGTTAAATAAAATCAAAAGAGAGTGTGAAAAATACCATCAAAAACGATGCAAATCTTATAATGGAGCAAAAAAATGTGACGCATCCAGCAGTACCTCGGTAAACTGCAATCAATGCAGGAAGGTCCACCACTGTTCTTGTAGGGGGCACAAATGTAAAAATCACAACACTAAAACACTCGTTGAGAAGACAAAAAGAAATTGTATAGCTTACAACCTGATCATACAAACATCTGACAGCATGATCAGCGAGGAAGTCACTTACGGAAACAAAGAGCGACAGTTACAGAATATTATAGTAAAAGTACCTTCAAAACGAAAAGAAGAAGATATACCTTTTAAGGAAATGTCCAGAAAGATTGAAAAAAAGATGAAAAACTGTAGTCCTCGATGTGGCAAGAACTATCACCGATCGAAAAGTTGTCCAAATGAGAGTGAAATATCAAGTACAGACGAATTTTTAAAAAGGGCTCGTGAGTTTACCGTTAGGGAGTACTTGGAACAAAATCGACCAGACTTCGTTGAGAAGAGTTCAAATCGACAACATTGCTTGAAATTAATTAATGAATCGCGAGCCAACGAACGAATTACCAAACGCCAATTGCTGTCGTTGCAACTGGACAAGCAACAGACTCTAAACAGTCTCACAAATACAGAATTACAAAATCTGGCTAGAGAATTAGGCAATGAGCTGAGAAGGAAAAAAGTGGCTCCAAAATTCATTAATGAACGTGAAATGAAAAAACATTCTGAAAAAATTTACAAGTCTCTACCGGAAGTCATGCGGCAAAAAGAAGAGATGAAAAAGGAAAATATTAAAAAAACTAACTTATTAATGGCCAGTTTATTTAAAAAGAATCTACAAAAGAAAACACTCAGTGGTGATGTCAATTTATCCAACTATAACACAGTCATAAAGATATGA

Protein sequence:

>DPOGS207016-PA
METDETDTKSGFDYNIEDSYSAINSSILEYYKKFGRKRDLEQFFSLSTTQSDIKDPSSIFWRRMKSQYDSSDSGDNKKSESSTELCRISIKCSIPEPGSSKEDNARTKIDSVSPPIISEEAPTHQCQSSDNDSDKSNDVQSQKSHDCTLDTSMNKPISPTSSVTSQRRLEWDSLADVGYANESDRKNSASSLSTLERMALQQQYSNETKHNSDLGPPTAQSTPVDVSEVKMKSKKMDSKKSTKIYKKDVDLFELNVPQMTEYMKPINVNLTKHISFNVERDGGVSVENMTKSVSLSPEKVSVETAITPQVKMDKEIQTTLIKTKDHNAQPKQNEPVKKIPVLISVNTLKKRVRRKKVRKLRRQNSRKQNVIDKENIPEKHAEPVSDAESFEYMPGHIYNQNQLKIDKNTVNPSGNKSSLESSAGLTTDSSKTSKYSFTKDLEKGIDMLKNTLDYKCDDPNLKKKLIRDVVEKLIKSKYRDDESSTEFLSGLTFESKKLDLYNQHHTTTSTSENDTMKRSKLLKPKKSILRIDKFNSGPVASTSQSVPNLHTVINQEKPIAPKIGQIQLSHTDSDISNKNKITSDTGIDKISSEQLYQKYLEALQREQAYKRHLKDKEIFLKQKLASSDIAFDVVRHAETKTQNRIKDLMKDLIRNNYDDGSGDASRLEGGSGSNLNIEQYNSARRQRSHSVFTLSSGTSDNHYKQAKLTKCTQDADALKTNFTKTDNHYCCCPYHKAHPKIGVVDSSVQVNLNCCKDYSPVRTQTCSTSCAPVHQSVPYKCEKCNTIYKTHKLPKCVPEDRDEIKYVCLCTEDTVTKNEMPENILIYKCSRLNSKGLKVDTVLKASNAVSAPSSNSTSPRKISTFGSQDCERKCLVIKDSNSSNIQTKSSQTDLNLRLALRNRTSEQSTQSSTSEDRNDKFSLHGKKPSIFVHEATRVVQTEMSIDPKISDPSISDINIVNDECCVQLISERFKEITQNSSRNQVNKCLEGAIRMEEREPVNIVYANTKETSTNTVEKHDKEIQSSPDVPNTPITEANDRQGENNFTIPIQGTNMMLKVSLGSENKGINKEFDRNLLKKTEVESTSKGTETLKNDVVENFTSLREDCSKAVQCNDTNIFKNLCKKPEPSCGHCNTYQEKTPADIVSSELNNKANCKLNTKPTCEKKILTDNKCNTFPRNDRCNVQKPLLRSNTDTGKMERCCRVTFNKKSTKDDGVQPDPEILKEKQQPKGVCTEKCQEKYDSCCSEKSLKSIDVSSKSKFSSDIDEPSQLKPSTSESDDNINKCISRDPLIEMIQDITKRYSKKDYEKGKRKKCFKEIITVLNYFLDTEESTDKSTDQDINKTSCSTGDENKGPEKKKTNNECCGSIPSKTFVDKSVQLSSKKSKTKQCTESSDLPLSSDLPSTSSDSATCKVLNKIKRECEKYHQKRCKSYNGAKKCDASSSTSVNCNQCRKVHHCSCRGHKCKNHNTKTLVEKTKRNCIAYNLIIQTSDSMISEEVTYGNKERQLQNIIVKVPSKRKEEDIPFKEMSRKIEKKMKNCSPRCGKNYHRSKSCPNESEISSTDEFLKRAREFTVREYLEQNRPDFVEKSSNRQHCLKLINESRANERITKRQLLSLQLDKQQTLNSLTNTELQNLARELGNELRRKKVAPKFINEREMKKHSEKIYKSLPEVMRQKEEMKKENIKKTNLLMASLFKKNLQKKTLSGDVNLSNYNTVIKI-