Monarch geneset OGS2.0

DPOGS202081
TranscriptDPOGS202081-TA4230 bp
ProteinDPOGS202081-PA1409 aa
Genomic positionDPSCF300116 - 67122-80400
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0033830.084.96% 
BombyxBGIBMGA011306-TA0.088.24% 
Drosophilarols-PB0.045.01% 
EBI UniRef50UniRef50_E0W0980.046.36%Rolling pebbles, putative n=7 Tax=Neoptera RepID=E0W098_PEDHC
NCBI RefSeqXP_969896.20.056.00%PREDICTED: similar to rolling pebbles [Tribolium castaneum]
NCBI nr blastpgi|2700034340.056.13%hypothetical protein TcasGA2_TC002665 [Tribolium castaneum]
NCBI nr blastxgi|2700034340.056.23%hypothetical protein TcasGA2_TC002665 [Tribolium castaneum]
Group
Gene OntologyGO:00054885.5e-12binding
GO:00055152.2e-05protein binding
KEGG pathway 
InterPro domain[892-1270] IPR0206832.4e-59Ankyrin repeat-containing domain
[1248-1365] IPR0119905.5e-12Tetratricopeptide-like helical
Orthology groupMCL11026 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202081-TA
ATGCCGAAAAATCGACGAATCCATCTCAAAAGGTCAGAAAAGGAAAGCGCGCCGCGGGCGAGAACGGACTTGGCCGCTTTACGACAACTTCTGGAGAGCGAAACGGGAGGCACAACCTGTCCGAGTTGTAACATGCCCTTTGATAAGGGTAAAAAGAGAAAGTTAATTGACACATGCGGACATGAGCGCTGCTATTCCTGTATGTTCAGAAACGAAGCCTGCCCAATTTGTGCAAGAAAAAGTCAGGGAAGACGTCCAGTTATGGAGAGATATACCCCTTCTCCACAGCGACAAGTGGATCATGAATGGCAATCACCGATGCGACTACCAAAACCACCGAAGCCTTCGAGTCTCGCTCAAAGCTGCCCCACACCCCCTCATACGAGAAGAAGATTCTTCCTTAGTCCTAAATCCTTGCGCAGTCCATTCGGCCAGCGAAGCCGCCATTCTCACGAAAACCACGTGCCTCTATCAGGGTTACCAGAAGAAGGTCCTAGGAACGCAGCCTGGACGAGCTTGGTGTTTAATAAGATAAGATCGTTGTGGTCAGCGCAGTCCTCAGTGCCTCAAGGACTCAACCAATTGACAGGCACGGAAACACAATATGACGAAGGAGGTCATATCAAACAAGGTTACGAGACAAGACGTCAAAATGACTTGTACATGCGGTTGGGATTACTTCTTGGAGAGCGACGTGGATCCAGAAACAAATCCCGGGACAGTTGCACATCTCTGGCCTCATTGGACGCTCATACTCTAGCCTCTCACAATACCAGTCCAGTGTCAACTCTAACTGGATCGTCAGAAGTGGATGCTGCGACACCACTTGGTAGGGATTCTTTAGGATCACTAGCCTCAATGTCACTCTCTGCCGCCAGCAATTGTTCATCATCAAGTCCAGGAAGCAGACGGCATTCTGTTAACACCTTGCAAAATGGACGAGAAGAATTGACACGGATGTCAAGTGGATTCTTTAAGAACAGAAAAACAGCAGCACGGAGATCAGCTCGTGTCAACAGCAAACAGTCGTCATCTTCGTCAGAAATAAAGAAAGTTCATCCAACTCCGCAACTGACATTGAGACCACTATTCTTCGAAGTGCCGGCAACCAATAACGACACTTGCTTTTCTGGACGACACTGGCTTATGAGAGACATGGAAAAAGCTTTGGAATCTTCTTCACCTGGTATAATGATATCAGGCTGTCCAGGTACAGGCAAAACTGCTTTAATACTTCAGCTGGTCGAATATTCATGTTTCGGCCGCAAAAGGAATTATCAGTATCAAGAATTAAGAGAGCAGTCAGATATTAGAGAAATGCTGCCAGAAGAAATAGCAGCAGGGATGATCACACAACTAGCATCACAAGTTGTTGCTTATCATTTCTGCCAAGCAGACAACAACAGCACTTGCCTTGTTGGCGAATTTGTACATTCCCTGGCGGCCCAACTATGTCAAGCACCAAGACTACAGGCATATAGAGAATACCTACTTAGCGAACCACATTTGCTATCCTGCCTTTCATTAAAAGAATGTATAGCCGATCCAGACTTAGCCTTTATGAGAGGCATTATAGAACCTCTTATAATATTAAGAAGAAATGGAAGTATAGATTCAAGTAATAGTATTATACTTGTTGATGGGCTCTGTGAAGCTGAATATCACCGACCCGATCACGGTTATACTGTTGCTTCATTTCTTATAAGACATGTACCAGAAATGCCAGCATGGCTTAAAGTTGTAGCCACCATAAGAAGTCAATTTCTGGAACTAACAAAGCAACTACCATACACAAGGTTCAGTCTAAATGAATGTGACAATGTCCAAAAAGATCTATTGGAATATTTTAATGCCAGGGTACAAGCAGCCCCAATTATAGAAACAAATATTAAAAGTTCCACGGGGAAATCCGAAGGAGTTCATAATTCTGTCATGAAGTTTGCCCAATATGTTATTCATCTCAGTCAAGGGTCATTCCTGTTTCTAAAATTAATTTTAGACCTTCTTGAACGCAGTCATATAGTCGTAAAGTCGACTAACTACAAAGTTGTGCCAATTTCGTTAGCTCAAATATTTTTGCTGCAATTCAATTTAAGATTCCCCACGGTACAATCTTTTGAAAAAGTAACCCACATTTTAAGTGTTTGTCTAGCAGCACTGTATCCTCTCACCTTGGTAGAGATTTATTACTCTGTAAATTCTCTTCTTGTCGACACTTACTTGCCGTGGGAAGAATTTTGTCACAGATTTGAAAGCCTATCCGATTTCTTGGTGAAAAGAATCGATAATACTTACATGTTCTTCCACCCATCATTCAGAGAATGGTTAATACGACGCGATGATAATGAGAGTCCAAAATTTCTATGTGACCTGCGGGCTGGTCACTGCGGTATTGCTTTTAGACTTGCTAGAGTGCAAGCGCCTCTAGACCCAGAAAAGTCTATGGAACTCGGACACCACATTTTAAAAGCTCATATGTACAGAAATATGGGACCAGCACAGTTAGGACTATGTCCGAGAGATTTACAAGCAATGATGGTAGCGTCGAGCTCTTCGAATGTAGGCGAAGCAGTAGCTAATTTACGTAACGTATATACTCCAAATGTAAAAGTATCGCGTCTCATGCTGCTGGCTGGTGGATCACCTAATCAAATTACTGATTGTCTTGGAAATGCTCCTCTATTATGTATGTATGCATATCAAGGAATTATATCAATGGTGGGATTACTGATTGAATTTGGAGCTGATTTAGAAATGACAAACTCGCAAGGATGTTCAGCTTTATCATTAGCTTGTCAGAGAGGTCACACCGATGTTGCGAGGATGTTGATAGCATCAGGCGCATCTTTAAGTCACACTGATACAGCCGAACAAACACCTCTCGTCCACGCAGCAAAGAATGGTCATAGAGATACAGTAATTTACCTGCTGGGTTGTCAAACTGGTAAAGACGATCGAAACTCAATAGAAATAGACGAAGGCAACATTGAACAACTAGTTCCCGGATCAAGACATGCTCTGATAGCGGCAGCTCAAAACGGTCATTTGGATATTGTCGAGTATCTTCTAGATACAGCTGAATTAATCCCCGACGGTATTTGTCCAGTAACAGGTGAGACAGCACTGACAGCTGCTTGCTCTACTGGTAACGCTGCCATCGCTGATGCTCTCCTAATTCGAGGAGCTACGCCATACTCATTAAATGCCAGACAAATGTCACCTTTGGCCCTAGCAGCTAAAAATGGCAGAACAGCATTAGTTTTACGACTCCTGGATTCTGGAGCTGATGTTATGGGGTCGAGTGGGAAAATACCATTAATTTTAGCAGCTGCGGAGGGTCATTCAGATGTTGTTGAAATGCTTTTAGGTCATGGAGCTGATCCCAATGCTGTGGATGGTGATGGCATATCTGCTTTAGGTTGGGCAAGTCTGAGATCTAGAATACCCACGGTAGTAATGCTTTTAGACAAAGGAGCAAATATAGAGCAAGCTGACAGTAGCGGCCGTACACCGTTAGGACTAGCTTGCGGTGGACCGGCGGAGCTAGCGGAACTTCTTTTAGAACGTGGCGCATCACTAGAACGTGGAGACCACAGCGGCTTACGACCATTAGATCGCGCCATCGGACAGAGGAATGTACCGATAGTAAATTGCTTTCTACGGAAAGGAGCGAAACTCGGTCCAACGACATGGGTAATGGCGTCAGGAAAACCAGAATTTATGCTCATCCTACTCAACAAACTTCTGGAAGACGGTAACATTTTATACCGCAAGAACAGGCCGTCTGAAGCTGCTCATAGATATCAATACGCCCTCAAGAAGATCTCTCCGCTCATCAGCGATGACGTCACCAACGCCCAGGAACACTTGAACGTTTTCGTGCAGCTTAAAACCAATCTGCTGCTAAATTTATCGAGATGCAAACGAAAACTTAATGAACCATCAGAGGCTTTGGATTTAGCCGCCCGCGCGTCCGTGTTACGTCCGAACGCTTTCGAATGTTCCTACGCCATGGCGAGAGCGATACTTGCTCTGAACAAACCATCAGATGCTCTTCCTCATGCTAGACGAGCTTTACTCCTCGCTCCACAGACAGATCTATCAGCCATGAGAACCTTGAAAGCCCTTCAACAAGAAATTCTGACGCGTATTAATGCCGGTACACAAAGTTTAAACGGTGACACACGATCTTTAAGAAATTTTGACAGCATTAGTCTAAACATGCCTTAA

Protein sequence:

>DPOGS202081-PA
MPKNRRIHLKRSEKESAPRARTDLAALRQLLESETGGTTCPSCNMPFDKGKKRKLIDTCGHERCYSCMFRNEACPICARKSQGRRPVMERYTPSPQRQVDHEWQSPMRLPKPPKPSSLAQSCPTPPHTRRRFFLSPKSLRSPFGQRSRHSHENHVPLSGLPEEGPRNAAWTSLVFNKIRSLWSAQSSVPQGLNQLTGTETQYDEGGHIKQGYETRRQNDLYMRLGLLLGERRGSRNKSRDSCTSLASLDAHTLASHNTSPVSTLTGSSEVDAATPLGRDSLGSLASMSLSAASNCSSSSPGSRRHSVNTLQNGREELTRMSSGFFKNRKTAARRSARVNSKQSSSSSEIKKVHPTPQLTLRPLFFEVPATNNDTCFSGRHWLMRDMEKALESSSPGIMISGCPGTGKTALILQLVEYSCFGRKRNYQYQELREQSDIREMLPEEIAAGMITQLASQVVAYHFCQADNNSTCLVGEFVHSLAAQLCQAPRLQAYREYLLSEPHLLSCLSLKECIADPDLAFMRGIIEPLIILRRNGSIDSSNSIILVDGLCEAEYHRPDHGYTVASFLIRHVPEMPAWLKVVATIRSQFLELTKQLPYTRFSLNECDNVQKDLLEYFNARVQAAPIIETNIKSSTGKSEGVHNSVMKFAQYVIHLSQGSFLFLKLILDLLERSHIVVKSTNYKVVPISLAQIFLLQFNLRFPTVQSFEKVTHILSVCLAALYPLTLVEIYYSVNSLLVDTYLPWEEFCHRFESLSDFLVKRIDNTYMFFHPSFREWLIRRDDNESPKFLCDLRAGHCGIAFRLARVQAPLDPEKSMELGHHILKAHMYRNMGPAQLGLCPRDLQAMMVASSSSNVGEAVANLRNVYTPNVKVSRLMLLAGGSPNQITDCLGNAPLLCMYAYQGIISMVGLLIEFGADLEMTNSQGCSALSLACQRGHTDVARMLIASGASLSHTDTAEQTPLVHAAKNGHRDTVIYLLGCQTGKDDRNSIEIDEGNIEQLVPGSRHALIAAAQNGHLDIVEYLLDTAELIPDGICPVTGETALTAACSTGNAAIADALLIRGATPYSLNARQMSPLALAAKNGRTALVLRLLDSGADVMGSSGKIPLILAAAEGHSDVVEMLLGHGADPNAVDGDGISALGWASLRSRIPTVVMLLDKGANIEQADSSGRTPLGLACGGPAELAELLLERGASLERGDHSGLRPLDRAIGQRNVPIVNCFLRKGAKLGPTTWVMASGKPEFMLILLNKLLEDGNILYRKNRPSEAAHRYQYALKKISPLISDDVTNAQEHLNVFVQLKTNLLLNLSRCKRKLNEPSEALDLAARASVLRPNAFECSYAMARAILALNKPSDALPHARRALLLAPQTDLSAMRTLKALQQEILTRINAGTQSLNGDTRSLRNFDSISLNMP-