Monarch geneset OGS2.0

DPOGS205696
TranscriptDPOGS205696-TA3153 bp
ProteinDPOGS205696-PA1050 aa
Genomic positionDPSCF300250 - 56559-63379
RNAseq coverage295x (Rank: top 38%)
Annotation
HeliconiusHMEL0148041e-12653.09% 
BombyxBGIBMGA009832-TA7e-11153.93% 
Drosophilamio-PA4e-4426.89% 
EBI UniRef50UniRef50_D6X0506e-8133.75%Missing oocyte-like protein n=1 Tax=Tribolium castaneum RepID=D6X050_TRICA
NCBI RefSeqXP_001603665.19e-8536.64%PREDICTED: similar to LOC100049152 protein [Nasonia vitripennis]
NCBI nr blastpgi|3454873006e-8836.90%PREDICTED: WD repeat-containing protein mio-B-like [Nasonia vitripennis]
NCBI nr blastxgi|3454873001e-14836.85%PREDICTED: WD repeat-containing protein mio-B-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.7e-11protein binding
KEGG pathway 
InterPro domain[59-347] IPR0110461.7e-11WD40 repeat-like-containing domain
[248-339] IPR0159437.7e-08WD40/YVTN repeat-like-containing domain
Orthology groupMCL11224 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205696-TA
ATGGCAGGAAGTAAATTGGATGTCCTCTGGTCTCCCATTCACCATGATAAATTTATCGTATGGGGTCAAGACTTAACCTTATACGAAGTGTCTAATCTTCAAGATATCCCCAAAAACACTGCATATACCCAACTGTGTCCAACAAGAGGAGCCACAGTAGTAGCATCACAGAGTGCAAGCGGAGTGCGTTGTGTGGACATCAGTGCTGTTGTTGAACACCCCGACCCACTGCTGGCTCTAGGACATGGAAGTGGACGAGTGTTGCTCACAAGTTTCAAACAGGCATATGATTCACTTGGCCTTGTGGGTAAAGAATTCGTGCCCCGCTACCCCCGACAATGTAACTCGGTTTCATGGAACAGGTCGGAGGGTCACTTGCTGGTGGTGGGAATGGACAAACATCGCAGCGACAGCGCCGTGCTGCTATGGGACGTCCAAGCCGGCGCCGGAGATGACTTCACTGGTAAGAGCGGTCCGTTGAGCGCGGCCGGCGCGGGCGAGGCGGCGGGCTGCGTGTCGTGGTGCGGGTTCGCTCCCCGGACAGTGCTGGCGTCCATGACCTCCAAACACATTAAGATATTCGACATGAGAGAGAATCCGGGCAAGGCAACCAGCTCGGTGTCAACCCGTCAGTGGGCGGGTGCCACGTGTGCCGGCTGGCTGGTGGCGGCGAGGGGTGAAGGTGGGGAGGCGGCGATATGTGTGAGGGACGCGCGCATGCTGACCAGGTCGTTGGCCCTGCTGCCGCTACAGAGACCCGCCAGGAAAATACACTGGAGCCCCACCAGGCAAAATCTGCTGATATCCCTCCAGAGAGATTCCACGACCTTGCGACTCCACGACATCCAGCACATGCACGACCCCCGGCGGCAGTCTCTCGACGTACGATCGACTCGAGTCGCTGCATTGCCTTATCCAGTCGAGCGTGACGTCACGGTCAGCGGCGTGCCGGTGGCTTCGTTCGCCTGTCACCCTGCACACCGCGCGAGGCTTCTCACGCTCACTACTACAGGCTGTGTGGCGGAGTACATGGTGATGGAACGCGTGTGCGTGTCGTGGGGCGCGAGCGGCGCGCTGGCCTGGGGCGGCACTTCGCTGCGGGTGCTGCGGCCCGCGCACCTGCCGGCCTCCACGCCCGACCTCGACATCTCGTACAAGGCGCGCGCGAGGGCTCTCAACGACTACGGTCTCAAGCCCGACCTGTGGCAGAACGCGGAGCTGGCGGAAGACGAGGCGCTGAGCTCGTTGTGGCACTTCCTCGCTCTCAGCAAGTCACTCGTAGAGGACGGCTGCATCCGCAACAGCGGCTGGAAGCATCCCGGCGTGCGATCGGTGCTGCGATCCCCGGGGGAAGGATACCGCTCGGAAGCCGTGTCCGCTCTCCTGCCCGACCTGCCCTCGCGCAAAGTCACCATCTACCGGAGTGCCGAGCGCACGAGGGCGCTACAGCTGTGCGGCTGGGGTTGGGGCTGGGAGAACGCCGCGGCCGGCGTGGAGCGCGCGGAGGCCGAGGGCACGCCGTGCCGCGCCGCCGCCCTCGCCGCCTTTCACCTGCGCGTACGGGCGGCGCTCGACGTGCTGTCCCGTGCGCGCGCCCCGGCTTTGGCCCGAGGAGCGCCTTTGGCGGGACGCTCTAGCCGCCGCTGCGCCCGCCCTGCCCGATCCTTATCTGCGGGCGCTGCTACACTTCGTCGCCGCTGCGGCGCCTCCGCCCCAGCAGCCGCCCGCCCCGCACCATCAACCGGACTACTCGGACGTCCTCGTGAGTACACTGCCCAGGGACACGTTCATGACGGCTCGAACGATCCGACGATGACCTCGTTTTCCACAGAACGAGACGGGCATGCGGCTGGAGGACCGCGTTGCATTCGCATGCATCTTCCTCCCGGATGGGAAGCTGCACGAGTACCTGGCGAACACCTGGGGCTCGCTCCGCGCGGAGGGCTCTCTGTCGGCCCTGCTGTTGTCCGGAGCGGGGGCGGAGGGCGCGGGTGCTCTGCAGAGATGGTTGGAGCGCACGGGCGACGTGCAGAGCGCGGCGCTGGTGGCGGCCCGCGCCTGTCCACCGGAGCTAGTCCGGGAGGGGCGCGCGGCCTCGTGGCTGGCGGAGTACCGCGCGCTCCTGGACGCCTGGCGCCTGTGGTGGCCGCGCTGCCTTTTGGATACGTGGCTGGCGGCGGCGGGGGCGGGCGCGGAGGCGGGCGCGGGGGCTTCCGTGGCCTGCACATACTGCGGCAAGCCCGTGGCGGCGGCGGGGGGAGCGCGGCCTCGACCCGCATTCGCTCGCCTGCCGCCGCCGGCCGCCAAGATGAAGCAAGTGTCGTCGTGCCCCAACTGCCGCAAGCCTCTGCCTCGTTGTGGCGTGTGCTCTCTCCACCTGGGCACGGGCGCGGCGGGGTCGGCGGGCGCCATGGTGGCCGTGGGCGCCGTGGGCGCGGCGGCGGGCGGGGCGGCGGCCGGCGCGGCGTTCGCGGGCTGGTTCAGCTGGTGCGTTTCGTGCCGTCACGGCGGGCACGCAGCGCACCTGCTGCAGTGGTTCAGCGAGCACGCCGAGTGTCCCGTCAGCTCGTGCACGTGTCGCTGCAGCGAGCTGGACCCACCGGACGTGCCGCGCGCCTGACGTGCCTCGGCGGTCCGTCCGTTCCCGTGTTGGTGCCGGGTGCGAGTGGTCGGCGCGTCTTCTCCGCTCGGGTTCAGGGCTTGAAGCGCCTAGTCCTCGTCGCGAGGATCGCATCCCCGCCCCGAACAGCCGTACAAAAATATACACACGACGACCGACACCCGAGGGCCAAGGTCGAGGGTCGAGGGTCTGCGGCCGACGCCCTAACTAACTATCATAAATATTATCCAGAAAAAGTACGACCACAAGGAACAAAGAGACGAGACAACATACAACCGACGCGCCCCGGCGCGACTCGCGGGCTCGGAGCACGCGACACCACAGGAAAATACGAAGACATCTCGAAAACATCTCTCTATACAAAAATAACAATATTTGTAATGAAGTGGACTCCGCTAACAGACCTACCTATATATATCTATTATAGACTTTTCTATAAAACGACAACAACGCGGGGTCGCGGGGTCACGGGGTCGCGGGGTCGCGGGGCACGGGTCGCGCTTCGAGGACGGCGCCGTAACGAATACAATTAA

Protein sequence:

>DPOGS205696-PA
MAGSKLDVLWSPIHHDKFIVWGQDLTLYEVSNLQDIPKNTAYTQLCPTRGATVVASQSASGVRCVDISAVVEHPDPLLALGHGSGRVLLTSFKQAYDSLGLVGKEFVPRYPRQCNSVSWNRSEGHLLVVGMDKHRSDSAVLLWDVQAGAGDDFTGKSGPLSAAGAGEAAGCVSWCGFAPRTVLASMTSKHIKIFDMRENPGKATSSVSTRQWAGATCAGWLVAARGEGGEAAICVRDARMLTRSLALLPLQRPARKIHWSPTRQNLLISLQRDSTTLRLHDIQHMHDPRRQSLDVRSTRVAALPYPVERDVTVSGVPVASFACHPAHRARLLTLTTTGCVAEYMVMERVCVSWGASGALAWGGTSLRVLRPAHLPASTPDLDISYKARARALNDYGLKPDLWQNAELAEDEALSSLWHFLALSKSLVEDGCIRNSGWKHPGVRSVLRSPGEGYRSEAVSALLPDLPSRKVTIYRSAERTRALQLCGWGWGWENAAAGVERAEAEGTPCRAAALAAFHLRVRAALDVLSRARAPALARGAPLAGRSSRRCARPARSLSAGAATLRRRCGASAPAAARPAPSTGLLGRPREYTAQGHVHDGSNDPTMTSFSTERDGHAAGGPRCIRMHLPPGWEAARVPGEHLGLAPRGGLSVGPAVVRSGGGGRGCSAEMVGAHGRRAERGAGGGPRLSTGASPGGARGLVAGGVPRAPGRLAPVVAALPFGYVAGGGGGGRGGGRGGFRGLHILRQARGGGGGSAASTRIRSPAAAGRQDEASVVVPQLPQASASLWRVLSPPGHGRGGVGGRHGGRGRRGRGGGRGGGRRGVRGLVQLVRFVPSRRARSAPAAVVQRARRVSRQLVHVSLQRAGPTGRAARLTCLGGPSVPVLVPGASGRRVFSARVQGLKRLVLVARIASPPRTAVQKYTHDDRHPRAKVEGRGSAADALTNYHKYYPEKVRPQGTKRRDNIQPTRPGATRGLGARDTTGKYEDISKTSLYTKITIFVMKWTPLTDLPIYIYYRLFYKTTTTRGRGVTGSRGRGARVALRGRRRNEYN-