Monarch geneset OGS2.0

DPOGS206533
TranscriptDPOGS206533-TA3468 bp
ProteinDPOGS206533-PA1155 aa
Genomic positionDPSCF300190 - 235379-247967
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0022910.073.90% 
BombyxBGIBMGA005920-TA2e-10774.60% 
DrosophilaOseg4-PA0.033.62% 
EBI UniRef50UniRef50_Q9W0970.033.62%LD29485p n=13 Tax=Diptera RepID=Q9W097_DROME
NCBI RefSeqXP_001352755.20.033.54%GA15220 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1954905440.034.08%GE20921 [Drosophila yakuba]
NCBI nr blastxgi|1497277463e-16031.64%PREDICTED: WD repeat-containing protein 35-like isoform 1 [Equus caballus]
Group
Gene OntologyGO:00055153e-22protein binding
KEGG pathway 
InterPro domain[214-320] IPR0159433e-22WD40/YVTN repeat-like-containing domain
[86-533] IPR0110471.2e-15Quinonprotein alcohol dehydrogenase-like
Orthology groupMCL13592 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206533-TA
ATGTTTATATATATGAGTAAAAAGATTGCTATTCCAAAACAATCAAATGTTTCTTGCCTGGCATGGAATCATTCATCGGGATATATAGCTGTTGGAGGAGAGGATGGAATGCTAAAAGTTTTAAAATTAGAGTCAGGTGGAAGTGGTAACCTCTCTATGAATCTCAGCTTAGAAGGTCACACAGGAAGAGTTTGTTGTGCGATCTGGAATGAGGGATCATGGTATGAGGAAATGATAAATAACAGAAATAAGTCTACAGTATCAGGGATGGCATGGGGTTCTGATGGACAGAAAATTTGTATAGCTTATGAAGATGGTGCAGTAATTGTGGGTTCAGTAGATGGGTCACGAGTTTGGGGCAAGGATATAAAGGGTCCAGGTCTTAAAGCTGTCCAATGGTCCCCGGATAACTCATTACTACTTTTTGCCCTTTCTAATGGAGAACTTCATTTATATGATGATCAAGGAAATTTTATGATGCCAGTAGGAAATAATGAAGTGTCAGGATCAACTGATGTAATCTGTATGGACTGGTATTCCGGTAGAGCACCAGCCAATAGACCAGTTTTAGTTATATGTTACAAAAATGGACTGATGCTTCTTATGAAAAATATCATTGAAGAAGAGAGTGTAGTGGTTGATACAAACATGACAATAATAGACGGTCACTGGAATCACAATGGAACCATATTAGGTGTGGCCGGAAAAACACAAGATCAAGCCAATGTTGTACAGTTCTTTAGTGCTTATGGAGAGCATATCCGGTCGCTGCGCGTGTCCGGCGGCTGCATGCAGTCCCTCTGCTGGGAGCAGCGCTCACTCCGCCTGGCCGTCACTATAGACAGTTTCATATACTTCGCGAACGTCAAGCCGGACCACAAGTACGCCTTCTATGGGAACACGCTGGCATACGTCTCCGGAACTGATACTGTCACATTTTGGGATACTCTCACTCATCAGTCATGGGTAAACCACATTCCCGACGTAGTGGACATGTATGGCGTCGACGAGTACTGCATTATAGCAACTTTGACTGCCCTTATCATATCCAACCAACAAGGGATCCAGTGTGATGCCAAAGCAATTACTATGCCGATATCTTTTGTCACAATTAATAGCAAAGCGATAGCCGTTGCAGCCTCCAAAGAATCCTTCATGATATGGAAGTTTTCAATACCTTCAAGACCGCGTATAACTGAGCAAGTTTTCTATGCTGACGGTAGTCCTGTCACAAAGAGTGAGGCTGGCTTCCACGACGATACGATATGTTGTATCTCATGTTCAGACACACATTTACTTATAGGAAGAGATTCAGGCACGATTCTATTGTTCTCTATGTTGAACTTCAAGAAAATTACATCCATCAATATGAATACGAAACCTTACAAGCTTGGCCTTAACTCAAATTCAAGTAAATTTTTCGTAATCGACCAGCCTGGTTCCCTTTTCATTCTTGAGACGGAAATGGCACATAACGTCAGTGTTGGACAAGCGGTACGAAGGGAGGCGTGGGCCGCGCGCTGGGCTTCAGATAACTCATCCCTCTTGGCGCTGGCGGAGAAGAATCGTATATATGTACTGAGAGACGGCCAGCCCGAGGAACCTCTAACAGCTAACGGTTACCTCTGTGACTTCAAGGATCTAGAAATAACTTGCGCGCTGTTAGATAGCATAACGGACAAGTGTACGCCGCAGCATGTGGTCAGGATGGAAGTAAAGTCGTTGAGAGACACGAGACAGCTCATAGAGAAAGTGGGATTGAAAGAAGCCGAGGACTTCATCAAGGATAACCCTCATCCGAAGCTATGTAAATTTTTCGTAATCGACCAGCCTGGTTCCCTTTTCATTCTTGAGACGGAAATGGCACATAACGTCAGTGTTGGACAAGCGGTACGAAGGGAGGCGTGGGCCGCGCGCTGGGCTTCAGATAACTCATCCCTCTTGGCGCTGGCGGAGAAGAATCGTATATATGTACTGAGAGACGGCCAGCCCGAGGAACCTCTAACAGCTAACGGTTACCTCTGTGACTTCAAGGATTTAGAAATAACTTGCGCGCTGTTAGATAGCATAACGGACAAGTGTACGCCGCAGCATGTGGTCAGGATGGAAGTGAAGTCGTTGAGAGACACGAGACAGCTCATAGAGAAAGTGGGATTGAAAGAAGCCGAGGACTTCATCAAGGATAACCCTCATCCGAAGCTATGGTTACTATTAGCTGAGGCTGCATTGAAGAAGCTGGATACGTCGTCTTTGGAGACTGCGGAGGCTGCGTTCGTTAGACGAAACGACTACGCCGGGGTGAGAGCTGTCAGGCGACTCAACGGTCTACACTCCGCGGCTCTCAAGAAGGCGGACATACTGGCATACTTCAAAGACTTCGATGCCGCTGAACAGATATACCACGATGAAGACAGGCGTGATCTGGCTATCGCTTTACGAAAGCGTTTGGGACATTGGTTCCGTGTGGTGGAGCTCCTAAAGATGTCAGTCACAACGACGGAGGCTCAAGTTAAACAAGCTTACAGTAACATTGGCGATTATTATATTGACAGACAAAACTGGACAAGCGCTATTGAATATTACAATATGTCAAACAACATAGACGGCCTCAAGAAATGTTACATGGCGTTGGAAGACAATGAATCCCTGTCAAAACTAATATCAGGAAGCCCGAGACACGCTAAGGTGGAACGAGAAAATAGATCCGGCGTCGACAAAAAAATAGACAGCATTCAAGAGCCTTCCATACAAGCTATAGCGCAGTTAAAAGAATCCGGACGCACACTGCAGGCAGCGGCTATGGCGTTTCAAATGGCGAATGCGGAAGCGGCTAAAGCGACTTCACCGTTGCGGATCAAGAAGCTATATGTCCTAGCGGGACATCTGTACAGTCAACACGCAGTCGACGGTGGTAAGAGTCGTGAGGCTGGTTCGTGCTGGCGTGCAGCTCGAGCACAGCACTGGTTCTGTGCCGCGGCGTCGGCTGCGGGGGGCGGAGGACCCGCTGCCCTCCGCCTCGCCGCCCGTCTGATGGACGTCCTGCCAGCACTACACGAGACACAGCACTTACAGGCCAGTACAGTGTCTTTACTAACATTCTCCGCCTGGTGTACAATTGCAGCCATAGCACTACAAGCGAGGGCTTTCGACCTTTGCTCAAAAGCGTTGATAAAGCTTGAAGCATTAGACGTAGAGGTATTCGAGACAATCGCCATTGAAATATTCAGCAGATGTAAACCGAAGGATGCCAAGTCCAACAAAATAGAATGCCCTCATTGTCAGATGAATATACCGGATTGGGTGTCATCGTGTCCTGGATGTTCGAGCTCGTTCCCGGGTTGCGTCGTATCTGGTCGCCCATTGATATCTCACACCACGTGGTCGTGTAATCGGTGTGAATCTCAAGCCCAACAACACGAGCTCGTCCTGCGACACGCCTGCCCAATGTGTCACACTCAACTCGCATAA

Protein sequence:

>DPOGS206533-PA
MFIYMSKKIAIPKQSNVSCLAWNHSSGYIAVGGEDGMLKVLKLESGGSGNLSMNLSLEGHTGRVCCAIWNEGSWYEEMINNRNKSTVSGMAWGSDGQKICIAYEDGAVIVGSVDGSRVWGKDIKGPGLKAVQWSPDNSLLLFALSNGELHLYDDQGNFMMPVGNNEVSGSTDVICMDWYSGRAPANRPVLVICYKNGLMLLMKNIIEEESVVVDTNMTIIDGHWNHNGTILGVAGKTQDQANVVQFFSAYGEHIRSLRVSGGCMQSLCWEQRSLRLAVTIDSFIYFANVKPDHKYAFYGNTLAYVSGTDTVTFWDTLTHQSWVNHIPDVVDMYGVDEYCIIATLTALIISNQQGIQCDAKAITMPISFVTINSKAIAVAASKESFMIWKFSIPSRPRITEQVFYADGSPVTKSEAGFHDDTICCISCSDTHLLIGRDSGTILLFSMLNFKKITSINMNTKPYKLGLNSNSSKFFVIDQPGSLFILETEMAHNVSVGQAVRREAWAARWASDNSSLLALAEKNRIYVLRDGQPEEPLTANGYLCDFKDLEITCALLDSITDKCTPQHVVRMEVKSLRDTRQLIEKVGLKEAEDFIKDNPHPKLCKFFVIDQPGSLFILETEMAHNVSVGQAVRREAWAARWASDNSSLLALAEKNRIYVLRDGQPEEPLTANGYLCDFKDLEITCALLDSITDKCTPQHVVRMEVKSLRDTRQLIEKVGLKEAEDFIKDNPHPKLWLLLAEAALKKLDTSSLETAEAAFVRRNDYAGVRAVRRLNGLHSAALKKADILAYFKDFDAAEQIYHDEDRRDLAIALRKRLGHWFRVVELLKMSVTTTEAQVKQAYSNIGDYYIDRQNWTSAIEYYNMSNNIDGLKKCYMALEDNESLSKLISGSPRHAKVERENRSGVDKKIDSIQEPSIQAIAQLKESGRTLQAAAMAFQMANAEAAKATSPLRIKKLYVLAGHLYSQHAVDGGKSREAGSCWRAARAQHWFCAAASAAGGGGPAALRLAARLMDVLPALHETQHLQASTVSLLTFSAWCTIAAIALQARAFDLCSKALIKLEALDVEVFETIAIEIFSRCKPKDAKSNKIECPHCQMNIPDWVSSCPGCSSSFPGCVVSGRPLISHTTWSCNRCESQAQQHELVLRHACPMCHTQLA-