Monarch geneset OGS2.0

DPOGS215355
TranscriptDPOGS215355-TA3165 bp
ProteinDPOGS215355-PA1054 aa
Genomic positionDPSCF300351 - 41826-62670
RNAseq coverage672x (Rank: top 19%)
Annotation
HeliconiusHMEL0052220.075.72% 
BombyxBGIBMGA009563-TA0.067.92% 
Drosophilauex-PE0.054.45% 
EBI UniRef50UniRef50_E2BD060.054.69%Metal transporter CNNM2 n=19 Tax=Metazoa RepID=E2BD06_HARSA
NCBI RefSeqXP_001664301.10.056.38%ancient conserved domain protein 2 (cyclin m2) [Aedes aegypti]
NCBI nr blastpgi|1571387100.056.38%ancient conserved domain protein 2 (cyclin m2) [Aedes aegypti]
NCBI nr blastxgi|1571387100.056.39%ancient conserved domain protein 2 (cyclin m2) [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[442-613] IPR0025509.1e-34Domain of unknown function DUF21
[872-952] IPR0184904.6e-09Cyclic nucleotide-binding-like
[901-953] IPR0147109.6e-06RmlC-like jelly roll fold
Orthology groupMCL10515 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215355-TA
ATGGCATCTGTCTGTGTTGTATTGCGGACTTGTGGGATATTATTATTATTATTAATTATGCAAGTGTCCGCTAAAAATAATTTATCGGTTGAACCATGGATAGTGTCTTTAAACGTAGTGGCGAAATCTTGTGTTGATGTTAAGTGTAAATATTTATTGTTAGTTAATGGCAGTGAGTTTTTGGGGCACAATTCTTGGAAATTAACGTCCAAAGAAGGGTCTAGAGGCAGTTATTGTGATACTATATATCCAAATTATGAATTACACGAAGTAGAAACAACACAATGGTTTTCAAAGATAAAAATATTAATACCGAATGTGAACGAAAAAATATATATTTGTCTGAGACATAACAAACAAAAGAATAATCCGGTGAACGGATTATGGATACATCAGGGCGTTGAATTATTTCTCAATCCCAGCGGCGACGATAATATATTACAAGAAAATAGAACCGAATCGTTAAATTTAATGAAAGATCAGATGGCGAAGGATATTGAGCTTCAGGAAACATCAACCTGGTCAGATCTTTCTAGAGACATTAGTGTAAACTATATCAATGCGGTTAGACCATCAGACGAGGAGGCCAAAGATATAGAGGTCCTGAGAGACAATAAATATGTACCTCTGAATGATATAGGAACAGAATTCATAGATAGAAACGATATTAATGATGGTATAGAAAGCTTCAATGATAGGAGGAGAGAAAATGATGATAAAGGAGATAGTAATGATATAAATGATAGATTGAAAAGAGATATCATAAAGGATCTCAATCATGAGATGTGGAAAATGAATGATGGCAAGGTACCGGAGAGGCCTCCCGAGATGTTTCAGAACGACGGTGTGGCTAACGTGGTCAATCCGCAAGTTGGTGATTTCACTGTGGTTAGATCTGATGCTGTACCGATATTTGTTGAGGGTCTGAGGGTGGAGGACGCAGCGAAAGAACCGAAGATCATAGAAGATGGTATACCAAGCGTTTTAGCTGATACGAAGGTTGTGCTCAGGTTATTCGGCCAAGGTTTCACTCCGAGGACGGTAATCGCATTCACGCAAGATCCCATGGACTACGGCCAGCCGTGCAAGTTTCTTGTTAAGGGCGAATATATGGCTATGGAGGGATCTGTAACAAAATCTTCAGTACTATTCGATATTATAGCTCCATCACCGATAGTAGGTTCGAAGTTATATATATGCGCAAAAAATTTAAAACCGGGCGTCAGTGATCCTAATCAGGACGAGGAGAAATACATTCACCAGGGTACTGAGAATTTTAAGATATTGGCTACCCACAACAAATTATTGCCGCTTTGGGTGTCACTAACACTGATTCTCGTCTGTCTGATGTTCTCCGCTCTGTTCTCCGGATTGAATCTCGGCCTGATGTCTCTGGATAGGACGGAACTGAAAATCATATCCAATACGGGAACAGAACAGGAGAGGAAATACGCCAGAGCGATAATGCCTGTCCGTGATCATGGCAATTATTTACTATGCAGCATTTTATTGGGCAACGTCGCAGTCAACTCCACATTCACGATACTCCTGGATGAATTGACTTCCGGTCTGTTTGCCGTTATATTCTCGACGCTGGCTATAGTACTCCTGGGTGAGATAACACCGCAGGCTATATGTTCGAGACACGGGCTCATGGTAGGGGCTAAAAGCATCGTCATCACCAAGGCGGTGATGGCGCTCACAGCGCCACTGGCGTTTCCGGTGAGCAAACTGCTGGATTACTTCCTGGGTGAGGAAATTGGCAGCGTTTATAACAGAGAGAGGCTCAAGGAACTCGTGAAGGTTACTACGGACGTCAACGACCTGGATAAGGACGAGGTGAACATCATCTCCGGGGCGCTGGAGCTTAGGAAGAAGAAGGTCTCGGACGTGATGACGAAGTTGGAAGACGTGTTCATGCTGCCTATAACGTCTGTGCTGGACTTCGAGACGATGTCCGAGATCGTGAAGTCTGGTTTCTCCCGTATCCCGGTATACGAGGGCACCCGCACCAACATCGTGACCGTGCTCTTCATCAAGGACCTGGCGTTCGTCGACCCTGATGACAACACTCCTCTGAGAACCCTCTGCCAGTATTACCAGAACCCCTGCAACTTCGTCTTCGAGGATGTCACGCTGGATGTCATGTTCAAACAGTTCAAAGAAGGTCACAAGGGTCATATGGCGTTCGTCCACCGCATCAACAACGAGGGCGAGGGCGATCCGTTCTACGAGACCGTGGGTCTGGTGACGTTGGAGGACGTCATCGAGGAGATGATACAGGCTGAGATCGTCGATGAGACGGATGTGTTCAGCCACAAAGGTCATATGGCGTTCGTCCAACGGATCGAGGAGGGCGACGGCGACCCGGTGTACGAGACCGTCGGTCTGGTGACGCTCGAGGACGTCATCGAGGAGATGATCCAGGCTGAGATCGTTGATGAGAGCGATGTTATAAGTGACAATCGCACCAAGAAGCGCCTCCTCCGCCCCATGAACAAGCTGCACGACATCGCAGCGTTCGCCGGCCACCAGCACCAGCGAGTGCACGTCTCCCCACAACTTATCCTCGCCACCTTCCAGTTTCTCAGCACCAGTGTTGATCCCTTCCGGGCTGATATGATATCTGAGAACGTCCTGCGTCGCCTGCTGAAGCAGGACGTCATCCAGCACGTGAAGCTGAGGGGTGATGAGGATAAGAACGACCCCAAGAGATACGTCTTCCAAGAGGGGAAACCGGTGGACTACTTCGTGCTGATCCTCGAGGGGCGAGTGGAGGTGACGGTCGGCCGAGAGAACCTCATGTTCGAGGCCGGACCCTTCACGTACTTCGGAGTGCAGGCGCTCACGCAGAACGTCGGAGTCGGTGAGAGACAGATGGAGATAGAGAGAGATGGATCCATGTATCTGGCTGCGAAACGCGCGACCCTCATGGAGAAGGGGGCCCTCAACAAGGGAGGAACCAACGAGCAGATAGAACCCGAAGTAGACAAGCTTCTGCGCGAAGGTGACGGCCACAAGCTGGAAGAAATAGTCGAGAACGAAAAAGAAAACTCTATAGTTAAACAGTTCAACCCTACATCGGCAAGCCCATTCACGAATTCCACCTTCAAGTCATACGACAAAGGGGATAATCCTGAAGAGGAGAAGCTTTTAAAGAAATGA

Protein sequence:

>DPOGS215355-PA
MASVCVVLRTCGILLLLLIMQVSAKNNLSVEPWIVSLNVVAKSCVDVKCKYLLLVNGSEFLGHNSWKLTSKEGSRGSYCDTIYPNYELHEVETTQWFSKIKILIPNVNEKIYICLRHNKQKNNPVNGLWIHQGVELFLNPSGDDNILQENRTESLNLMKDQMAKDIELQETSTWSDLSRDISVNYINAVRPSDEEAKDIEVLRDNKYVPLNDIGTEFIDRNDINDGIESFNDRRRENDDKGDSNDINDRLKRDIIKDLNHEMWKMNDGKVPERPPEMFQNDGVANVVNPQVGDFTVVRSDAVPIFVEGLRVEDAAKEPKIIEDGIPSVLADTKVVLRLFGQGFTPRTVIAFTQDPMDYGQPCKFLVKGEYMAMEGSVTKSSVLFDIIAPSPIVGSKLYICAKNLKPGVSDPNQDEEKYIHQGTENFKILATHNKLLPLWVSLTLILVCLMFSALFSGLNLGLMSLDRTELKIISNTGTEQERKYARAIMPVRDHGNYLLCSILLGNVAVNSTFTILLDELTSGLFAVIFSTLAIVLLGEITPQAICSRHGLMVGAKSIVITKAVMALTAPLAFPVSKLLDYFLGEEIGSVYNRERLKELVKVTTDVNDLDKDEVNIISGALELRKKKVSDVMTKLEDVFMLPITSVLDFETMSEIVKSGFSRIPVYEGTRTNIVTVLFIKDLAFVDPDDNTPLRTLCQYYQNPCNFVFEDVTLDVMFKQFKEGHKGHMAFVHRINNEGEGDPFYETVGLVTLEDVIEEMIQAEIVDETDVFSHKGHMAFVQRIEEGDGDPVYETVGLVTLEDVIEEMIQAEIVDESDVISDNRTKKRLLRPMNKLHDIAAFAGHQHQRVHVSPQLILATFQFLSTSVDPFRADMISENVLRRLLKQDVIQHVKLRGDEDKNDPKRYVFQEGKPVDYFVLILEGRVEVTVGRENLMFEAGPFTYFGVQALTQNVGVGERQMEIERDGSMYLAAKRATLMEKGALNKGGTNEQIEPEVDKLLREGDGHKLEEIVENEKENSIVKQFNPTSASPFTNSTFKSYDKGDNPEEEKLLKK-