Monarch geneset OGS2.0

DPOGS214633
TranscriptDPOGS214633-TA3696 bp
ProteinDPOGS214633-PA1231 aa
Genomic positionDPSCF300050 + 496397-511548
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0076630.077.32% 
BombyxBGIBMGA005049-TA0.077.72% 
DrosophilaCG5521-PB2e-8946.96% 
EBI UniRef50UniRef50_D6WR550.045.57%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WR55_TRICA
NCBI RefSeqXP_975486.10.045.53%PREDICTED: similar to tuberin [Tribolium castaneum]
NCBI nr blastpgi|910872230.045.53%PREDICTED: similar to tuberin [Tribolium castaneum]
NCBI nr blastxgi|910872230.045.19%PREDICTED: similar to tuberin [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL11396 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214633-TA
ATGTTTACCAAAAAATCTCATCAGGATGTCAAAAAATCGACTGTTAAAATACAGGATTCCAAAAAGGATTCCACCACTAGACTTAAACATTTAAAAATCGTTTTAGAACATTATGATGTGGATGAAGCCAAAAATTATTTCGAAAATAATTTTAGCCATGTGTATTTCATTTTATATGACAACTTTATATTAGCAGAGAATAATTTAAGACAAAGAGAACTTCCGTTTCACCTAGTTCACAAAGCGGGCAGAGAAGAATTAGAAGGTGCTTTGTCCCTTCTAGAAAAGGTATTATGTCTTCTACCGGAACTGATTGGTAAAAGATGGCAGTTACACTCCCTGACACGAATATTTTCAAAACTGCTCCACAGTGGAAATTCACAGAAACTAAGAGCGGAAGCCATACGCTATTTCTTGCTATGGTATCAAGCTCTAGGCGACAATGCCACGGCAGAAGCTCACAGGATGTTCGCGACCCTCGTGCCAGGTTTGCCAGACACCTGTATACAATCCATAGGCTCGCCGTTTAAACCCAAACCTTTCGACGTCGTCACCAACATATCCAGCGACACGGGTGACTTTAAAATGGAGGACATAATGCATCATCCAAATTTCAACACAACAGCCGACAGCGTCAAGAAGGTCGTGGTGGAACAAACGCCAACTGGTATTATGGGGAAGTTGGCGAATGTATCGGCCTCCATCTTCCATGATACAAACGCTCTAAACCCTGTCCAAAGCGTCGACATCCAACCATTATTAACAGCGAACGCCAACGAAAAAATCGTGGACAATGAAACTAAGTACTTTTTCGAAATACTACTAGATGGGATGGCAACATCCGTCACTAAAATACACTGGAAGGACCGATCCCACGCGAAGGCTATGAGGTCGTTTGCTTTCCTATTGGAGAAATTTAAAATATACTACCTGCCTATTATTTGTCCTCAGTTCAACCACAAGAACTCACTGTACAAACCGAATCTAGAGCTCCCTGTCCAACACACTATACTAGAAGATGATTTCGTACATTGTCGTGTGATACTCATAAAATGGGTGGCCAGTTACGCTCACTATATGAAGAAGCCTGGCAGTGAAGGTGGCCAACCCTCTTCTATGACCAGTACCAGTTCCCATCAGCCTCACTCAGGGACCCCAACCCCCGCCGCGTCACTACATCACGAGGAAGAGACCTCTTTCACTGTGGGACACCCCTCAGACTCTAGCAGAGCGTCCTCTCACGATTCCACTTCTATAACACCACATCCGAGTCTGGAATCAGGTGGCGCTGAAGAGCTGGTCTCCGAGGCGGCAGTTCGTGATGTACTGTGTTGTTGGTCCCGTGAACATGTGGACTTCACCGTGGAGGTGCTCCGTCAAGGCTTCCTTCTGCCGTTCAGCCACGCACCAGCTATAAGGAGAGTCATCGCTCTTTATAAGGACTGGATTCAAATGAATGTGGCAGAAATTCCTCCGTTCCTAATGGAAAGTACACTAGAGCCGTCAGGAGAGCCCCTCGCTCCTCCGAGACGTCTTCGGAATGATTCTTATCTGGGAGCCGTCGGGAGAGACAACGTTATGGTCCGAGCTGGCATGCAGAACGTTCTGCAGGTGTTCATGACGCAAGCAGCGAACGTGTTCGTGGGCGGGTCCGGCGGCCCGGGGGCGGAGCCTCACCTGCTGGACGAACAGACGGACACCTGCAAGCGGGTCCTCAACATATACAGATACATGGTCATGCATATAGCCATGGATGCCAGCTCATGGGAACAGTTACTGCTGGTGTTGCTACAAATAACGTCCTTGGTGTTGACGAAGGTTCCACCAAAGAGTAAAGGTTACACGCTAGGTGGATCTTTGGCCCCGGCGTTGTTCCAGACTTTGATAGTGACCTGGATTAAAGCCAACCTGAACGTAGCTGTGAGGTCTGAACTGTGGCAGGACTTCATGGAGCTCCTCACGAGACTGACTCACTGGGAGGACCTCATTAAGGAGTGGGCGAAAACTATGGAGACCCTGACGCGCGTGCTGGCGCGACACGTGTACTCGCTAGAGCTGTCAGAGGCGACGGCCGAGGGTCGAGGTCGCGGCCGGCGCCCTGTGCCCGCGCCGGCTCCCTGCAGGCCGCAGCCCCTGCGACCGCCCGCACGGAACGCAGGCAAGTCCCGGATGTCTGGCGAGTCGTCCCGAAAAGAAGTCGCCCGACGACTGGCGCGAGCAAGAAGCGATCCGCATCTCAACAGACATACAGACGACACCAAAGATTCATCAAGCGTCAAGCCGAGGCGCTGGTATTCGCTGGACTCGCTGAGGCAATCATCACAACAAGAGGAGAGAGATTCCGCCAGCGGTTCAAGGTCGCCCTCACCGGCGCCCTCCAGCGGCGTAGAGAGCAGCTCCATCAAAGATTCACCTCTACAGATAGATCTGGCATCAGACGGGGGAAATGTTGAATGGTCGGAGACCGACTCAAACACCGCGGTCTGTTCGTCCGGCGTGGGCGCTCCGTCGGGCGTGGGCGTGGTGCTGGGCGGCACGGCTCGCGGCTGGTTGCCGGACGTCGCCTGCGTGCTGTGGCGACGCATGTTGGCGGCGCTGGGAGACCCCAACGCTCTCAGAGACCCTCAAGCCCATCACAACCTGTTCAAGTACCTCGTACATTTGAATGCGACCTTGATCAAGATAAGTCAGAATCAAACTCTCAATGGCAACACGGAGATCCACGTGCCCTTCAACCTGGTGGCGGGCTGGTGTCTCGAGGCGGAGGCGCTTCCCCCATCACACAGGGCTGGTAAATTGGTGGCACTGCGTTTGCTCTGCGAGTCGACGGCTGCGCAGGCGCAGGGCCCGGGAGCTCCCACCAGCTGCTCCGGCAGACCACACCTCGCTCACCTACAACTGTACCAGCGAGCACTACATCACGGTCTAACAGGCGAAGACCGGTCTGTAGTGGATGTTCTCGTGGAGCACGCGGCTCCCCGCTACCTCTCGCTAGCCCTGGATGGATACTCGCTGCTGCTCCTGGATTTCGTTCACGCATCAACAGTCGTGCTGAACTCCTCTGACATGGGACCCTCGTGTCCTCGCACCGCTGCCGTCACGTTCCTGGGGTCCCTGCTAGCGTTACCCGACGAGCTGATGAACGCTCCCATGCTGCAGCCCTACCCCCATCAGTATAACACAGTGTCATGTCCTGACTTGAAGGAACACGTGCTGAACATAGTGGTTCGCGTGTGTCGCCGCGAGGGGTCGGCGGCGGCGCGGTGTGCCGGCGCATGCGCCCTGGCGGCCCACGTGGCGCACCTCCTGGCTGAGAGGGCGGAGGCACCGCGGCTACCCTCATACGTGACCTGCTTGCTACAGATGCTCATGATGAAAAATAAGACTATAGCTAAGGTTGTCAGCGATGCTGTGCTGGTGCTGGCTGATTACACCGACCGGATAGTTGAGTTATATCCGGGGCTAGTAGAGAAGATAATAAAGTGGATATGCGCGTGTCTGGCGCAAATCTCCAGTGTCAGTGGGCGGGAGTCGGTGAAGCCGTTGGCGGGCTCCCTGCTGCTGTGTCTGGCGGAGTACGCGGTGCGCTGCGGACCAGCCCGCCTGCTGGCGCATCGCGAGGACGGGGGCTCGCTGCTGCTGCTTGTGTTCAAGGTAATGGCAATGTTACTCCGGGACTTGTATGATGAAGACTAA

Protein sequence:

>DPOGS214633-PA
MFTKKSHQDVKKSTVKIQDSKKDSTTRLKHLKIVLEHYDVDEAKNYFENNFSHVYFILYDNFILAENNLRQRELPFHLVHKAGREELEGALSLLEKVLCLLPELIGKRWQLHSLTRIFSKLLHSGNSQKLRAEAIRYFLLWYQALGDNATAEAHRMFATLVPGLPDTCIQSIGSPFKPKPFDVVTNISSDTGDFKMEDIMHHPNFNTTADSVKKVVVEQTPTGIMGKLANVSASIFHDTNALNPVQSVDIQPLLTANANEKIVDNETKYFFEILLDGMATSVTKIHWKDRSHAKAMRSFAFLLEKFKIYYLPIICPQFNHKNSLYKPNLELPVQHTILEDDFVHCRVILIKWVASYAHYMKKPGSEGGQPSSMTSTSSHQPHSGTPTPAASLHHEEETSFTVGHPSDSSRASSHDSTSITPHPSLESGGAEELVSEAAVRDVLCCWSREHVDFTVEVLRQGFLLPFSHAPAIRRVIALYKDWIQMNVAEIPPFLMESTLEPSGEPLAPPRRLRNDSYLGAVGRDNVMVRAGMQNVLQVFMTQAANVFVGGSGGPGAEPHLLDEQTDTCKRVLNIYRYMVMHIAMDASSWEQLLLVLLQITSLVLTKVPPKSKGYTLGGSLAPALFQTLIVTWIKANLNVAVRSELWQDFMELLTRLTHWEDLIKEWAKTMETLTRVLARHVYSLELSEATAEGRGRGRRPVPAPAPCRPQPLRPPARNAGKSRMSGESSRKEVARRLARARSDPHLNRHTDDTKDSSSVKPRRWYSLDSLRQSSQQEERDSASGSRSPSPAPSSGVESSSIKDSPLQIDLASDGGNVEWSETDSNTAVCSSGVGAPSGVGVVLGGTARGWLPDVACVLWRRMLAALGDPNALRDPQAHHNLFKYLVHLNATLIKISQNQTLNGNTEIHVPFNLVAGWCLEAEALPPSHRAGKLVALRLLCESTAAQAQGPGAPTSCSGRPHLAHLQLYQRALHHGLTGEDRSVVDVLVEHAAPRYLSLALDGYSLLLLDFVHASTVVLNSSDMGPSCPRTAAVTFLGSLLALPDELMNAPMLQPYPHQYNTVSCPDLKEHVLNIVVRVCRREGSAAARCAGACALAAHVAHLLAERAEAPRLPSYVTCLLQMLMMKNKTIAKVVSDAVLVLADYTDRIVELYPGLVEKIIKWICACLAQISSVSGRESVKPLAGSLLLCLAEYAVRCGPARLLAHREDGGSLLLLVFKVMAMLLRDLYDED-