Monarch geneset OGS2.0

DPOGS204322
TranscriptDPOGS204322-TA1887 bp
ProteinDPOGS204322-PA628 aa
Genomic positionDPSCF300142 - 284151-288961
RNAseq coverage213x (Rank: top 46%)
Annotation
HeliconiusHMEL0070143e-15863.09% 
BombyxBGIBMGA014091-TA8e-7247.50% 
Drosophilapst-PF2e-3429.72% 
EBI UniRef50UniRef50_D6WTN82e-3931.49%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WTN8_TRICA
NCBI RefSeqXP_001807608.14e-4031.49%PREDICTED: similar to GH06117p [Tribolium castaneum]
NCBI nr blastpgi|1892397268e-3931.49%PREDICTED: similar to GH06117p [Tribolium castaneum]
NCBI nr blastxgi|1984644613e-4025.97%GA21184 [Drosophila pseudoobscura pseudoobscura]
Group
KEGG pathway 
Orthology groupMCL16296 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204322-TA
ATGAGTCCAGTGCAAATACAACAAGAATATTATGAAAATCTTCCTGATTGTGATTGGATTAAATATGAGTATGCTGAAAGAAAATATTTATTACAAAATGTGGCATTTGCATTATTTGGAAGTCCAACAGCCGAAGGGGATATGGCTCTGAACTCTGGAAACATACAACTGGAAGGTTACACATCACGACAAAGGGAACAAGTTATGAAAGTTTATAAATCAATCTGCAAGCAAAGTAAATATGCACAAAATGATTCAGACATTTACATGTCAGTATTATTAATTGTATGTGGAAAACCAAAGGCTCTTAAGTTTTACGAACTTGAACCATCCAACTATTGGCTGGATTTGCACCAAAGAACAGATATTGACATTTGGTGCACAGCTGTATTCAAGATAAGGAAATGTGTGCCAACTGTAGTTGGAGCAAATTCTTGTAAGGTTTATGTTGATGAGAATGGCAGGGTTTACCAATCTTGGGAGTCATATTTAAAAGATAACACATTACCTAAATGTGTCATTATCGCTCCAGAAAATGGTGAATATAATGGAGTTGTTACTGAGGGGGATCACATGTTTGCAGTGAAGCTAACAGTGGCACCATCGCCTTCATTGGGGTTAAAGGCCAGAGTTCTAAGTTCAGTGGACACGGCTAGCACCGTTGCTACATTAGGTGCTGTAGGTGTACTCGGTGTTGCTGCATTCACACCTGTGGGGCCGGTAGTGCTTGCTGGTGCTGCTGTAGCAACTGTTGCTACGGGAGTCTACGGCCTTTTCAGATCATCACTGCACCTCCATGATAGAAAGATTCATGAACAGACTTTGAGTCCCACAGACCCCGAAGCCCGTGGTAGTTGGCTAAACATAGCTGCCTCTAGCGTGGGACTTGCCGCAGGTGCTGCTAGCACTTTACTGTCCAAATCAGCAGCTGCTGGCACCAATCTGACAAAGACGGGTCAAGCTTTGGCAGTGTCGGTTGAAGTTCTACGCCATGCGAATATAATAACAGGTGGAATGGGTGTTCTGAACAGCTTAGTACACATAATTGTCAAGATATTCGATAAAATCAGCGCCGAATCCCGTAGAGTCAGTGGCACGATCCAGGGTAACGTTGAAGTGATCCGAGGCATCAAGAATATTGCAAACAAGGACCAGTACTTTGCCGATGTTCTGAAGATAAACAAAGACGTCAATCAGCACAAACTAAGGATATCTATGACAGCAGACGGCCAGGTCAAATTGAACGCTGTGCACAAATTTAATCCTTCAGAGCTATACAATTTAGGGAATGAGGGTAGGTCGCAATTATTCTCATCGATAGGGCCAGCTACCGTCACCACTCCTAATGTTTCAACACGAGTGGTACCTTCAGTTAATGCAGTAACAGGATATATTGATGGTGAGGAAGAACCAAGCTTCCTGGTTGGAATCCATCCAGGGGAAATAATTCAAATCGGCTCTTTATTGATTCGTGTTAGTTCTTCGGGTGCTGAGAATATATCATTGATGTTAGAAAACCTCAGTCAGGAGGTGTACGCCAATCTGATGACGGTTGCATTCAACCTCCTATCGAAATTGTTACCAGAGGAAATAGCAAAGCTGAGGCTGCTGAGTCCCGATGAGGACTTGATTTTCCAGATTGTTAAATTTGTTTTCAATTATCTAAGACACCAACGCCCTCTAGGTGAGGCCACCGACAACGATAACGGCATAGTCGTTATTCTGAAAGAGTTCTTCCAAAACGGCGTCGTCCGTCAAGACACCATACTAAAATTAAAAGACAATCTCATAAATTGGATGAACGCCGAAATTGACGAGAGGCGTAGATTATACCCAAATAAGATACTCATCAGATGTCAAACATGTGATGGAGTGAGATATGCTTGA

Protein sequence:

>DPOGS204322-PA
MSPVQIQQEYYENLPDCDWIKYEYAERKYLLQNVAFALFGSPTAEGDMALNSGNIQLEGYTSRQREQVMKVYKSICKQSKYAQNDSDIYMSVLLIVCGKPKALKFYELEPSNYWLDLHQRTDIDIWCTAVFKIRKCVPTVVGANSCKVYVDENGRVYQSWESYLKDNTLPKCVIIAPENGEYNGVVTEGDHMFAVKLTVAPSPSLGLKARVLSSVDTASTVATLGAVGVLGVAAFTPVGPVVLAGAAVATVATGVYGLFRSSLHLHDRKIHEQTLSPTDPEARGSWLNIAASSVGLAAGAASTLLSKSAAAGTNLTKTGQALAVSVEVLRHANIITGGMGVLNSLVHIIVKIFDKISAESRRVSGTIQGNVEVIRGIKNIANKDQYFADVLKINKDVNQHKLRISMTADGQVKLNAVHKFNPSELYNLGNEGRSQLFSSIGPATVTTPNVSTRVVPSVNAVTGYIDGEEEPSFLVGIHPGEIIQIGSLLIRVSSSGAENISLMLENLSQEVYANLMTVAFNLLSKLLPEEIAKLRLLSPDEDLIFQIVKFVFNYLRHQRPLGEATDNDNGIVVILKEFFQNGVVRQDTILKLKDNLINWMNAEIDERRRLYPNKILIRCQTCDGVRYA-