Monarch geneset OGS2.0

DPOGS209804
TranscriptDPOGS209804-TA1575 bp
ProteinDPOGS209804-PA524 aa
Genomic positionDPSCF300117 - 220619-222276
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0118701e-17656.52% 
BombyxBGIBMGA008033-TA2e-15249.72% 
Drosophila% 
EBI UniRef50UniRef50_D6W9D25e-1525.30%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9D2_TRICA
NCBI RefSeqXP_970105.19e-1625.30%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|3838624851e-2324.53%PREDICTED: MYCBP-associated protein-like [Megachile rotundata]
NCBI nr blastxgi|3838624852e-2924.53%PREDICTED: MYCBP-associated protein-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL17825 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209804-TA
ATGAATATCTTTGAAAACAGGCGACAAGACAAAGAAAGAAAGATAACTTTAGAACATGCCCAAATTGAGAAGAAAGTTGGAATAAGAGGTTCCCTTTGGGAGCAACCTCAACGACTTAAACAAGGCTGCTATTGTGAACCTGCTTATGAATTGCAACGAACTAGAGCTGAAATGGGACGCCCACGTATAATTCAACATATTGGAGTACCGATGTATATTCAGGAAACTGAAAAGGGGATTAGTGGTGTGTCCAAAAGAAATCCTTGCACTAAATTAAATTCAGAATACGTGAAATATAGAGCTAAGAGGGAGAAGGAATTGGAAGAAAATATAAAAACTATAGATCCGTTCAGACCTGAAATAAAAGATCTCGTGGTTGTAGGGACCAAACCCAAAACACCTCCTAAAAAAATGCCTTCGGTTCCAGTCATATCAATTATTTTACAGAACGAACCTCTTGGAGATTTTTATCCTTCAATTTATGCCGTAAGAATAAATGATACGGTATTTTTTAAAAACACGCCGCCTCACGATCTTCACAGTTTGCATGAAATGCAAAAACAATCGTGGCACGTGAATTGCAATTCTTGGTCTTATTACTTTAATTGTCCGGCCAAAAGGGTTGGGAGAAGCAAGTTATTTTTACAAAATCTTGGTACAGTGACGTTGAAATATTGTTGGAAAAAATTAAAACCAGTTAGCGTAGAAGACTATCAAGAGCCGGTTTTCTTTTTTAATAGAAATGAGAATGTAATATGTCCGGGACAGACTCAATATATTTATTTTACTTTTATATCAGGTGAACCTGGGTCATACAGAGAAACTTGGGAATTGATCTTTTATAATGTTTCATTTTACGAATCCGTTGATAAGGCGTTCAATGTAAATCTTTACGCGGATTCGACAGAAAACTTCTACAAAATTAAGAAGAAGATTGAAAAACTTGAAAACTTTATCTATAACATTCTGCTTCGGAATATTGCCAGGGATTTGTTACATGAAATAATTACAAAATCAACAAGTGTTCAACCACAAGTTTATCCTTACAAAGAAATGTTTTTAGAGGCTGAAATGTTTGTTATGAAAAATCCTGTATGTTATTATCATCAAACTGAAGTTATGAAAATGAAAAATTTCTATGAAGAGATGGTTGTTCATCGTCAATGGGATTTATCTATCAGCAGTTGGCGGTTAGAAATGATGAAAAAGGATTATGAAGAAAGAATGAAATACTTTGATTTATTAAAGTCATCACATAAGGAATTACAAAAACCGTGGTACGAAAAGAATGATTTATTTGAACAAAAATATAAGGCTGTATACGAACTGCTTGGTCAATTTGCTAGTAAATTGGATTATGAATATACTCGTATACCAAGTATGTTTTTTATGAGTTTTCCTGAAGAATCACAGATACAATCTTCGACCACCATTCAAGAATCTCCTGCTGTTGTGACTCATATATTCTTTTTACGCGCTTATGAACACTTTTCTGTGACAATAGAATTATGTGCAGGCATCCTTAGTAGTTTAGATCTAAACAGATGGATACACTTTGATTTTTGTCGAACGTAA

Protein sequence:

>DPOGS209804-PA
MNIFENRRQDKERKITLEHAQIEKKVGIRGSLWEQPQRLKQGCYCEPAYELQRTRAEMGRPRIIQHIGVPMYIQETEKGISGVSKRNPCTKLNSEYVKYRAKREKELEENIKTIDPFRPEIKDLVVVGTKPKTPPKKMPSVPVISIILQNEPLGDFYPSIYAVRINDTVFFKNTPPHDLHSLHEMQKQSWHVNCNSWSYYFNCPAKRVGRSKLFLQNLGTVTLKYCWKKLKPVSVEDYQEPVFFFNRNENVICPGQTQYIYFTFISGEPGSYRETWELIFYNVSFYESVDKAFNVNLYADSTENFYKIKKKIEKLENFIYNILLRNIARDLLHEIITKSTSVQPQVYPYKEMFLEAEMFVMKNPVCYYHQTEVMKMKNFYEEMVVHRQWDLSISSWRLEMMKKDYEERMKYFDLLKSSHKELQKPWYEKNDLFEQKYKAVYELLGQFASKLDYEYTRIPSMFFMSFPEESQIQSSTTIQESPAVVTHIFFLRAYEHFSVTIELCAGILSSLDLNRWIHFDFCRT-