Monarch geneset OGS2.0

DPOGS213539
TranscriptDPOGS213539-TA3252 bp
ProteinDPOGS213539-PA1083 aa
Genomic positionDPSCF300033 - 469550-472894
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0054780.064.65% 
BombyxBGIBMGA011821-TA3e-13966.99% 
DrosophilaCep135-PB1e-1826.17% 
EBI UniRef50UniRef50_UPI00022469598e-5025.98%UPI0002246959 related cluster n=2 Tax=unknown RepID=UPI0002246959
NCBI RefSeqXP_975548.11e-3831.94%PREDICTED: similar to AGAP010985-PA [Tribolium castaneum]
NCBI nr blastpgi|3454845453e-4925.98%PREDICTED: centrosomal protein of 135 kDa-like [Nasonia vitripennis]
NCBI nr blastxgi|2420050754e-9225.40%hypothetical protein Phum_PHUM055770 [Pediculus humanus corporis]
Group
KEGG pathway 
Orthology groupMCL25890 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213539-TA
ATGGGTGACGTTTATTTTGATTTAAAAAGGAAACTTGAAGATCTAGGATATACTAATACATTGTCATTGGATTCGGTGCCTTTAGTTCAATGTATTGTTGCTGATTTACTACAAACTACCCGTAGCTTACAACATTATATGGATTTATCCAAAGAGGCCCTAATGCAACGAGATGCTCTTATGATTGAAGCCGAACCCTATAAATGTGATAATATTAAATTAATACAGGAAAACAATCAATTACATAAAGAAATTATGCTTGTTAAAGAGGAGCATCTAAAAATAACAAAAGAAAGCCGACGTAAAATCAAAAATCTTACTGATGAATTGTCAAAAAAGGAAGCATTGATAAGTAAATTACAACATGATATAAGAGATCTCAGTTTAAGAGGGTTATGTGCGGAAACACAGAGCAGCCGTAATAAGAGCAAAAGAAAAGATGGAGGTGACTGTTTAACATCCAAAGTTTGCATATGTCATGATACAAACACTTATGAAAAAGATGCCATGGAAAAGAATAGAACTATTCAGTCCCTGGAGGAAAAACTAGCTGAGTATAGTGACGAAATAACTCTTCTTCAAAACCAAGTTGAACAGAGAGATAATGAAATAGTTAGGCTGAGTATACTTTTAGATGGAGGCAGACCTGTTACAGCTGTGAGTAAAGACTTTTACAATGAACAACCTAACATTAAATTACAAAATCTAACAAAGCAAATGAAAGAATTGGAGAGAGCCAACGAATCACTGAAGAAAGAAGTAGCTAGTAGTCTTGAGAAACAACACGAAGCTATGCTCCGTGCTCTGTCTCTAGCAGATAAGAATAAAAAGCTTCAAGAGGAAGTGCAGCAAGTGGATAAATTAGCATTAAAACTTGAAGATGACTGTAATAAAAGATTAGCGTCCATGATGAATGAGATGAATTTCTTGCAAACAAGATTAGATGGTTTGAGTATGAAAAATTCAGAATTAGAAAAGGAGGCATCACAGAGATATTCAAAAGATAGTTCCACCCATACCCAAAAACTCCAAGAGAACTTAGCCGCTGCTTTGATGGAGAAAGAGGTGCTACATAAAGAAATTAAAGATCTAGTAGATCTCAATAAGAGCTTACAAGAAAAAATTGTGTCACTAACAGAGGTCAATAGAAACTTCAATAGCAATGTTACACCAGAAATTGTTGAAGACACTCCCCATCTAGTGAAGGAAGAATTAAAAGAATTATTACAAGAAGAAAGAAGGAAATATGAAACTTATATTGTAAGTCTTGAAGAAAAATTATCTGAAACCATAAATCTTTTCAACAAACATGCCTCTAGAGAAAAGGATTTAATTCCAGCATCGTCGAGCTTGTCTTGCGATAACAGCTTTATAAGAGATCTACATAACAAATTATGTAAAAGCGAGCAGCAGATTCTAATGTTGAAGAAGGAAAATGACGAATTACAGACAAAAATATATAACACAGAAGAGGGTAGCAAACATAATTATAAAGACATAATAAAACAATTGAATGACGAAAATACAGAGCTATCAAAAGAAAATATATCTCTCAGTAGACAAGTTAGTCAATACAAGTCTCTGAATACTAATGATAGGGGTGATTACTGTAGGAAAGATTTACAAAAACTTAATGAAAAAATTGACGATATGTCGAGAGAAATCCAGGTGTTAAAGAAAGATAAACAGGAATACCACATGAGGTACAAGGAAGCCATGGAGCTGGCTGATAAGTTAAAAAGAGATTTAGCATATAAGGTCAAAGAAATGGAACATTTAGAAGAAGAAAATTGTTCATACAAAATGAGCCATAGGACCGGACAAGCGTCTGCCGATCATTTAAAAGAAGAATGTAATTATTTAAGAGAGCAAATGAAAAAGATGCAATCTGATTACATCAAAGAGAAGACATTAGCAAACCAAATAAAAAATATACAACTCGAAACGGAAAGAAGCAGTGCGGAGGCACATAACGAATTATTGTCACTACAAAAGAAACTCAGTTTATTGAAAGACAGCAACGAAACTTTGGAAAATAAATGCAGAGATTTACAGTCTGAAATTATAAAACTGAGAAATGACAATATGAATTTAGTAGATAATATCAAATTAATAGATAAAGAAAGGGACAAACTAGTCATTGAGTTAGATCAGAAAACTGAAAATATAAGTGTTTATCTTGTCTTTGAATTAAGTGTCTCATATGAGTTAAGTAAACTCGAAAATGAATTGAGTGACGCAAAGAGAAAACTTAATATGAACAAAGTAAGCGAACACAAAGTAGTGGACTATGAATCACAAATAACTTTCCTTAATGGTGAAATATTAAGGCTGACACAGCAGCTGGATACATCGGTGATGGAAAATAAACATTTACAAAATAGTTTAGCCGATGCCAATGGACATTTGAAAATAACGAGAATTGAACTAGAAAAATCTAAAAAGGACGTCGACGGGCTCAAACAGCAATTGCAACATTACGTAGCTGAAATTAGAAGAATTGAGGAGCTTTTATCTCAAAAGGAAGCCGAAAGATCAGATATGTTAGAACACTTTGCTAGTTTATCTGTCGAAGCTAACATTTTAGAAAACACAAATCACTCATTGGAGAGCGAGTCCGCGTCTAAATCAATGCAACTTCAGTCATATGTTAGTAAAATTCAAAATTTAGAAGAAAAACTTGTGGACAAAGAACACATTATTGACAGTCAGTCAGCTAGAATAGCAGCTATGACCTGCAAGATAAGTTCATTAGAAAATGAAATAAAACTGATGACAGAAGAGAAGAATATCCTGGAGCAAAATGTTAGCTGCCTTAAACAAATGTGCAACAATCTACAAAGCAACAAGATGCCTAAAAGTGATGATAATTCAGAAATCAAGTTATATGAAAATAGAATACGAAATCTGTCTAGTGTTAAAACTCAATTGGAATCTGAGAAGGAGGATTTAAAGGAGAAGTTGCGGACAACCGAAAGATTACTATCCAACACGAGAAGGGAGTGCATAGAGTTGAAATTAGCTTTACAAGATGCTACGTCGGAAACAAAATCTCTTCAAGAACATGTCAGCAGACTTAGGACAGCGGATGACGAACAGAATGTATTAGCCACCGCCGAGTTGAATCTCAACCTGCCTTTAATGTTGGAAGAAACGATACACGAGCTCAGCCATGAAGACGAGTACAGTGATAGATGTAATTCAAACTTAAATAAAAGTTTCACAAAGTATACTCACAGCAGCACTTTATAG

Protein sequence:

>DPOGS213539-PA
MGDVYFDLKRKLEDLGYTNTLSLDSVPLVQCIVADLLQTTRSLQHYMDLSKEALMQRDALMIEAEPYKCDNIKLIQENNQLHKEIMLVKEEHLKITKESRRKIKNLTDELSKKEALISKLQHDIRDLSLRGLCAETQSSRNKSKRKDGGDCLTSKVCICHDTNTYEKDAMEKNRTIQSLEEKLAEYSDEITLLQNQVEQRDNEIVRLSILLDGGRPVTAVSKDFYNEQPNIKLQNLTKQMKELERANESLKKEVASSLEKQHEAMLRALSLADKNKKLQEEVQQVDKLALKLEDDCNKRLASMMNEMNFLQTRLDGLSMKNSELEKEASQRYSKDSSTHTQKLQENLAAALMEKEVLHKEIKDLVDLNKSLQEKIVSLTEVNRNFNSNVTPEIVEDTPHLVKEELKELLQEERRKYETYIVSLEEKLSETINLFNKHASREKDLIPASSSLSCDNSFIRDLHNKLCKSEQQILMLKKENDELQTKIYNTEEGSKHNYKDIIKQLNDENTELSKENISLSRQVSQYKSLNTNDRGDYCRKDLQKLNEKIDDMSREIQVLKKDKQEYHMRYKEAMELADKLKRDLAYKVKEMEHLEEENCSYKMSHRTGQASADHLKEECNYLREQMKKMQSDYIKEKTLANQIKNIQLETERSSAEAHNELLSLQKKLSLLKDSNETLENKCRDLQSEIIKLRNDNMNLVDNIKLIDKERDKLVIELDQKTENISVYLVFELSVSYELSKLENELSDAKRKLNMNKVSEHKVVDYESQITFLNGEILRLTQQLDTSVMENKHLQNSLADANGHLKITRIELEKSKKDVDGLKQQLQHYVAEIRRIEELLSQKEAERSDMLEHFASLSVEANILENTNHSLESESASKSMQLQSYVSKIQNLEEKLVDKEHIIDSQSARIAAMTCKISSLENEIKLMTEEKNILEQNVSCLKQMCNNLQSNKMPKSDDNSEIKLYENRIRNLSSVKTQLESEKEDLKEKLRTTERLLSNTRRECIELKLALQDATSETKSLQEHVSRLRTADDEQNVLATAELNLNLPLMLEETIHELSHEDEYSDRCNSNLNKSFTKYTHSSTL-