Monarch geneset OGS2.0

DPOGS203959
TranscriptDPOGS203959-TA1563 bp
ProteinDPOGS203959-PA520 aa
Genomic positionDPSCF300005 + 416999-421344
RNAseq coverage69x (Rank: top 66%)
Annotation
HeliconiusHMEL0135160.069.28% 
BombyxBGIBMGA002113-TA2e-6851.79% 
DrosophilaCG10508-PK5e-5136.41% 
EBI UniRef50UniRef50_UPI00017580913e-6542.14%UPI0001758091 related cluster n=1 Tax=unknown RepID=UPI0001758091
NCBI RefSeqXP_967972.25e-6642.14%PREDICTED: similar to CG10508 CG10508-PD [Tribolium castaneum]
NCBI nr blastpgi|1892358991e-6442.14%PREDICTED: similar to CG10508 CG10508-PD [Tribolium castaneum]
NCBI nr blastxgi|1892358991e-7142.37%PREDICTED: similar to CG10508 CG10508-PD [Tribolium castaneum]
Group
Gene OntologyGO:00055156.9e-06protein binding
KEGG pathway 
InterPro domain[230-288] IPR0090606.9e-06UBA-like
Orthology groupMCL16688 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203959-TA
ATGGAAGATTATAGAAGTAAACATAAGACCATACAAAGTGCAGCTCACCATATCGCTTTAGATTGTATACATCCGCAGTCTACACATGGCTCACCCTATCATGTACATCCTTACCCCTCTCCAGTCCGTCTGCAACATGCTTTTCAGGCACAACAAAGTCCACAGATGTTTCATCCAAAAGGATTGAAACCATGCATTGATCCCAAAGCATTTTTGACTCCAAGTGATAGTCCAATATTGGGACGGGCCCTTTGTCTCACTCCACGAGGCAGTCCTCTACCTGGTTGTGTATCACATCTCAATGAAAAGTTTCAAGATTCATTAACTCTTAATTCTGATACAGAAGCACTAGTTGCAAAAATCAGTGCCATGTTTCCAACAGTCAGTGAAACACATATTAAAATTTTATTAAAAAAGTATTATAATCGAGAAGCTGTAGTCATCAGTGCTTTGCAAGTAGAGAAACATCCAATTACAACACCTGGCCCGCTGACCATGTCTCCGTCTTCCACACGCTTAACTAAAGGTGCCATGGGTGTGTATTCAGCACTGCAACTCGCTAAAGGGGTATCTGGTTCATTGCACAGTGCTAATCACTTTACTCCACTCACTGGAACCCCCCAGGGTTCACCACAACTACTTCGGCCAGGCTCCTGTGCTTCTAGTTATTATGGAACAACAAAATCTGATCAAGCGCCAAGACATCAATCACCTAAAATGAAACTCAAGTATTTAAAAAGTGTTTTTCCTAAGGCTGAGGAAACTTTAATCCTTGATGTTCTTGCTAACAAAGATAATAATGTACAAAAGGCAAGTGAAGAACTTATTTCCATGGGTTTCTCTAAAAAAGAGACAGTTTTCATACAGCAAAAAAAGAAAGAAAAATCCACACCACAGCCACCTAAGAAAGTAGTCACAATAGTGAAATCAATAGAGGAGAAAAAAGAATTAAAGGAGAAACTCCAAAAAAAATACTACAATAAAGTAGCTGAGAAAGTTGTAACAATAGCATTAGAGAGTGTTGATTACAATGAGGAAAGGGCAGAACAAATCTTAGCAGCTGTTGTTCAAGAAGAGGAAACACCTAAAGTTGCCCAAAAAACCGAAGTGAAAGATATGAGATGTCCAAATAATGTTGTTGTTATTAAAGACATGGCCAATCTCGGTATCACATCACCCATTAGTTCGCCGAGCCCGCAGCGAGCTCCGAGTCCACCGCCACCACGCCGTCTCAAGATCTGGACTGAACATTTGAAACCAATTTTGAAGCCAAGCCAAAACAACGAGAGTAAATTTAAATCTCAATACACCATTACTACCAATGGACCAAATGCTGCTCTGCGACAGGGGCCGAAGGACAGTCTACTTCTGGAGGATTATATGACTTGGAATGGCCCTAACCCTGAACTGAGATGCGGACCGGGAGCGAAGCAAGACGGTCCAGATACAAAGAGCAGTGCTGCGTCTATGGCGAGAGGAGCGGCGGGACTAGCTAAGGGACCAGCGGGTATTGCTAAAGGATCCATATATCAAAAGATTAAGAATAAAACGGCCAAAGTGTAA

Protein sequence:

>DPOGS203959-PA
MEDYRSKHKTIQSAAHHIALDCIHPQSTHGSPYHVHPYPSPVRLQHAFQAQQSPQMFHPKGLKPCIDPKAFLTPSDSPILGRALCLTPRGSPLPGCVSHLNEKFQDSLTLNSDTEALVAKISAMFPTVSETHIKILLKKYYNREAVVISALQVEKHPITTPGPLTMSPSSTRLTKGAMGVYSALQLAKGVSGSLHSANHFTPLTGTPQGSPQLLRPGSCASSYYGTTKSDQAPRHQSPKMKLKYLKSVFPKAEETLILDVLANKDNNVQKASEELISMGFSKKETVFIQQKKKEKSTPQPPKKVVTIVKSIEEKKELKEKLQKKYYNKVAEKVVTIALESVDYNEERAEQILAAVVQEEETPKVAQKTEVKDMRCPNNVVVIKDMANLGITSPISSPSPQRAPSPPPPRRLKIWTEHLKPILKPSQNNESKFKSQYTITTNGPNAALRQGPKDSLLLEDYMTWNGPNPELRCGPGAKQDGPDTKSSAASMARGAAGLAKGPAGIAKGSIYQKIKNKTAKV-