Monarch geneset OGS2.0

DPOGS200565
TranscriptDPOGS200565-TA2367 bp
ProteinDPOGS200565-PA788 aa
Genomic positionDPSCF300119 + 392417-396163
RNAseq coverage271x (Rank: top 40%)
Annotation
HeliconiusHMEL0128860.081.56% 
BombyxBGIBMGA009353-TA0.079.94% 
DrosophilaCG4975-PC1e-2046.39% 
EBI UniRef50UniRef50_UPI00022C9AA03e-3148.41%UPI00022C9AA0 related cluster n=3 Tax=unknown RepID=UPI00022C9AA0
NCBI RefSeqXP_001122527.14e-3356.76%PREDICTED: similar to Ataxin-10 (Spinocerebellar ataxia type 10 protein homolog) (Brain protein E46) [Apis mellifera]
NCBI nr blastpgi|3287758681e-3156.76%PREDICTED: hypothetical protein LOC726806 [Apis mellifera]
NCBI nr blastxgi|3287758685e-3056.76%PREDICTED: hypothetical protein LOC726806 [Apis mellifera]
Group
Gene OntologyGO:00054886.2e-12binding
KEGG pathway 
InterPro domain[683-778] IPR0191564.6e-32Ataxin-10 domain
[127-750] IPR0160246.2e-12Armadillo-type fold
[685-753] IPR0119891.1e-07Armadillo-like helical
Orthology groupMCL19569 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200565-TA
ATGTCTCGAGACAAAAGTGTGTGGTTCGCTGAAGAAATTTTGTTAGATGCCAAACTTATAAATTTTCAATTAGAAACTGGAGAATGGGATGCAGTAGAAGAAATGTTGCGTGCTGAAGCGCAACTCTCTAGAGATGCGGAGACCGGACGATTGCGACCTAAAAAAGCATCGGAATACACTTTGCGACTCATAGCCGAAGTTTTGCACATCATGCGATTAGATGTGGAACAGGCGTCATATAATCGCAGTACCCTTGCCGTGGCTGAGCAATGCCTACGGCTCGTTCGCTCCTGCGCCGCCGCAGGAACAAAAATGCAAACATACATCTCCAAAGAACTGTCTATACTTGATTCAATGCTGATTCTTGCAACGGAATACTTACATCCAAGCATAGAGTTCTTGAATGAAGAATTGCAAAAAAAATATGAGTCCTGTATGACAGTGTTGGTGCAGACTTTAGGCAACTTAGTGGTCAACAATCCATTTAACCAAATAATGATATGGAATAAATTTGACACTATAATCCTAAGTAGTCTGGTCGGTCAAAATGAAAAAATTGCATCTGCTGCAGCTATGATTGTATACAACATACTTTTGGGACAACCAAATGTACTCCCTGATGATGTAATGCTTCTAAATTCTTTAGCTTACATGTATATAAATGGAAATACAGACTATCCACATTTGATCATTGAATACTTAATCGGTGTGGAGAACACTTATATAGAGAAACTGTACTCGAAATTGGATCCCGAATGCCGTATGTTAGTACTAAATCTGGCTTACAATATTCTTATGTCAAATGAATCTCAGGATATAAATGTATCTGTTAACTTTGTCAAATTCATGGCTCATGAGTTCAAAAGTAAATCGGATTGTATTTTGAAAACCGTTGATAAGTATGTGAATACTATAGAGCCACAAGAAGTTGTGTTTTTGCTTGAAATCATTGCCACAGCTTCTAGTATGGATGCTTATATGGACTGTTTGCAAAGTGATACCTCTTTGTTCATTAATTGTGCTTTTTTGCTACGAGCCATCCATAAGTTGGGCAAAGAGAGTAATAACTTTTTTTCATCTATTAACAAGCTGTATGAGGCTACCGGAAAACATTTGGTGAATGATGAAGACATGCCAGCTGTGAACATTCCTCCTGAGATTCCGCTATGCAAGAACGACTCGGCAGACAATATGCTGTCTTGCAACGAAACAACATCTGAATGTACTGAGTCAGAAAGTAGTGAGCAATATTTTGAATGCAACGAAACCCCGGGTTATATTGAAAAGCTAGATGCCTGCACATTAGAAAATCCACCGATGAAATTTGATATGGACAAAAACTCTAGAGCTTCCTCCGAAGGGAACTCAAATGTCAGAATTGATAGTGACTTGACTAAAGTTGCATTCAAAAGGAGTAGTTCCGTACCAAAACCAGACGATAGAATTCATCAGCTTCCAATAAGAAGGATAAGTCTGGAACATAAAATAGATATAGCTGATGATACTGAACCACCATTGATAATTGACACCCAATCGCCCACGGCCAGTATGAACACGATAGCTGAAGTTATACCGGTCGGGCTTCAGCCGAGGTTGGAGTTACAGGATTCGATGAATGACTCGCAGGACAGCTCCCCGAACGATCCGCCGACGGAAATCCATATGGTAGACACCAAAGACGTGAAAGCATCACCAAGCGAAATATCACCACCGTTATCAGGAGTGAAACAGGAAAGTGAATCACAGAAACAGTCACCCGAAACGCCCAAGGAAGTTAAAAAGAGCCAGAATCTATCGACGGACTTCGATCTCAGTGAACAGTTACAGAAGATTCTGGGTATAACTTCGGAGAAGACGAGAGTTGATGCCAGTGATAAAAAATCGGAACCCGCCAAGGAAATTAAAATGTCAGATGACACTAACAAGATACCCTTGGTGAAAATCGATTCACCTGAAACTCAGAAGACCAGATTAGGAGCTATAGATATGACAACTCCAAAGAAATCTATAACAAGCGTGGAAACTGTGGAGAGACATGTGGCTTTTGGATTCAAGGCATCTCTTGTGAGAACACTTGCCAACCTCTGTTGGAAGAACCAAGAAAATAAACGACAGATGAGAGAACTCGAGCTCATACCAGTTCTTTTGGATTGTTGCAACATCGATGCCAGAAATCCACTAATAATGCAGTGGGTTATTTTCGCTATTCGCAACTTATGTGAGAATTGTCCGGAAAATCAGGAAGTGATTTCGAAGTTGACACTTCAAGGTCCAGTTGACAACGAAGTCCTTCAGGAAATGGGCCTGACGTTGAATACGGACTCACAAGGAAATACAATTAAAATTGTGCCCATGACACGAAATTGA

Protein sequence:

>DPOGS200565-PA
MSRDKSVWFAEEILLDAKLINFQLETGEWDAVEEMLRAEAQLSRDAETGRLRPKKASEYTLRLIAEVLHIMRLDVEQASYNRSTLAVAEQCLRLVRSCAAAGTKMQTYISKELSILDSMLILATEYLHPSIEFLNEELQKKYESCMTVLVQTLGNLVVNNPFNQIMIWNKFDTIILSSLVGQNEKIASAAAMIVYNILLGQPNVLPDDVMLLNSLAYMYINGNTDYPHLIIEYLIGVENTYIEKLYSKLDPECRMLVLNLAYNILMSNESQDINVSVNFVKFMAHEFKSKSDCILKTVDKYVNTIEPQEVVFLLEIIATASSMDAYMDCLQSDTSLFINCAFLLRAIHKLGKESNNFFSSINKLYEATGKHLVNDEDMPAVNIPPEIPLCKNDSADNMLSCNETTSECTESESSEQYFECNETPGYIEKLDACTLENPPMKFDMDKNSRASSEGNSNVRIDSDLTKVAFKRSSSVPKPDDRIHQLPIRRISLEHKIDIADDTEPPLIIDTQSPTASMNTIAEVIPVGLQPRLELQDSMNDSQDSSPNDPPTEIHMVDTKDVKASPSEISPPLSGVKQESESQKQSPETPKEVKKSQNLSTDFDLSEQLQKILGITSEKTRVDASDKKSEPAKEIKMSDDTNKIPLVKIDSPETQKTRLGAIDMTTPKKSITSVETVERHVAFGFKASLVRTLANLCWKNQENKRQMRELELIPVLLDCCNIDARNPLIMQWVIFAIRNLCENCPENQEVISKLTLQGPVDNEVLQEMGLTLNTDSQGNTIKIVPMTRN-