Monarch geneset OGS2.0

DPOGS204503
TranscriptDPOGS204503-TA1983 bp
ProteinDPOGS204503-PA598 aa
Genomic positionDPSCF300002 + 1718415-1724813
RNAseq coverage214x (Rank: top 45%)
Annotation
HeliconiusHMEL0130778e-12895.12% 
BombyxBGIBMGA007847-TA9e-15785.31% 
Drosophilamtm-PA0.065.63% 
EBI UniRef50UniRef50_Q136140.062.01%Myotubularin-related protein 2 n=187 Tax=Eumetazoa RepID=MTMR2_HUMAN
NCBI RefSeqXP_001605144.10.073.72%PREDICTED: similar to ENSANGP00000020921 [Nasonia vitripennis]
NCBI nr blastpgi|3504163080.076.03%PREDICTED: myotubularin-related protein 2-like [Bombus impatiens]
NCBI nr blastxgi|3072113860.072.15%Myotubularin-related protein 2 [Harpegnathos saltator]
Group
Gene OntologyGO:00163114e-46dephosphorylation
GO:00167914e-46phosphatase activity
KEGG pathwayhsa:88980.0 
 K01112 (E3.1.3.-)maps-> Thiamine metabolism
    Riboflavin metabolism
    Fructose and mannose metabolism
InterPro domain[208-325] IPR0105694e-46Myotubularin-related
Orthology groupMCL10943 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204503-TA
ATGGACAAACATAACAGTTCAGAGTTGTTAAATTCTGATTTTCCACTATCTAAGAATGCTAGTTCCGATTCACTGGATTCTGATTCTAAATCGAGTTCATTGAATTCCAAACACGGACAGGATTCCAGTCATGTATTGGGAGATATACAATTATTGGATGGAGAGAAGGTTATGGGTGTTGCCAGAGATGTTACATACCTTTGCCCATATAGTGGGCCATCACGAGGTGTTTTAAAAGTAACAAACTATCAGATACATTTCCGACCTACAGAAGCTACTTCTTTTCAGACTACCTTAAGTGTTCCACTTGGGGTTGTTTCCCGAATTGAAAAAGTTGGCGGAGCATCATCAAAAGGTGAAAACTCATATGGCATTGAAGTGTTTTGCAAGGATATGCGCAACCTGAGGTTTGCTCACAAACAAGAGAATCATTCTCGGAGAGGTATATTTGAGAAATTGCAACAATTAGCCTTCCCATTGTCACACAGGCTGCCGATGTTCGCATTCAGTTATTCAGAAAGTTTTCCAGAAGATGGATGGAATGTTTATGAACCTATTGCTGAGCTCAGACGTATGGGTGTCAACAATGATATGTGGCGCATAACACGCATCAATGACAAGTATGAGATATGTGATAGCTATCCGTCAGTGTGGGCAGTTCCCGCTGCTGCGAATGATGATCTTTTGAGGTCTGTGGCAGCGTTCAGATCCAGGGGTCGCATACCAGTTCTGGCATGGATACATCCCAGCTCCCAGGCCACCATAACCAGATGTAGTCAACCTTTAGTTGGGGTAAGCGGTAAGCGTAGCCGTGAAGACGAGCGTTATATACAACTGATAATGGACGCCAATGCTCAGGCTCACAAGCTGTTCATCATGGATGCAAGACCCAGCGCTAACGCTATCGCCAACAAGGCTAAAGGCGGAGGCTATGAATCTGAAGATGCATATCAAAATGCAGAGTTGGTGTTCCTTGACATCCACAACATTCATGTGATGAGGGAAAGTTTGAGGAAACTCAAAGAGCTTTGCTTCCCTCAAATAGATCAGACGAGATGGTTCAGTGGTATAGAAGCTAGCTGTTGGTTGAAGCACATAAAATGCATTTTGGCTGGAGCTGTCAGAATTGTTGATAAGGTGGAGAACCATAAGACATCAGTACTGGTTCATTGTTCGGACGGCTGGGACAGAACAGCTCAGCTCACAGCACTGGCCATGCTTATGCTGGATCCATACTATCGCACCCTACGAGGCTTCCAAGTTCTTATAGAAAAGGAGTGGCTGTCTTTCGGACATAAGTTCCAACTTCGTATAGGTCACGGCGACGAGCGTCACTCGGACGCGGACCGGTCGCCCGTGTTCGTGCAGTGGGTGGACTGCGTGTGGCAGCTACAGCAGCAGTTCCCTACAGCCTTCGAGTTCACCGAGCGTCTGCTGATCACGGTGGTCGACCACCTGTACTCGTGTCGCTTCGGCACGTTCCTATTTAATACTGAACGGGAGAGAGTCAAGGAAGAATTAAAAACCAAAACGACGTCACTCTGGTCATATATAAACAGCAGGCAGAATTTGTACCTCAATCCATTGTACTGGGGTCCATCATCGTTCACCAACTCCCCACCATCACAACACACGAGACCACAGATGGTGCTGGTACCGGTCGCCTCACTGAGGATCATCAAGCTCTGGAAGGCTTTATACTGCAGATGGAACCCCACTATGAGACAGCAGGTGAAAATATTAACACGCAGTACAGTAATCCCCCCTATTACACGGTATATTTGTTCCGCGTTATAGGAAAACGCGTTGTACGAAAATTCTCGTAAAACGCGCTCGTATGACCTCGTATAACGCGTTATATACAAATGATTCACGTTTAACTTTTCTTGTTAAAGAGGGTTTAAATCGAATATTAAGGAAAAACTATGTTATGAACGAAATATATGCGTTTTATTAAAATCAAACCAAGAACATTTTCAATAA

Protein sequence:

>DPOGS204503-PA
MDKHNSSELLNSDFPLSKNASSDSLDSDSKSSSLNSKHGQDSSHVLGDIQLLDGEKVMGVARDVTYLCPYSGPSRGVLKVTNYQIHFRPTEATSFQTTLSVPLGVVSRIEKVGGASSKGENSYGIEVFCKDMRNLRFAHKQENHSRRGIFEKLQQLAFPLSHRLPMFAFSYSESFPEDGWNVYEPIAELRRMGVNNDMWRITRINDKYEICDSYPSVWAVPAAANDDLLRSVAAFRSRGRIPVLAWIHPSSQATITRCSQPLVGVSGKRSREDERYIQLIMDANAQAHKLFIMDARPSANAIANKAKGGGYESEDAYQNAELVFLDIHNIHVMRESLRKLKELCFPQIDQTRWFSGIEASCWLKHIKCILAGAVRIVDKVENHKTSVLVHCSDGWDRTAQLTALAMLMLDPYYRTLRGFQVLIEKEWLSFGHKFQLRIGHGDERHSDADRSPVFVQWVDCVWQLQQQFPTAFEFTERLLITVVDHLYSCRFGTFLFNTERERVKEELKTKTTSLWSYINSRQNLYLNPLYWGPSSFTNSPPSQHTRPQMVLVPVASLRIIKLWKALYCRWNPTMRQQVKILTRSTVIPPITRYICSAL-