Monarch geneset OGS2.0

DPOGS216123
TranscriptDPOGS216123-TA2094 bp
ProteinDPOGS216123-PA697 aa
Genomic positionDPSCF300182 + 295995-303250
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0056220.078.79% 
BombyxBGIBMGA009288-TA0.082.74% 
DrosophilaCG3530-PA0.056.95% 
EBI UniRef50UniRef50_E2BFK60.054.80%Myotubularin-related protein 8 n=2 Tax=Harpegnathos saltator RepID=E2BFK6_HARSA
NCBI RefSeqXP_001959805.10.057.69%GF11853 [Drosophila ananassae]
NCBI nr blastpgi|1947550500.057.69%GF11853 [Drosophila ananassae]
NCBI nr blastxgi|1953842270.058.20%GJ19990 [Drosophila virilis]
Group
Gene OntologyGO:00163111.3e-39dephosphorylation
GO:00167911.3e-39phosphatase activity
GO:00468726.4e-09metal ion binding
KEGG pathwaydpo:Dpse_GA175040.0 
 K01112 (E3.1.3.-)maps-> Thiamine metabolism
    Riboflavin metabolism
    Fructose and mannose metabolism
InterPro domain[112-227] IPR0105691.3e-39Myotubularin-related
[623-688] IPR0110113.7e-12Zinc finger, FYVE/PHD-type
[621-679] IPR0003066.4e-09Zinc finger, FYVE-type
[630-680] IPR0130832.7e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL10973 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216123-TA
ATGAGAAAGGAAACTTGGATACTTCTAATGCACATATCAACAGTGGAACGTTTGCCGATCACAACGACGGGCTCGCCTCTGTTAGTTCGCACTAAGACCTTCCAATCTGTGTTCTTTGTTATACCGAGAGAACGGGACTGTCACGAGATGCACCAGACCCTGCTGAGGTTAAGTCAACCGGTTAACCTGGAAGAGCTATACTGCTTCCATTACAAGTCTACTCCGGACGACCTTCCCAAGTCAGCTGGTTGGAATTTCTTCGACATACAGACAGAGTTCCAGAGAATGAATGTCCCCAATGAACAGTGGACCCTCTGTAACGCTAACAGAGATTATGAGCTTTGTGACACTTACCCAAGCGAGATTTACGTACCGGCTCGAGCCTCCAGTGCGGTGTTACTGGGCAGCGCCAGCTTCCGGTCCCGCGGGAGGTTGCCGGTGTTGGCTTACTTGCACCATAACAAAGCGGCCATCGCCAGATGCAGTCAGCCATTAAGCGGATTTTCTGCCAGATGCATGGAAGACGAACAAATGTTGGATCTCATCCGTCGCGCAAACCCCAACTGCGGGTACATGTATGTCGTAGACACACGACCGAGGATTAATGCGATGGTGAATCGTGCCGCTGGCAAGGGCTATGAGAACGAAGCCTTCTACGAGAACATCAAGTTTCAGTTCATGGGCATCGGTAACATACACGTGATGAGGAAGAGTCTTCAGAAACTGGTTGAAACGTGTGAACAAAACTCTCCCACAATGTCGTCCTTCTTGAATGGCTTGGAGTCATCAGGTTGGCTGAAACATATCAAATCTATCCTGGACACGTCCTGGTGGATAGCCAGCGCTATATCCGGAGGAGTGAGCGTCTGTGTTCACTGCAGCGACGGCTGGGACAGAACCGCTCAGGTGTGCTCGCTGGCCGCCCTGTGTATAGAACCTCACTACAGGACTATCAATGGCTACCAGGCGTTGATCGAAAAGGACTGGCTGTCATTTGGCCACAAGTTCACGGCTCGTTGTGGTCACGTGGCGTGTGACTCGCGAGAACGATCGCCCGTCTTCACACAGCTGCTGGACTGCACCTGGCAGCTGCTGCGGCAGGCCCCAGAGGCCTTCCAGTTCAACGAGAGGTTCCTACTAACATTACACGACCACGCCCACGCCTGCCAGTACGGGACGTTCATTGGTAACTGCCAGAAGGATAGGCGAGATCTCAGATTGTCGGAGCGAACGTTCTCTCTATGGGGCTATATGGCGAGTCACCTCAATGAGTACAAGAATCCACTGTACAATCCCAAAGCCTACCCCGACATATTGAAACCTGACTTGAGCGCTCAGAGTATCAGGTTCTGGCGCGGTATGTATTGTCGCCATGAGAGCGGTGTCCATCCTCGGGAGTCTCTGGCAGACCTGCTGCCGGCCGCCGTACACCACGCCACCGCCCTGACTCATCACATAGATTACCTGGCTAAGAGGATATCGACGTTCAAAAACCTTTTATCCGGACGGAAAGAGAGAAAAGATGACACCGTCGTGAACTATCAGAACGGATCCTCGGACACCGTGGACATTGAGACCAAGATGGAGGCTGCGGAGATTGATAATAAGCAATCCGCCATGATGTCTTGTACACCAATCAGAGCGCATGACGTCACTTCTAAGTCGAAGCATAGACAACACGCACGAACACTTCACTACGAGTCCGGCGGCAACCTATCAGAGCTGGAGTGCGCTAACCACGACCACCCACTGAAGGAAGGAGCTCCCAAAATAATACCGCTCATAACAGAGAAGGCAGACGCCATGATACCCGCGACCCTGAGCCTGCTAGAGGCTGAGATCAGCACAGTGGCACTCGACTGGAAGACTATCAAGAACGTCACCGAGTGTGGCTGCTCTACACCCTTGGATCACTTCAGCAGGAAGCATCACTGCTGGGGTTGCGGCCGTGTGGTTTGCACGCGGTGTGTGGCCGCTCGGGCAGCCCTGCCAGCCCTGCACGCCGCTAGAGCGGCACCACTGTGTGCAGCTTGCGCACCCTCCACACCCTCCACCCCTGTCACTCCACCTGATTTGTTGACGTCCGCAACCTAA

Protein sequence:

>DPOGS216123-PA
MRKETWILLMHISTVERLPITTTGSPLLVRTKTFQSVFFVIPRERDCHEMHQTLLRLSQPVNLEELYCFHYKSTPDDLPKSAGWNFFDIQTEFQRMNVPNEQWTLCNANRDYELCDTYPSEIYVPARASSAVLLGSASFRSRGRLPVLAYLHHNKAAIARCSQPLSGFSARCMEDEQMLDLIRRANPNCGYMYVVDTRPRINAMVNRAAGKGYENEAFYENIKFQFMGIGNIHVMRKSLQKLVETCEQNSPTMSSFLNGLESSGWLKHIKSILDTSWWIASAISGGVSVCVHCSDGWDRTAQVCSLAALCIEPHYRTINGYQALIEKDWLSFGHKFTARCGHVACDSRERSPVFTQLLDCTWQLLRQAPEAFQFNERFLLTLHDHAHACQYGTFIGNCQKDRRDLRLSERTFSLWGYMASHLNEYKNPLYNPKAYPDILKPDLSAQSIRFWRGMYCRHESGVHPRESLADLLPAAVHHATALTHHIDYLAKRISTFKNLLSGRKERKDDTVVNYQNGSSDTVDIETKMEAAEIDNKQSAMMSCTPIRAHDVTSKSKHRQHARTLHYESGGNLSELECANHDHPLKEGAPKIIPLITEKADAMIPATLSLLEAEISTVALDWKTIKNVTECGCSTPLDHFSRKHHCWGCGRVVCTRCVAARAALPALHAARAAPLCAACAPSTPSTPVTPPDLLTSAT-