Monarch geneset OGS2.0

DPOGS212740
TranscriptDPOGS212740-TA3300 bp
ProteinDPOGS212740-PA1099 aa
Genomic positionDPSCF300012 + 371473-384506
RNAseq coverage2137x (Rank: top 6%)
Annotation
HeliconiusHMEL0035618e-8785.08% 
BombyxBGIBMGA013250-TA0.072.76% 
DrosophilaCG3632-PE2e-15642.88% 
EBI UniRef50UniRef50_Q17NT18e-17248.45%Myotubularin n=2 Tax=Aedes aegypti RepID=Q17NT1_AEDAE
NCBI RefSeqXP_001848681.11e-17248.01%myotubularin [Culex quinquefasciatus]
NCBI nr blastpgi|1700418923e-17148.01%myotubularin [Culex quinquefasciatus]
NCBI nr blastxgi|3838514030.036.27%PREDICTED: myotubularin-related protein 3-like [Megachile rotundata]
Group
Gene OntologyGO:00163111.7e-42dephosphorylation
GO:00167911.7e-42phosphatase activity
GO:00468725.8e-15metal ion binding
KEGG pathway 
InterPro domain[197-321] IPR0105691.7e-42Myotubularin-related
[1039-1099] IPR0110115.2e-15Zinc finger, FYVE/PHD-type
[1040-1096] IPR0003065.8e-15Zinc finger, FYVE-type
[1036-1096] IPR0130834e-13Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11428 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212740-TA
ATGAGTGAGCGTGATCGTTACCGCGCCCCTCCCCAGCCGGAGCCCCCGCCGCACCGCGTCCGGGCCTCGGACCTCTATCCGAGGACCAGGCCTCACCTGGACGAGCCCGGCCTGGAGGCTGGCTTCGCTACCATATGTGGCGAGTTCGTACAGTTCATCGGGCGTACGTCTGATGGCGGGACTATCGCCATGTCCAACTACCGCCTTCACCTCCAGCCTCGGCGACGCACCGGTAGCCTCGGGTCCTCCGTGCCGCTTCGCCTGATCGAGGCGCTGGAGATCAGAGACCTGCTGTGTCTCATAATACTCTGCAAGCACGGACGACAACTCAAATGTTCGTTCAACACTGGCGATCAGTGTGTGGAGTGGTGGAGGAGGCTGAACACGGCGCTGGTGCCAGTCAGTACATTACAGGAGACGTTCGCTGCGGCCTACGCTGCCTGGGCCAAGGAACAGCCTAACAACTCCGTCCACAGGGCGCTCATGAGGGCCTCCCAGCCGCCTCAGAGACACTGGTTCAAACCTGAGTTGGACAGGCTCGGATTCACAATGAAGGGTGCGTGGCGCGTGTCGGCCGCCAACGTGGAGTACAAGCTGTGCGGATCCTACCCCCCGCTGCTGGTGGTACCGGCCTCTGTAGGGGACGATGACCTTGAATCCGTCGCGCGTTTCCGCGCCATGCGTCGTATCCCGGCTGTGGTGTGGCGTCACCGCGTGTCCGGGGGTATCATCGCGAGGTCCAGTCAGCCCGAGGTCGGCTGGCTGGGCTGGCGCTCCGCCGAGGACGAACGACTGCTGGCCGCCTTCGTGCACGCCTGCAACCAGGACAGGCCCATACCCAATAAGCAACTGAAGCTGCTGATAGTGGACGCACGCTCGTACGCGTCCGCGGTGACGAACCGTGCTCGTGGTGGGGGGTGCGAGTGTGCTCAGTACTACCCCGCGGCTGATATACAGTTCATGTCGCTGCCCAACATCCATCACGTGAGGAGGAGCTTCCAGCAGCTGAGAGCTCTGGCTGCTGAACCACCAGATCAACCCAATTGGCACAGTTCCCTAGAGCGTACACTATGGCCTCAGTACGTGTCGGGGGTGCTGCGAGCGGCGGCGGCCGTGTCCCGGGCGGCCGCGGCCGGTCGACCTGTGCTCGTGCACTGCTCCGACGGCTGGGACCGTACACCGCAGCTGGTCGCCGCCGCCCAGATCATACTCGACCCTCACTACAGGACCATAGAGGGTTTCCGCACGTTGATCGAGCGCGAGTGGTTGGACTTCGGTCACAAGTTCGGTGATCGCTGCGGCCACTCGTTCGGTGGCGAGGATCCCAACGAGCGCTCGCCCGTGTTCCTCCAGTGGCTGCATTGCATCTACCAGCTGATGCTGCAGTATCCCTGCAGCTTCGAGTTCAACGAGGCCTACCTGATAAAGCTGGCTGTTCACGTGCATTCGTGTATGTTCGGTACGTTCCTGTGTAACTCGAGCCGTGAACGTGTCGAGTATCACACGGCTCACACCGCTCAAGTGTGGCGCCTGTTGTCCTCCCCCGCCTACAGGAACCATCTATACACACCGCACGAGGATCAGGTGATATGGCCGGAGTGTAGCGTGCGGTCCATGCAGGTGTGGTGGGGAGTGCTGCTGGGAGAGAGAGAGCGGGAGCCGCCGCGACACAACACACACGTCGACAACAACACAACAAACAACACAAACAACATACACAACGGTCTAATGACGAAGACGAGATCCTGCGATAATCTCCACGGAGGCGAGAAGAAGACGACCCAACGCCGGTGTAGCGACCCCAGTCTGGCGCCCGACATCATGAAATTATCGTTACTGAATGGAAGCGAAATACCAGACGCACAAACAGACACCGACACGGATCAGGTGGACGGTCTTCATCCTGATCACTTTGACAATCATCTCCGAGACATAACAAGCAACTCCTCGTCGCTGGAGAGGGAGCTGGTGTCGATGCCGCCCGTCACCCTGGACGCCAAGGAACAGGACAACCTCACCAACAGCACAGACAACGACGAGCCGAACGAAGCTCTCACCATCACCACAATCACCACCATCGACACCATCAACCATGACGACCTAAAAATTAGCTCCTCCAACCACGATATTATGAACCGAGACGCTATGGAGCTCAGCACCTTGAACAACGACCCTGTCAACCACACCCCGGCGAGCCCCTGCGCCCGACTGGAGGTCGACTCGCCCGAAGAGCCCGCTGTATTTGTTTGTGAAACCTACACCGACGTCATCGGCATGGCGGAGGCCGCTCGCGAGTCCGCGCGCACCCGGAACATAAGTATAACGTGGCGGTCGATATCGGAGTCGAGCAACCAGTCCTCGACCGGCTTCGACATCGGAGACAACTCGCCGCAGACGCGCCTCGAGCCCGCGACTCGACTCGACCCGCAGGAGATCAACAACCACAACTCCACTGACGGGGATGTCGTCAACCATAACATTGTGACCAACATGACGAACCACAACAGGGGAGACGGACTGGAGGGGGTCAACGGGGTCGCTGACGTAACTAACAATTTCCTGGAGGTAGATCTTAATGCATGCGCGTCACCGTGTAGCTCGTCCGACTCGTGCTGCGAGGCGGTGCGCGGCTCCAGCCAGCTCACTCTGTGCCCGGCCACGCCGCCTCACACACACGGCGCGTGTTGCGCTTGCTCCAGCGAGGCGGAAGAGGCGGACGAGACGTTGGAGGTGACGACGGCCGCTCGGACGGGCTGGTCGTGTTCGTGCGGCGGAGCGAGAGCTGTGGAAGGATGCAGGGACTCGCTGGAGGGGGTGGACGGCTTGGACGGCCTACCTCTAGCCAGCGACCCCGTCCAGGCCAGGCTACATCAAATCATACTACAACATAAGAAAATGGTGGAAGATTTAAACGGACAGTTGCGGGAGGCGCGCGAGGCTTTGAGACGTGCGTCGGGGGTTCGGACCCCCGCGGCCGTGACACGACCCCCACACGCCCAGAGCCCTACCGGCGTGTCATCCTCGTGTGTGACGTGTGTGGGTCCCGGAGCTCCGGGCGGGTCCAGCTCGGGCAGCTCCAGCGCCTCGGAGTTGGAAGTGTGTGAGGAGGCGCGGGTCCGTTGGTTGCCGGACGCCGCCGCTCCTCGTTGCCAACACTGCCGGAACTCCTTCTGGCTGGCGAGGCGTCGACACCACTGCCGGAGGTGCGGTGGTATATTCTGTGGCTCCTGCTCCGAGATGTCTCCGTGGGGTGACATGGGTGCCGTCCGAGTGTGCCGCCGCTGTCGGGCGCTCCGGTGA

Protein sequence:

>DPOGS212740-PA
MSERDRYRAPPQPEPPPHRVRASDLYPRTRPHLDEPGLEAGFATICGEFVQFIGRTSDGGTIAMSNYRLHLQPRRRTGSLGSSVPLRLIEALEIRDLLCLIILCKHGRQLKCSFNTGDQCVEWWRRLNTALVPVSTLQETFAAAYAAWAKEQPNNSVHRALMRASQPPQRHWFKPELDRLGFTMKGAWRVSAANVEYKLCGSYPPLLVVPASVGDDDLESVARFRAMRRIPAVVWRHRVSGGIIARSSQPEVGWLGWRSAEDERLLAAFVHACNQDRPIPNKQLKLLIVDARSYASAVTNRARGGGCECAQYYPAADIQFMSLPNIHHVRRSFQQLRALAAEPPDQPNWHSSLERTLWPQYVSGVLRAAAAVSRAAAAGRPVLVHCSDGWDRTPQLVAAAQIILDPHYRTIEGFRTLIEREWLDFGHKFGDRCGHSFGGEDPNERSPVFLQWLHCIYQLMLQYPCSFEFNEAYLIKLAVHVHSCMFGTFLCNSSRERVEYHTAHTAQVWRLLSSPAYRNHLYTPHEDQVIWPECSVRSMQVWWGVLLGEREREPPRHNTHVDNNTTNNTNNIHNGLMTKTRSCDNLHGGEKKTTQRRCSDPSLAPDIMKLSLLNGSEIPDAQTDTDTDQVDGLHPDHFDNHLRDITSNSSSLERELVSMPPVTLDAKEQDNLTNSTDNDEPNEALTITTITTIDTINHDDLKISSSNHDIMNRDAMELSTLNNDPVNHTPASPCARLEVDSPEEPAVFVCETYTDVIGMAEAARESARTRNISITWRSISESSNQSSTGFDIGDNSPQTRLEPATRLDPQEINNHNSTDGDVVNHNIVTNMTNHNRGDGLEGVNGVADVTNNFLEVDLNACASPCSSSDSCCEAVRGSSQLTLCPATPPHTHGACCACSSEAEEADETLEVTTAARTGWSCSCGGARAVEGCRDSLEGVDGLDGLPLASDPVQARLHQIILQHKKMVEDLNGQLREAREALRRASGVRTPAAVTRPPHAQSPTGVSSSCVTCVGPGAPGGSSSGSSSASELEVCEEARVRWLPDAAAPRCQHCRNSFWLARRRHHCRRCGGIFCGSCSEMSPWGDMGAVRVCRRCRALR-