Monarch geneset OGS2.0

DPOGS216106
TranscriptDPOGS216106-TA1632 bp
ProteinDPOGS216106-PA543 aa
Genomic positionDPSCF300182 - 167581-178051
RNAseq coverage229x (Rank: top 44%)
Annotation
HeliconiusHMEL0048570.092.22% 
BombyxBGIBMGA009224-TA0.073.09% 
DrosophilaSbf-PA1e-14850.65% 
EBI UniRef50UniRef50_E2ADV60.060.00%Myotubularin-related protein 13 n=8 Tax=Formicidae RepID=E2ADV6_CAMFO
NCBI RefSeqXP_394363.30.060.37%PREDICTED: similar to SET domain binding factor CG6939-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838508140.060.75%PREDICTED: myotubularin-related protein 13-like [Megachile rotundata]
NCBI nr blastxgi|3838508140.060.75%PREDICTED: myotubularin-related protein 13-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[37-220] IPR0011943.3e-65DENN
[463-543] IPR0220965.4e-22Myotubularin protein
[273-342] IPR0051123.5e-18dDENN
Orthology groupMCL10595 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216106-TA
ATGAGGTCAACGGCCTTGAGGCTGAGACTCGAGCGGAAATCAGCTCTGGACGAAGAATCTCTAGATTCAAATCGGCCCGTCGCAAACATCACACATCACAGTATAATGTACGCTCCAAAGTGTATGGTCATAGTTTCAAGACAGGACTACATCGACACATTCCGGAACTGCCTCGGAATTATTTACACGGTGTGGGTTGAAAACCTGGGTGTACCTCTGGAGACGTTGGTCGGTAACCTGTTGGGATGCGTGTCGGTGCCACCGGCTGGCGGTCCTCAGGTCCGATTCAGTATTGGTGCAGGGGACCGGCAAGCCTTGCAGCCGCCGGCCGCGCCCCCCATGCCGGTCACACACACCGCTGTGCATATGCTTCTGAGGTTGCTTGGTATCCATAATTCTATAACACTCTGGTGCGCTGTGATGAGTGAGCACAAAGTGCTACTGGTGTCTTTAGCTGCCGCTCGCTTGTCGGCGGCGTGTCGCGCACTCGCCGCTCTCATGTTTCCATTCCGATACGCCCACGTTTATATCCCGTTGCTACCCACGGGCCTGGCGGAGGTGTTGGCGACGCCTACTCCATTCCTCATCGGCGTACATGCTAGTTTGAAGGAGGAAGTTTCTGAGCTGCTCGATGTAATAGTAGCGGACCTGGATGTCGGTTCCCTTCATATACCTGCAAGTGTGAACATCCCTCGTCCGGAAGGCAAGCTCCTGTCGTCGCTGCAGGAGGCCCTTGCACTGGTGCTGCAACCCGAGCTCAAGTCAGCGGACTCGGCCTTCGCCCCCCCGCCGCCCTCAGCATCACCACCACATATGTTAGATAAGGAGATCAGAGCGGTGTTCATGCGCACCCTGGCCAAATTGTTACAGGGTTACAGACATTGTCTCACAATCATCCGCATCCATCCATCTCCTGTGCTGACATTCCACAAAGCTGGGTTCTTAGGAGCCCGCGGCTTGTCCCAGTGTCCCTTCGCCTCCCGTCTGCTGGACTCCATGTTCTTCAACGGCCTGGTCGCTGAACGCGGCCCACCCTGGCGGCCCACAGACATATGGGACGAACTAGTGCAAAATCTGCCGGAACAACTGAGGCTGGAGTCGTTGAACCCGGAGCTGGAGCTGCAGCACATACAAGACTTGGCGATGCAGCTACATTTGAACGAAAATCCGAATCCACAGGCCCAGTCCACCCAGCCGTACGCTCAGCGTGTTCTCCGACCCCCTGAAGGGGCCTCGGCCCGCATACACCAGCCGCCTCTCCCGACTCTGGACCCGCGCGCGGTGCACGCCGTCATGAGAGACCTCGCCGCAAGGAACACACCATCTGTTAAGATGTCATCACTCCGTCTTCCAGCTCCAAGGATTATACCACCGGGAGCGTCTCCAACTGGTGCCGTGGAACACACGCAGCTTATACTCACTAACTCTGCGAGGAGACTCGAGGTGTTGCGGTCGTGTATAGCGGCTATATTCGAGTGTCGGTACGCGGACGCTAGGAAGTCTCTCCCGGGGGTGGTGCGCGCGCTGCGGGCTCCGGCGGCGAGGGCAGCGCTGGTGAGGGACCTGGCCGCGAGGCTACCCACCAACAAACATCTGCTGCAGCATCATCAGTTCGAGCTGGTTGTCAGGTGA

Protein sequence:

>DPOGS216106-PA
MRSTALRLRLERKSALDEESLDSNRPVANITHHSIMYAPKCMVIVSRQDYIDTFRNCLGIIYTVWVENLGVPLETLVGNLLGCVSVPPAGGPQVRFSIGAGDRQALQPPAAPPMPVTHTAVHMLLRLLGIHNSITLWCAVMSEHKVLLVSLAAARLSAACRALAALMFPFRYAHVYIPLLPTGLAEVLATPTPFLIGVHASLKEEVSELLDVIVADLDVGSLHIPASVNIPRPEGKLLSSLQEALALVLQPELKSADSAFAPPPPSASPPHMLDKEIRAVFMRTLAKLLQGYRHCLTIIRIHPSPVLTFHKAGFLGARGLSQCPFASRLLDSMFFNGLVAERGPPWRPTDIWDELVQNLPEQLRLESLNPELELQHIQDLAMQLHLNENPNPQAQSTQPYAQRVLRPPEGASARIHQPPLPTLDPRAVHAVMRDLAARNTPSVKMSSLRLPAPRIIPPGASPTGAVEHTQLILTNSARRLEVLRSCIAAIFECRYADARKSLPGVVRALRAPAARAALVRDLAARLPTNKHLLQHHQFELVVR-