Monarch geneset OGS2.0

DPOGS213959
TranscriptDPOGS213959-TA1659 bp
ProteinDPOGS213959-PA552 aa
Genomic positionDPSCF300226 + 145418-149605
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0152925e-10688.50% 
BombyxBGIBMGA003376-TA0.081.79% 
DrosophilaCG5026-PB4e-14344.30% 
EBI UniRef50UniRef50_Q7QKK48e-15650.81%AGAP003266-PA n=6 Tax=Endopterygota RepID=Q7QKK4_ANOGA
NCBI RefSeqXP_970048.13e-17052.17%PREDICTED: similar to AGAP003266-PA [Tribolium castaneum]
NCBI nr blastpgi|910819076e-16952.17%PREDICTED: similar to AGAP003266-PA [Tribolium castaneum]
NCBI nr blastxgi|910819072e-16852.36%PREDICTED: similar to AGAP003266-PA [Tribolium castaneum]
Group
Gene OntologyGO:00163114.6e-34dephosphorylation
GO:00167914.6e-34phosphatase activity
KEGG pathwaycfa:4922112e-67 
 K01112 (E3.1.3.-)maps-> Thiamine metabolism
    Riboflavin metabolism
    Fructose and mannose metabolism
InterPro domain[157-268] IPR0105694.6e-34Myotubularin-related
Orthology groupMCL13401 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213959-TA
ATGGAATTTATAGAGCTTATCCTTATAAATAAGTTAGACGGAGTGATATTACGATATCCTCATCATGATAACGTAGATGGAACAGTGTGTATAACTGGTCACCACCTCATATTAAGTTCGCGAAAAGAAGGTGTTCGCGAACTATGGCTATTGCACAGGAACATAGACAGTATCGAAAAAAAGGAGAACAAATTGAGTGGTGGAGTTGTACAAGGTGGTTTTCTGTATTTAAAATGCAAAGATTTGAGAATATTTCAACTGGATATCAATAACACGACAGAATTATCCTTGCTAGCTCAGACATTGGAAAACTTGTCAAGTATACAGGACCCTACCCTTTTCTATCCCTTCTTTTACAGGCATATGCATCCCATAATGGAAAATGGTTATACCCTTTACAGTATTGAAGGAGAATTCACAAAAGTGATTGCCACAGAGGAATGGCGTATATCGAGAGTGAATCAAAATTACACAGTCTGTCAAAGTTATCCGAAAGCTGTGGTTGTACCAAAGTGTATAGATGATGAGACCCTGGCCGCGGCAGCAACATTCAGACAGGGCGGCCGCTTCCCTGTGCTCTCGTATAGACATAACAATGGTGCTGTTCTCCTTCGAGCCGGCCAGCCTCTTTATGGTCCAAAACATCGCAGGTGTCGTCCGGACGAGCAAATATTAAATTCAATAGTTAATGTTGGCATGAAGGGTGTCATATATGATCTCAGAAGCTCCAATATGATCTCGCAACAGCAAAATAAAGGCGGAGGCAGTGAGAGTCCATCAAACTATTCTCAATGGAAAATATACAACAGACCGATGGATGAAGTCGACAATCAACAACCGCTGCTGGAGAGCTTCTCAAAACTGATTGAAACCTGTATAGATCGTGAGATATCGTGTGACAAATGGCTGTCTCGTCTGGAGTCCTGCGGGTGGCCGGAGTCGGTCAGGAATTCTCTGCACACAGCTTGTATCATCGCACAACATATTCACCAGAGATCCGAGCCAGTTTTGGTGCATGGTACCAGAGGTGAAGACGCTACCTTACTCATCTGTTCCTTAGTACAAATCATCCTCAATCCAGACAGTCGGACTATCAGAGGACTGCAGGCCTTAATCGACCGTGAATGGCTGCAAGGCGGCCATCCCTTCCAGAGCCGTGTCACTTCCGGTCCGTACTCGTCCCGCCCCCGCGCGGCGCCCACTTTCACCCTCTTCCTCGACTGCGTGCGGCAGTTCCTGGAGCAGTTCCCATGCAGCTTCGAGTACCGACAGTGCTTCCTCATAACGCTGTTCGAGCACGCCTACGCCAGTCAGTTTGGATCGTTCTTATGCGATAGTGACCGCGAGCGGTCCGCACTGGGCGTTTACGACCAAACGACCAGCTTGTGGTCGTGGATGAACCAGCCTGAAGAGCTGGCCGTGTACATCAACCCCTTGTATGACCCCACGCATGACGTCATCTGGCCGTCGGTGGCACCCATGAGCTATGTTATATGGGAAGAGCTCTATTTACGCTGGTTGGTGAAACAACGTACAGAGGAAAAGGAGGAACTGTACAGAGTAATTCGAACTAGAGAGCAAGCGTTAAGGGCTCAGGCGCAACAATTACGGCGAGAACTGAATGACCTCGCGCAGCAGTATTACGCTCAAGATAAGTGA

Protein sequence:

>DPOGS213959-PA
MEFIELILINKLDGVILRYPHHDNVDGTVCITGHHLILSSRKEGVRELWLLHRNIDSIEKKENKLSGGVVQGGFLYLKCKDLRIFQLDINNTTELSLLAQTLENLSSIQDPTLFYPFFYRHMHPIMENGYTLYSIEGEFTKVIATEEWRISRVNQNYTVCQSYPKAVVVPKCIDDETLAAAATFRQGGRFPVLSYRHNNGAVLLRAGQPLYGPKHRRCRPDEQILNSIVNVGMKGVIYDLRSSNMISQQQNKGGGSESPSNYSQWKIYNRPMDEVDNQQPLLESFSKLIETCIDREISCDKWLSRLESCGWPESVRNSLHTACIIAQHIHQRSEPVLVHGTRGEDATLLICSLVQIILNPDSRTIRGLQALIDREWLQGGHPFQSRVTSGPYSSRPRAAPTFTLFLDCVRQFLEQFPCSFEYRQCFLITLFEHAYASQFGSFLCDSDRERSALGVYDQTTSLWSWMNQPEELAVYINPLYDPTHDVIWPSVAPMSYVIWEELYLRWLVKQRTEEKEELYRVIRTREQALRAQAQQLRRELNDLAQQYYAQDK-