Monarch geneset OGS2.0

DPOGS210374
TranscriptDPOGS210374-TA1017 bp
ProteinDPOGS210374-PA338 aa
Genomic positionDPSCF300025 + 672718-674774
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0138587e-14071.73% 
BombyxBGIBMGA011939-TA4e-13968.86% 
DrosophilaCG15743-PA2e-7142.15% 
EBI UniRef50UniRef50_E0VYW22e-8248.94%Bisphosphate nucleotidase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VYW2_PEDHC
NCBI RefSeqXP_002431306.13e-8348.94%bisphosphate nucleotidase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420217506e-8248.94%bisphosphate nucleotidase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420217504e-8549.39%bisphosphate nucleotidase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00044371.6e-104inositol or phosphatidylinositol phosphatase activity
KEGG pathwaydre:6415709e-59 
 K01092 (E3.1.3.25, IMPA, suhB)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
    Streptomycin biosynthesis
InterPro domain[16-336] IPR0007601.6e-104Inositol monophosphatase
Orthology groupMCL15981 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210374-TA
ATGAATTTCGGCGGCACTCTAAGATTTAACAAATTTGCTTGTTTTACGCTAGGTTTTATTTTATTTCTAATTATTTATTGGCGCTCTGGTGGTTCAGGATATCCCATTGAAAAAGAAGACCTCATTAATCTGAAATCTTTACTTAAAGCTGCTATATATGCTGCCGAGCGAGGTGGTAAGAAAGTCATAGATGGCAAGAATCACGACTTAAACATAAAGAGTAAGGGGAAAACGAAGGAAGGTCTAAACGACCCCGTCACTGATGCAGACTACGCATCTCATTGTGCCATGTACTATAGCCTTAAAAACACATTTTCAAATTTAAAAATTATATCAGAAGAACATTCAAGTGATGACCCGAGTTGTAAAAATCAAGAAAAGATAGATGTTGACTCGGTGATACCCGAACACAGGATTATAGAACACTTGAATGATGAACATGTGTTAACCAATCAGGTCACTATGTGGATTGATCCCCTCGACGCAACTAAAGAATATACAGAGGGACTATATGAGTATGTGACGACCATGGCCTGTGTAGCCATCAATGGAGTGCCAATTGTGGGTGTTATACACTACCCATTCCCTCCTCGAACTTATTGGGGCTGGTTCACAAAGAAAACATCCAGTAACATAGCTAATATACAACATATAGCTGAGAACAAGGAACATCCAAGAGTTGTTATCTCACGTTCACATCCAGGTAAAGTTGAGGATTTAGTTAAAAGATCATTCGGCCCCAAGACTACAGTGATCCAAGCGGCCGGGGCGGGAGACAAAGTCATGGGAGTGGTGAATGGTAACTTTGATGTTTACCTTCACGCTACTGCCATCAAGAAGTGGGATCTTTGCGCGGGAAACGCCATCATAAAAGCCGTCGACGGTAAAATGACGACCCTGAAAGGCGAAGACATTAATTATTCTTCCGACAGCGAACCGAAAGTCACAGATGGCATTTTAGTATCGAGATACGACCACGATTATTATTTGAGTAAAATTCCCAAAAATGACGCCTGA

Protein sequence:

>DPOGS210374-PA
MNFGGTLRFNKFACFTLGFILFLIIYWRSGGSGYPIEKEDLINLKSLLKAAIYAAERGGKKVIDGKNHDLNIKSKGKTKEGLNDPVTDADYASHCAMYYSLKNTFSNLKIISEEHSSDDPSCKNQEKIDVDSVIPEHRIIEHLNDEHVLTNQVTMWIDPLDATKEYTEGLYEYVTTMACVAINGVPIVGVIHYPFPPRTYWGWFTKKTSSNIANIQHIAENKEHPRVVISRSHPGKVEDLVKRSFGPKTTVIQAAGAGDKVMGVVNGNFDVYLHATAIKKWDLCAGNAIIKAVDGKMTTLKGEDINYSSDSEPKVTDGILVSRYDHDYYLSKIPKNDA-