Monarch geneset OGS2.0

DPOGS210258
TranscriptDPOGS210258-TA2160 bp
ProteinDPOGS210258-PA719 aa
Genomic positionDPSCF300216 - 306533-319197
RNAseq coverage356x (Rank: top 33%)
Annotation
HeliconiusHMEL0169785e-12975.51% 
BombyxBGIBMGA000032-TA0.058.13% 
Drosophila5PtaseI-PF9e-12657.91% 
EBI UniRef50UniRef50_E2A2Z91e-13663.03%Type I inositol-1,4,5-trisphosphate 5-phosphatase n=15 Tax=Coelomata RepID=E2A2Z9_CAMFO
NCBI RefSeqXP_973481.13e-13763.23%PREDICTED: similar to IP3phosphatase [Tribolium castaneum]
NCBI nr blastpgi|3800162848e-13763.87%PREDICTED: uncharacterized protein LOC100866560 [Apis florea]
NCBI nr blastxgi|2700133285e-13263.23%hypothetical protein TcasGA2_TC011918 [Tribolium castaneum]
Group
Gene OntologyGO:00044373.8e-37inositol or phosphatidylinositol phosphatase activity
KEGG pathwaytca:6622791e-136 
 K01106 (E3.1.3.56)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
InterPro domain[4-375] IPR0003003.8e-37Inositol polyphosphate-related phosphatase
[3-354] IPR0051355.7e-28Endonuclease/exonuclease/phosphatase
Orthology groupMCL14417 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210258-TA
ATGGGTTCCGATAAAGTGCCGTTGTTATTGGTGACTGCGAACGTCGGATCAGTGTTTGAGGACCCATCGGTGATGCTTCCAATATGGACGTCAGAGTTTCTGCAAGCTGTGTCCAGGATGGATCCAAAGTTTATAGCTCTTCATCTTCAAGAAGTCGGCGGAAAAACTTACGAGAAATCAATGCAATACGTTAAGGATTTCGTACAGAGGCTGTGTGACTGTCCCGAGTTGAGACTGTTTGATAAGATCAGGATGTATTTGGACGAAGACTTCAGTTCCTCAGAAAAATTTACGGCCCTGGGCAATATGTACTTTGCTCATACCACATTAACCGACCTCAAGATGTGGGACTTCGAATTGAAGGCGTACGTTGATGTTGTTGGCAAAGAGGTCTACAACGGTAACATAGAGAAAGTACCCACCAAAGAGAAGGCAAAGTTTCCCCAGCAATTTTTCCCTGAGTGTAAGTGGTCTCGCAAGGGATTCCTCAGGACTCGTTGGTCCATACGAGGAACAGCCGTCGAATTTATCAACATTCACTTATTCCACGACGCATCCAACTTACTCGCCATGGAGCCGTTTCCATCTGTGTATTGTCGCAGTCGCCGTCGCGCCCTCCGCCACACCCTCCGTCACCTCCACTCTGACGTGAACGCGGCGCCGTACTTCATTTTCGGTGACTTCAACTTCAGAACAGACACCGGCGGAGTTGTGAAGAAAGTAACGGAAGATTTGCACGCCTGCCGTCTCCAGAATTCTAACAACGTAGAGTCGTCCAAGCTGCAGTACCTGAAGGAAGATCGCGTGGTGCTTACCGTTGGGAAGAAGGAGTTCGCTCATGTAGACCATCAGAAGGTGTTCAGGGAGCCCTGGTTACAGAAATACGATCGCGAGCTGGAAGCGCTACGTCCCCATCTGTTCGAGTTCCCCGTAAAATTCCCGCCGAGCTATCCCTTCGAGGAAGATATACATTTGCCAACGCATTACATGAAAACGAGATGTCCTTCGTGGTGTGATCGTGTACTGCTGTCTCCGTCCGCCAGAGTTCTGGTGCAGCATGATAGAGACAAACACCTGCACACTTCCAGGAAGTCGGTAGCTGATTCTGATAGTGGAAGAGTGTCTTCATCTGATAGTAGCCCGGGACGATCTGGATCCCAGTCTCCGAAGATTCTAGAGAAGAAGCCGAGCTCGTCAGAGTTGGATGGACTGGTGGTGCCACATATGACCGGAGCCAGGAGGAGCATCGCAGACCCCACTGGCATACAACAGGCGATCAGCGCCAGGGTATCCGACTCCGAAGCGTCACCAGGTCGTCGTAAATTGGTGAGGAATCAATCGGAGGGCTCGCCCAAAAGTGGTGAAACTTCAGCCGAGTTGAGAAGATTAGTCGACGCACCGACGCGAAGACGGAGCGAATACGGAGTCATAGGAGACACCACGTGCATGGGTGACCATAAGCCGATATACCTCCGCGTGATGCTGCAGTGTGACCGAGGTATCGTACAGTGTTGTGACCACCTGCCTTGTGCCTTATGCGTGTGCGCACTCAATTACACCAAAAAACCCCAAAAACTGCCCCGCCTACCTACAGACCCCGATCTATATGCTAAAAAAACTATTCAGAGTTACGATCACTCGAGCAGCCACCCGGAGACGCTGTCCGACGAACAGAGACTTAGACGCACCAGAACCGGCTCGCTCAATGACAAAATATTTCGTATTCCGAACGTTAAAATAACCTGCGCCGACTCCATTTACAACGGCATCTTTGTCAACGACATCGACAGTTCGTTATTGAGTCCGAGCCGATGTCTGGGTCCGTACACCCCGGAGAGCGTGGACTCTCACACGCCCACCGCTGACGTGTCCAACGGTTCGGACGACCGTGACGTCATAGACGCTGTCGACAAACAGAAATACAGTCACGACAGGAGCGTCTCCCCGACGCAGCTGAAGTCCCGCTTAGACCGGCTCCTGAGCGATAAGGAGAAACAAAGCAACGATAGCACGCCGGAGATACAGAGGCGGAACAGCAATGAGTCGTGTAAGTCGGACGGCGGCAAGGGATTGTGCTGTCTGGCGCTGAAGTGCTACGGGTTCTGTAGGCGTAAGGCGCGCAAAGTCAACTGTAGTGGCTTGAAATGTTGCACCTCATGA

Protein sequence:

>DPOGS210258-PA
MGSDKVPLLLVTANVGSVFEDPSVMLPIWTSEFLQAVSRMDPKFIALHLQEVGGKTYEKSMQYVKDFVQRLCDCPELRLFDKIRMYLDEDFSSSEKFTALGNMYFAHTTLTDLKMWDFELKAYVDVVGKEVYNGNIEKVPTKEKAKFPQQFFPECKWSRKGFLRTRWSIRGTAVEFINIHLFHDASNLLAMEPFPSVYCRSRRRALRHTLRHLHSDVNAAPYFIFGDFNFRTDTGGVVKKVTEDLHACRLQNSNNVESSKLQYLKEDRVVLTVGKKEFAHVDHQKVFREPWLQKYDRELEALRPHLFEFPVKFPPSYPFEEDIHLPTHYMKTRCPSWCDRVLLSPSARVLVQHDRDKHLHTSRKSVADSDSGRVSSSDSSPGRSGSQSPKILEKKPSSSELDGLVVPHMTGARRSIADPTGIQQAISARVSDSEASPGRRKLVRNQSEGSPKSGETSAELRRLVDAPTRRRSEYGVIGDTTCMGDHKPIYLRVMLQCDRGIVQCCDHLPCALCVCALNYTKKPQKLPRLPTDPDLYAKKTIQSYDHSSSHPETLSDEQRLRRTRTGSLNDKIFRIPNVKITCADSIYNGIFVNDIDSSLLSPSRCLGPYTPESVDSHTPTADVSNGSDDRDVIDAVDKQKYSHDRSVSPTQLKSRLDRLLSDKEKQSNDSTPEIQRRNSNESCKSDGGKGLCCLALKCYGFCRRKARKVNCSGLKCCTS-