Monarch geneset OGS2.0

DPOGS209759
TranscriptDPOGS209759-TA1671 bp
ProteinDPOGS209759-PA556 aa
Genomic positionDPSCF300314 - 33656-47689
RNAseq coverage320x (Rank: top 36%)
Annotation
HeliconiusHMEL0119042e-12847.33% 
BombyxBGIBMGA006041-TA1e-4035.06% 
DrosophilaCG9784-PA5e-7536.43% 
EBI UniRef50UniRef50_D7EHS94e-10742.29%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EHS9_TRICA
NCBI RefSeqXP_971116.22e-11142.69%PREDICTED: similar to skeletal muscle/kidney enriched inositol 5-phosphatase [Tribolium castaneum]
NCBI nr blastpgi|1892418584e-11042.69%PREDICTED: similar to skeletal muscle/kidney enriched inositol 5-phosphatase [Tribolium castaneum]
NCBI nr blastxgi|1892418583e-10742.89%PREDICTED: similar to skeletal muscle/kidney enriched inositol 5-phosphatase [Tribolium castaneum]
Group
Gene OntologyGO:00044371.8e-81inositol or phosphatidylinositol phosphatase activity
KEGG pathwaytca:6597465e-111 
 K01106 (E3.1.3.56)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
InterPro domain[2-318] IPR0003001.8e-81Inositol polyphosphate-related phosphatase
[2-312] IPR0051354.3e-55Endonuclease/exonuclease/phosphatase
Orthology groupMCL11642 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209759-TA
ATGGATACCCTCAGGTTCTATTTTGTAACCTGGAATGTGGCTACAAAGTCTCCAGGTCAAGATCTGAACGCATTATTGGATTTTCCATCAGTTTTCAACAAAAATAAGCCACTGCCAGATTTCTATGTCATTGGCTTGCAAGAAGTGAAATCTCAACCTCAAAATATGTTAATGGACAGTCTGTTCACTGACCAATGGACTTCAATGTTCAACAAGATCCTATGTAGACAGGGCTATATAGTTGCTAAAAGTGTCAGATTACAGGGAATAATACTTCTAGTGTATACACATATGAAACATGTGGTGCATCTCAGGGATATAGAAGCTCAGTATACAAAGACAGGCCTAGGAGGTCTATGGGGCAACAAAGGTGCTGTGAGTGTGAGGTTCAATATATACGGTTGTTCTGTGTGTCTGGTGAACTCTCACCTTACGGCTCACGATCACCTGCTGGCGGATCGGATCAACGATTACAATACCATCATATCGGACCACCAGTACCATATTACGGAGACACACAATATACTATACCACGACTACGTGTTCTGGATAGGTGATCTAAATTTCCGGACCGATCACCCAGCTGAGAACAGTCCGAGCGCCGAGGAGATAGTTGCTACCCTGGAAAAAGTCGAAAAGGACAAATTTAACACACTCCTGCGACATGACCAGCTTCTGGCGGTGATGGAGAGCGGGGAAGCTTTCTCGGAATTCTCGGAAGCAGATATCAAATTCGCGCCGACTTACAAATTTATGATCGGCACGGACGACTACGACATCAAACGTAAACCCTCGTGGACGGACAGAATCTTGTTCAAAGTTATTACTAACACTTACGAGAATGTAACTCTTCGTGCTGACGTCATCTCGTACAACTCCCTACCACAGTACACCATCAGCGATCATAAACCAGTGGTTGCACAGTTCAATATAAAAACACAACGAAAAATATCAGCGAGCGATGTGAAACCTAAGGTAGGGAAGGCACAGATCCGAATGCGTTCAGCCGTTGTATCAGATTCAGCGGAAATGCCCAATGAGGTTGCAGTAGCCACTTTCGCCCATATACCTGATGAAGATCAAGATGAACACCAGCCTGAGGCGTTTTCCAACTACGCAGTCCGTGTCGTTGAATTCGAGCCGATCTCAAGGATTTGGTACATCGGTGACGCTGACTTCAGGACACAGTGCACCCTCACACCAGACGTTGAAGTCAATCCTAACGACTGGATAGGGATATATGATGCGAACTTCCATAGTCTTGACGACTATATAGCGTACGAGTACCTGTCCAAAGTGTGCGTGGCGGGTGCGGCGGGTGCTGAGGGTGCGAGGGGTGCGGAGGGAGCGGAGAGTGCAGAGGGTGCGAGGAGTGCGGGTAGACCTCGCACCTTCACCCTCAGCTTCCCCGTCGGCAGCGGCGTCCGTACACCCGGCTTCTACCGCTTTATATACTTCAGCCAGCCAAATAATGACGTCAGGAGTGTACTCGGTATCTCGGAGCCGTTCGAAGTGAGCAGTAAGGAGGATAGGTTAGTGAATATAGATCCGTGCATTGAGGCGTCTACATCGAAACCGTCGACAGCAACTAGTGACGTTTTCACGGATTTCACCGGTCTGGACGTGGCGAAGCTCTCACGTCACTTCTCAAACGACCTAAGCATCGACTGA

Protein sequence:

>DPOGS209759-PA
MDTLRFYFVTWNVATKSPGQDLNALLDFPSVFNKNKPLPDFYVIGLQEVKSQPQNMLMDSLFTDQWTSMFNKILCRQGYIVAKSVRLQGIILLVYTHMKHVVHLRDIEAQYTKTGLGGLWGNKGAVSVRFNIYGCSVCLVNSHLTAHDHLLADRINDYNTIISDHQYHITETHNILYHDYVFWIGDLNFRTDHPAENSPSAEEIVATLEKVEKDKFNTLLRHDQLLAVMESGEAFSEFSEADIKFAPTYKFMIGTDDYDIKRKPSWTDRILFKVITNTYENVTLRADVISYNSLPQYTISDHKPVVAQFNIKTQRKISASDVKPKVGKAQIRMRSAVVSDSAEMPNEVAVATFAHIPDEDQDEHQPEAFSNYAVRVVEFEPISRIWYIGDADFRTQCTLTPDVEVNPNDWIGIYDANFHSLDDYIAYEYLSKVCVAGAAGAEGARGAEGAESAEGARSAGRPRTFTLSFPVGSGVRTPGFYRFIYFSQPNNDVRSVLGISEPFEVSSKEDRLVNIDPCIEASTSKPSTATSDVFTDFTGLDVAKLSRHFSNDLSID-