Monarch geneset OGS2.0

DPOGS210020
TranscriptDPOGS210020-TA3399 bp
ProteinDPOGS210020-PA1132 aa
Genomic positionDPSCF300372 - 73225-84332
RNAseq coverage280x (Rank: top 39%)
Annotation
HeliconiusHMEL0020891e-11172.54% 
BombyxBGIBMGA010836-TA0.078.45% 
DrosophilaCG7956-PC2e-17949.51% 
EBI UniRef50UniRef50_D6WZ410.046.50%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WZ41_TRICA
NCBI RefSeqXP_974291.20.048.20%PREDICTED: similar to suppressor of actin (sac) [Tribolium castaneum]
NCBI nr blastpgi|1892411460.048.20%PREDICTED: similar to suppressor of actin (sac) [Tribolium castaneum]
NCBI nr blastxgi|1892411460.048.06%PREDICTED: similar to suppressor of actin (sac) [Tribolium castaneum]
Group
Gene OntologyGO:00425784.2e-88phosphoric ester hydrolase activity
KEGG pathwaytet:TTHERM_000795707e-57 
 K01099 (E3.1.3.36)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
InterPro domain[49-450] IPR0020134.2e-88Synaptojanin, N-terminal
[630-744] IPR0221581.3e-19Inositol phosphatase
Orthology groupMCL13989 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210020-TA
ATGGAATTGTTTCGATCTGAATCCTATTTCATATTTGTGAGGAACGAGTCAAGTCTTTGGTGGAATCGGCTAACGGGAGCCTTTTCAGTCCGATCAGCATGGGATCTTTCCGATATCGAGGACATAGAATGCCTTGGTATAACAGAAGGTATTATAGGAAAAGTAGAACATTCAAATATATACGAGCCTCGTTTAATGATTATAAAAGAAAGCGTACCTATGGGTCAGATATATTTTCACCATACCATATACAAGATCAAATCAATATGCTTTTTGAATATGGGTGTGAATAATCAGGAACTTGAGCTTTCTCCATGTACGAAACATGGATCGTCAACCCTCTTGGAGAATGCAAGATCAAGCAGTAAGAAAATGGGGGCTCGTTTGTTTGAAAATTCCGCCTTTTTAAATAAAACTGTAGGTGCCGTTAAAAATGTTAGCAATACAATTAAAACCACAACACAGCAAGCCGCTACTCAGGTAAAACAAACAGTGAAGAAGCAACGTGATCCAAAACTAGCTGAACGGTTTGAGAAGCGTCTGACGGATGAGCTACACAAGATATTTGATGACTCTGACAGCTTCTACTATTCAAGAACATTAGATCTAACTAACTGCTTACAACGACAATATGAAATTGAAAAAATTTTGGAAACCGAAGAGGGCAATGGGAAACCAATCACTGACATAACAAGATGGTGGAAATATGTGGACGATAGATTCTTCTGGAACAAACATATGCTCAAAGATATTATCGCTTTGGAGAGTCCTGGTTGTGATGAATGGGTTCTGCCGGTCATCCAAGGCTATGTACATCTGTCACAAATAGCCGTCGAACCACCTGATGCCAATCCCTTGAATACCGAATCATTGTCGAGTACGAATTCATGCGATGAAACTTTCACTCTAGGTCTTATATCAAGAAGATCTAGGTACCAGGCTGGAACTCGGTACAACCGTCGTGGTATAGAGCCCGGTGGGAGAGTTGCAAACTATGTTGAAACTGAGCAGATTGTGTCCATTGTGTGCTCGGATAGCATTCACAGAGCATCATTTGTACAGGTCCGTGGATCTGTGCCAATATACTGGAGCCAGCCTGACTACAAGTTCAGGCCGCCGCCGAGGCTTGACAGAACCGAAGAAGAATCCCACCAGGCTTTTAAGAAGCACTTCGAAGAGGAGTTAAAACTTTATAAACAGATTTGTATAGTGAATTTGGTAGAGCAGCAGGGGAGAGAACGCATCATATGGGAGGCCTATAGCAACCACGTCCTCAAGTACAACAGTCCTAATATAATATACGCTACCTTCGACTTCCACGAATACTGCCGCGGCATGCACTATGAGAACGTTAGCATATTAATAAACGCTATATCGGATATCATCGGTGACATGCGTTTCTGTTGGCGTGACGACCGCGGCCTCATCTGTACACAGACCGGCGTTTTCAGAGTCAACTGTATAGACTGTCTCGATCGCACCAACGTCGTACAGACAGCGATAGCCAAGTACGTGTTGGAGTTGCAGCTATGTAGGTTAGGTCTCGGAGCCCCGGGCTTCGGTCTACCCGTGGGGCTCCGACAGGCCTTTCTGGCTATGTGGGCTGATAGCGGAGATCTCGTATCAAGGCAATACGCTGGCACCAAAGCTCTCAAGGGTGATTATACTCGCACAGGAGAGAGGAAGTTAACTGGGATGATGAAAGATGGCGTCGCATCCGCTAACAGATATTATCTGTCAACATTCAAGGACGCTCTCCGTCAAGTGGCTATTGATGTAATGACAGGAGAATCCAAAACTATACCAGAACAACTCATTGTACCCGACTGTACCCCGTGTACCTCAGTCAAGGTTCTTATGTTTAACGATCAATCAGTACCAGATACAGCGGCTATGGCACAGCATGTGAAGAGCCTCATAGATGACTGTAAGAAGCTCTTGGTGGATACGGAACCAGTCCTCGGCTCCTGGGGACTTATAGATGCCGACCCACATACCGGAGATCCTCAGGAAACGGAAATGGATAGCGTCCTGGTGTTAACGGGAGAGGCGTATTACGTCACAGACTATGACGAGACCTCCGACAGGTTGTTGTCGGTACAGAGGGTGCCTCTGAAGGATGTTACGTCTATAGAACTTGGCACTTTGGACTCTAGTGCTACGATATTCAGCGTGGCCCGCAAGAGTAACGCCGAGCCGGTGCACTGTATACGTATCAACTACATGTATAACAACGAACCGGGATACTTCCACATGTTCAGGTCAACATCGCTGAGGTTCTTCAATAACATGGCTGTGGCTATAAATACCAAGGACGAGATGATAGAATCCCTTCACTCGATATGTGAATCACTAGTTGTGGCGAGGGATGTAGCGAAATTATCACCTGTACCATTCCACGACGGAGTCAAGCTAGAAAGGAAAAAGTCTAAGATACATCCAACACAAGGGTCTTCAGGCGCTAAATCTTCCCTGTACTTGGACCTATCGAGACTGCCAACACTCACTAGAAATGTCAGCGAGACTCAGCTGGTGGCGGACATAAGGAGTGTCGGATCAAAAGCCCTAAACAATATGTCGGAGCAATTCAGCAAGCTGAATAAACTGAGTCACTCCCTGAACGCCAGAGCCAGACCTACATTACAATTGAAATTCGACCAAGGGACTTCAAAGACAAAGAAAATATTTACATTAGGGCAGAAGAGTGATGGCAAAAAGAAAGGAAGCCTATCAGACGGCGCGAGCTCGGACTATTCATCAGACGACGAGGCGAGGACCAACATCTTCGAGCCCACGCTAGACAACTTCGAACATCTACAACACTACATCGGAGATCAGGAGAGAAAGGACGAGAATGATTGCGATCTAGTAGAAAATCCACTGTATTCATCCAAAATTGAACCAAACTACGACATGGACACCACGATTTCCGATACCAGGACAAACGTATCGAAAACGCCAAGCAACAAGATGAACCCGTTCAACAGTGACGTCACACCGGAAATACAAGTGGACTCCAAGCCGATACCGCCGAATTCGCTGCTGCTGAACCAAAAACTGTCGCAGAGCTCCAGTTACCTCAACTTTGAACCTACGGTTAACTACGTAAGGTCTAATTCCCAGCACGAGATAACATTGAACATAGCGCAGTCGCATAGCGAATCAGCGTTACGGCAGTTGAAGAATATAACAAGTCCTGTGTCCACAGCCACCAAAGAAATGGTACTCTCGCCTCTCTCAAAACTGGCTAAGGGCGTGCAAACATTGGGCGCCAATCTAGATCCGAGGAAGATAAAGGCTCCGGCATCGGTGAAACATATATCAGAACAGCAGTATGAAGAACACAAGAGATTACAAGAAAAATGGCAGGATAGCAACACACGGCTGATTGCTCTGTGA

Protein sequence:

>DPOGS210020-PA
MELFRSESYFIFVRNESSLWWNRLTGAFSVRSAWDLSDIEDIECLGITEGIIGKVEHSNIYEPRLMIIKESVPMGQIYFHHTIYKIKSICFLNMGVNNQELELSPCTKHGSSTLLENARSSSKKMGARLFENSAFLNKTVGAVKNVSNTIKTTTQQAATQVKQTVKKQRDPKLAERFEKRLTDELHKIFDDSDSFYYSRTLDLTNCLQRQYEIEKILETEEGNGKPITDITRWWKYVDDRFFWNKHMLKDIIALESPGCDEWVLPVIQGYVHLSQIAVEPPDANPLNTESLSSTNSCDETFTLGLISRRSRYQAGTRYNRRGIEPGGRVANYVETEQIVSIVCSDSIHRASFVQVRGSVPIYWSQPDYKFRPPPRLDRTEEESHQAFKKHFEEELKLYKQICIVNLVEQQGRERIIWEAYSNHVLKYNSPNIIYATFDFHEYCRGMHYENVSILINAISDIIGDMRFCWRDDRGLICTQTGVFRVNCIDCLDRTNVVQTAIAKYVLELQLCRLGLGAPGFGLPVGLRQAFLAMWADSGDLVSRQYAGTKALKGDYTRTGERKLTGMMKDGVASANRYYLSTFKDALRQVAIDVMTGESKTIPEQLIVPDCTPCTSVKVLMFNDQSVPDTAAMAQHVKSLIDDCKKLLVDTEPVLGSWGLIDADPHTGDPQETEMDSVLVLTGEAYYVTDYDETSDRLLSVQRVPLKDVTSIELGTLDSSATIFSVARKSNAEPVHCIRINYMYNNEPGYFHMFRSTSLRFFNNMAVAINTKDEMIESLHSICESLVVARDVAKLSPVPFHDGVKLERKKSKIHPTQGSSGAKSSLYLDLSRLPTLTRNVSETQLVADIRSVGSKALNNMSEQFSKLNKLSHSLNARARPTLQLKFDQGTSKTKKIFTLGQKSDGKKKGSLSDGASSDYSSDDEARTNIFEPTLDNFEHLQHYIGDQERKDENDCDLVENPLYSSKIEPNYDMDTTISDTRTNVSKTPSNKMNPFNSDVTPEIQVDSKPIPPNSLLLNQKLSQSSSYLNFEPTVNYVRSNSQHEITLNIAQSHSESALRQLKNITSPVSTATKEMVLSPLSKLAKGVQTLGANLDPRKIKAPASVKHISEQQYEEHKRLQEKWQDSNTRLIAL-