Monarch geneset OGS2.0

DPOGS201001
TranscriptDPOGS201001-TA1722 bp
ProteinDPOGS201001-PA573 aa
Genomic positionDPSCF300147 - 137148-146553
RNAseq coverage435x (Rank: top 28%)
Annotation
HeliconiusHMEL0137232e-7269.63% 
BombyxBGIBMGA009071-TA9e-7258.30% 
DrosophilaPten-PH3e-6542.39% 
EBI UniRef50UniRef50_C5IX044e-9250.87%Phosphatase and tensin-like A n=9 Tax=Apis mellifera RepID=C5IX04_APIME
NCBI RefSeqNP_001155985.17e-9350.87%phosphatase and tensin-like [Apis mellifera]
NCBI nr blastpgi|3072155098e-9551.90%Phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase PTEN [Harpegnathos saltator]
NCBI nr blastxgi|3072155099e-9648.83%Phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase PTEN [Harpegnathos saltator]
Group
Gene OntologyGO:00055151.2e-26protein binding
GO:00081381.7e-07protein tyrosine/serine/threonine phosphatase activity
GO:00064701.7e-07protein dephosphorylation
KEGG pathwayame:4118592e-92 
 K01110 (E3.1.3.67, PTEN)maps-> Inositol phosphate metabolism
    Prostate cancer
    Tight junction
    Glioma
    Melanoma
    Phosphatidylinositol signaling system
    Pathways in cancer
    Small cell lung cancer
    Endometrial cancer
    p53 signaling pathway
    Focal adhesion
InterPro domain[154-303] IPR0089731.2e-26C2 calcium/lipid-binding domain, CaLB
[155-299] IPR0140206.2e-22Tensin phosphatase, C2 domain
[46-126] IPR0003401.7e-07Dual specificity phosphatase, catalytic domain
Orthology groupMCL15111 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201001-TA
ATGGGCTTCCCCGCCGAGAAGCTGGAGGGTGTGTACAGGAACCACATAGATGAAGTGTACCGCTTCCTGGAACAGAAGCACAAGGACCACTACATGATATACAACCTGTGCGCCGAGAGAGAGTACGACTACACCAAGTTCAACAAGAGGGTGCAAGTGTTCGCGTTCAAGGACCACGAGCCGCCGAAGATCGGTCAGATCCAGCCGTTCTGTGAGGACGTTCACAACTGGCTCAGCAAGGACCCTCGGAACGTGGCCGCCGTGCACTGCAAGGCTGGCAAAGGAAGAACCGGTACAATGGTGTGCTGCTACCTGCTGTACAGCGGACAGAAGACCACGGCGGATGAGGCTCTTAAATACTATGGGAACAAACGCACACACGACGAGAAAGGTGTTACGATCCCGTCCCAGCGACGCTACGTGGAGTACTACGCGGCGCTGGTCCGAGGAGGGCTCTCGTACCGCGCCACCAGGGTGCACGTCCGGGAGCTCCTCATGTACCCTCCGCCGGCCTTCAACGGCTCGCAGTGTACCCTGCAGCTGACGGTCACGCAGGCTGACCCCTTCCACAAGACGTCGCTGGGCAGCCACGAGGTGCGCCGCCACGAGTCGGTGGCGCGGGTGGTGGCGACGGCGTGCGCTCCGCTGGCCGGGGACGTGCGGGTGGACGTGTACTGCAAGCCCAAGATGAAGATGAGGAAGGAGAGGCTGTTCCACTTCTGGTTCAACACGTACTTCGTCACCGCTGGCGTCGGAGCCGCCAACGTGCCCGCGCCCATCGACAGTCAAAATCAAGAAACATTCAAGTTGACGCTGGACAAGTGGCAGCTGGACGACGCGCACAAAGACAAGCAACACAAAATGTACAGCGCCGACTTTAAGGTGGAGCTGATAGTTCACAAGCTGCCCGAGTCGTCGACGTTCAGCGTTCCCCGCGGGTCCTCTCCGGCCAGCTCCTCCTCGGACCCGGACACGGAACACGAGCAGGAGTGGGACTCCGGTGAGACAGACTACACTACGGACAAATATCTCCAGACAGATCCCGAGATAGATCCCGAGATAGACCGCGGGATATCCGACGCGGCCGACTCCGACAGGCAGCTTCACACCAGTCACCAGTACCATCGCCAGCGGGAATGCGACGCTCTACCGGGGTCGAACCCGTTCTATCCCCGGAGCGATCCGTTCGCTCCCTCCGACCTCCCTCCCCGGTTACGTCCCGGGCCCGACCCGCAGCACTCCCCTCCCTACCGCTATCTGTCAAACGTGCCCGAATATCCCGAACACTTCCCTCGAGACTGTACGAGGGTGGATCGCCCGCCCCCGACACACACCACCCACCCTCCCGAGACGAACACCAACGGTGACGAGGACCTCCTGTACACCGACACCGAGCCGGTCATGTTCGGCGGACACAAACTAGCAGACTATCGCTTAGAGAAATTGCCGGAGTACACTCACAGTGGCTACTACGACGACAGAAATAACTTCAACGTACCGATGGCGCGGTACGACGTGTGTCCCGGGGACGAGTACCCGGCGGGGGAGGACGCCGACACTATGGAGTACCGCGGGGACGACGCCAAACAATACGATAGGAATAAGAAGTGCAAGGACCCTCTCACCAAGCCGCGTCTGTCGCTGGGGGACATGAGGGCCTCGTGGAGAGAGTTCAGCGGCAAGTTCGGCGAGAACAGGAAAAAGAAGAATAGGGACGACTGA

Protein sequence:

>DPOGS201001-PA
MGFPAEKLEGVYRNHIDEVYRFLEQKHKDHYMIYNLCAEREYDYTKFNKRVQVFAFKDHEPPKIGQIQPFCEDVHNWLSKDPRNVAAVHCKAGKGRTGTMVCCYLLYSGQKTTADEALKYYGNKRTHDEKGVTIPSQRRYVEYYAALVRGGLSYRATRVHVRELLMYPPPAFNGSQCTLQLTVTQADPFHKTSLGSHEVRRHESVARVVATACAPLAGDVRVDVYCKPKMKMRKERLFHFWFNTYFVTAGVGAANVPAPIDSQNQETFKLTLDKWQLDDAHKDKQHKMYSADFKVELIVHKLPESSTFSVPRGSSPASSSSDPDTEHEQEWDSGETDYTTDKYLQTDPEIDPEIDRGISDAADSDRQLHTSHQYHRQRECDALPGSNPFYPRSDPFAPSDLPPRLRPGPDPQHSPPYRYLSNVPEYPEHFPRDCTRVDRPPPTHTTHPPETNTNGDEDLLYTDTEPVMFGGHKLADYRLEKLPEYTHSGYYDDRNNFNVPMARYDVCPGDEYPAGEDADTMEYRGDDAKQYDRNKKCKDPLTKPRLSLGDMRASWREFSGKFGENRKKKNRDD-