Monarch geneset OGS2.0

DPOGS204658
TranscriptDPOGS204658-TA930 bp
ProteinDPOGS204658-PA309 aa
Genomic positionDPSCF300170 - 505471-515959
RNAseq coverage3029x (Rank: top 4%)
Annotation
HeliconiusHMEL0040144e-12097.60% 
BombyxBGIBMGA007460-TA0.098.38% 
Drosophilamts-PA2e-17792.88% 
EBI UniRef50UniRef50_P677751e-17694.17%Serine/threonine-protein phosphatase 2A catalytic subunit alpha isoform n=418 Tax=root RepID=PP2AA_HUMAN
NCBI RefSeqXP_002426726.11e-17995.79%serine/threonine-protein phosphatase PP-V, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3228016314e-17996.12%hypothetical protein SINV_14598 [Solenopsis invicta]
NCBI nr blastxgi|3228016311e-17996.12%hypothetical protein SINV_14598 [Solenopsis invicta]
Group
Gene OntologyGO:00167872.8e-154hydrolase activity
KEGG pathwayphu:Phum_PHUM2690203e-179 
 K04382 (PPP2C)maps-> Meiosis - yeast
    Cell cycle - yeast
    Tight junction
    Wnt signaling pathway
    TGF-beta signaling pathway
    Chagas disease
    Long-term depression
    Oocyte meiosis
InterPro domain[23-293] IPR0061862.8e-154Serine/threonine-specific protein phosphatase/bis(5-nucleosyl)-tetraphosphatase
[51-243] IPR0048431.5e-41Metallophosphoesterase domain
Orthology groupMCL11392 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204658-TA
ATGGAGGACAAGGCGTCATTAAAGGAACTGGATCAATGGATAGAACAGCTAAACGAATGCAAGCAATTGACAGAAAACCAAGTGAAAACGTTATGCGAAAAGGCAGAAGAGATCCTGACCAAAGAGTCTAACGTCCAAGAGGTGAACTGTCCAGTGACGGTATGTGGGGACGTTCACGGCCAGTTCCACGACCTGATGGAACTGTTCCGTATAGGTGGAAGATCACCGGACACCAACTACCTGTTCATGGGCGACTATGTTGACCGTGGCTACTACAGTGTTGAGACAGTCACGTTGCTTGTAGCGCTTAAGGTCAGATATCGGGAGAGAATAACAATCCTCCGCGGCAACCACGAGTCACGTCAGATAACTCAGGTATACGGCTTCTACGACGAGTGTCTACGTAAATACGGCAACGCGTCCGTGTGGAAGCATTTCACGGATCTTTTCGACTTCCTGCCGCTGACGGCCCTCGTCGATGGTCAGATCTTCTGTCTCCACGGAGGCCTGAGTCCGTCTATAGACACCTTGGATCACATTAGGGCTTTGGACCGCGTGCAGGAAGTACCACACGAGGGACCCATGTGTGACCTACTCTGGAGTGATCCCGATGATAGAGGCGGTTGGGGTATATCCCCACGTGGTGCTGGCTATACCTTCGGTCAGGACATCTCGGAGACCTTCAATCACAGCAACGGCCTGACCTTGGTGTCCCGAGCCCACCAGCTGGTGATGGAGGGGTACAATTGGTGTCACGATAGAAATGTGGTCACTATATTCTCAGCACCCAATTACTGCTACAGATGTGGTAACCAAGCTGCCATAATGGAATTGGATGATGCGCTCAAGTATTCATTCCTTCAGTTCGACCCGGCGCCTCGACGCGGCGAGCCGCATGTCACGCGGCGGACGCCGGATTATTTCATGTAG

Protein sequence:

>DPOGS204658-PA
MEDKASLKELDQWIEQLNECKQLTENQVKTLCEKAEEILTKESNVQEVNCPVTVCGDVHGQFHDLMELFRIGGRSPDTNYLFMGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHESRQITQVYGFYDECLRKYGNASVWKHFTDLFDFLPLTALVDGQIFCLHGGLSPSIDTLDHIRALDRVQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGYTFGQDISETFNHSNGLTLVSRAHQLVMEGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDALKYSFLQFDPAPRRGEPHVTRRTPDYFM-