Monarch geneset OGS2.0

DPOGS200794
TranscriptDPOGS200794-TA975 bp
ProteinDPOGS200794-PA324 aa
Genomic positionDPSCF300454 - 66234-70554
RNAseq coverage2967x (Rank: top 4%)
Annotation
HeliconiusHMEL0169533e-16492.21% 
BombyxBGIBMGA002237-TA9e-16889.59% 
DrosophilaPp2A-29B-PD2e-13577.56% 
EBI UniRef50UniRef50_Q173Y94e-13675.87%Serine/threonine protein phosphatase 2a regulatory subunit a n=7 Tax=Coelomata RepID=Q173Y9_AEDAE
NCBI RefSeqNP_001034525.15e-14679.50%protein phosphatase 2, regulatory subunit A, alpha isoform [Tribolium castaneum]
NCBI nr blastpgi|3291304277e-16594.32%protein phosphatase 2 regulatory subunit A alpha isoform [Helicoverpa armigera]
NCBI nr blastxgi|3291304274e-16695.21%protein phosphatase 2 regulatory subunit A alpha isoform [Helicoverpa armigera]
Group
Gene OntologyGO:00054881.3e-61binding
KEGG pathwaytca:6416021e-145 
 K03456 (PPP2R1)maps-> Meiosis - yeast
    Cell cycle - yeast
    Tight junction
    Wnt signaling pathway
    TGF-beta signaling pathway
    Chagas disease
    Long-term depression
    Oocyte meiosis
InterPro domain[8-312] IPR0119891.3e-61Armadillo-like helical
[9-314] IPR0160242.2e-49Armadillo-type fold
Orthology groupMCL10980 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200794-TA
ATGGCGGCCAGCGACTCAGGGACGGATGAGTCACTTTATCCCATCGCGGTACTTATCGACGAATTAAAAAATGAGGATGTTCAACTCAGATTGAATTCTATTAAGAAGCTATCAACCATCGCTCTGGCTTTGGGTGTGGAAAGAACAAGATCAGAACTCGTTCCGTTCCTAACTGAGACTATATATGATGAAGATGAAGTACTTTTGGCACTCGCTGAACAACTTGGCAACTTCATCAATCTTGTCGGCGGAGGTGAATATGCACACTGTCTTTTGCCCCCACTGGAGTCGCTCGCTACTATCGAAGAAACTGTTGTCAGGGATAAGGCTGTGGCCTCACTGAGAGCGGTCGCCGCCCATCATTCACCACAAGCTTTAGAACAGCATTTTGTGCCATTAGTACAACGTCTTGCGGGCGGAGATTGGTTTACATCTAGAGCATCCGCCTGTGGACTCTACAGCGTCTGTTACCCTCGTGTCAGTGCACCAGTGAAGGCTGAGCTCCGAGAACATTTCCGTGTCCTGTGCCAGGATGACACACCCATGGTGAGACGAGCGGCCGCATTCAAGCTGGGAGAGTTTGCTAGAGTGGTCGAAGTGGAATATGTGAAGAGCGATCTCATACCAATGTTCATACATCTGGCCCAGGATGAACAGGACTCGGTACGTCTACTGGCTGCAGAGGCCTGTGCCGCTGTGGCTGCCCTGCTTCCGCCTGAAGATATGGAACAGCTAGTAATGCCCACGGTGAGGGCCCGTGCTGGAGACACCTCCTGGAGGGTGCGGTTCATGGTGGCTGAAAAGTTTGTGGAGCTGCAACAGGCTGTGGGTCCGGAGCTGGCCCGCTCAGACCTGGCCCAGATCTTCCAGGCGTTGCTCAAGGACAGCGAGGCTGAAGTACGAGCTGCAGCAGCTGCTAAGGTTAAAGACTTCTGTATGACTTGGATAAAGCCCACCAGGAACACATCATCATGA

Protein sequence:

>DPOGS200794-PA
MAASDSGTDESLYPIAVLIDELKNEDVQLRLNSIKKLSTIALALGVERTRSELVPFLTETIYDEDEVLLALAEQLGNFINLVGGGEYAHCLLPPLESLATIEETVVRDKAVASLRAVAAHHSPQALEQHFVPLVQRLAGGDWFTSRASACGLYSVCYPRVSAPVKAELREHFRVLCQDDTPMVRRAAAFKLGEFARVVEVEYVKSDLIPMFIHLAQDEQDSVRLLAAEACAAVAALLPPEDMEQLVMPTVRARAGDTSWRVRFMVAEKFVELQQAVGPELARSDLAQIFQALLKDSEAEVRAAAAAKVKDFCMTWIKPTRNTSS-