Monarch geneset OGS2.0

DPOGS200347
TranscriptDPOGS200347-TA1017 bp
ProteinDPOGS200347-PA338 aa
Genomic positionDPSCF300026 + 536610-538102
RNAseq coverage833x (Rank: top 15%)
Annotation
HeliconiusHMEL0000390.094.38% 
BombyxBGIBMGA005641-TA4e-17887.57% 
Drosophilalic-PA2e-13568.24% 
EBI UniRef50UniRef50_Q9U9831e-13267.92%MAPKK n=16 Tax=Bilateria RepID=Q9U983_DROME
NCBI RefSeqXP_002428268.15e-14677.12%cAMP-dependent protein kinase catalytic subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2613359490.094.38%putative mitogen-activated protein kinase (MAPKK) [Heliconius melpomene]
NCBI nr blastxgi|2613359490.094.38%putative mitogen-activated protein kinase (MAPKK) [Heliconius melpomene]
Group
Gene OntologyGO:00167722.9e-67transferase activity, transferring phosphorus-containing groups
GO:00055241.8e-63ATP binding
GO:00046741.8e-63protein serine/threonine kinase activity
GO:00064681.8e-63protein phosphorylation
GO:00046721.8e-50protein kinase activity
GO:00047133.4e-12protein tyrosine kinase activity
KEGG pathwayphu:Phum_PHUM3727301e-145 
 K04432 (MAP2K3, MKK3)maps-> GnRH signaling pathway
    Amyotrophic lateral sclerosis (ALS)
    Fc epsilon RI signaling pathway
    Toll-like receptor signaling pathway
    MAPK signaling pathway
InterPro domain[37-338] IPR0110092.9e-67Protein kinase-like domain
[51-314] IPR0022901.8e-63Serine/threonine-protein kinase domain
[56-313] IPR0174421.8e-50Serine/threonine-protein kinase-like domain
[51-313] IPR0206353.4e-12Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11586 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200347-TA
ATGTCTAACAGAAGAATTAAAAAACCACCACCTCCCGCATTCAGTTTTAAGCCACAAGAAGAACAAGTCACTGTAACACCTCCAAGGAATTTGGATAAACAGACCACCATCACAGTTGATGACAAGACCTTTACTGTTCATGCTGACGATCTGGTAAAATTATGTGATCTTGGCCGAGGAGCTTATGGCATTGTGGAGAAGATGCACCACAAACCCAGTAACACCATCATGGCAGTCAAGAGGATCACAGCTTCATTCAATAGCCAATCATTAGAGTTGAAGCGTTTGTTAATGGATCTCGATGTGTCCATGAGGGCCAGTGCATGTCCTTACACTGTACATTTTTATGGTGCCATGTTCAGAGAGGGGGATGTTTGGATCTGTATGGAGGTTATGGATATGAGCCTTGATAAGTTCTATACAAAGGTATACAAGAACAATAAAACAATAACAGAGAATATTCTTGGGAAAATTGCATTTTCGGTTGTGAGTGCACTGCATTACCTCTATTCAAAGCTGAGGGTTATACACAGAGATGTGAAACCTTCAAATATATTGATAAATAGGAAAGGGGAGGTCAAAATGTGTGATTTTGGGATTTCAGGATATTTAGTAGATTCAGTAGCTAAGACCATAGATGCTGGCTGTAAGCCTTACATGGCACCCGAAAGGATTGATCCAAGCGGTAATCCTGGACAATATGACATCAGAAGTGACGTATGGTCCCTTGGAATATCCATGATCGAACTTGCTACGGGAAAGTTTCCTTACAACACCTGGGGAACACCATTTGAACAGCTCAAGCAAGTCGTAGAGGATGATCCACCAAGTCTTCCTATAGGACAGTTCTCACCCGAGTTTGAGGACATAATCACCCAATGCCTCAAAAAGGATTACAGGCAGAGACCGAATTATGACGCACTGCTTTCACATCAATTCTGCCAGGAACACAGCGAGAAAGAAACAGACGTGGCTTCCTTTGTCAAGGAGATACTTGATATACCCGACGATTCCTAG

Protein sequence:

>DPOGS200347-PA
MSNRRIKKPPPPAFSFKPQEEQVTVTPPRNLDKQTTITVDDKTFTVHADDLVKLCDLGRGAYGIVEKMHHKPSNTIMAVKRITASFNSQSLELKRLLMDLDVSMRASACPYTVHFYGAMFREGDVWICMEVMDMSLDKFYTKVYKNNKTITENILGKIAFSVVSALHYLYSKLRVIHRDVKPSNILINRKGEVKMCDFGISGYLVDSVAKTIDAGCKPYMAPERIDPSGNPGQYDIRSDVWSLGISMIELATGKFPYNTWGTPFEQLKQVVEDDPPSLPIGQFSPEFEDIITQCLKKDYRQRPNYDALLSHQFCQEHSEKETDVASFVKEILDIPDDS-