Monarch geneset OGS2.0

DPOGS200381
TranscriptDPOGS200381-TA1230 bp
ProteinDPOGS200381-PA409 aa
Genomic positionDPSCF300026 + 1129178-1132412
RNAseq coverage499x (Rank: top 25%)
Annotation
HeliconiusHMEL0053860.091.93% 
BombyxBGIBMGA007238-TA0.094.46% 
DrosophilaMkk4-PA1e-14373.65% 
EBI UniRef50UniRef50_Q17GI13e-14965.21%Dual specificity mitogen-activated protein kinase kinase 4 MAPKK4 n=9 Tax=Opisthokonta RepID=Q17GI1_AEDAE
NCBI RefSeqXP_001603456.13e-16280.59%PREDICTED: similar to dual specificity mitogen-activated protein kinase kinase 4 MAPKK4 [Nasonia vitripennis]
NCBI nr blastpgi|3320307632e-16174.93%Dual specificity mitogen-activated protein kinase kinase 4 [Acromyrmex echinatior]
NCBI nr blastxgi|1565543955e-15679.37%PREDICTED: dual specificity mitogen-activated protein kinase kinase 4-like [Nasonia vitripennis]
Group
Gene OntologyGO:00167729.8e-75transferase activity, transferring phosphorus-containing groups
GO:00055245e-73ATP binding
GO:00046745e-73protein serine/threonine kinase activity
GO:00064685e-73protein phosphorylation
GO:00046724.1e-56protein kinase activity
GO:00047134.3e-12protein tyrosine kinase activity
KEGG pathwaynvi:1001194568e-162 
 K04430 (MAP2K4, MKK4)maps-> GnRH signaling pathway
    Fc epsilon RI signaling pathway
    Toll-like receptor signaling pathway
    MAPK signaling pathway
    Chagas disease
    ErbB signaling pathway
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[92-400] IPR0110099.8e-75Protein kinase-like domain
[106-369] IPR0022905e-73Serine/threonine-protein kinase domain
[108-369] IPR0174424.1e-56Serine/threonine-protein kinase-like domain
[106-369] IPR0206354.3e-12Tyrosine-protein kinase, catalytic domain
Orthology groupMCL12908 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200381-TA
ATGTCGAAGAATGGCGAAGTGTCGTCAAATCAAGGTCCCAGCAGGCCTAGTATGCCGAAACCAGATCTGAATTTGTTCAGTACAGATAAACGTAAAGTATTAAACCTTCAATTAGGAGGCTCATCAAGTGAAAGCGCCGCATTTATGCCTTTTTCCCCGAATGCATCAGCTCAAAAAACTCGACTACCATCTAGAACTATCCGAGATGTTTTGCCAGAAAACCCAAGGGATCGATGTCGCATCTATCCTTCAATGCAATCATCGGGAAAATTGCAATTATCAGCAACAGAGGTGTATGATTTTACATCTGACGACTTACAAGACCTCGGTGAAATAGGAAGAGGTGCATTTGGAGCGGTCAACAAAATGGTCCATAGAAAGAGCAACAGAGTTATGGCAGTAAAACGTATAAGATCCACTGTTGATGAGAAGGAGCAGAAACAGTTGTTGATGGATCTTGAAGTTGTCATGAAAAGTAACGACTGCCCATACATTGTACAGTTTTATGGGGCTTTGTTTAAAGAAGGTGACTGTTGGATATGTATGGAATTAATGGATACCTCATTAGATAAATTCTATAAGTTCATCTGCGAAAGGATGCAGACTCGAATACCTGAAAATATTATTGCTAAAATAACACTAGCGACTGTTAAAGCTTTAAATTATTTAAAAGAAAAACTTAAAATAATTCACAGGGATGTCAAGCCATCTAATATACTTCTAGATCGCAGAGGGAACATAAAGTTATGCGACTTTGGTATTTCAGGAAAATTAGTTGATTCTATAGCTCGTACACGGGATGCAGGTTGTAGACCTTATATGGCGCCTGAACGCATTGACCCCGGCCGAGCAAGAGGATATGATGTTAGATCAGATGTATGGTCACTAGGCATCACACTGATGGAGGTAGCAACAGGATCCTTTCCTTACCCTCGCTGGGGCTCAGTGTTTGAACAGTTACAGCAAGTTGTTCAGGGAGACCCTCCTCGCCTTACCAACAAAAATAATATATTTTCAAATGATTTTGTCAACTTTGTTAATACCTGTTTAATAAAAGAAGAAACACAAAGGCCCAAATATAATAGGCTATTGGAGCATCCATTCATCAAGGGTATTGATCAGAGTAGAGTGGATGTTGCCGCATATGTATGTGAGATATTAGATGCTATGGAACGTAACGGAGTTAGTCCGTTCACCACAGACCAGCCAGCACAGGCTTGGATAGACTAA

Protein sequence:

>DPOGS200381-PA
MSKNGEVSSNQGPSRPSMPKPDLNLFSTDKRKVLNLQLGGSSSESAAFMPFSPNASAQKTRLPSRTIRDVLPENPRDRCRIYPSMQSSGKLQLSATEVYDFTSDDLQDLGEIGRGAFGAVNKMVHRKSNRVMAVKRIRSTVDEKEQKQLLMDLEVVMKSNDCPYIVQFYGALFKEGDCWICMELMDTSLDKFYKFICERMQTRIPENIIAKITLATVKALNYLKEKLKIIHRDVKPSNILLDRRGNIKLCDFGISGKLVDSIARTRDAGCRPYMAPERIDPGRARGYDVRSDVWSLGITLMEVATGSFPYPRWGSVFEQLQQVVQGDPPRLTNKNNIFSNDFVNFVNTCLIKEETQRPKYNRLLEHPFIKGIDQSRVDVAAYVCEILDAMERNGVSPFTTDQPAQAWID-