Monarch geneset OGS2.0

DPOGS207034
TranscriptDPOGS207034-TA2952 bp
ProteinDPOGS207034-PA983 aa
Genomic positionDPSCF300001 + 1673245-1683706
RNAseq coverage565x (Rank: top 22%)
Annotation
HeliconiusHMEL0094160.082.55% 
BombyxBGIBMGA012976-TA0.087.06% 
DrosophilaTao-1-PD0.068.67% 
EBI UniRef50UniRef50_G6CHU00.0100.00%Serine/threonine protein kinase n=7 Tax=Endopterygota RepID=G6CHU0_DANPL
NCBI RefSeqXP_001658893.10.066.57%serine/threonine protein kinase [Aedes aegypti]
NCBI nr blastpgi|1571176980.066.57%serine/threonine protein kinase [Aedes aegypti]
NCBI nr blastxgi|3174193410.052.71%Serine/threonine-protein kinase TAO2 [Dicentrarchus labrax]
Group
Gene OntologyGO:00055242.4e-91ATP binding
GO:00046742.4e-91protein serine/threonine kinase activity
GO:00064682.4e-91protein phosphorylation
GO:00167721.6e-81transferase activity, transferring phosphorus-containing groups
GO:00046722.2e-66protein kinase activity
GO:00047133.5e-27protein tyrosine kinase activity
KEGG pathwayaag:AaeL_AAEL0002170.0 
 K04429 (TAO)maps-> MAPK signaling pathway
InterPro domain[25-278] IPR0022902.4e-91Serine/threonine-protein kinase domain
[20-374] IPR0110091.6e-81Protein kinase-like domain
[26-278] IPR0174422.2e-66Serine/threonine-protein kinase-like domain
[25-278] IPR0206353.5e-27Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10474 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207034-TA
ATGCCACGTCCGGGAAGTTTGAAGGATCCGGACATTGCGGAGTTATTCAGCAAACATGATCCAGAGAAGATTTTTGAAGATCTCCGTGAGATTGGTCATGGCTCCTTCGGAGCCGTGTACTATGCTCGTTGCAATCTCACCAAAGAGATAGTAGCTATCAAGAAGATGTCCTACCTCGGGAAGCAGAGTGAAGAGAAATGGCAGGACATTTTGAAAGAGATAAAGTTTCTCAAGCAATTGGATCACCCGAACACTATAGAATACAAGGGTTGCTACAAACGGGAGCACACCGCCTGGCTGGTCATGGAGTACTGTGTCGGCTCGGCTTCTGACATTATTGAGGTTCACAAGCGTCCTCTGCGCGAGGAGGAGATAGCTGCGATATGCGAGGGCGTCGTGTGCGGTCTGTCTTACTTGCATTCCTTGGGGCGGATCCATCGCGACATCAAGGCTGGTAATATCCTACTGACGGAGAACGGGACGGTAAAGCTAGCTGACTTCGGCAGCGCCAGCATCAAGTGCCCAGCCAACAGCTTTGTCGGCACTCCCTACTGGATGGCGCCGGAGGTCATCCTCGCCATGGACGAGGGCCAGTACGATGGAAAGGTTGATGTGTGGTCGCTCGGTATAACGTGCATTGAACTGGCTGAGAGGAAGCCTCCGTATTTCAACATGAACGCCATGTCCGCTTTGTACCATATCGCGCAGAACGATTCACCTCCATTGCAAGCACCTGAATGGACCGATACATTCCGTTACTTCGTGGAGGCGTGCCTACAAAAGAATCCACAGGATCGTCCGTCGTCTACCAAACTTCTGTCCCACCCTTACATCACGCGGCCGAGATCACCGAACGTGCTGGTCGACCTTATACAGAGAACAAAAGCGGCTGTGAGGGATCTAGACAATTTGAATTACAGGAAGATGAAGAAAATACTCATGGTGGACGCCGATAATGAGAGCGCTATGGGTGATAATGAAGAGACGGCTGAGGAGCGTTCGGGTGGAGATAGCTCCAAATCCAATTCCGCTACATCAGAGCACAGTGCGGCCGGAGCATCCAGTCAGAGTTCCTCGTCAGGCAGTCTACGGCGCAGGCCTATCGCGCTGAACGCTAATCACAATCAGCAACAGCAGCAATACCAGCAATACAACCACCAACCAGCCCACCCCACCAGAGACGAGAGCCCTATACCGGGCTCCCACGACGACTACGTGAACAGGGAAGTTCTACGTGATTACGCGAACAGGGATTCCCTGTGTGACGATTACGGCGATGATTACGCGAATCGGGAGGCGATACGGGAGGCTCAGAGGGAAAGGGAGAGGGAACGCCAGGCTCTAGAATACAGGGAGTACGTTAACGCGCCCTCCACGTGGCAACAGGACAGCGGCGACGACAATAGAAACACGCAGAGAAGACGTGTAAGCAACAACGTTTGTGCCGCTATATCTCTCGTGTCCGAACACGGTGCGAACAACTTTGCTACTATACGGACTACTAGCATTGTCACAAAACAGCAAAAGGAGCATAATCAGGAAATGCATGAACAGATGTCCGGTTATAAACGCATGCGTCGCGAACATCAGGCTGCACTTCTCAAGTTGGAGGAGCGCTGTAAGGCGGACGTCGAGGCTCATAAGTCGCAGCTCGATCGTGAATATGACTCGCTGCTGCAACAGCTGTCTAGAGACCTGGAAAGGCTGCAGACTAAACACGCTCAAGAGCTGGAACGGAAACAGAAACAGAATTCGACAGCCGAGAAGAAGCTGATAAAGGATATAACATCACGCCAAGAACAAGAGAGGAAAGCCTTCGAGACACAGCGCAAGAGAGAGTACAAGGCGAATAAGGAGAGGTGGAAAAAGGAATTGAGTATGGACGACGCCACGCCCAAACGGCAAAGGGATGCTACATTACAGTCCCATAAGGACAATCTGAAGCAGGCGGAGGCGGCAGAGGAGGCGCGTTTGGTGCGCTCTCAGCGAGAATACCTCGAATTGGAACTGAGAAGATTCAGGAGGAGACGAATGCTGGCCCTGCATCATAAGGAACAGGAACTACTCAGAGAGGAGTTAAACAAGCGTCAGACTCAGTTGGAGCAGGCGCATGCGATGCTGTTACGTCATCACGAGAAGACTCAGGAGTTGGAGTACAGACAACAGAAGGGCGTACATGCTCTCAGGGAGGAACAGTTGTCCAACCAGCACGCGACCGAGTTGGCCAATCAGAGGGATTATATGCAGAGAGCTGAACATGAGCTCAGGAAGAAACACGCGCTGCAACTCAAACAACAGCCCAAGAGTCTTAAGCAAAAAGAAATGCTGATCCGCAAACAGTTCCGTGAGACTTGCAAGATACAAACGCGCCAGTACAAAGCACTCAAAGCACAAATCTTGCAAATGACACCTAAGGAGCAACAAAAGGAAGTCATCAAATCGCTGAAGGATGAGAAGCGTCGCAAGCTCGTGTTGTTAGGAGAACAGTATGATCAGAGTATTAGCGAGATGTTGCAGAAACAGACAGTGCGCCTTGATGAGAGCCAAATGATGGAGTGCCAGCAATTGAAGATGCAGTTGGAACACGAGCTGGATATGTTGACGGCCTACCAGAGCAAGAGTAAGATGCAGGCGGAGGCTCAGAGGAACAGAGAGCGACGGGAGCTGGAGGAGAGAGTCGCTGTCAGGAGGGCGCTGTTAGAGCAAAAGATGGAGTGCGAGTGTGGTCAGTTTGTCGCTGAACGAGCGGAGAGGACTCGCATGTTGCACGAGCGGCACGAGAGAGACCTCGATCACTTCGACAACGAGAGCGCGAGACTTGGATTCAGTGCAATGGCGATCGCCGAAGGTAGCAGGGAGGGTTACGGGGAGGAGGAGCAGTCACTCTCCGGGTCGATGCTGTCCCTCGCCCACAGCAACTCGTCGGCCAGTTTCCCGGCCGGCTCCCTCTAA

Protein sequence:

>DPOGS207034-PA
MPRPGSLKDPDIAELFSKHDPEKIFEDLREIGHGSFGAVYYARCNLTKEIVAIKKMSYLGKQSEEKWQDILKEIKFLKQLDHPNTIEYKGCYKREHTAWLVMEYCVGSASDIIEVHKRPLREEEIAAICEGVVCGLSYLHSLGRIHRDIKAGNILLTENGTVKLADFGSASIKCPANSFVGTPYWMAPEVILAMDEGQYDGKVDVWSLGITCIELAERKPPYFNMNAMSALYHIAQNDSPPLQAPEWTDTFRYFVEACLQKNPQDRPSSTKLLSHPYITRPRSPNVLVDLIQRTKAAVRDLDNLNYRKMKKILMVDADNESAMGDNEETAEERSGGDSSKSNSATSEHSAAGASSQSSSSGSLRRRPIALNANHNQQQQQYQQYNHQPAHPTRDESPIPGSHDDYVNREVLRDYANRDSLCDDYGDDYANREAIREAQRERERERQALEYREYVNAPSTWQQDSGDDNRNTQRRRVSNNVCAAISLVSEHGANNFATIRTTSIVTKQQKEHNQEMHEQMSGYKRMRREHQAALLKLEERCKADVEAHKSQLDREYDSLLQQLSRDLERLQTKHAQELERKQKQNSTAEKKLIKDITSRQEQERKAFETQRKREYKANKERWKKELSMDDATPKRQRDATLQSHKDNLKQAEAAEEARLVRSQREYLELELRRFRRRRMLALHHKEQELLREELNKRQTQLEQAHAMLLRHHEKTQELEYRQQKGVHALREEQLSNQHATELANQRDYMQRAEHELRKKHALQLKQQPKSLKQKEMLIRKQFRETCKIQTRQYKALKAQILQMTPKEQQKEVIKSLKDEKRRKLVLLGEQYDQSISEMLQKQTVRLDESQMMECQQLKMQLEHELDMLTAYQSKSKMQAEAQRNRERRELEERVAVRRALLEQKMECECGQFVAERAERTRMLHERHERDLDHFDNESARLGFSAMAIAEGSREGYGEEEQSLSGSMLSLAHSNSSASFPAGSL-