Monarch geneset OGS2.0

DPOGS204039
TranscriptDPOGS204039-TA1122 bp
ProteinDPOGS204039-PA373 aa
Genomic positionDPSCF300138 + 134871-151180
RNAseq coverage73x (Rank: top 66%)
Annotation
HeliconiusHMEL0074965e-9153.25% 
BombyxBGIBMGA004790-TA0.086.25% 
DrosophilaCG7236-PA2e-13666.97% 
EBI UniRef50UniRef50_B4JC591e-13267.69%GH11042 n=33 Tax=Coelomata RepID=B4JC59_DROGR
NCBI RefSeqXP_001600503.12e-14269.91%PREDICTED: similar to cdkl1/4 [Nasonia vitripennis]
NCBI nr blastpgi|3454926229e-14269.71%PREDICTED: cyclin-dependent kinase-like 1-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454926223e-13969.71%PREDICTED: cyclin-dependent kinase-like 1-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00055247.5e-91ATP binding
GO:00046747.5e-91protein serine/threonine kinase activity
GO:00064687.5e-91protein phosphorylation
GO:00167725e-87transferase activity, transferring phosphorus-containing groups
GO:00046722.5e-74protein kinase activity
GO:00047132.6e-14protein tyrosine kinase activity
KEGG pathway 
InterPro domain[29-320] IPR0022907.5e-91Serine/threonine-protein kinase domain
[15-351] IPR0110095e-87Protein kinase-like domain
[29-319] IPR0174422.5e-74Serine/threonine-protein kinase-like domain
[29-320] IPR0206352.6e-14Tyrosine-protein kinase, catalytic domain
Orthology groupMCL14978 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204039-TA
ATGAGGGGAAAACTTAACACCTCTATAATATCCCCATACGTAATAACCGCCAGATCGCGGGCCAGCAGCCGAGCTATGGAGCGTTATGAGAAACTGGCGAAGCTGGGCGAGGGTTCGTACGGGTTGGTGTACAAGTGCAGGAACAGGGAGACTGGGGAAGTGGTTGCCGTGAAGAAGTTTGTGGAGAATGAGGACGATCCTCTCATACGGAAAATAGCCCTGCGGGAGATACGTATGCTGAAGAATCTGAAACATCCAAACTTGGTAAACCTGATCGAGGTGTTCCGAAGGAAGCGCAAGCTGCACCTCGTGTTCGAATACTGCGACCACACCGTACTGCACGAACTGGAAAAATATCCAGCTGGTTGTCCAGAACTGTTGTCGAAGCAGATTATCTGGCAAACGCTTCAGGGAGTCGCCTACTGCCACCGACACAACTGTATCCATCGAGACGTGAAACCCGAGAACATTCTGCTAACCAGCGACGGTGTCGTCAAATTGTGTGACTTCGGATTTGCGAGAATGATAAGTCCCGGCGAGAGCTACACGGACTACGTGGCGACCAGGTGGTACAGGGCTCCGGAACTGCTGGTCGGAGACACGCTCTACAGCACACCCGTCGACGTCTGGGCTATCGGCTGTGTGTTCGCCGAGTTACTGTCGAGCGAGGCTCTGTGGCCTGGCAAGAGCGACCTCGACCAGCTCTACCTGATCCGTAAGACCCTGGGCGACCTGCTGCCGCGACACATGACGATATTCTCGCAGAATACCTTCTTTCAGGGCATGGAGCTGCCGGAGCCGACGAGTCTGGAGCCCCTTGAGAAAAAAATACCGCCGCGCTACGCCAATAACGATTTGGTCTTAGACTTTTTAAAGGCGAGCTCTTTATACATAAAGTGCCTGGACAAGGATCCCTTGGCTCGCGCCACGTGTGAGCAGTTACTTCGTCACGCTATATTCGAGAATTTTCTATACGCCGTCCCGCGAACCGATCACGACGACTTCGAGAAGGCTAGGCGAGCACAGCAAGAATTCCAAGATGGCGCTGCAACAAATTTCGATGACGAGTCTACTTTAACTCCCACTAACGCCACCCATACTAAGAAGACTAAGAAGAAATGA

Protein sequence:

>DPOGS204039-PA
MRGKLNTSIISPYVITARSRASSRAMERYEKLAKLGEGSYGLVYKCRNRETGEVVAVKKFVENEDDPLIRKIALREIRMLKNLKHPNLVNLIEVFRRKRKLHLVFEYCDHTVLHELEKYPAGCPELLSKQIIWQTLQGVAYCHRHNCIHRDVKPENILLTSDGVVKLCDFGFARMISPGESYTDYVATRWYRAPELLVGDTLYSTPVDVWAIGCVFAELLSSEALWPGKSDLDQLYLIRKTLGDLLPRHMTIFSQNTFFQGMELPEPTSLEPLEKKIPPRYANNDLVLDFLKASSLYIKCLDKDPLARATCEQLLRHAIFENFLYAVPRTDHDDFEKARRAQQEFQDGAATNFDDESTLTPTNATHTKKTKKK-