Monarch geneset OGS2.0

DPOGS210561
TranscriptDPOGS210561-TA1017 bp
ProteinDPOGS210561-PA338 aa
Genomic positionDPSCF300304 + 184058-186287
RNAseq coverage270x (Rank: top 40%)
Annotation
HeliconiusHMEL0095545e-13593.88% 
BombyxBGIBMGA013453-TA5e-13387.92% 
Drosophilacdc2c-PC7e-9566.12% 
EBI UniRef50UniRef50_P249412e-10569.14%Cyclin-dependent kinase 2 n=105 Tax=Eukaryota RepID=CDK2_HUMAN
NCBI RefSeqXP_393450.32e-10568.60%PREDICTED: similar to cyclin-dependent kinase 2 [Apis mellifera]
NCBI nr blastpgi|3784049223e-13087.92%cyclin dependent kinase 2 [Bombyx mori]
NCBI nr blastxgi|3784049223e-13987.92%cyclin dependent kinase 2 [Bombyx mori]
Group
Gene OntologyGO:00167722.1e-82transferase activity, transferring phosphorus-containing groups
GO:00055241.3e-79ATP binding
GO:00046741.3e-79protein serine/threonine kinase activity
GO:00064681.3e-79protein phosphorylation
GO:00046721.1e-66protein kinase activity
GO:00047137.3e-11protein tyrosine kinase activity
KEGG pathwaygga:4296421e-106 
 K02206 (CDK2)maps-> Small cell lung cancer
    Pathways in cancer
    Prostate cancer
    p53 signaling pathway
    Progesterone-mediated oocyte maturation
    Cell cycle
    Oocyte meiosis
InterPro domain[2-269] IPR0110092.1e-82Protein kinase-like domain
[4-274] IPR0022901.3e-79Serine/threonine-protein kinase domain
[5-227] IPR0174421.1e-66Serine/threonine-protein kinase-like domain
[4-249] IPR0206357.3e-11Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11145 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210561-TA
ATGGAAAACTTTTCGAGAGTTGAAAAAATTGGTGAAGGAACATATGGCGTAGTTTATAAGGCAAGAGACAAAGTAACCGGTAAAGAAATTGCTCTGAAGAAAATAAAACTTGAAAATGAACCTGAGGGGGTCCCATCAACTGCACTCAGAGAGATATCAGTATTGCGGGAGTTAAAGCACCCGGCTGTGGTGCGCTTACTAGATGTGCTGCTGGCAGACACAAAGCTGTTTCTGGTCTTTGAGTTTTTACACATGGACCTCAAACGGCTTATGGATATAACCAAAGGACCACTGCAGCTGGATCTTGTGAAGAGTTACTTGCGACAGCTGTTAGAGGGAGTTGCCTACTGTCACGCACATCGTGTGTTACATCGCGACCTCAAGCCACAGAATCTTCTGGTGGATGTGGAGGGTCACATAAAGCTGGCCGACTTCGGTCTAGCGCGCGCCTTCGGTATACCCGTGCGCGCTTACACTCACGAGGTGGTGACGCTCTGGTACCGGGCGCCCGAGATCCTGCTAGGAGCCAAGTTCTACTCCACTGCCGTCGACGTCTGGAGTCTCGCCTGCATCTACGCCGAGATGGCCAGCGGCAGGACGCTATTTCCGGGCGACAGCGAGATCGATCAGCTGTTCAGAGTGTTCCGCGCCCTGGGTACGCCCGGCGAGGACGTGTGGCCCGGAGCGCGCCTGCTGCCCGACTACCGCGCCGCCTTTCCTCGCTGGCCGCGCCGCGAGGCCCGCCTGCTTCTGCCGGCGGCGGACTCGGGCTCCGCCGCACGCCCCACACGCCGCGAGCCTGTTCGAGAGCATGTTGCGGTACGAGCCGAGCGAGCGCGTGTCGGCGCGCGCCGCCCTGCTGCACCCTTACCTGTCGAACGCGCGCCTGGTGCCGCCCGCCCTGCCGCCGCAGCATTCTCCGCCTTCCACCGACTGCGCCGATCGTCCCGACGACTTCTGATCTCCGTCGAGTTGATTTCGTTGTCACCTGTAAATAAATATTTATATTTTTTTTAA

Protein sequence:

>DPOGS210561-PA
MENFSRVEKIGEGTYGVVYKARDKVTGKEIALKKIKLENEPEGVPSTALREISVLRELKHPAVVRLLDVLLADTKLFLVFEFLHMDLKRLMDITKGPLQLDLVKSYLRQLLEGVAYCHAHRVLHRDLKPQNLLVDVEGHIKLADFGLARAFGIPVRAYTHEVVTLWYRAPEILLGAKFYSTAVDVWSLACIYAEMASGRTLFPGDSEIDQLFRVFRALGTPGEDVWPGARLLPDYRAAFPRWPRREARLLLPAADSGSAARPTRREPVREHVAVRAERARVGARRPAAPLPVERAPGAARPAAAAFSAFHRLRRSSRRLLISVELISLSPVNKYLYFF-