Monarch geneset OGS2.0

DPOGS204246
TranscriptDPOGS204246-TA807 bp
ProteinDPOGS204246-PA268 aa
Genomic positionDPSCF300046 - 450229-451431
RNAseq coverage156x (Rank: top 52%)
Annotation
HeliconiusHMEL0151788e-15697.39% 
BombyxBGIBMGA007520-TA2e-1793.02% 
DrosophilaCycC-PA5e-12380.60% 
EBI UniRef50UniRef50_P250086e-12180.60%Cyclin-C n=31 Tax=Bilateria RepID=CCNC_DROME
NCBI RefSeqXP_001663584.11e-13587.31%g1/s-specific cyclin c [Aedes aegypti]
NCBI nr blastpgi|3123805682e-13587.69%hypothetical protein AND_07358 [Anopheles darlingi]
NCBI nr blastxgi|3123805682e-13387.69%hypothetical protein AND_07358 [Anopheles darlingi]
Group
Gene OntologyGO:00063559.2e-171regulation of transcription, DNA-dependent
GO:00199019.2e-171protein kinase binding
GO:00000799.2e-171regulation of cyclin-dependent protein kinase activity
GO:00517264.9e-163regulation of cell cycle
GO:00165914.9e-163DNA-directed RNA polymerase II, holoenzyme
KEGG pathwayang:An15g060507e-27 
 K06634 (CCNH)maps-> Nucleotide excision repair
    Cell cycle
InterPro domain[1-268] IPR0154299.2e-171Cyclin C/H/T/L
[1-268] IPR0235984.9e-163Cyclin C/H
[1-185] IPR0137635.5e-56Cyclin-like
[31-149] IPR0066711.7e-14Cyclin, N-terminal
Orthology groupMCL13733 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204246-TA
ATGGCTGGAAACTTTTGGCAAAGTTCTCACCATCAGCAATGGATTTTAGACAAACAAGATCTCATACGAGATCGCCAACACGATTTAGCTAAATTGACGGAAGAGGAATATCAAAAAATATTCAATTTCTTTGCTAGCATAATACAAGTATTAGGCGAACAGTTAAAATTGCGTCAGCAAGTAATAGCGACCGCTACTGTTTACTTTAAAAGATTCTATGCAAGAAACTCCTTGAAATGCATAGATCCGTTGCTTCTAGCTCCTACCTGCGTTTTTTTGGCGTCAAAAGTAGAAGAATTTGGTGTTATTTCTAATTCAAGATTAATAACAACATGTCAAACAGTTATCAAGAATAAATTCAGTTATGCTTATGGTCAGCAGGAGTTTCCTTACAGAACAAATCATATTTTGGAATGTGAATTCTATTTACTGGAGAACTTAGACTGTTGTCTGATTGTATATCAACCATACAGACCCCTGTTACTCTTTGTTCAAGACATTGGCCAAGATGATCAACTTCTTACCTATGCTTGGAGAATTGTTAATGACTCTTTACGAACTGATGTTAGCTTACTATATCCCCCATATCAGATTGCGATAGGAGCACTTCATATTGCATGTGTAATGCTAGGCAAGGAAAATTTAAAGCCTTGGTTCGCCGAATTGAATGTTGACATGGACAAGATCCAAGAAATAGTAAGGTTAATTATCAACTTATATGAAATGTGGAAAAGTTATGATGAAAAGAAAGAAATCCAGGGCCTCTTAGGAAAAATGCCAAAACCAAGTCCAGCACCTCAAAGATAA

Protein sequence:

>DPOGS204246-PA
MAGNFWQSSHHQQWILDKQDLIRDRQHDLAKLTEEEYQKIFNFFASIIQVLGEQLKLRQQVIATATVYFKRFYARNSLKCIDPLLLAPTCVFLASKVEEFGVISNSRLITTCQTVIKNKFSYAYGQQEFPYRTNHILECEFYLLENLDCCLIVYQPYRPLLLFVQDIGQDDQLLTYAWRIVNDSLRTDVSLLYPPYQIAIGALHIACVMLGKENLKPWFAELNVDMDKIQEIVRLIINLYEMWKSYDEKKEIQGLLGKMPKPSPAPQR-