Monarch geneset OGS2.0

DPOGS202214
TranscriptDPOGS202214-TA1521 bp
ProteinDPOGS202214-PA506 aa
Genomic positionDPSCF300149 + 203864-210434
RNAseq coverage412x (Rank: top 29%)
Annotation
HeliconiusHMEL0092125e-17770.61% 
BombyxBGIBMGA013507-TA4e-16568.38% 
DrosophilaCycE-PF2e-9146.70% 
EBI UniRef50UniRef50_E5Q8K31e-17571.66%Cyclin E splice variant 4 n=6 Tax=Bombyx mori RepID=E5Q8K3_BOMMO
NCBI RefSeqXP_002090013.12e-9046.48%GE19392 [Drosophila yakuba]
NCBI nr blastpgi|3129643265e-17571.66%cyclin E splice variant 4 [Bombyx mori]
NCBI nr blastxgi|3129643260.071.87%cyclin E splice variant 4 [Bombyx mori]
Group
Gene OntologyGO:00517268.4e-13regulation of cell cycle
GO:00199018.4e-13protein kinase binding
GO:00000798.4e-13regulation of cyclin-dependent protein kinase activity
GO:00056343.4e-07nucleus
KEGG pathwaydya:Dyak_GE193927e-90 
 K06626 (CCNE)maps-> Small cell lung cancer
    Pathways in cancer
    Prostate cancer
    p53 signaling pathway
    Cell cycle
    Oocyte meiosis
InterPro domain[289-464] IPR0137633.7e-40Cyclin-like
[167-295] IPR0066712.4e-38Cyclin, N-terminal
[21-450] IPR0144008.4e-13Cyclin A/B/D/E
[349-410] IPR0043673.4e-07Cyclin, C-terminal
Orthology groupMCL15236 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202214-TA
ATGGCACAAACAGGGTATGTACAGCTGATTGCACTCTTGTACAATACTGACCTTGGTATCATGACTACTAAATCCCTAAATCATTTTAGATCCTCCTGTGAAGCCAACGAGAAGAGGATATGTTTGAAGAGGAAAAGAAATTCAACAGATGATGAGGTAGAGAACATGCCACCTATAAAAATAACATCAAAGTTAGAGGAAGATGTATGTGAACTCCCACCACACACAGTTGTTGAGTCATCTTCGTGTTCGAGTGATGATGAAGGCCCAGAGCAGCCGAGGAGTGTGTTCACAGACTTAGATTATAGTGCAGATAGTTTCTTAAGCCCACCGAGTATACCTGAGCTACCCAATTGCGTCCTGAGTCCTCTTGAGAATGTTGCCAGAGGAGAATGCACCCCACACTCCAATAAAAGACCTAGCACAAGCAAGGTATATCCCACTCCACCGAAGCGCAAATGTCCCCTGCCTGGGCTGTCCTGGGCGGATCCCAGTGATGTGTGGAAGAGCATGTGTGAGGTGGACGCCAGGTCCACAATGATGAAGAATCCCAACATGTTTGACAACCACCCCAACCTCCAGCCTAGGATGAGAGCCATCTTGTTGGACTGGNAGATAACTTGGGTGTGTGAAGTATACAAGCTCCACCGAGAGACCTTCCACCTGACCGTGGACTACGTGGACCGCTACCTCTCCAACACTGAGGACGTGCAGAAAGGAAGACTACAGCTCATAGGTATAACCTGCTTATTCATAGCCGCGAAGGTCGAGGAAGTGTATCCGCCGAAGATAGGCGAGTTCGCTTACGTGACGGACGGCGCGTGCACTACGGACGAGATCCTGCTGGAGGAGCTGCTCATACTGAAGATACTGTCCTGGAGCATCACGCCCATCACCATCAACAGCTGGCTGAACATATACATGCAACTGGCCAGCGAAGGCCGCAGCGCCAAGAGGAGGCTGCTCAGTGAGAGCGATCTGGGGATCAATGCGTTACGGGGCTACACTTTTGTGTTCCCACAATACTCGTCGTTGGAGTCCGTGGCGTGCGGACAGCTGGTAGACTTGGCGGTGCTGCACGCTGACGTCACAAGATACGCCTCCAGCGCTGTAGCCGCGGCCGCCGTGGCGCACGCCTTCAATGTAGACCTGGCTTTAAGGGTATCAGGTTACAGCTGGTCGTCATTAGAGCCTTGCTACACGTGGCTGGCTCCGTTCGTGTCCGCCGTGCGCGAGGCGGGCTGCGTGGTGGGTGTGCGCGGCGGGGACGGCGACCACGTGCAGCGAGCCGCGGGACTGAGGCTCATCTGTCCCGACCTCAACCTGGACGAGAGTCACCGCATACAGACACACAACGTCTCACTCGACATGTTTGACAAAGTATACCAACAGCTGGCGGAGCAGCCGGAGACATCGGCGGGGTCCGAGTTGCACGCCTTCCCGCCCACTCCGCCCCACTCCGACCACAAGAGTCCCCACACGCCCGCCGCCAAGACCCCCTCCACGCGGACCTGCGAGTGA

Protein sequence:

>DPOGS202214-PA
MAQTGYVQLIALLYNTDLGIMTTKSLNHFRSSCEANEKRICLKRKRNSTDDEVENMPPIKITSKLEEDVCELPPHTVVESSSCSSDDEGPEQPRSVFTDLDYSADSFLSPPSIPELPNCVLSPLENVARGECTPHSNKRPSTSKVYPTPPKRKCPLPGLSWADPSDVWKSMCEVDARSTMMKNPNMFDNHPNLQPRMRAILLDWXITWVCEVYKLHRETFHLTVDYVDRYLSNTEDVQKGRLQLIGITCLFIAAKVEEVYPPKIGEFAYVTDGACTTDEILLEELLILKILSWSITPITINSWLNIYMQLASEGRSAKRRLLSESDLGINALRGYTFVFPQYSSLESVACGQLVDLAVLHADVTRYASSAVAAAAVAHAFNVDLALRVSGYSWSSLEPCYTWLAPFVSAVREAGCVVGVRGGDGDHVQRAAGLRLICPDLNLDESHRIQTHNVSLDMFDKVYQQLAEQPETSAGSELHAFPPTPPHSDHKSPHTPAAKTPSTRTCE-