Monarch geneset OGS2.0

DPOGS209949
TranscriptDPOGS209949-TA1149 bp
ProteinDPOGS209949-PA382 aa
Genomic positionDPSCF300148 - 314551-316211
RNAseq coverage1385x (Rank: top 9%)
Annotation
HeliconiusHMEL0135491e-15778.38% 
BombyxBGIBMGA011338-TA3e-13391.98% 
DrosophilaCycK-PA2e-11567.68% 
EBI UniRef50UniRef50_G6CNN00.0100.00%Putative cyclin k n=3 Tax=Coelomata RepID=G6CNN0_DANPL
NCBI RefSeqXP_001607256.19e-13163.10%PREDICTED: similar to cyclin k [Nasonia vitripennis]
NCBI nr blastpgi|3454958532e-12963.10%PREDICTED: cyclin-K-like [Nasonia vitripennis]
NCBI nr blastxgi|3454958537e-13661.89%PREDICTED: cyclin-K-like [Nasonia vitripennis]
Group
Gene OntologyGO:00063551.3e-153regulation of transcription, DNA-dependent
GO:00199011.3e-153protein kinase binding
GO:00000791.3e-153regulation of cyclin-dependent protein kinase activity
KEGG pathway 
InterPro domain[2-377] IPR0154291.3e-153Cyclin C/H/T/L
[3-173] IPR0137631.3e-52Cyclin-like
[3-135] IPR0066713.7e-17Cyclin, N-terminal
Orthology groupMCL12199 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209949-TA
ATGCCTTATTGGTATTATGATAAAAAAGATTTGCAAAACACACCATCATTTCGTGATGGGATCCCAAATGAAACAGAAAACCGTTATCGAAAAGAGGGAGCGAGGTTCATCATCGACACTGGTTCGAAAATGGACTTAGGTTATAACACTGTTGCGACTGGTGTTGTTTATTTTCATCGCTTCTACATGTTTCAATCGTTCAGGACATTCCCGAGATACATAACCGCTTGTTGCTGTCTGTTTCTGGCTGGTAAAGTGGAGGAGACGCCAAAGAAATGTAAAGATATTATTAAAGTAGCAAAATCGTTGTTGACAGAAGAAAAGTTTAGTTCTTTCGGAGAGGACCCTAAGGAGGAAGTTATGACATTAGAAAGAATATTGCTGCAAACGATCAAGTTCGATCTGCAGGTGGAGCATCCTTATGGGTACCTTCTGAAGTATGCAAAGTGCCTTAAAGGGGATAAAGCGAAGTTACCGAAAATGGTACAAATGGCCTGGACTTTCGTTAACGATAGTCTATGTACAACCCTGTGCCTCCAGTGGGAGCCAGAGGTGATAGCTGTGGCACTGTTGTTCTTGGCGGGGAAGTTAAGCAAGTTTGAGGTCGCCGACTGGAATGGACGATCAGCAAAACATTCAGCCTGGTGGGACATGTTTGTGGAAGACATCACGATGGAGTTGCTCGAGGATATTTGTCACCAAGTGCTGGATCTCTACTCGCCACAGACTCAGCCGTCGGGGAGTGACTCTCCTCCCGTCGCATCTTCTACTAAACTTCCAAAAAACGACAAGTTGTCCGTTACACCGCCCACTTCAGCATCGCCTGTCATCGTCCCCCCTAAGCCCGCGGTGACTCCTCTCAAGAACGGAGCGGACATGAAGCCCGAGCTGGCCAAGATGGACATGAGGTTCACGTACCCCGGCTACCCGGGGCTGCCGGGCTACCAGGCCGTGTACACTGCGCCGCCCCCCGCGCTGCCCGCCCATCCTCCGCCGCCGCTAGTGTACGCGGAGCCTCCGCCCGCGCCGCCGGGGGCTCGCTTCCCGCCCGTCAACGTACCGCCGCCCAACTTCTTTCCCCCTCTAGGAAAGCGCCCGCCGCCGCCTCGGGCTCCGCTGCCGCCCCGCCCCTACTACCCGCCGCCGTGA

Protein sequence:

>DPOGS209949-PA
MPYWYYDKKDLQNTPSFRDGIPNETENRYRKEGARFIIDTGSKMDLGYNTVATGVVYFHRFYMFQSFRTFPRYITACCCLFLAGKVEETPKKCKDIIKVAKSLLTEEKFSSFGEDPKEEVMTLERILLQTIKFDLQVEHPYGYLLKYAKCLKGDKAKLPKMVQMAWTFVNDSLCTTLCLQWEPEVIAVALLFLAGKLSKFEVADWNGRSAKHSAWWDMFVEDITMELLEDICHQVLDLYSPQTQPSGSDSPPVASSTKLPKNDKLSVTPPTSASPVIVPPKPAVTPLKNGADMKPELAKMDMRFTYPGYPGLPGYQAVYTAPPPALPAHPPPPLVYAEPPPAPPGARFPPVNVPPPNFFPPLGKRPPPPRAPLPPRPYYPPP-