Monarch geneset OGS2.0

DPOGS211318
TranscriptDPOGS211318-TA939 bp
ProteinDPOGS211318-PA312 aa
Genomic positionDPSCF300125 + 64576-65930
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0093621e-10367.24% 
BombyxBGIBMGA013453-TA2e-6142.12% 
DrosophilaCG6800-PA1e-7246.45% 
EBI UniRef50UniRef50_G3N6X08e-9455.48%Uncharacterized protein n=6 Tax=Bilateria RepID=G3N6X0_GASAC
NCBI RefSeqXP_002741922.12e-10057.28%PREDICTED: cyclin-dependent kinase 20-like isoform 1 [Saccoglossus kowalevskii]
NCBI nr blastpgi|2949564842e-9957.81%novel protein similar to H.sapiens cell cycle related kinase (CCRK, zgc:101530) [Danio rerio]
NCBI nr blastxgi|1565463002e-9855.88%PREDICTED: cyclin-dependent kinase 20-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055243.2e-83ATP binding
GO:00046743.2e-83protein serine/threonine kinase activity
GO:00064683.2e-83protein phosphorylation
GO:00167721.8e-82transferase activity, transferring phosphorus-containing groups
GO:00046722e-67protein kinase activity
GO:00047132.1e-10protein tyrosine kinase activity
KEGG pathway 
InterPro domain[8-291] IPR0022903.2e-83Serine/threonine-protein kinase domain
[5-303] IPR0110091.8e-82Protein kinase-like domain
[8-291] IPR0174422e-67Serine/threonine-protein kinase-like domain
[8-291] IPR0206352.1e-10Tyrosine-protein kinase, catalytic domain
Orthology groupMCL14894 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211318-TA
ATGGCGAGTAATATAACAAATTATTCCGTCCTGGGACGTATCGGTGAAGGAGCTCATGGTCTTGTGTTTAAGGCTCGGCATCTTCCTACTGGTCGTGATGTAGCTTTGAAGAAGATTCTAATAAAAAATTTGGAGGATGGTATTCCTCTTAATGTAATGCGAGAAATAAAAGCACTGCAGTTACTTCGGTGTAAATATGTTATAAAATTATACGATATGTTTCCCCGTGGAATGTGCTTAGTGTTAGTATTGGAGTATATGTGCTCTGGACTATGGGAAATGCTACATCAAAAACAACAGGAGTTGACACTACCAAGAGTGAAAACTTACGCCCAAATGTTACTGAAAGGGACACGTTACATGCACGCACACTATGTTATGCATAGGGACCTTAAACCTGCAAATTTGCTAATCAATCATGAGGGTATACTTAAAATAGCGGACTTGGGACTTGCTCGTTTGTATTGGCCTGATGGTGGAAGACCTTATTCACATCAAGTAGCGACGAGATGGTACCGCGCACCAGAACTTTTATACGGCGCTAGATATTATAGTCAAAATGTAGACATATGGGCTGTCGGATGTATTATAGCTGAAATGATCACAAAACAACCGCTCTTTGCAGGAGAGTCTGATATTGAACAATTGGCAATAGTCTTACAACGTCTTGGTACCCCCACTGAGGAGACGTGGCCGAAGCACTCGGAATTGCCGGATTACCACAAGATAACATTTCCCGAATCATCGCCTATGCCGTGGACGGAACTTTTACCAGGAGTCGAACCTGACGCTATTCATCTCATCAAATCTTTCATACTCTACGACGCACAAAAGAGAATATCAGCTAAAGAGGCTCTAAATCATCCTTGGTTCCACACAAAACCACTGCCAGCCGCACTAGAGGACATGCCCAAAGCCAACACAATCAAAACAAAATGA

Protein sequence:

>DPOGS211318-PA
MASNITNYSVLGRIGEGAHGLVFKARHLPTGRDVALKKILIKNLEDGIPLNVMREIKALQLLRCKYVIKLYDMFPRGMCLVLVLEYMCSGLWEMLHQKQQELTLPRVKTYAQMLLKGTRYMHAHYVMHRDLKPANLLINHEGILKIADLGLARLYWPDGGRPYSHQVATRWYRAPELLYGARYYSQNVDIWAVGCIIAEMITKQPLFAGESDIEQLAIVLQRLGTPTEETWPKHSELPDYHKITFPESSPMPWTELLPGVEPDAIHLIKSFILYDAQKRISAKEALNHPWFHTKPLPAALEDMPKANTIKTK-