Monarch geneset OGS2.0

DPOGS200830
TranscriptDPOGS200830-TA3189 bp
ProteinDPOGS200830-PA1062 aa
Genomic positionDPSCF300071 - 502561-509719
RNAseq coverage961x (Rank: top 13%)
Annotation
HeliconiusHMEL0114670.079.05% 
BombyxBGIBMGA009883-TA0.071.86% 
DrosophilaCdk12-PA0.054.50% 
EBI UniRef50UniRef50_Q7Q2R50.058.12%AGAP004780-PA n=2 Tax=Opisthokonta RepID=Q7Q2R5_ANOGA
NCBI RefSeqXP_397595.30.053.21%PREDICTED: similar to CG7597-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3784049240.072.41%cyclin dependent kinase 12 transcript variant A [Bombyx mori]
NCBI nr blastxgi|3784049240.074.42%cyclin dependent kinase 12 transcript variant A [Bombyx mori]
Group
Gene OntologyGO:00055249.3e-96ATP binding
GO:00046749.3e-96protein serine/threonine kinase activity
GO:00064689.3e-96protein phosphorylation
GO:00167729.4e-92transferase activity, transferring phosphorus-containing groups
GO:00046722.2e-72protein kinase activity
GO:00047131.8e-13protein tyrosine kinase activity
KEGG pathway 
InterPro domain[696-990] IPR0022909.3e-96Serine/threonine-protein kinase domain
[682-1021] IPR0110099.4e-92Protein kinase-like domain
[696-990] IPR0174422.2e-72Serine/threonine-protein kinase-like domain
[696-988] IPR0206351.8e-13Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11496 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200830-TA
ATGGAGCGCGGTTACGAGAAGAGACACAGTAAGCACCACAAGGAAAAGCATAAGAAGCGAGCACACAAGAAGTATAGAAGTCATTCGGGCTCCCATGAACAGTCATATGCTACTAGTAACTCCATGAAACCCCTGGTGGAATATTCCGATGTCAGCTCCGAGGATCTGTCGGCCCCGGAGGCCGGTGAGATAGAGAGTGAAGCCAGTTCCATCGGTCGGCACATTGCAGACGACTTGGATAGAACGAGAAAATCCCACCCTATACGTACTTTCCTTGATAACAAGATATCTGTGACGACTAGCAGCCGACGTGTGGTCGAGGAATACAGCCTCACTCGTGATTCCTCTTCCACGGCGAGGAAGAGACGAGGCGTTGAATACGAGGAGCCAGTCCCAGACTTCGATGATCATTCCCGATATAAGAAGAAGAAGGAGAAGCGAAAGAAAGACAAGAAGAAGAAAAAGAAGAAGTCCAAACACAGGTCACGCTCAGCCAGTCTAGAGAGCTTGTCTCCTGGTGATATTGTACCAACTTGTACACCACTTCGGCCCCTGACTCCACAACGGTACACCCAGGTGCCAGTCAGCGAATGGGAAAAGGGATCATCTCCATTACGGAACGGTTCCTGTTCCCCAGTGTCGCCCTCAACACCTCCGCTGAGGCATGAGGTGTCACCTCGACACCGACCCATTCACCGTGAGCCATCTATGCACAATGTTATCCCATTCTCGCCACATCCAGAGAGATCTCCTATTATTAGGAGGAGGCAGAAGTCCACAACTCCTCACACACCCCTGGCGCCTCCCTACCATGAGACCGTTACTATTGATTCAGATACAGAACAGGACTACGACAGGAGACGCGACTACTGGCACGAACAGCGCATCTCTAATGAAATCATGGTTATATCTGACTCCCCGGTCCACGAATCGCGAAGCCGCGACTACAGTCCGAGACGTCATCGCCGCCGCTCGCCTCGGAGACGACGCAGTAGAGACAGGGAGAGACACAGGGAGAAGGTCCATCATCGATCTCACAGTAGGTCGTCATTGAAGCGTCGCCGTTCTGGGTCACGTTCGAGACGCCGTTCATCATCACCTCCACGACATCGTCCGAGACATTCAGACATCCCCAGAGACAGGCACAGGAGTAAACATGAATCCCCGAGTCCACCCACACTGTTACAGAGAAAAATTGACTTCAAAGAAAAGATTAGCGATACCAGTTTGTTTGCGGAACTTGTTAAAGACAAACACAAGCGTGCCAAGAAACTACAAGAAATATTAGAACAGAAAGAAGAATCACAGGGAGCTGCCTCAAGTAGTGCTTCAGTTACCAACACTGATGCGCCAACAGAAGACCTGGCCGATAATGCGGATAGCTTGCAGAGTTCAAAGGAGAATGGTGAGAACTCTAAGTCGGATGTAGTTGACATTCCAATGCCCGTGCAGACTTCAGAGGAATCCAATGCTCAACCAGCAGAGGCTGAGCCCCAAGCTGCCCAGCAAACAAACGCAACCACCAATGGGGAAGCTAATGAGAATTCTAATAGCTCTGAGACTCAAAAAGTTCCCGTCACACCGCCCGCCCCCCTACCAAGCGCCGCGTCCACTTTACCGCCCCTCCCAAAGGTGGGTGGGAGCGTGGTAGAGGGCGTCTTTATGAATAGCCAACAACCGCCTCCGCCGAAACCAAAAAGCCTTACGAAACTGCCCATGCCGCCCAATACTCAGGTCGAAGATTTAAAGACCCTCGCCAACGACAGTCCATTGAGCACGCCGTCCCCAAGTCCCGTCAAGAAACCGGAAAAACCTAAGAGAACCGGTATTATGAATTTACCTATGCCGCCAGTGATCCCTGGGTCAGAGGAGCTGAGCGGGGACGAATTGGACGGGTCCACTCCGCCCGCGCGTCCTCCCACACACAGGGATCAATACTCACACGTGTTCAGCGGCAGGAGGCAGGCCGATGCGGGATCTAAACTGAAGAGGCCGCGTATTCTGAAACGCCGAGGATCTAAGGTCGTCCCGGTAGCGACTCCGACACACCACGCCAAGGACTGGGGCGAGAAATGTGTGGACGGTTTCCAGGTAATAACTCAAATAGGCGAAGGCACGTACGGTCAAGTATATAAGGCGCGCGACAAAAACACGGCCCAACTCGTAGCACTGAAGAAGGTGAGGTTGGAGAACGAAAAAGAAGGCTTTCCCATCACCGCTGTCAGAGAAATTAAGATTCTGAGACAACTCAATCACAAGAACATTGTTAATCTGAGAGAGATCGTTACCGATAAACAGGACGCCATGGACTTTAGAAAGGACAAGGGCTCGTTCTATCTGGTGTTTGAATACATGGACCACGACCTCATGGGTCTCTTGGAGTCGAAGATGGTAGACTTCACCGAGTCACATAACGCATCCATAATGCGTCAGCTGTTAGACGGACTGGCGTACTGTCATCGCAAGAATTTTCTACATCGAGACATCAAATGCAGTAATATATTAATGAACAATAAAGGGGAAGTAAAGCTAGGGGATTTCGGTCTGGCTCGACTGTGGTCAGCCGAGGACAGAGCTCGGCCATACACTAACAAAGTTATCACACTGTGGTATAGACCACCAGAGCTATTGTTAGGAGAGGAACGATATGGCCCCGCTGTGGATGTGTGGTCCATGGGATGTATACTGGGCGAGCTGTTCCTGAAACACCCGCTGTTCCAGGCTAACACGGAGATGATGCAATTGGAGATGATATCTCGCATCTGTGGCACTCCAGCGCCGGGAGTGTGGCCCAACGTGGTGAAGCTGCCCCTCTGGCACACACTGCGGCCCAAACGTTTCCACAAGCGGTGTGTTCGGGAACAGTTTGCGTTTATGCCACCAGCGGCTCTACAGCTGTTGGATCGTATGCTGGAGTTAGATCCAGACAAGCGAATCACGGCGGACGACGCGCTGAAGAGCGTGTGGCTGAAGAATGTTGTCCCGGATCAAATGCCGGCCCCGGAGCTACCGACATGGCAAGACTGTCACGAGCTCTGGTCGAAGCAACGCCGGCGGCAGCAGAGGGAACAGGAACAGTCGTCGAAACCTAAATCCTATACATTCACACAGGATGAGAACTCTGAATCGAGTTTCAAAAGTGATATCAAGTCAGAGACTAACCCGGAACCTGTCGGACAAGTTAAATAG

Protein sequence:

>DPOGS200830-PA
MERGYEKRHSKHHKEKHKKRAHKKYRSHSGSHEQSYATSNSMKPLVEYSDVSSEDLSAPEAGEIESEASSIGRHIADDLDRTRKSHPIRTFLDNKISVTTSSRRVVEEYSLTRDSSSTARKRRGVEYEEPVPDFDDHSRYKKKKEKRKKDKKKKKKKSKHRSRSASLESLSPGDIVPTCTPLRPLTPQRYTQVPVSEWEKGSSPLRNGSCSPVSPSTPPLRHEVSPRHRPIHREPSMHNVIPFSPHPERSPIIRRRQKSTTPHTPLAPPYHETVTIDSDTEQDYDRRRDYWHEQRISNEIMVISDSPVHESRSRDYSPRRHRRRSPRRRRSRDRERHREKVHHRSHSRSSLKRRRSGSRSRRRSSSPPRHRPRHSDIPRDRHRSKHESPSPPTLLQRKIDFKEKISDTSLFAELVKDKHKRAKKLQEILEQKEESQGAASSSASVTNTDAPTEDLADNADSLQSSKENGENSKSDVVDIPMPVQTSEESNAQPAEAEPQAAQQTNATTNGEANENSNSSETQKVPVTPPAPLPSAASTLPPLPKVGGSVVEGVFMNSQQPPPPKPKSLTKLPMPPNTQVEDLKTLANDSPLSTPSPSPVKKPEKPKRTGIMNLPMPPVIPGSEELSGDELDGSTPPARPPTHRDQYSHVFSGRRQADAGSKLKRPRILKRRGSKVVPVATPTHHAKDWGEKCVDGFQVITQIGEGTYGQVYKARDKNTAQLVALKKVRLENEKEGFPITAVREIKILRQLNHKNIVNLREIVTDKQDAMDFRKDKGSFYLVFEYMDHDLMGLLESKMVDFTESHNASIMRQLLDGLAYCHRKNFLHRDIKCSNILMNNKGEVKLGDFGLARLWSAEDRARPYTNKVITLWYRPPELLLGEERYGPAVDVWSMGCILGELFLKHPLFQANTEMMQLEMISRICGTPAPGVWPNVVKLPLWHTLRPKRFHKRCVREQFAFMPPAALQLLDRMLELDPDKRITADDALKSVWLKNVVPDQMPAPELPTWQDCHELWSKQRRRQQREQEQSSKPKSYTFTQDENSESSFKSDIKSETNPEPVGQVK-