Monarch geneset OGS2.0

DPOGS213784
TranscriptDPOGS213784-TA3066 bp
ProteinDPOGS213784-PA1021 aa
Genomic positionDPSCF300212 + 546405-559287
RNAseq coverage3812x (Rank: top 3%)
Annotation
HeliconiusHMEL0031320.077.41% 
BombyxBGIBMGA009267-TA0.063.53% 
Drosophilahipk-PB0.062.46% 
EBI UniRef50UniRef50_D1ZZH60.080.39%Putative uncharacterized protein GLEAN_07452 n=2 Tax=Tribolium castaneum RepID=D1ZZH6_TRICA
NCBI RefSeqXP_972748.10.080.39%PREDICTED: similar to GA14321-PA [Tribolium castaneum]
NCBI nr blastpgi|2700054010.080.39%hypothetical protein TcasGA2_TC007452 [Tribolium castaneum]
NCBI nr blastxgi|3407106840.047.65%PREDICTED: homeodomain-interacting protein kinase 2-like [Bombus terrestris]
Group
Gene OntologyGO:00167723.8e-80transferase activity, transferring phosphorus-containing groups
GO:00055245.5e-79ATP binding
GO:00046745.5e-79protein serine/threonine kinase activity
GO:00064685.5e-79protein phosphorylation
GO:00046722.1e-54protein kinase activity
GO:00047132.4e-05protein tyrosine kinase activity
KEGG pathway 
InterPro domain[88-431] IPR0110093.8e-80Protein kinase-like domain
[102-430] IPR0022905.5e-79Serine/threonine-protein kinase domain
[102-430] IPR0174422.1e-54Serine/threonine-protein kinase-like domain
Orthology groupMCL10691 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213784-TA
ATGGATTTGTCCATACCTAGACTTAAGTTTGAAACAGAAGATTACTTTGACGATGATTACGAGCTCGGACAAACTGTGGCAGATAGCGGGTCAGGAGTGGCGGCGGGAGTGGCGGCTGTGGCGACTGTGGCGGCGGGTGTGGCTGTGGCGGGTCGGCAGCGGACGGGCGGCACGACGACGGCGCACAGCAAGCAACTGGCGGCGGCCGGGGCCAACCCGGCCCAGCCGCAAGGCTCCGGCTGTGGCGGAGATGGGGACTACCAGCTGGTGCAGCATGAGGTGCTCTACTCCTCGTCCAATAGGTACGAGGTGTTGGAGTTCCTCGGCCGCGGAACCTTCGGCCAGGTGGTGAAGTGTTGGAAGAAGGGCACCAACGAAATAGTCGCCATCAAGATACTCAAGAACCATCCCAGCTATGCCAGGCAGGGGCAGATTGAGGTGTCGATCCTGTCGCGCCTGTCCCAGGAGTCTGCGGACGAGTTCAACTTCGTCCGAGCGTACGAATGTTTCCAGCACCGCTCTCACACCTGCCTCGTGTTCGAGATGCTGGAACAGAACCTGTACGACTTCTTGAAACAGAACAAGTTCTCGCCGCTGCCCCTCAAGTACATCAGGCCCATCCTGCAGCAGGTCCTCACCGCGCTCTTGAAACTTAAACAACTCGGTTTGATCCACGCTGATCTGAAGCCTGAGAACATAATGCTGGTGGAGCCGGCGCGCCAGCCCTACAGGGTCAAGGTCATAGACTTCGGCAGCGCCTCGCACGTCAGCAAGGCCGTCTGCAACACATACCTCCAGAGCAGATACTATAGGGCCCCCGAGATAATCCTGGGTCTAGCGTTCTGTGAGGCGATCGACATGTGGTCCCTGGGCTGTGTGGTCGCCGAGCTGTTCCTGGGCTGGCCGCTGTACCCGGGCTCGTCGGAGTACGATCAGATCAGATACATATCACAGACCCAGGGTCTACCCACCGAACATATGCTTAACAGCGCATCAAAGACAGCCAAGTTTTTCTATCGCGACGTGGACAGTACTTATCCGTTTTGGAGGCTGAAGACTCCCGAGGAACACGAGCTGGAGACCGGCATCAAGAGCAAGGAGGCCAGGAAGTACATCTTCAACTGTCTCGACGACATAGGACAGGTAAACGTTCCGACGGATCTCGAAGGCGGACAACTGCTGGCCGAGAAAGCGGACAGGAGGGAATTCATAGACTTACTGAAGAGGATGCTTACTATGGACCAGGAGCGTCGCATCACCCCCGGGGAGGCCCTGAACCACGCCTTCGTCACCCTCGCACACCTCGTGGACTACGCGCACTGCAATAATGTTAAGGCGTCGGTACAGATGATGGAGGTCTGCCGTCGCTCAGGTAGTGGCGTGGGCGGAGTGGGCGGCGTGGGCGGTGTAGGTGGCGAGTACGTGGGTGGCGTGGTGGCGCCCGCCGCGCCCACCGCTCACCATCTCGCGCTCACCATCAACCAGCAGCGACTCAGAGCTGCTCCATACGATAACCTGTACCAGTTGTATGGCGGCGGGCGGGTGGCGGTGGGCGGCGGCAGACAGTTCACGCGGGCCCCGGACCCCTTCCCTCATCAGTTCGTGTCGTCCATCCTGTGTCAGCCGGCCTACCAGGTTCAACAACTGGCTGCAGCGGCGGCCGCAGCGGCAGCGGCCCACCACCAGCATGGTGTAGGCGGCGTGGGCGGAGTGGGCGTCGGCGTCGGCGTAGGCGACGTGGGCGAGTGGCGCGCGCCGCTGATTGTGGAGCCAGCGCCACTTCCTGAACCGGAAGTGTGGGACTTCCATCACCATCACCCGCACCATCATCACCCTCACAAGAGATCGACGAAGCAACAAACGAGCGGCGTGCCTCACTACCAGCCGCCGCACCAAACGCATCAATACAGTATGGCGAGCGCCGGCGGCAAGAAAGAGGCGACGCAGCTCTCCCCCGTCAAGAAGAGGGTGAAGGAAGGCACGCCGCCCTCCAGGCACAGGCAGGAAAACACTGGTCGGTGTAACCAGCCCGTGGCAGGGTGGGAGGCGGGCGGCGCTCACACCATCACCATCAGAGACACGCCGTCACCCGCCGTGTCCGTCATCACCATCTCGGACAGTGACGACGATGATGGCCGACCAGACAGGGGAGGAGTTCTATTATCCCCAACCTCCCCTCCAGGTGTCAACCCGCCGGCGATGGTGGCGCCTCCACCGGCGAGTGTGATCTGTAACGCGAGGACTCCGCGAGACGACCGACACGTGAAGCATGAGCCGCATCACACGCACCAACCCTGCCACCATCACACGCAGCCGCAGCAACCTCAACAACAGACGCAGCAACAACAGCAACAACCTCAGCAGCAACAGCAACAGCAACAACAGCAACAGCCCCGCCAGCCGGCCGGGCCTCAGGTGACGCCACAGAGCCAGAAGAAACGTCTGCTAGCTCTGGCGCAGCAGGAACAGGCGAAGCTGGAACATCACGATCTGCAACACTACCACCAACAGAGCGGGTCGAGCGTCCGGCCGAGCGACTACCAGTGTAACCAATATGTGCCGACACAACACAACAAGAGGGATAGCAGCGGTGGTTGTCGCGAGTACCTGCCCGTGAGCGCGGCCGTGAGCGGCGTCAGCGGGGTGACGGCAGTGAGCGGCGTGGGCGTTGGCGGCGTGGGCATGAGCGTGAACGTGGGCCCGCCCGCCGCACACAAACACGCACACGCCTGGCACCACCATCCACATCCACACTATAAGAGCAGTGTGGGCAGTGGTGTTGGCGTGGGCGTCGGCGTGGGCGCAGGCGTGGGAGTGGGCGGCGGCGTCCTCTCCCCGGGCGGCGTGTCCCCCGGGTACCTGGCACCCCCGCTATACGTACCCACCTACCACTATCACACTGGCGGTGTTAGCGGCGTGGGCGGTGTGGGAGGCGTGGGCGCCGTGGGCGGTCCTCCTCCCGCCCATCACGGCAGCGGCGCGCGGCTCCCGGGCTACCTGCAGCCGTACTCCCCCACATACCTCGTGCCGCAACATCCGCAGCAGCACCTCTGGTACACGGACTGA

Protein sequence:

>DPOGS213784-PA
MDLSIPRLKFETEDYFDDDYELGQTVADSGSGVAAGVAAVATVAAGVAVAGRQRTGGTTTAHSKQLAAAGANPAQPQGSGCGGDGDYQLVQHEVLYSSSNRYEVLEFLGRGTFGQVVKCWKKGTNEIVAIKILKNHPSYARQGQIEVSILSRLSQESADEFNFVRAYECFQHRSHTCLVFEMLEQNLYDFLKQNKFSPLPLKYIRPILQQVLTALLKLKQLGLIHADLKPENIMLVEPARQPYRVKVIDFGSASHVSKAVCNTYLQSRYYRAPEIILGLAFCEAIDMWSLGCVVAELFLGWPLYPGSSEYDQIRYISQTQGLPTEHMLNSASKTAKFFYRDVDSTYPFWRLKTPEEHELETGIKSKEARKYIFNCLDDIGQVNVPTDLEGGQLLAEKADRREFIDLLKRMLTMDQERRITPGEALNHAFVTLAHLVDYAHCNNVKASVQMMEVCRRSGSGVGGVGGVGGVGGEYVGGVVAPAAPTAHHLALTINQQRLRAAPYDNLYQLYGGGRVAVGGGRQFTRAPDPFPHQFVSSILCQPAYQVQQLAAAAAAAAAAHHQHGVGGVGGVGVGVGVGDVGEWRAPLIVEPAPLPEPEVWDFHHHHPHHHHPHKRSTKQQTSGVPHYQPPHQTHQYSMASAGGKKEATQLSPVKKRVKEGTPPSRHRQENTGRCNQPVAGWEAGGAHTITIRDTPSPAVSVITISDSDDDDGRPDRGGVLLSPTSPPGVNPPAMVAPPPASVICNARTPRDDRHVKHEPHHTHQPCHHHTQPQQPQQQTQQQQQQPQQQQQQQQQQQPRQPAGPQVTPQSQKKRLLALAQQEQAKLEHHDLQHYHQQSGSSVRPSDYQCNQYVPTQHNKRDSSGGCREYLPVSAAVSGVSGVTAVSGVGVGGVGMSVNVGPPAAHKHAHAWHHHPHPHYKSSVGSGVGVGVGVGAGVGVGGGVLSPGGVSPGYLAPPLYVPTYHYHTGGVSGVGGVGGVGAVGGPPPAHHGSGARLPGYLQPYSPTYLVPQHPQQHLWYTD-