Monarch geneset OGS2.0

DPOGS210391
TranscriptDPOGS210391-TA1326 bp
ProteinDPOGS210391-PA441 aa
Genomic positionDPSCF300291 - 76284-81953
RNAseq coverage897x (Rank: top 14%)
Annotation
HeliconiusHMEL0148360.079.70% 
BombyxBGIBMGA008240-TA1e-15177.62% 
Drosophilaslpr-PB1e-6649.39% 
EBI UniRef50UniRef50_D2A0C21e-10058.94%Putative uncharacterized protein GLEAN_07298 n=3 Tax=Tribolium castaneum RepID=D2A0C2_TRICA
NCBI RefSeqXP_975604.11e-10058.94%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
NCBI nr blastpgi|2700052703e-10058.94%hypothetical protein TcasGA2_TC007298 [Tribolium castaneum]
NCBI nr blastxgi|2700052702e-9858.94%hypothetical protein TcasGA2_TC007298 [Tribolium castaneum]
Group
Gene OntologyGO:00047093.9e-91MAP kinase kinase kinase activity
GO:00167726.5e-28transferase activity, transferring phosphorus-containing groups
GO:00046721.4e-21protein kinase activity
GO:00064681.4e-21protein phosphorylation
GO:00047131.1e-06protein tyrosine kinase activity
GO:00055248.7e-05ATP binding
GO:00046748.7e-05protein serine/threonine kinase activity
KEGG pathwayrno:5006901e-68 
 K04417 (MAP3K9, MLK1)maps-> MAPK signaling pathway
InterPro domain[12-264] IPR0157853.9e-91Mitogen activated protein kinase kinase kinase 9/10/11-like
[19-150] IPR0110096.5e-28Protein kinase-like domain
[26-95] IPR0012451.4e-21Serine-threonine/tyrosine-protein kinase
[1-119] IPR0206351.1e-06Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10295 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210391-TA
ATGCTGAAACATATCATGCAATACTACTTTTCGTTACTACTGCTTAGCGAGGCCATTCTCTCTGACGACACTCTGGAGGAGAAGACTCTCAAAATAACAGATTTCGGCCTGGCCCGGGAGGTGTACAAAACTACCCGCATGTCAGCGGCCGGGACATACGCCTGGATGCCGCCTGAGGTGATAAAAAATTCCACATTCTCCCATGCATCAGACGTGTGGTCCTATGGTGTCCTCCTATGGGAGCTTCTCACTGGAGAAACTCCTTACAAAGGGATCGACGCTCTGGCGGTGGCTTATGCCTGTTGGCGCTCAAATCCCCGGGAGCGTCCCTTATTTCCTGAAATCCTTGACCAATTGGAACACATCAGACAGTCGGAATTCACGAGGGCGCCGCACGAATCCTTCCACACCATGCAGGATGGTTGGAGGCTGGAAATTGAGGAGGTCCTGAGAGATCTACGGAGGAAGGAAAAGGAACTTCGTTGTAGAGAGGAGGAGCTAACCAGAGCTCAGCTTCAGCAGCGGTTGATGGAACAGAATCTTGCGCAGAAAGAGAGGGAATTGGAGATGAGGGAAATAGACCTGGCCGCCAGGGAATTACACATACTGATAGTGGCCAGTAACATGTCGCATACACAACCGCCGCAGCCCAACAAGAGAAAGGGCAAATTTCACAAGATGAAGTTACTGAAAAAGGACACGATCTCATCTCCACTGGACTTTAGGCACACGTTAACAGTTCGGCAGTCGGACGACGAGTCCAGTGTTAAACAACTCGTAGCAGTCGCTAAGGACACGCCGCCAGGGTCGCCAGCCATTATGAGACCCATCGTTTTGCCAGCGGACGGAGTGAAAGGGAAAACTTGGGGTCCATCGACGGGTCACCAACGGGCTCGAGCACATCTACCGCTGCCGGCGCTCAGACCTCGCGCCCACCGACCATCCACCTCTGCCCCGCATCTACCACCGCACGCACCTCGCGCGCCTCATCCAGGGCTTATAACGATAAACGCGATCGAGGAAACGAAGCGCAAACCGCGCAAGTCGAAATCACAAGTACGCGCGCCCAAGGCCGAGATAGCGAAGAAGAGGTCGGGCAGTCACGACGACCTGCTCGACGCCGAGCCCAGGAGGAACAAGTTCTTTATGTGTCCCATGTTCACGCCCGAGCTGCCCCACCACTACGACACGGTGTTCGACGCGCCGCAGAACAGGAAGGAGAAGAAGACCTTCCTCAAGAGCATCCGCTCCAACAGCCTGGCGGGGCTGCTGGCGGGGACCGGCAACCTCGCGCCCGACACCGCCGAGCTGCTCCGGGACGAGTGA

Protein sequence:

>DPOGS210391-PA
MLKHIMQYYFSLLLLSEAILSDDTLEEKTLKITDFGLAREVYKTTRMSAAGTYAWMPPEVIKNSTFSHASDVWSYGVLLWELLTGETPYKGIDALAVAYACWRSNPRERPLFPEILDQLEHIRQSEFTRAPHESFHTMQDGWRLEIEEVLRDLRRKEKELRCREEELTRAQLQQRLMEQNLAQKERELEMREIDLAARELHILIVASNMSHTQPPQPNKRKGKFHKMKLLKKDTISSPLDFRHTLTVRQSDDESSVKQLVAVAKDTPPGSPAIMRPIVLPADGVKGKTWGPSTGHQRARAHLPLPALRPRAHRPSTSAPHLPPHAPRAPHPGLITINAIEETKRKPRKSKSQVRAPKAEIAKKRSGSHDDLLDAEPRRNKFFMCPMFTPELPHHYDTVFDAPQNRKEKKTFLKSIRSNSLAGLLAGTGNLAPDTAELLRDE-