Monarch geneset OGS2.0

DPOGS210390
TranscriptDPOGS210390-TA687 bp
ProteinDPOGS210390-PA228 aa
Genomic positionDPSCF300291 - 99083-99769
RNAseq coverage1082x (Rank: top 12%)
Annotation
HeliconiusHMEL0148378e-11897.16% 
BombyxBGIBMGA012076-TA2e-11490.09% 
Drosophilaslpr-PB3e-8062.96% 
EBI UniRef50UniRef50_UPI00022474AB7e-8871.96%UPI00022474AB related cluster n=1 Tax=unknown RepID=UPI00022474AB
NCBI RefSeqXP_975604.12e-8971.63%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
NCBI nr blastpgi|910811933e-8871.63%PREDICTED: similar to mixed lineage kinase [Tribolium castaneum]
NCBI nr blastxgi|2700052701e-8471.63%hypothetical protein TcasGA2_TC007298 [Tribolium castaneum]
Group
Gene OntologyGO:00047092.9e-109MAP kinase kinase kinase activity
GO:00167726.8e-37transferase activity, transferring phosphorus-containing groups
GO:00046726.9e-28protein kinase activity
GO:00064686.9e-28protein phosphorylation
GO:00055159.8e-21protein binding
GO:00047131.4e-08protein tyrosine kinase activity
GO:00055245.8e-06ATP binding
GO:00046745.8e-06protein serine/threonine kinase activity
KEGG pathwaymdo:1000200827e-78 
 K04417 (MAP3K9, MLK1)maps-> MAPK signaling pathway
InterPro domain[9-224] IPR0157852.9e-109Mitogen activated protein kinase kinase kinase 9/10/11-like
[62-221] IPR0110096.8e-37Protein kinase-like domain
[92-224] IPR0012456.9e-28Serine-threonine/tyrosine-protein kinase
[11-71] IPR0014529.8e-21Src homology-3 domain
[92-228] IPR0206351.4e-08Tyrosine-protein kinase, catalytic domain
[92-226] IPR0022905.8e-06Serine/threonine-protein kinase domain
Orthology groupMCL10295 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210390-TA
ATGGCGGCAGCCGATGATCGTCCGCCGCTCGCTCTGTTTACCGCCGTCTATGACTACATAGCCCAGGGAGAGGACGAGCTCTCTCTACGCCGAGGCGAGATCGTCGAGGTTCTGTCCAAGGACGCCAACATCTCCGGAGATGAGGGCTGGTGGACGGGTAAGATCGGTGATCGCGTCGGCATCTTCCCAGCCTCGTATGTCACCGAGGACGATCCGTTGGCCGTCTCCTCGGTGATAGGGGACGTCGACCCTCCCCGCGTCTCCTTCTCGGAGCTGAAACTCGAGGAGGTGATCGGAGTGGGCGGTTTCGGTAAAGTGTATCGTGGCTATTGGAACGACGAGGTAGTGGCAGTGAAGGCGGCGCGGCAAGACGCAAACGAGGATATAGAAGTGATTAAAGAGAGTGTGTTACAGGAAGCTAAACTGTTTTGGATGTTACAACATGATAATATCGTGTCGTTGAAAGGAGTGTGCTTAGAAGAGCCTAATTTGTGTTTGGTGATGGAGTACGCTCGCGGAGGGCCATTGAACAGAGTGCTATCGGGACGGAAGATCCGGCCAGGTATCTTGGTGGACTGGGCGATACAGGTGGCACGCGGGATGGCCTACCTGCACGTAGACGCACCAATATCACTCATCCACCGGGACTTGAAGAGCTCTAATGGTATGATTTTAACACACCATTAA

Protein sequence:

>DPOGS210390-PA
MAAADDRPPLALFTAVYDYIAQGEDELSLRRGEIVEVLSKDANISGDEGWWTGKIGDRVGIFPASYVTEDDPLAVSSVIGDVDPPRVSFSELKLEEVIGVGGFGKVYRGYWNDEVVAVKAARQDANEDIEVIKESVLQEAKLFWMLQHDNIVSLKGVCLEEPNLCLVMEYARGGPLNRVLSGRKIRPGILVDWAIQVARGMAYLHVDAPISLIHRDLKSSNGMILTHH-