Monarch geneset OGS2.0

DPOGS214696
TranscriptDPOGS214696-TA2955 bp
ProteinDPOGS214696-PA984 aa
Genomic positionDPSCF300022 - 1454590-1464365
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0135890.071.00% 
BombyxBGIBMGA000180-TA0.073.28% 
DrosophilaIre1-PB0.043.61% 
EBI UniRef50UniRef50_D6WVT70.046.31%Putative uncharacterized protein n=8 Tax=Endopterygota RepID=D6WVT7_TRICA
NCBI RefSeqXP_001809663.10.046.08%PREDICTED: similar to serine threonine-protein kinase [Tribolium castaneum]
NCBI nr blastpgi|2700121650.046.31%hypothetical protein TcasGA2_TC006276 [Tribolium castaneum]
NCBI nr blastxgi|2700121650.046.06%hypothetical protein TcasGA2_TC006276 [Tribolium castaneum]
Group
Gene OntologyGO:00167723.2e-46transferase activity, transferring phosphorus-containing groups
GO:00055246.9e-32ATP binding
GO:00064686.9e-32protein phosphorylation
GO:00046746.9e-32protein serine/threonine kinase activity
GO:00063972.9e-31mRNA processing
GO:00168912.9e-31endoribonuclease activity, producing 5'-phosphomonoesters
GO:00046729.8e-23protein kinase activity
GO:00055151.5e-13protein binding
GO:00047136.7e-08protein tyrosine kinase activity
KEGG pathwaytca:6581420.0 
 K08852 (ERN1)maps-> Alzheimer's disease
    Protein processing in endoplasmic reticulum
InterPro domain[507-786] IPR0110093.2e-46Protein kinase-like domain
[521-755] IPR0022906.9e-32Serine/threonine-protein kinase domain
[762-882] IPR0105132.9e-31KEN domain, ribonuclease activator
[524-669] IPR0174429.8e-23Serine/threonine-protein kinase-like domain
[816-870] IPR0065671.5e-13PUG domain
[108-250] IPR0110472.9e-11Quinonprotein alcohol dehydrogenase-like
[521-753] IPR0206356.7e-08Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11483 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214696-TA
ATGAAGTTTTTTTTCATAATATTATTAGTGTTGGAGTGCTATGGGTTAGAATCGAATGTGGACAATGATAGAAAGGTGAATACAGAAGTGTCTAGAGCTTTAGATGACCGGCCTTTGTTGTTTGCTACGCTCGGAGGCGGCATGGTGGCCGTGGATCCCTCAACAGGGAACGTTATATGGAAACTAAAAGATGAACCCGTAGTCAAAGTGCCGAATCAGCATGGGAATCTCATGTCCCAATTCCTACCAGATCCACGTCATGGCTTCCTATACATGTACGGTCCGAGAGGGGATAGTAAAATGCTGAAGAAGCTGCCCTTTACCATACCAGAGCTGGTTGCGAACGCACCTTGCAGGTCTACAGAGGGTATATTATATACTGGGAAGAAGAGCGACACGTGGTTCATACTGGACCCATTAACTGGTACCAGAGAACATGTATCTGGATTCGACAGTTCAAAGATATTCAAAAGCGAGGGAGGTACCTGTCCGCCGAACAGAAAACGCGGCGTCTACATCGCCCGCACCGAGTACAATATACTGATGCACGATTCAAACAACGAGAATCAGAAGTGGAATGTAACATTCTTTGATTACACCTCGCATGCCATGGCGAAGGAAATGGTCAACGATTACGGTATAATACATTTCACGTCCACATCGAATGGTAGGATAATGTCCTTCAATAGGAAGACCGGAGATCTTCTTTGGTCTCATAACTTTGAGACTCCGGTTGTTGCTACATATTTGTTAGACAGGGAGGGCCTTATATCCGTTCCTTTCAATTCCATTGGTGATGATACTTTGGACCACATTATGGAAGACGCGACCACTTTGAGGAATGGTCAAGGAATCAAGAATTCCAATATAGAGTTGTATCCCACTCTATACATCGGCGAGCACAACCATGGTCTATACGCATTATCGTCTCTCGTAGACAAGAACACAGTCACAATATCAACTGGACACACAAAACCCCTGCTTCTCGAGGGTCCAGTGACGGAAAACCAAGCGAACTCCGAGAAGGTTACCTACGAGCCCTTCAAGAACATCCACTACCAGCTGAACGATCTAAATCTCCACGTGACACCGCCATACCTGTTACTAGGGCATTACAAAGTACCAGAACTGACAACTAATTGGATGCCGCATCTACCGAATATAAACACTATATCGCACTCACAGAACTCAGTGAAGCTGATTAACGGGGAGGTCCGGGTCACTGACCTTGATGGGGAAAATGAAACTAATTTGAAGTCCAATACCAATTCAGTGTCTGTGTCTGTTCAAACAGACGAGTTTTTCCAGGAGTTCAGTTTTAGACCTGATCTGTGGTATAAGAAGGCCTATATATGGTTGCACCAGCAAGAAAATAAAGCTCTCAAAGTACTGCTGATAATTCTGATGGGTTTGGTTGTAACCTTATTCTGGTATTTAAGGTATCAGGCCCGTGAATTTCAACAGCTCTCTCAGTCTGGCTCTAGGGGTTCCTCCACACAGGAAGTGACTGCTCAGCTGGAGGAGTTAGGGGAGGGGGAGGTGAGAGTGGGGAAGATCAGTTTCTTCACGGACCAAGTTCTGGGGAAAGGCTGTGAAGGAACATTCGTTTATAGGGGTACGTTCGACAAGCGAGCTGTGGCTGTTAAGCGACTTCTACCAGAGTGTTTCACCTTCGCCGACCGCGAGGTGGCGCTGTTAAGAGAGTCCGATGCGCACGCGCATGTAGTACGATACTATTGTACTGAGAGGGACAAACAGTTTAGATACATAGCTCTGGAGCTCTGCTCGGCGACTTTGCAGGACTACGTGGAAAAGAAATTAAACTTTGTATGCAAGATAGAGTCCTTGGAGGTTCTACGTCAGGCCACTTTAGGGTTGTCGCATCTGCATTCCATGGATATAGTCCACAGAGATATCAAGCCCCATAATGTGTTGCTGTCGATGCCGAACGGTACCGGCGAGGTGCGGGCTATGATATCAGACTTCGGGCTGTCGAAGAAACTGAATATAGGGAAGACGACCTCAGTGGACATGTTTTCTCTGGGATGTGTGTTCTACTTCGTGTTGTCCAGAGGCCTGCATCCGTTTGGAGACGTCTTGCGAAGACAAGCGAACATACTGACAGGGGACTATAACCTGGATCACCTGGATAAGGTTCTGCCTTCAGAGCAGGTGTTACTGTCCAAGATGCTGATCCGCTGTATGATATCCGTCCGCCCCAGCCGCCGCCCGCCCTGCGACTCAGTACTCAAATTCCCGCTCTTCTGGAATCGACAGGGAGTTCTTAATTATCTACAGGACGTGAGTGACCATATAGAGAGCGTCTCCCAGACAAGCGATCATCCGCTGGAGTTCGGAGGTCGTAAGGTCATCAGAGGTGACTGGAGGCTACACGTGTGTAGCCGCGTAGCGGGGGATCTACGAGCGAGGAGGACCTACAGGGGGGATCGCGTAGCACATCTATTAAGAGCCGTTAGGAATAAGAAACATCACTATAGAGAATTAGAACCAGAAATACGCGAAAGTCTGGGCCGTTTGCCGGACGGCTTCGTCACGTATTGGCTGAAGAGATTCCCCCTCCTGCTCCCGCACGTGTGGCTTCAGATGCAGCAGTACAGGAACGAGGATATACTGCAAGCATACTATCCGTACTCCTTCACATTTTATAGAGAAGAAGTCCCAGAGCTAGCGGACGATGACAATGAAACGCCGCCAGAATCAGAGGATCCGCACAAAAACGAACTGTTCGCGAAAAGCAGGGTGTACTACGATGAGAGCAAAGAGAAAAAGTTCTATAGGAAAGATTGGTCGCCAAAGAAGCAAGTGGATTGGCGGAGTCAGGGGCAGGAATTTCAGTTGAAACAGGATGACGTCAGATTAAGAGATAGACATCACTATAAGAAGAGAGAGAAGAAGAGAGAGGAGATGCCTGTGTGGAGCTTACCTCCGCAGTGA

Protein sequence:

>DPOGS214696-PA
MKFFFIILLVLECYGLESNVDNDRKVNTEVSRALDDRPLLFATLGGGMVAVDPSTGNVIWKLKDEPVVKVPNQHGNLMSQFLPDPRHGFLYMYGPRGDSKMLKKLPFTIPELVANAPCRSTEGILYTGKKSDTWFILDPLTGTREHVSGFDSSKIFKSEGGTCPPNRKRGVYIARTEYNILMHDSNNENQKWNVTFFDYTSHAMAKEMVNDYGIIHFTSTSNGRIMSFNRKTGDLLWSHNFETPVVATYLLDREGLISVPFNSIGDDTLDHIMEDATTLRNGQGIKNSNIELYPTLYIGEHNHGLYALSSLVDKNTVTISTGHTKPLLLEGPVTENQANSEKVTYEPFKNIHYQLNDLNLHVTPPYLLLGHYKVPELTTNWMPHLPNINTISHSQNSVKLINGEVRVTDLDGENETNLKSNTNSVSVSVQTDEFFQEFSFRPDLWYKKAYIWLHQQENKALKVLLIILMGLVVTLFWYLRYQAREFQQLSQSGSRGSSTQEVTAQLEELGEGEVRVGKISFFTDQVLGKGCEGTFVYRGTFDKRAVAVKRLLPECFTFADREVALLRESDAHAHVVRYYCTERDKQFRYIALELCSATLQDYVEKKLNFVCKIESLEVLRQATLGLSHLHSMDIVHRDIKPHNVLLSMPNGTGEVRAMISDFGLSKKLNIGKTTSVDMFSLGCVFYFVLSRGLHPFGDVLRRQANILTGDYNLDHLDKVLPSEQVLLSKMLIRCMISVRPSRRPPCDSVLKFPLFWNRQGVLNYLQDVSDHIESVSQTSDHPLEFGGRKVIRGDWRLHVCSRVAGDLRARRTYRGDRVAHLLRAVRNKKHHYRELEPEIRESLGRLPDGFVTYWLKRFPLLLPHVWLQMQQYRNEDILQAYYPYSFTFYREEVPELADDDNETPPESEDPHKNELFAKSRVYYDESKEKKFYRKDWSPKKQVDWRSQGQEFQLKQDDVRLRDRHHYKKREKKREEMPVWSLPPQ-