Monarch geneset OGS2.0

DPOGS210718
TranscriptDPOGS210718-TA3054 bp
ProteinDPOGS210718-PA1017 aa
Genomic positionDPSCF300013 - 189375-194315
RNAseq coverage823x (Rank: top 16%)
Annotation
HeliconiusHMEL0070810.088.74% 
BombyxBGIBMGA006330-TA0.089.49% 
Drosophilawts-PA0.075.30% 
EBI UniRef50UniRef50_F4WFL80.066.43%Serine/threonine-protein kinase LATS1 n=11 Tax=Endopterygota RepID=F4WFL8_ACREC
NCBI RefSeqXP_395146.30.064.52%PREDICTED: similar to warts CG12072-PA [Apis mellifera]
NCBI nr blastpgi|3071732530.065.77%Serine/threonine-protein kinase LATS1 [Camponotus floridanus]
NCBI nr blastxgi|3320268940.068.37%Serine/threonine-protein kinase LATS1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055249.6e-92ATP binding
GO:00046749.6e-92protein serine/threonine kinase activity
GO:00064689.6e-92protein phosphorylation
GO:00167721.1e-80transferase activity, transferring phosphorus-containing groups
GO:00046721.8e-39protein kinase activity
GO:00055151.1e-07protein binding
GO:00047132.9e-07protein tyrosine kinase activity
KEGG pathway 
InterPro domain[624-928] IPR0022909.6e-92Serine/threonine-protein kinase domain
[606-988] IPR0110091.1e-80Protein kinase-like domain
[625-780] IPR0174421.8e-39Serine/threonine-protein kinase-like domain
[43-114] IPR0090601.1e-07UBA-like
[624-853] IPR0206352.9e-07Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11488 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210718-TA
ATGAACCCGCCGGCGCCCGGCAAGACGGCCGCGCGCTCATCAGGGTACAACCAGAAAGCGCTCGCCGAAATCCGAAACTCATTACTCCCGTTTGCTAACATCGGGAGCTCAGAGCCTCCGGGCTCTTCCGCTGCCAGTACTGTCAGTTCGGGAGTCAGTTCTGGCTTCAGTTCCACCTCAGGGAATGGGCTGGATAAGGACTTGAGCGTACTACCTCAGTCATTAAATCAGCTTATAGCTCTGGGTTACGATGAAGATCCAGCAGTCAAGGCGCTCAAATATGCCGGCGGTCGTTTCGACGCGGCACTCGATTATCTTTCGAAGAAGCAGGAGCCGCTCAACGGCGTCCTCAAGTCGAGTAACCTGAGCGCGCTCAGCACCAAATTAATAAGGAAACCAAGTTTAGAGCGCGAGATTAACCTCCACCGAGGCAGTCCGGCGCTCGACTCCGGAGCGGGGAGCTCACGGTCGGACAGTCCGAGGCAGTCGGACGCACCGCCTTTGCCCCATGAGAAGCTCAGTCGGCAGTACTCGCCGTCAGGGTTCTCGGAGCCTCCGCCGCCGCCGCCGCCGCGATGTCCCTCGACGCCGCCCGTGCCGCCTTCGGTGCAGCAATACATGAAGAGGATATCGCCAGCCCCGCCTCTACCGCCGGCAAGGGGCACGAGTCCTGTGGCTGCCGGGGCTCCGACGCCGCCCGCGCGGACCCCCATGATCGTCCAGAACGGCCCGCAGGTGCAGCAGCAGCTGACACAGCAGATCCAGGCGCTGAGCATCTACCAGACGGGGCCGACGGGCGCGGAGCTTCCACCGCCATATCCTCTCTCGAGCGTCCCTCCACCGCCTTCGTACTCCGTGTCCATGCAGAATCGACAGAGTCCCACTCAGTCCCAGGACTACAGGAAGAGTCCGTCCTCGGGGATATACTCGGTGGGGACGTCAGCGGGTTCTCCGAGCCCCATAACGGTGACTCAGTCGTCTGGATCTTCGGCGGGTATGACGCGTCCGACTCCCATCCAGGCGTGGACGGCTCGGCAGGCCGTCCAGCCGCCTATCATAATGCAGTCGGTGAAGAGCACGCAGGTTCAGAAGCCAGTACTGCAGACGGCCATCGCTCCGGTGGCTCCTCCGCCGGTGGCTCCGGGTCAGCCTCCGCCGCCGTCTTACGCGAGCTCAATACAACAAAAGCAGTCGCAGTCGAAGCCCGGGTACCCTCTCGCGCCGAAGCCGTCGCCCCCGGGGTCCACGCCGCCGGCGACGACCCCCGTCCCTACCACGGAGCCCCCCAGCTACGCCATCACAATGCAGGCGCTGGCCGTTCAAAGAGGGATGCACCCTATTCCTCCGCCGCCCTACGGCAACCAGGCCGAGACGACCACGACCGTCAACTCTCACCATTCTCCATTACATAAAAAGTTCTCTAATAACTGTGATGTTAAGGGTGAATCCTCGCATATGGAACTGAAGTGTTCGAATCAAAACTGTAGTATGATGAAAGATAGTGTTCCGTCCGGTTCCGATAAGAGTTCGAATGGCTCGACGGAGAGGCGGTCGAAGGGCGCCCCCGACAAGATCAGGCATCAGTCGCCGATCCCCGAGAGAAAGAACATCAGTAAAGAGAAGGAGGACGAGAGGAGGGACTGTAAGGTTAGGAATTATTCCCCGCAAGCCTTTAAATTCTTTATGGAACAACACGTCGAAAATATTCTCAAGGCGTACAAGCAGCGGACGTATCGAAGGATGCAGTTGGAGAAAGAAATGACGAAGATAGGCCTCAGCTCCGAGGCGCAGGATCAGATGAGGAAGATGCTGTCCCAGAAGGAATCCAACTACATCAGGCTGAAGAGGGCGAAGATGGACAAGTCCATGTTCGACAAGATAAAACCGATAGGCGTCGGGGCGTTCGGGGAGGTGACGCTGGTGAGGAAGATCGATACTAGTCATCTATATGCCATGAAAACGCTCAGGAAAGCTGATGTGTTGAAGCGAAACCAAGTGGCTCATGTCAAGGCCGAGCGGGACATCCTTGCGGAGGCCGACAACGAGTGGGTCGTGAAGTTGTACTACAGCTTTCAGGATAAGGACAATTTGTATTTCGTTATGGACTATATACCGGGGGGCGATCTGATGTCCCTGTTGATAAAGTTAGGTATCTTCGAGGAGAACCTGGCGCGGTTCTATATCGCGGAACTGACGTGCGCGGTGGAGAGCGTGCACAAGATGGGCTTCATCCACCGGGACATCAAGCCGGACAACATCCTCATCGACCGCGACGGACACATCAAGTTGACGGACTTCGGTCTCTGCACCGGCTTCAGGTGGACGCACAATTCAAAGTACTACCAGAGAAACGACCACGGTCGACAGGATTCAATGGATCCCGTGGACGGAGAGTGGGGCGCGATGGGGGAATGTCGCTGTCACCAGCTGAAGCCTTTAGAGAGGCGGCGGAAACGGGAGCACCAGCGCTGCTTGGCCCACTCCCTCGTCGGGACACCGAACTACATAGCACCCGAGGTTTTGCAGAGGACTGGATACACGCAGCTCTGCGACTGGTGGTCGGTGGGGGTCATACTGTTTGAGATGCTGGTTGGGTCTCCGCCTTTTCTGGCGTCCACGCCCGCCGAAACTCAGTTAAAGGTCATAAACTGGGAGAGCACGCTGCACATACCGGACGCGGCGAACTTGTCGCGGGAGAGCAAAGACCTCATACTACAACTGTGCTCCGGCCAGGAGACCAGGTTGGGCAAGGACGCCAACGAAGTCAAAAACCATCCCTTCCTCAAGAGCATCGACTTCGAGAAGGGTCTGCGGCGGCAGACGGCGCCCTACATACCCAGGATAGACTACCCCACGGACACCTCCAACTTCGACCCCATAGACCCCGACAAGCTGAGGAACTCCGGTTCCGACTCCAACAAGTCGGACAGCGAGCTGGACAACGGCAAGACCTTCCACGGCTTCTTCGAGTTCACCTTCAGGAGGTTCTTCGACGACGGTTACACCAACAAGATCAACCTGGACGACGACCATCAGGGACCTGTGTATGTGTGA

Protein sequence:

>DPOGS210718-PA
MNPPAPGKTAARSSGYNQKALAEIRNSLLPFANIGSSEPPGSSAASTVSSGVSSGFSSTSGNGLDKDLSVLPQSLNQLIALGYDEDPAVKALKYAGGRFDAALDYLSKKQEPLNGVLKSSNLSALSTKLIRKPSLEREINLHRGSPALDSGAGSSRSDSPRQSDAPPLPHEKLSRQYSPSGFSEPPPPPPPRCPSTPPVPPSVQQYMKRISPAPPLPPARGTSPVAAGAPTPPARTPMIVQNGPQVQQQLTQQIQALSIYQTGPTGAELPPPYPLSSVPPPPSYSVSMQNRQSPTQSQDYRKSPSSGIYSVGTSAGSPSPITVTQSSGSSAGMTRPTPIQAWTARQAVQPPIIMQSVKSTQVQKPVLQTAIAPVAPPPVAPGQPPPPSYASSIQQKQSQSKPGYPLAPKPSPPGSTPPATTPVPTTEPPSYAITMQALAVQRGMHPIPPPPYGNQAETTTTVNSHHSPLHKKFSNNCDVKGESSHMELKCSNQNCSMMKDSVPSGSDKSSNGSTERRSKGAPDKIRHQSPIPERKNISKEKEDERRDCKVRNYSPQAFKFFMEQHVENILKAYKQRTYRRMQLEKEMTKIGLSSEAQDQMRKMLSQKESNYIRLKRAKMDKSMFDKIKPIGVGAFGEVTLVRKIDTSHLYAMKTLRKADVLKRNQVAHVKAERDILAEADNEWVVKLYYSFQDKDNLYFVMDYIPGGDLMSLLIKLGIFEENLARFYIAELTCAVESVHKMGFIHRDIKPDNILIDRDGHIKLTDFGLCTGFRWTHNSKYYQRNDHGRQDSMDPVDGEWGAMGECRCHQLKPLERRRKREHQRCLAHSLVGTPNYIAPEVLQRTGYTQLCDWWSVGVILFEMLVGSPPFLASTPAETQLKVINWESTLHIPDAANLSRESKDLILQLCSGQETRLGKDANEVKNHPFLKSIDFEKGLRRQTAPYIPRIDYPTDTSNFDPIDPDKLRNSGSDSNKSDSELDNGKTFHGFFEFTFRRFFDDGYTNKINLDDDHQGPVYV-