Monarch geneset OGS2.0

DPOGS202920
TranscriptDPOGS202920-TA1851 bp
ProteinDPOGS202920-PA616 aa
Genomic positionDPSCF300126 + 435619-439669
RNAseq coverage264x (Rank: top 40%)
Annotation
HeliconiusHMEL0145810.064.70% 
BombyxBGIBMGA004156-TA8e-17658.36% 
DrosophilaCG42856-PB6e-4838.36% 
EBI UniRef50UniRef50_UPI00021A40EC1e-6438.83%UPI00021A40EC related cluster n=1 Tax=unknown RepID=UPI00021A40EC
NCBI RefSeqXP_973074.11e-7044.90%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|910869612e-6944.90%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|910869611e-6544.27%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00055246.6e-78ATP binding
GO:00046726.6e-78protein kinase activity
GO:00064686.6e-78protein phosphorylation
GO:00046744.7e-76protein serine/threonine kinase activity
GO:00167721.6e-68transferase activity, transferring phosphorus-containing groups
GO:00047134.5e-12protein tyrosine kinase activity
KEGG pathway 
InterPro domain[14-299] IPR0133346.6e-78Hormonally upregulated Neu-associated kinase
[18-298] IPR0022904.7e-76Serine/threonine-protein kinase domain
[2-358] IPR0110091.6e-68Protein kinase-like domain
[21-298] IPR0174422.7e-60Serine/threonine-protein kinase-like domain
[18-297] IPR0206354.5e-12Tyrosine-protein kinase, catalytic domain
Orthology groupMCL24993 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202920-TA
ATGGCGAACATAGCGCGGAGTAAGGGACAAGTGTATAGTGTTGGAAATTACTTGATGACCGGTAAAGTCTTAGGGAAGGGACATTTCGCTAGGGTAGAAGAAGCTACTCACCGTATAATCGGAAAAAAGGTGGCAATAAAAATCATAGACCTTACGTGCATCAAAGAAGATTATGCCCGCCGTAATTTACATCGAGAGCCTAAAGTCATGGCCAAGCTCCGACACCCTTGCATCGCCGCTTTATATGAAACCATGATGCATGGTCCGCGTCTGTACGTGGTGATGGAGGCAGCGGGCGGGGGCGACCTGTGCGCGCACGTGCTGGCAGCCCGCGGCGCCGCGCGAGGTCTGCCCGAGCAGAGGGCGCGGGCCCTGGCCGCTCAGCTGGTGTCGGCAGTCCGGCACATGCATGCTCGCGGTGTAGTGCACCGCGACTTAAAGATGGAAAATATAATGTTGGACAGCACGAAACAATTTATTAAGATCGTTGATTTCGGTTTGTCCAACGTGTGGAGCGCGGGCGGGTCGCTGCGCACACCGTGCGGCTCGCTGGAGTACGCCGCGCCTGAACTGTTCGTTGATGGACGGAAGTACGGGCCGGAGGTCGACTTGTGGAGTATAGGCGTGATTGTATTCGGTATGGTGACGGGTGGTCTGCCGTTTGCGGGTAGTGGTAGTGGTGGTGGTGGTGGTGGTGGGGCGGCCGCAGCCGGGGCCGGGGAGGGGAGCTCGCGACCTCAGTTGCGCGCCGCCATCGCGCGCGGTTACACGCGTAAACAGAGAGCGGCGCTTGTATGTGTGTCAGCAGAATGCAAGACGTTCATACAGCAGTTACTGGAGCCGAAGGTGGAGTTAAGAATGAAGATCGAGGAGGCAGCGAGGCATCGCTGGATCAGGAGACCGGGCATGAGGATGAGGACACATCCACTGCCGGGAGTCGAGCCCAGGGCTAACAGGGAGATTTACAGACAAATCTCCGAGCTGTGTGGAGAGACAATGCTAGATGTCGTCGCTCACATAAAGGCGGACCCGTTCGGCGCGATAGCCGGCATTTACAACATTAAGTCGCACCTGCAACAGATGTCGTCCAGCACAGGAGGCGAGCTCTTGTGGTCTTCCAGCACCGAGGAACCCCGACCCGCGAGCCCGCCGGAGTATCGCGTCAAATACGACGAGTACGCACCATCTAGCAGTCGCATGTCGGTGTCGCAAATATCCAGTTTCAACTTCCCCACACATGTCCGACAGTTTGAACAAAAAACGGAAGGAAAGCCGCAGATCCCGAAGATGCCTCGGGCCGCGAGGACGCAAAAGCCTGCGGCGGCCCCGAAGTGTCAAGAGATCGGTCAGAAGGTTCAGAACGACAACCCGTCGGTGGCCAAGACATGTTACGCGATGAAGAAGCCGGTGTTTAAGGTTTATGACAACGGAGACGGCTTCGATCCGCGGCTGCCGAGTATAGACGAACACTCCAACATCGACGTGACGGACGGAAAGACCCTCCACAAGACGAGGCAGTGTGTGAGGAAGGTCAGTCTGGAGGAGCCGGCCAGGATCAGCTCCTGCCCGGACAGAGCTGATGCCAAGAGAACGGACTTCTACAAGAAGGCTGGCGACGGCCTACAGAGAAACATCGTAAGCCCCCACTCATCATCAAACAGTCACTGCAAAAGCGAAAGGGCGATGCCGCTCATCATCTGCAATAAGTATGTTGGGTTGGCCATGGGCGTGAGACTCACCATCAGTGGTAGGGCATGCGTGCTGGGTTGGCCATGGGCGCGAGACTCACCATCAGTGGTAGGGCATGCGTGCTGGGTTGGCCATGGGCGCGAGACTCACCATCAGTGGTAG

Protein sequence:

>DPOGS202920-PA
MANIARSKGQVYSVGNYLMTGKVLGKGHFARVEEATHRIIGKKVAIKIIDLTCIKEDYARRNLHREPKVMAKLRHPCIAALYETMMHGPRLYVVMEAAGGGDLCAHVLAARGAARGLPEQRARALAAQLVSAVRHMHARGVVHRDLKMENIMLDSTKQFIKIVDFGLSNVWSAGGSLRTPCGSLEYAAPELFVDGRKYGPEVDLWSIGVIVFGMVTGGLPFAGSGSGGGGGGGAAAAGAGEGSSRPQLRAAIARGYTRKQRAALVCVSAECKTFIQQLLEPKVELRMKIEEAARHRWIRRPGMRMRTHPLPGVEPRANREIYRQISELCGETMLDVVAHIKADPFGAIAGIYNIKSHLQQMSSSTGGELLWSSSTEEPRPASPPEYRVKYDEYAPSSSRMSVSQISSFNFPTHVRQFEQKTEGKPQIPKMPRAARTQKPAAAPKCQEIGQKVQNDNPSVAKTCYAMKKPVFKVYDNGDGFDPRLPSIDEHSNIDVTDGKTLHKTRQCVRKVSLEEPARISSCPDRADAKRTDFYKKAGDGLQRNIVSPHSSSNSHCKSERAMPLIICNKYVGLAMGVRLTISGRACVLGWPWARDSPSVVGHACWVGHGRETHHQW-