Monarch geneset OGS2.0

DPOGS203914
TranscriptDPOGS203914-TA2493 bp
ProteinDPOGS203914-PA830 aa
Genomic positionDPSCF300005 - 749336-779619
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0080480.090.45% 
BombyxBGIBMGA000491-TA0.083.46% 
DrosophilaDdr-PA3e-7135.14% 
EBI UniRef50UniRef50_E9HQT93e-13134.44%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9HQT9_DAPPU
NCBI RefSeqXP_966707.25e-17136.38%PREDICTED: similar to Discoidin domain receptor CG33531-PA [Tribolium castaneum]
NCBI nr blastpgi|1892348941e-16936.38%PREDICTED: similar to Discoidin domain receptor CG33531-PA [Tribolium castaneum]
NCBI nr blastxgi|1892348946e-16736.48%PREDICTED: similar to Discoidin domain receptor CG33531-PA [Tribolium castaneum]
Group
Gene OntologyGO:00047134.2e-56protein tyrosine kinase activity
GO:00046721.2e-53protein kinase activity
GO:00064681.2e-53protein phosphorylation
GO:00167721.3e-46transferase activity, transferring phosphorus-containing groups
GO:00071551.7e-12cell adhesion
GO:00055242.3e-10ATP binding
GO:00046742.3e-10protein serine/threonine kinase activity
KEGG pathway 
InterPro domain[544-817] IPR0206354.2e-56Tyrosine-protein kinase, catalytic domain
[550-817] IPR0012451.2e-53Serine-threonine/tyrosine-protein kinase
[506-817] IPR0110091.3e-46Protein kinase-like domain
[1-152] IPR0089791.6e-35Galactose-binding domain-like
[11-147] IPR0004211.7e-12Coagulation factor 5/8 type, C-terminal
[544-825] IPR0022902.3e-10Serine/threonine-protein kinase domain
Orthology groupMCL26556 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203914-TA
ATGCAGAACGGCATGATATCAAATGAGTCCCTCGCCGCTAGTTCCAGTTATGACCAGAGCGTGTCTCCACTAAGTTCAAGAATTAGAACAGAGATACGAGGTGGAGCTTGGTGTCCTAATGGTTTGATATCCCCTCGAAGTCGTCAATACTTGGAAATAGATTTACACGATGAATACCTGATCACTGCAACGGAGTCACAGGGCAGGTTTGCCAACTCTGTTGGAGTAGAGTTTGTTGAGAGTTATTCGGTTGAGTACTGGCGGAACGTTTTGAGCCGTTGGGTCAAATACAAGGACTTCAACGGCAGCCGTCTAATTCCAGGCAATGTCAATACTTATACGCCGAGAAAGACCACATTAGAAGCCCCATTTGTTGCTTCAAAAGTGAGATTCTTTCCATACGCCGCACATCCTCGAACAGCTTGCATGCGTGTCGAAATCTACGGCTGCCGCTGGAAACAGGCCCTCGTCGCGTATTCAGCCCCTCAAGGAGGAGATATGCAAGCTATGACTGGTGGAGCACGTTTTGAAGATCTCTCATATGATGGTATCATGCGCAACGGCAATATGATAAATGGACTTGGTCAAATAACTGATTCCTTCATTGGTCCAAATGACTTCGAACTACCAGATCCCCTAGATACTCGGGGTACCCGTTGGGTCGGTTGGAATAAAACAATGTTAACCAATGATGAAATCAACATAACATTCAACTTTACCGACCCAAGACTTTTTCATTTCATGGACGTTCATACTAATAACATGTTTACAAAAGACGTGCAGTTGTTTAAAGAAGCTGAGGTGTACTTTTCTCTGGAAGGGGAACGTTGGCAAGAAGACTTTATTTCTTACGAGCCTAAACAAGACAGGGTCTCCGAACATGCTCGTACAATTCATATTGATCTAGAGAACAGGACAGCGAAACATATCAAGATGAAACTAAAGTTTCAACATGAATGGATCCTTATCAGCGAAGTCACTTTTAAATCTGTTCCAACAGTTATAAATTCATCGGCGGAATTTCTTGAGGAATATTATCGGAGCGACTCACCTCCATATTCAAGAAAGAAAAGAAATTCCTCTTTACGAGTTTCTGTGGGTCTTGCTTGCGGAGCCGTCGTTGGAGTCGGAGCATGTATTGCAGCCATAGCTGTGTTAATAACAAAGCGTACCAGAAGGAGGATACCGAATGTTTTGAAGAAGCCTTTTTGTCCTTCACTAAGAAATGCTAATGCTCGAAATAATCGCAGAGCCCCGAGGCTTGCTTTGGCGTTAGCAAATTGTCCACCAATTCACATGCTCCGGCCTGCTGTAGTTGACGAGGACTACAGGGAACCATACAATATATGGAGAGAAACACTTGGTCGACGAGATAAACAAGAGATTCATGATCAAAACGAATACAACGAAATTTGCGAGGATCCTGAGTTCCCCCGACCCTACATGATGAAAAGACCCGATCCGCATGAAAGTTTCTACGCCGCCACTGACATTATTCATCACACTTATCCCGACGAGCGAACTCTAAGACGCGAACGTCCTGCCCCAGGACTCTTCACGTCCATTAAACTACCTGAAGCCGAGCCCCCAAACGGAGTGGCGCCCCTGCTGGATTTCCCGCGGGGGCGCATGCGACCGGTTACTTTCCTAGGCGAGGGACAACATGGCACTCTCCAGATTTGCGAGACGGACGGCATCGAAGAATTGAACGACGAAGATACACCGATTGGCCATCGCCGCCGCCTTGTTGTCGTCAAGACGTTATGGCGCGGATGCCATAGCGATATCAAAGCGGCGTTTGCCCGCGAAGCTACATGGGGTGCGGGCCTGAAACATCCACAGCTCGCACGTGTGCTTGGATTGAGCCTTCTAGAACCCCCATGCGCGGCACTCGACCGCGGAGACGCTGTCCCTCTCCCGACGATACTGAAAATGGAACGGAGATTGAATTATTCGAGTTTGATTCACATATGCTGTCAAATAGCCAGCGGCATGAAGTATCTGGAATCGTTCGAGTTAGTACACCGGGATCTCGCGGCAAGAAACGTAACTGTGAGCGACGACCTCCATATTAAAATATCTGACTACGCTATGTTCTGCGAGGAATTTGTTGGTGACTACCATATTTTAGCTGACGGATCTCGCATCCCACTACGATGGATGGCCTGGGAAAGCTTATTATTGGGTGTCTTCTCACCAGCTAGCGATGTTTGGTCCTTTGGTGTAACCGTTTGGGAAGTGCTTACATACTGTTTGGTCAAGCCATTCGAGGAGATGAACGATGACCAGGTTGTAGCCAACGCTAACGAATGGCGTTCAGGCGGCCGTAATGCAAGAGTGCCGGCTGCTCCTCCACCACGCTGTCGCCGGGAACTGTACGATCTGATGCATGAGTGCTGGCGTCGAGAACCCATGCAACGCCCGCGCTTCCATGAACTCCACCGCTTTCTTGACCAGATGACCCACGGCTACAAACCACCTATTCGCCGCTGA

Protein sequence:

>DPOGS203914-PA
MQNGMISNESLAASSSYDQSVSPLSSRIRTEIRGGAWCPNGLISPRSRQYLEIDLHDEYLITATESQGRFANSVGVEFVESYSVEYWRNVLSRWVKYKDFNGSRLIPGNVNTYTPRKTTLEAPFVASKVRFFPYAAHPRTACMRVEIYGCRWKQALVAYSAPQGGDMQAMTGGARFEDLSYDGIMRNGNMINGLGQITDSFIGPNDFELPDPLDTRGTRWVGWNKTMLTNDEINITFNFTDPRLFHFMDVHTNNMFTKDVQLFKEAEVYFSLEGERWQEDFISYEPKQDRVSEHARTIHIDLENRTAKHIKMKLKFQHEWILISEVTFKSVPTVINSSAEFLEEYYRSDSPPYSRKKRNSSLRVSVGLACGAVVGVGACIAAIAVLITKRTRRRIPNVLKKPFCPSLRNANARNNRRAPRLALALANCPPIHMLRPAVVDEDYREPYNIWRETLGRRDKQEIHDQNEYNEICEDPEFPRPYMMKRPDPHESFYAATDIIHHTYPDERTLRRERPAPGLFTSIKLPEAEPPNGVAPLLDFPRGRMRPVTFLGEGQHGTLQICETDGIEELNDEDTPIGHRRRLVVVKTLWRGCHSDIKAAFAREATWGAGLKHPQLARVLGLSLLEPPCAALDRGDAVPLPTILKMERRLNYSSLIHICCQIASGMKYLESFELVHRDLAARNVTVSDDLHIKISDYAMFCEEFVGDYHILADGSRIPLRWMAWESLLLGVFSPASDVWSFGVTVWEVLTYCLVKPFEEMNDDQVVANANEWRSGGRNARVPAAPPPRCRRELYDLMHECWRREPMQRPRFHELHRFLDQMTHGYKPPIRR-