Monarch geneset OGS2.0

DPOGS207149
TranscriptDPOGS207149-TA2619 bp
ProteinDPOGS207149-PA872 aa
Genomic positionDPSCF300001 + 4213708-4226693
RNAseq coverage291x (Rank: top 38%)
Annotation
HeliconiusHMEL0117960.066.04% 
BombyxBGIBMGA000587-TA0.079.25% 
DrosophilaDdr-PA4e-10840.60% 
EBI UniRef50UniRef50_E0W0140.048.01%Discoidin domain receptor, putative n=22 Tax=Neoptera RepID=E0W014_PEDHC
NCBI RefSeqXP_001946164.10.051.31%PREDICTED: similar to Discoidin domain receptor CG33531-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3504062220.054.59%PREDICTED: discoidin domain-containing receptor 2-like [Bombus impatiens]
NCBI nr blastxgi|3504062220.054.74%PREDICTED: discoidin domain-containing receptor 2-like [Bombus impatiens]
Group
Gene OntologyGO:00047136.3e-86protein tyrosine kinase activity
GO:00046725.3e-66protein kinase activity
GO:00064685.3e-66protein phosphorylation
GO:00167724.1e-62transferase activity, transferring phosphorus-containing groups
GO:00055244.6e-27ATP binding
GO:00046744.6e-27protein serine/threonine kinase activity
GO:00071553.9e-21cell adhesion
KEGG pathway 
InterPro domain[592-864] IPR0206356.3e-86Tyrosine-protein kinase, catalytic domain
[593-864] IPR0012455.3e-66Serine-threonine/tyrosine-protein kinase
[564-864] IPR0110094.1e-62Protein kinase-like domain
[21-179] IPR0089792.1e-51Galactose-binding domain-like
[592-864] IPR0022904.6e-27Serine/threonine-protein kinase domain
[20-177] IPR0004213.9e-21Coagulation factor 5/8 type, C-terminal
Orthology groupMCL15555 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207149-TA
ATGCTACAGGTGTTCCAGTTGATAAAAGCCGCTGAGGAACACTCGCTCGACGGAACTCAATGCATCGCACCGCTCGGCATGGAGAGCGGTCTGATCCCGGATCGAGACCTAACCGCTAGCTCCTCCTTCGACGATGGCAACGTGGGACCGCAGAATGGACGGTTAAATCAAGAGACAAAAGGCGGTGCTTGGTGCCCTAAGCCGCAGATCACGACGGAATCCACGGAGTGGCTGGAGATAGATCTTCATACGGTACACGTTATCACGGGCACGGGTACACAAGGGCGCTTCGGTAACGGACAGGGGGCTGAGTACGCGGAGGCCTACGTGCTAGAGTATTGGAGGCCGAAACTAGGGAAGTGGGTTCGATACAGGGGCATCGATGGACAAGAGATCCTGACTGGCAACACGAACACCTACCTGGAGAAGAAGCATCACCTAGAGCCGCCTGTGTGGGCCTCAAAGATCCGTTTCATTCCATACAGCTCCCACCGCCGTACCGTCTGCATGCGGGTAGAACTCTACGGATGCTACTGGAGTGTGGGGATAGTATCGTATTCGATGCCGAAGGGCGATCAGAGGAGCAATGGAGTCGAGTTGACGGATATTATATACGACGGCCAGTGGGGGGAGGAACTCAGGGGGGGTCTGGGTCAACTGGTCGATGGACAGTTCGGGGGAGATGAAATCAGAGAGGCCTCGAAGAACTCCGTGTGGGTCGGATGGAGGAACGATTCGAGGCCAACCCCTCCAGTTATGATATTCGAGTTCGACAAAGTGAGAGAATTCTCCGCCGTCCACCTATACTGCAACAACAAGTTCATGAGAGACGTGCAGGTGTTCTCGGAGGCTATAATATCGTTTTCGATAGGCGGCCGTCAGTTCCAGCCGGAGCCGGTCAGGTTCACTACAATAGAGAACAACATCTTCGAGAGCGCGAGGAACGTCACCATCAAACTGCACCACCGCATAGGACGCTGGGTCAGAATCGAGCTAAGGTTCGCCGCTACTTGGATACTCATCAGCGAAGTTGTCTTCGACTCCGATGTCGCTCAAGGGAATTACACTCCGGAGAGTCAAAAGCCGGGAGTGAAAGATAAAACGAAGTTACCAGCCACGAAAGAGCTGCCGATATCGACAGCACACACAGAAGATCCGTTGTACATGGCTATCGTAGTGGGAGTTCTAACAGCACTGGTCATACTTCTAGCCGTAGCGGTATTCTTTATAATACATCGCCATAGACATAGGAAGTGCTTCGCGTCTCCTCTAGCTAAGACAACGATAGCCCAACGGACAGTGTCAGAGAATTACAACCCGTGTACCTCGATGCTGCCGGAGAACAAGATTGTTATAGACTGTCCATTGGACGTGAAGTCTGATGAATACCAGGAGCCGTACCAGGCATTAAAATGCGCTCCGTACTTCAGTTACAGTACTGTCCTGTTAGAGATGAGGGACTTCGTTAAGGATTCCAATGCAGCGCTATCTGATAGTTCAAACTACGATTACGCAGTGCCCGAACTAAGCTCAGCGCCGCTTCTGACGAAGCGATTAGACACGCTCTCGGACCGAGCCACCGAGCTGGTGTCTGACGTGGAACACCTGGAACAGCTGGGACACCTGGAACACATGGACGCGATGGACTGCGAACTGAACCGATCAGTGAACTCTATACGGTCCAAGCATAGCAATAGGAGTCAGCAAGAGATGCTTATCGAGTTGAAGCGTCGTCTGGAGAGTACCGAGGTGATAGAGTTCCCGAGACATAGACTCCGCATGATATCCAAACTAGCTGAAGGAGCGTTTGGAACTGTTTACGTGGCGGAGGCTGACGGCGTCCCGGAGTACGAGGGAGGCGTCTGCTCAGAGAAGAGATTAGTTGCTGTTAAATTCCTATGCAGCGACGCGTCACTCAACGAGAGGTATATCATGGAGGAGTTTGAGCGTGATGTTCGTATATTGGCCGCATTGTCATCGCCGTACCTGGCCCGTGTGTTGGGAGCGTGTCGGTCACCGCCGCTAGCTGTAGTGCTCGAGTATCTCGAACTGGGTGACCTCTGTGCGTTCCTACGAACGACATCACCTCCGAGCACGAGCACGTTGCTCCACATCGCCACGCAGATCGCTTCCGGAATGCACTATTTAGAGTCGCTGAACTTCGTGCACAGAGATCTGGCGACCAGGAACATCCTGATAGGTAAGAACTATCAGATCAAGATCAGCGACTTCGGAACGGACAACGAGTCGTACAGCTGCGATTACTATAAGGTGGACGGTCGTATACCTCTGCCACTTCGCTGGGCCGCCTGGGAGTGTGTACTGCGGGGCGAGTATACCACCAAGACGGACGTGTGGGCCTTCGCTGTAACACTGCACGAGATATTCTCACTGTGCCGTCGACGACCGTACGAGCAGTTCACCGACGCCGAGGTGTTAGAGAACCTGTCCCACTTGGAGGCGGACGACGGTCTGTTCACATACATACCCCGCAGCCCGGGCTGTCCTCGCTCGTTGTATGACGCGATGCGTGCGTGTTGGAGACGTCGTGACGCCGACAGGCCGACCTTCAGCGAGCTCCACTCCTTCCTCCAGCGGACCGGTCGCGGGTACTCGTGA

Protein sequence:

>DPOGS207149-PA
MLQVFQLIKAAEEHSLDGTQCIAPLGMESGLIPDRDLTASSSFDDGNVGPQNGRLNQETKGGAWCPKPQITTESTEWLEIDLHTVHVITGTGTQGRFGNGQGAEYAEAYVLEYWRPKLGKWVRYRGIDGQEILTGNTNTYLEKKHHLEPPVWASKIRFIPYSSHRRTVCMRVELYGCYWSVGIVSYSMPKGDQRSNGVELTDIIYDGQWGEELRGGLGQLVDGQFGGDEIREASKNSVWVGWRNDSRPTPPVMIFEFDKVREFSAVHLYCNNKFMRDVQVFSEAIISFSIGGRQFQPEPVRFTTIENNIFESARNVTIKLHHRIGRWVRIELRFAATWILISEVVFDSDVAQGNYTPESQKPGVKDKTKLPATKELPISTAHTEDPLYMAIVVGVLTALVILLAVAVFFIIHRHRHRKCFASPLAKTTIAQRTVSENYNPCTSMLPENKIVIDCPLDVKSDEYQEPYQALKCAPYFSYSTVLLEMRDFVKDSNAALSDSSNYDYAVPELSSAPLLTKRLDTLSDRATELVSDVEHLEQLGHLEHMDAMDCELNRSVNSIRSKHSNRSQQEMLIELKRRLESTEVIEFPRHRLRMISKLAEGAFGTVYVAEADGVPEYEGGVCSEKRLVAVKFLCSDASLNERYIMEEFERDVRILAALSSPYLARVLGACRSPPLAVVLEYLELGDLCAFLRTTSPPSTSTLLHIATQIASGMHYLESLNFVHRDLATRNILIGKNYQIKISDFGTDNESYSCDYYKVDGRIPLPLRWAAWECVLRGEYTTKTDVWAFAVTLHEIFSLCRRRPYEQFTDAEVLENLSHLEADDGLFTYIPRSPGCPRSLYDAMRACWRRRDADRPTFSELHSFLQRTGRGYS-