Monarch geneset OGS2.0

DPOGS206801
TranscriptDPOGS206801-TA4341 bp
ProteinDPOGS206801-PA1446 aa
Genomic positionDPSCF300001 - 4257026-4266469
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0157960.079.13% 
BombyxBGIBMGA000602-TA0.081.41% 
DrosophilaEgfr-PB0.058.87% 
EBI UniRef50UniRef50_P0CY460.060.47%Epidermal growth factor receptor n=15 Tax=Coelomata RepID=EGFR_APIME
NCBI RefSeqXP_001602830.10.059.14%PREDICTED: similar to epidermal growth factor receptor [Nasonia vitripennis]
NCBI nr blastpgi|3838651520.061.61%PREDICTED: epidermal growth factor receptor-like [Megachile rotundata]
NCBI nr blastxgi|3838651520.061.68%PREDICTED: epidermal growth factor receptor-like [Megachile rotundata]
Group
Gene OntologyGO:00047133.8e-130protein tyrosine kinase activity
GO:00064681.5e-88protein phosphorylation
GO:00046721.5e-88protein kinase activity
GO:00167721.4e-74transferase activity, transferring phosphorus-containing groups
GO:00055248.4e-40ATP binding
GO:00046748.4e-40protein serine/threonine kinase activity
GO:00160209.6e-25membrane
GO:00071699.6e-25transmembrane receptor protein tyrosine kinase signaling pathway
GO:00047149.6e-25transmembrane receptor protein tyrosine kinase activity
KEGG pathwaynvi:1001189700.0 
 K04361 (EGFR, ERBB1)maps-> Prostate cancer
    Cytokine-cytokine receptor interaction
    Regulation of actin cytoskeleton
    MAPK signaling pathway
    Gap junction
    Dorso-ventral axis formation
    Glioma
    Melanoma
    Pathways in cancer
    Endometrial cancer
    Focal adhesion
    ErbB signaling pathway
    MAPK signaling pathway - fly
    GnRH signaling pathway
    Adherens junction
    Pancreatic cancer
    Endocytosis
    Bladder cancer
    Calcium signaling pathway
    Non-small cell lung cancer
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[880-1136] IPR0206353.8e-130Tyrosine-protein kinase, catalytic domain
[883-1135] IPR0012451.5e-88Serine-threonine/tyrosine-protein kinase
[864-1188] IPR0110091.4e-74Protein kinase-like domain
[880-1137] IPR0022908.4e-40Serine/threonine-protein kinase domain
[515-638] IPR0090301e-26Growth factor, receptor
[196-347] IPR0062119.6e-25Furin-like cysteine-rich domain
[362-490] IPR0004941.6e-23EGF receptor, L domain
[506-557] IPR0062123e-09Furin-like repeat
Orthology groupMCL10382 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206801-TA
ATGTTGGCTGGCCGCAGCTATCTGTGGTGGCTGTGCGTGTGCGCGTGCGCGACGGTGCTGGACGCGAGGCGCCTGCACAACGACGCGAACGCCAGGCACAAACATTCGGAATTTGTCAAGGGAAAAATATGCATCGGTACCAATGGGCGGATGTCGGTGCCTTCGAATCGAGATACTCACTACAGAAACCTACGCGATCGATTCACCAACTGCACCTATGTCGATGGTAACTTAGAATTGACGTGGCTCGAGAACGAAACGATGGACCTTTCATTCCTACAACACATAAGAGAAGTGACTGGTTATGTTCTCATTTCGTATGTGAAAGTGGCTCAAATCATATTACCAAAACTACAAATTATTCGTGGAAGAACACTTTTCAAACTTAATGTTAGAGTTGAGGAATTTTCCCTTTTGGTAACTATGTCTTCGGCTCTTACACTGGAACTTCCTGCTCTTAGAGATGTCCTTCGTGGCAGTGTGGGCATCTATAATAATTACAACCTTTGTCATGTTAAAACCATTAATTGGGATGAAATTATTACTGGTGTTAATGCAACATACGTATTTGTATACACCTTTAATGTACCAGAAAGAGAGTGTCCGCCGTGTCATCCTAGTTGTGAGGCAGGTTGCTGGGGAGAAGGAATACATAACTGTCAGAAGTTCTCAAAAACAAACTGTAGTCCTCAATGTGAAGACGGAAGATGTTTTGGCCCAAACCCTCGGGATTGCTGCAATACTTTCTGTGCTGGGGGGTGTAAAGGTCCTCTCCCTAGTCAATGCCTTGCTTGCCGAAACTTTTATGATGAAGGAACATGTTCCCAAGAATGCCCACCAATGCAAAAATATAACCCAACAACTTATTCTTGGGAACCAAATCCTAACGGAAAATATGCATACGGTGCTACTTGTGTTAGGAATTGTCCAGAACACCTATTAAAAGACAACGGAGCCTGCGTTAGGAGCTGTCCCCCTAACAAGACAGCAGTGAATGGTGAATGTATACCATGCAATGTGACTTGCCCAAAAACTTGCCGAGCTGAAAAACCTATCCACTCCGGAAATATCGATAGTTTTAAGGATTGCACTATTATAGACGGATCGATAGAAATTCTTGAAATGACTTTTACTGGGTTTCAACACGTTAACGCTGATTACTCTTTCGGAGATAGATATCCAAAAATGGAACCTGATGCTTTGGAAGTATTTAGCACTGTACGTGAAGTTACCGGTTACTTAAATGTTCAAGCTCATCATCCAAACTTCACAAGTCTCTCGTACTTTAGAAATCTAGAAGTTATAGGCGGGCGTCAAGTTGTAGAAAACTTATTTGCATCTCTTTATATTGTAAAGACCTCTTTAAAATCTCTGGGACTTAAATCTCTCAAAAGAGTGAACTCTGGTGCTATAGCAATCATGGAAAATCGACAACTATGTTTCGCTGATAAAATAGCATGGAATAAGCTGGGCAAATCAAAAGATCATAAGCAAATAATTCAGAAAAATGGGGATCAAAGAAATTGTGAAAAATTGAATTTGGTGTGCGATCCTCAATGCTCAGCGGATGGCTGTTGGGGTCCCGGGCCGGATCAATGTTTATCATGTGAAAATTATAAATTTGGAGAAACTTGTGTTCAAAATTGCTCCGTACTTCCTGGACTTTACAAAGGCGGACCAAAAGTTTGCAGACAATGTCATGCTGAGTGTTTAGACGGGTGCTCTGGTCCAACAAGAGCAAATTGTACTAAGTGCAAACACGTACGGGATGGACCCTACTGTTTCGCGGAGTGTCCAAAGTCTAGATACACAACAGGAAACGGCACCTGCTTGCCATGCCATCAAAATTGTTTTAATGGATGCACTGGTCCTGAAAATATTGTTGGCGAAGGAGGATGTAATTCCTGTAAGAAGGCTATAATAAGTGTTGAAGCCACTGTTGCTAGTTGCCTTAAAGAAGATGAACCATGCCCTGAAGGGTATTATAACGAATGGGTTGGCAATGTAAAACCTCTTGAAGGAAAAGTAAAAGTTGTTTGTCGTAAATGTCATCCATTATGTTATAGATGTACTGGTTATGGCATACACGAGCAGGTATGTCAAGTGTGCAACGGATTTAAGAGAGGTGACCAATGTGAACATGAATGTCCACCAGATCATTATACAGATAAAGCCAACCGTCTCTGCACACCATGTGATCCAGAATGTCGAGAATGTACAGGGCTAACAGCAAAGGACTGTATAAAATGTAATAATCTTAAAGTATTTCTCGGAGATGATAATGCTAGCCCATTTAGCTGCACTGATAAATGCCCAAGCGAGAAACCACACAAAATATATTTTGACGAACTTCTTCAAAACCCTATAGATGAACCATACTGTTCGGCTTCGTCAAATGGTCTTCCTAATATGGCTACTGCAAAAATCCCCACAGTCCTAGTTATAATATTCATTATAGCTTTTATCCTTCTCATCATTCTTGCCATAATTGGTTATACATGCAGACAGAAAGCCAAAGCAAAGAAAGAAGCCGTAAAAATGACTAGGGTACTCACTGGATGTGAAGATAATGAACCACTGAGACCAACTAACGTCAAGCCGAACCTTGCAAAATTACGAATCGTAAAGGAAGCAGAGTTACGTCGAGGAGGAATGCTTGGTTTTGGTGCTTTTGGTAAAGTTTACAAAGGCGTTTGGGTACCAGAAGGAGAAAATGTGAAAATTCCAGTAGCCATAAAGGTTTTGAAAGAAGGAACAGGAGCCAACACAAGTAAAGAATTTTTAGAAGAAGCTTATATTATGGCAAGCGTGGAACATCCCAATTTATTACAATTGCTGGCAGTATGTATGACCAATCAAATGATGCTTATCACTCAATTGATGCCTCTTGGCTGTTTACTTGACTATGTTAGGACTCACAAAGAAAAAATTGGATCAAAAGCATTTCTAAATTGGTGTACACAAATTGCTCGTGGTATGGCATATTTAGAAGAAAAAAGATTGGTTCATAGAGATTTGGCAGCTAGAAATGTTCTAGTGCAAACACCCAATTGTGTTAAAATTACCGATTTCGGACTTGCCAAATTACTTGATATCAATGAAGATGAATATAAAGCAGCAGGCGGAAAAATGCCCATAAAATGGTTGGCTCTAGAGTGTGTTCAACACAGAATATTCACTCACAAAAGCGATGTTTGGGCTTTTGGTGTTACTATATGGGAAATTCTCAGCTACGGAGCAAGACCATATGCAAATATTTCAGCTAGAAATGTTCCTCAATTAATTGAAAATGGTTTAAAATTACCTCAACCAAGTATTTGTACATTGGATATTTATTGCATTATGGTCTCGTGCTGGATGTTGGACGTCGACAGCCGACCAACGTTCAAGCAACTAGCTGAAACTTTCGCTGAAATGGCTCTTGATCCTGGACGTTATCTTGTAATACCTGGAGACAAATTTATGCGACTGCCATCCTATTCACATCAGGATGAAAAGGAACTAATAAGGACTCTTTCTTCGGGAATGGAAGGACCTGAACCTCTAGTAGAAGCGGATGAGTATTTACAACCACAACTAAAGCCTTCAGTGCCCGGTACGGCGACAACCAATTCTACTGCTACCGTGACTCTTGAAACAAACAGACACACTACAAAGCCCTGTACTTCTTCCTCCTGGACTAATAACGCAACCGGACAAGAAAATGTTACTACGGACAATTCCAATAGGCAAGAATGGGATCAAGATGCGTTGCATTATAATAATTCTTCAAATGCAGATGGAACAGAACTTCGACACTATTATAATAATGGTGTTTGTGGGTCTGAAAATACCAGTTCCCGTTACTGCAGTGACCCAATGAGAAGTCGACCTGACTGTGCCGAGACGAAGTTCGATAGTATGAGTAAAGTCAGAGAAGCACACGTAGGAAATCTGAAATTAAACTTGCCTCTTGATGAAGACGATTACTTAATGCCATCCCCCCAACAAAATCAAACAACTTCTACCTATATGGATTTAATAGCTGAGGGAGCAGAAGGACAAGAAAACCAGGATTTACGTTACAGTGGTTTCCTTGCATCTAAGCGATGCATTGATAATCCGGAGTATTTGATGTCTGATCAGAACGTTCCTCAGCAAACCCTGGGAATACCCACAGAGCCAGTGGCGCCGGAGTCGCTCCCAAGTGAAACCAGTGAGAGTGTCAATACGCCACAGCCAGGTCCTAGCCGATATGCACCGCAGCGGTCTGTAGAGGAAGAGTCCATGTCGGACCACGAATACTACAATGACCTTCAACGCGAGCTGCAGCCGCTACGCCGGAACGAAACAACCGTCTGA

Protein sequence:

>DPOGS206801-PA
MLAGRSYLWWLCVCACATVLDARRLHNDANARHKHSEFVKGKICIGTNGRMSVPSNRDTHYRNLRDRFTNCTYVDGNLELTWLENETMDLSFLQHIREVTGYVLISYVKVAQIILPKLQIIRGRTLFKLNVRVEEFSLLVTMSSALTLELPALRDVLRGSVGIYNNYNLCHVKTINWDEIITGVNATYVFVYTFNVPERECPPCHPSCEAGCWGEGIHNCQKFSKTNCSPQCEDGRCFGPNPRDCCNTFCAGGCKGPLPSQCLACRNFYDEGTCSQECPPMQKYNPTTYSWEPNPNGKYAYGATCVRNCPEHLLKDNGACVRSCPPNKTAVNGECIPCNVTCPKTCRAEKPIHSGNIDSFKDCTIIDGSIEILEMTFTGFQHVNADYSFGDRYPKMEPDALEVFSTVREVTGYLNVQAHHPNFTSLSYFRNLEVIGGRQVVENLFASLYIVKTSLKSLGLKSLKRVNSGAIAIMENRQLCFADKIAWNKLGKSKDHKQIIQKNGDQRNCEKLNLVCDPQCSADGCWGPGPDQCLSCENYKFGETCVQNCSVLPGLYKGGPKVCRQCHAECLDGCSGPTRANCTKCKHVRDGPYCFAECPKSRYTTGNGTCLPCHQNCFNGCTGPENIVGEGGCNSCKKAIISVEATVASCLKEDEPCPEGYYNEWVGNVKPLEGKVKVVCRKCHPLCYRCTGYGIHEQVCQVCNGFKRGDQCEHECPPDHYTDKANRLCTPCDPECRECTGLTAKDCIKCNNLKVFLGDDNASPFSCTDKCPSEKPHKIYFDELLQNPIDEPYCSASSNGLPNMATAKIPTVLVIIFIIAFILLIILAIIGYTCRQKAKAKKEAVKMTRVLTGCEDNEPLRPTNVKPNLAKLRIVKEAELRRGGMLGFGAFGKVYKGVWVPEGENVKIPVAIKVLKEGTGANTSKEFLEEAYIMASVEHPNLLQLLAVCMTNQMMLITQLMPLGCLLDYVRTHKEKIGSKAFLNWCTQIARGMAYLEEKRLVHRDLAARNVLVQTPNCVKITDFGLAKLLDINEDEYKAAGGKMPIKWLALECVQHRIFTHKSDVWAFGVTIWEILSYGARPYANISARNVPQLIENGLKLPQPSICTLDIYCIMVSCWMLDVDSRPTFKQLAETFAEMALDPGRYLVIPGDKFMRLPSYSHQDEKELIRTLSSGMEGPEPLVEADEYLQPQLKPSVPGTATTNSTATVTLETNRHTTKPCTSSSWTNNATGQENVTTDNSNRQEWDQDALHYNNSSNADGTELRHYYNNGVCGSENTSSRYCSDPMRSRPDCAETKFDSMSKVREAHVGNLKLNLPLDEDDYLMPSPQQNQTTSTYMDLIAEGAEGQENQDLRYSGFLASKRCIDNPEYLMSDQNVPQQTLGIPTEPVAPESLPSETSESVNTPQPGPSRYAPQRSVEEESMSDHEYYNDLQRELQPLRRNETTV-