Monarch geneset OGS2.0

DPOGS212041
TranscriptDPOGS212041-TA2745 bp
ProteinDPOGS212041-PA914 aa
Genomic positionDPSCF300054 + 73443-96674
RNAseq coverage299x (Rank: top 37%)
Annotation
HeliconiusHMEL0180690.089.86% 
BombyxBGIBMGA010090-TA0.080.33% 
Drosophilahtl-PC2e-14941.71% 
EBI UniRef50UniRef50_Q1JUB80.073.71%Fibroblast growth factor receptor n=2 Tax=Obtectomera RepID=Q1JUB8_BOMMO
NCBI RefSeqNP_001037558.10.073.71%fibroblast growth factor receptor [Bombyx mori]
NCBI nr blastpgi|989608410.065.27%fibroblast growth factor receptor [Spodoptera frugiperda]
NCBI nr blastxgi|989608410.065.81%fibroblast growth factor receptor [Spodoptera frugiperda]
Group
Gene OntologyGO:00047131.2e-132protein tyrosine kinase activity
GO:00046721.6e-91protein kinase activity
GO:00064681.6e-91protein phosphorylation
GO:00167723.6e-77transferase activity, transferring phosphorus-containing groups
GO:00055241.4e-43ATP binding
GO:00046741.4e-43protein serine/threonine kinase activity
KEGG pathwayaga:AgaP_AGAP0031081e-163 
 K05093 (FGFR2)maps-> Pathways in cancer
    Prostate cancer
    Endocytosis
    Regulation of actin cytoskeleton
    MAPK signaling pathway
InterPro domain[610-872] IPR0206351.2e-132Tyrosine-protein kinase, catalytic domain
[611-872] IPR0012451.6e-91Serine-threonine/tyrosine-protein kinase
[599-901] IPR0110093.6e-77Protein kinase-like domain
[610-876] IPR0022901.4e-43Serine/threonine-protein kinase domain
[370-483] IPR0137833.4e-17Immunoglobulin-like fold
[217-282] IPR0035981.1e-16Immunoglobulin subtype 2
[218-292] IPR0130983.8e-13Immunoglobulin I-set
[379-482] IPR0035999.6e-12Immunoglobulin subtype
Orthology groupMCL10319 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212041-TA
ATGAAGCAGGCCGCCATTGCGTTTTGGCGCGAAATGCCAATCGCTGTCACAATCGCATTCGTGGCTCTGGCATGTACGGTTTCAGCCAGAGACATTGTTTTGGATGATACATACACCGTAATACAAGCTGCAACAAATGAAAGGCTGCGTCTCATCTGTGGGCTGAAGCCACGAAGCCAGATGCCAGCTGTTAGGTGGTTCTTCGATGGAAAACCAGTGGACTACCAGCTGAGGCAAAGAGTAATGCAGCAGAAACAGTGGTTAAGAATAAAAGGGTTTCGCGCAAAAGACGCCGGTGTGTACACGTGCCAGAACGAAGAAGACAAAGGGAAGGAAATGAGTGTTGCCATCAAACACAAACCCGATCTTGGAACGGAAAATATTGACGAATACCAGTCCGATGTTGACGTTTTACGACCTATGCCTGAAATATTGGCACAGCATTCAGCACCATTGGTTGATAATAATACAATCGAAGAGAATACAACAGATCCTAGGAACAAATACAAAGAGTACGATACCAAAGACGATGATCTCGACGATAGGAATGAAAGGGAACATTTGAACTACGGCCACGTTGATGACACCAAATCAAAATATGCACCGAAGTTTAAACATCCGTCCAAAATGTTCAACATGGAGATGAAACCAGCTGGCAGTTCAATAAGATTCAAATGCGCTGCTGAAGGTAATCCAATGCCAAACATAACATGGTACAAAAATAGTGGAACGCCGATAGCCAGGAGCTACTTCAAACCGTCTTACGGTAAATGGTCCATGGCTTTAGATGAGCTCACAAAAGCTGATAATGGAAATTACACTTGCAAAGTTTGTAACGAACTGGGTTGCATTCAACACAAATACACACTCCACATACAAGAACGCTATCCATCGAAGCCGTACATAAAAGAAGGGCATCCTGGAAACATAACAGTGCTTGTGAATGAGACAGTACAATTGACATGTCCACCTGTATCAGACCTAGAACCTTATCTGTACTGGATCAGACCAACGAACTATTCTGTGAAAGACACAGAGGTTGGACCTAGCGATGCCCCAGCGCCTATAGGCGACGTTATTGAGGTTAAATTTGATTTAGAGCGTCTATACGCCAAGCCGGTGTTAACTCAGGCCGCGGTGAACCAGACCAAGTTAGTTGGAGAGACCGCAAAATTCAGTTGCGAATACCTCAGTGATCTACACCCGCTTGTGTATTGGATGTATTTCACAAAACACGAATATATATATAATGAAACAACAACGGATTCTAGCAATGAGTCCATTGTGTATGATGATATTACGAAAATTGTTACGAGTGAGAATCCAGAGGACAAACCCGAACAATTAACTATATATAATGTGACCAAAGAGGACGAGGGTTGGTACGTCTGTGTAGCTCTCAATACACTTGGCAACACGACTGCTAAGGGATACCTCACTGTTTTGGATTCTCTGCCCATCACAGAAGCGTTGGATCACGGCAAGCATACATTATTCATAAATATACTAACAGCGGTCCTTGGTGCTATGTTCTTTGTGGCGGCTATCATAGTAGTCATGATATTCAAGAAATTGAAACGTGAGAAAATCAAGAAGCAGTTGGCGATAGAAACAGCTCGAGCTGTCATAGTGACACATTGGACGAAGAAGGTGACAGTAGAGAAGCCGCAGATGAACGGAACGGAAAATACAGCTGAAGGACTGTTAATGCCAGTGGTGAAGATCGAAAAACAGAAGCTATCACAAGTCCAGAGCCCCTGTGACTCCATGTTGATGTCAGAATATGAACTGCCAATGGATATAGACTGGGAGGTGTCCAGGGACGCTCTTTGTATTGGAAAAGTTTTGGGGGAGGGAGAGTTCGGGAAAGTAGTGAAGGCAGAATGTCAAGGAATCGTGAAACCAGAAGGTCACACGGACGCGGAGATGATGGCATTAGTATCGGAGATGGAAATGATGAAGATGATCGGTAAACACGTGAACATTATCAACCTGCTGGGATGCTGTACGCAGGATGGACCGCTCTACGTCATAGTGGAGTATGCACCAAATGGAAACCTAAGAGAGTTCCTGAGAAATCACAGACCTGGTAACAGGTATGAATCACCAAACGAAGACCTGAAGGAAAAGAAAACTTTAACCCAAAAAGATCTTGTATCGTTCTCATATCAAGTAGCGAGGGGCATGGAGTATTTGGCTTCTCGGAGATGCATACACCGTGATCTGGCCGCTCGTAACGTGTTGGTATCAGATGATTGCGTACTGAAGATCGCGGATTTCGGTCTCGCTAAGGACGTGCAGTCCAACGACTACTATCGCAAGAAGACCGAGGGCAGACTCCCCGTACGATGGATGGCGCCAGAGAGCTTGTATCATAAAGTCTTCACAACGCAAACTGACGTGTGGTCATTCGGTGTACTGCTGTGGGAGATCATGACACTTGGTGGTACCCCGTACCCGACTGTCCCGGGACAGTATATGTACCAACATCTCAGCGCCGGCCATCGGATGGAGAAGCCGCCCTGTTGCAGTCTTGAAATTTACATGCTAATGCGAGAGTGTTGGTCCTTCTCCCCCGGCGACCGGCCTTCGTTTACAGAACTGGTTGAGGATTTGGACAAAATACTCACAGTAACGGCCAACCAGGAGTATTTGGATCTCGGTCTGCCGCAACTGGATACGCCCCCGTCCAGCTACGACGGCTCCGGCGACGAGAGTGACAGCGAGTTCCCTTTCATTAAATAA

Protein sequence:

>DPOGS212041-PA
MKQAAIAFWREMPIAVTIAFVALACTVSARDIVLDDTYTVIQAATNERLRLICGLKPRSQMPAVRWFFDGKPVDYQLRQRVMQQKQWLRIKGFRAKDAGVYTCQNEEDKGKEMSVAIKHKPDLGTENIDEYQSDVDVLRPMPEILAQHSAPLVDNNTIEENTTDPRNKYKEYDTKDDDLDDRNEREHLNYGHVDDTKSKYAPKFKHPSKMFNMEMKPAGSSIRFKCAAEGNPMPNITWYKNSGTPIARSYFKPSYGKWSMALDELTKADNGNYTCKVCNELGCIQHKYTLHIQERYPSKPYIKEGHPGNITVLVNETVQLTCPPVSDLEPYLYWIRPTNYSVKDTEVGPSDAPAPIGDVIEVKFDLERLYAKPVLTQAAVNQTKLVGETAKFSCEYLSDLHPLVYWMYFTKHEYIYNETTTDSSNESIVYDDITKIVTSENPEDKPEQLTIYNVTKEDEGWYVCVALNTLGNTTAKGYLTVLDSLPITEALDHGKHTLFINILTAVLGAMFFVAAIIVVMIFKKLKREKIKKQLAIETARAVIVTHWTKKVTVEKPQMNGTENTAEGLLMPVVKIEKQKLSQVQSPCDSMLMSEYELPMDIDWEVSRDALCIGKVLGEGEFGKVVKAECQGIVKPEGHTDAEMMALVSEMEMMKMIGKHVNIINLLGCCTQDGPLYVIVEYAPNGNLREFLRNHRPGNRYESPNEDLKEKKTLTQKDLVSFSYQVARGMEYLASRRCIHRDLAARNVLVSDDCVLKIADFGLAKDVQSNDYYRKKTEGRLPVRWMAPESLYHKVFTTQTDVWSFGVLLWEIMTLGGTPYPTVPGQYMYQHLSAGHRMEKPPCCSLEIYMLMRECWSFSPGDRPSFTELVEDLDKILTVTANQEYLDLGLPQLDTPPSSYDGSGDESDSEFPFIK-