Monarch geneset OGS2.0

DPOGS201775
TranscriptDPOGS201775-TA2622 bp
ProteinDPOGS201775-PA873 aa
Genomic positionDPSCF300404 + 24801-29592
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0093970.054.25% 
BombyxBGIBMGA008597-TA2e-15947.28% 
DrosophilaInR-PC1e-5829.64% 
EBI UniRef50UniRef50_E2A5302e-7332.44%Tyrosine-protein kinase receptor n=2 Tax=Formicidae RepID=E2A530_CAMFO
NCBI RefSeqXP_320130.31e-7327.95%insulin receptor (AGAP012424-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071854619e-7332.44%Insulin-like receptor [Camponotus floridanus]
NCBI nr blastxgi|3505368276e-8227.02%insulin-like receptor-like [Apis mellifera]
Group
Gene OntologyGO:00160202.9e-21membrane
GO:00071696.7e-10transmembrane receptor protein tyrosine kinase signaling pathway
GO:00055246.7e-10ATP binding
GO:00064686.7e-10protein phosphorylation
GO:00047146.7e-10transmembrane receptor protein tyrosine kinase activity
KEGG pathwayaga:AgaP_AGAP0124243e-73 
 K04527 (INSR)maps-> Aldosterone-regulated sodium reabsorption
    Insulin signaling pathway
    Adherens junction
    Type II diabetes mellitus
InterPro domain[277-386] IPR0004942.9e-21EGF receptor, L domain
[133-263] IPR0090302.7e-12Growth factor, receptor
[513-700] IPR0089573.7e-11Fibronectin type III domain
[143-262] IPR0062116.7e-10Furin-like cysteine-rich domain
[630-695] IPR0137838.6e-06Immunoglobulin-like fold
Orthology groupMCL25513 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201775-TA
ATGGCAAATATAAAAAAGTTACAAAACTGCACAGTTGTGGTTGGGGATCTGATAATAACACTTCTAGAGAGAACTAAACCAAAAGACTTTCGGGATATAAGTTTTCCTAAATTAAAAGAGGTTACAGGATTTATGGTTGTTTACCGAGTGTCGGGATTGGAGTCTCTGGGGGACCTCTTTCCGAATTTAGCGAGAATCCGCGGTAACACGCTTCTATACAACTACGCGCTCATTGTTTATGACATGCCCCGTTTGAGAGAGATAGGTTTCTATAACCTGCTCAAAATCGATAGAGGCGGAGTCATCATATGGGGCGGTAAACTTACTTGCTTCATTGATTCCATTGATTGGAACGTTATTGCGCCCAAATCTCGTCACGTTCTCAGCATACCAGACAAAGGGACACGGTGCATGTTTGTTTGCACTTGTACAAGAAACGCTGTCTCCAATCGCTGTTGGAATAATAAGAAATGCCAACGTTTCCTTGAGGGTCCGGATGCAGAAGATTGTGATGTGAATTGCTTTGGATGCCGCAAGACCAACCCGAAGAGCTGCACATTATGTAGGAACTACACCATCAATAATACATGTGTGAACCGCTGCCCTAACAATACCATAATATTAACGGAGAGCAATTATTGCGTGACGATTGACGAATGTAAACATTTAAATAGATTTGAATTCAATAATACATGCGTGGAAAAGTGTCCGAATAATTATGAAATGGTGACCATTGGAAGAGATACATCATGCAAACCCTGCGTTAATTGCGATAAGACTTGTAAAAGCCTCATTATTCAAACATTGGCTTCCATACAAGCTACAGAAAAGTGTGTATATGTGAATGGCTCATTGACAATACACGTTAGATCAGTTCCCGGAGCGATGGATGAATTGAGATATTATTTGAAAAATATCAAAGAAGTTTCTGGATACATTCTAATTTATGGTTCTATTTCAGTTACATCACTAGATTTTCTATCATCGCTAAAAAGTATCAAGGGCAATACACTATTAAACGGAAAGTATAGTTTAGTCGTTTACGATATGCAAAACCTTCAGATGCTATTTTCAGACAATGTTACCAAAAAACTTAAAATAAACAAGGGTTCAATGAGATTTTACCGAAACCCCATCCTTTGTATGAGCCAAATCGAAAAGTTAAAGCCATTATTTCCGGTGGCTCCTAATGAAATTGATTTACCTCAGGGACTCAATGGTTATAGCGGGGGTTGTAAAGAAATAAATTTGGGTCTAAAAATTAACGTCAAGAATCAAACGTTTGCAGTTGCCACTTTTGATGGTGAGACTGGAACTGACGTGTTTTACACTATTTTATATATCGAAATATCTCACGATACAAAAGTGCCCATTGGACCGGAAGCATGTAGTGAGTCAGAATGGAATGCTATAAGCGTTTCATATTCTTCAAATAGGCTAATTGAAGTTCCCCTACACTCTCTTCGACCGGCTTCGATGTATGCTGTTTGTATAGAAAAGTATGAACCTTCCACACGTCATCTCGCTCGCAGTGCTATAGTAAATTTTACAACGCCACCTGGTAAACCAGAGCCGCCATTCATAACAGAACTTGTGGCTTCTTCCTCTGACGTAGTTGTAGTAAGATGGGTTGATCACAAAAACTATGAACGGCACATTACTAGATACGAGTTAGACGTGTACTTAATAGAAAAGAATCAAAACCATATAAATACAAGAGATTATTGCCAAAATTATAATGATATTGATGAAATTGACTATTCACGTCACGCGAAAGTTATGAGACCACCGCGTAATTATGGAAAAGGTTGTGAAAGTATGTGCGGTATTTTATCATCTTTTACTTTTGGTGCAATGGTCGATGAGTATTTTGATATATGCAATTCAATAAAAGGCTGTGAGAAAGAAGTGGATCGTCCTAAAGTTGATTATATCAAAGGATTACTTAAAACGGTATCGTTAGACATTACTGCCCCAAGAAAAGTTTATCAAATTGGAGGATTAGCACCTTTTAGAGATTATAGATTTCACCTTCGGGCTTGTATTAAAGATTTGTGTAGCCGTTCTGCTAGAGAGGTAGTGCGGACCTTAAGGTTAGAAAACATTGATATAGCCTCTATTACATTTACAAGCGCTGAAGAGAATGGTTTAATAGTCGTGAACTGGGATCCACCGGCAATATCAAACGGAGTTATATTGTCATACACTGTGGAAATTTGTCCAGATAATAATTTAAATGACATGAGTCATTTATTGCCTCAAGTTATGTGCGTTTTTGGAAACGAGACAAGTCTCACAGTAAAATCTCATAAAGCAAATATTTATCTTATAAGAGTGTGTACAACGACGCTGGCTTATTCGTATGTTTGTAACAATTGGACTAAAGTGATGGTTATTCAACAAAATTATCTTTCCATATGGATTGGTGGTGTAGTCTTCGGAATATTACTGTGTGTTATATCCATAAAATTTGGATGGCACTGGAAACAAACTACTATCAAATCGGACGATATACCGTTGGTAGACGCTACTTCTGCTAATCGCAATGAATCTGAACCACCAGCAATTATGATGTCGGATTTTATGCCACTGTATAGCATAGATTTTGGACATTCAGAATAG

Protein sequence:

>DPOGS201775-PA
MANIKKLQNCTVVVGDLIITLLERTKPKDFRDISFPKLKEVTGFMVVYRVSGLESLGDLFPNLARIRGNTLLYNYALIVYDMPRLREIGFYNLLKIDRGGVIIWGGKLTCFIDSIDWNVIAPKSRHVLSIPDKGTRCMFVCTCTRNAVSNRCWNNKKCQRFLEGPDAEDCDVNCFGCRKTNPKSCTLCRNYTINNTCVNRCPNNTIILTESNYCVTIDECKHLNRFEFNNTCVEKCPNNYEMVTIGRDTSCKPCVNCDKTCKSLIIQTLASIQATEKCVYVNGSLTIHVRSVPGAMDELRYYLKNIKEVSGYILIYGSISVTSLDFLSSLKSIKGNTLLNGKYSLVVYDMQNLQMLFSDNVTKKLKINKGSMRFYRNPILCMSQIEKLKPLFPVAPNEIDLPQGLNGYSGGCKEINLGLKINVKNQTFAVATFDGETGTDVFYTILYIEISHDTKVPIGPEACSESEWNAISVSYSSNRLIEVPLHSLRPASMYAVCIEKYEPSTRHLARSAIVNFTTPPGKPEPPFITELVASSSDVVVVRWVDHKNYERHITRYELDVYLIEKNQNHINTRDYCQNYNDIDEIDYSRHAKVMRPPRNYGKGCESMCGILSSFTFGAMVDEYFDICNSIKGCEKEVDRPKVDYIKGLLKTVSLDITAPRKVYQIGGLAPFRDYRFHLRACIKDLCSRSAREVVRTLRLENIDIASITFTSAEENGLIVVNWDPPAISNGVILSYTVEICPDNNLNDMSHLLPQVMCVFGNETSLTVKSHKANIYLIRVCTTTLAYSYVCNNWTKVMVIQQNYLSIWIGGVVFGILLCVISIKFGWHWKQTTIKSDDIPLVDATSANRNESEPPAIMMSDFMPLYSIDFGHSE-