DPGLEAN22017 in OGS1.0

New model in OGS2.0DPOGS201775 
Genomic Positionscaffold1298:+ 24801-29592
See gene structure
CDS Length2622
Paired RNAseq reads  596
Single RNAseq reads  1429
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008597 (4e-139)
Best Drosophila hit  Insulin-like receptor, isoform D (3e-52)
Best Human hitinsulin receptor isoform Long preproprotein (4e-60)
Best NR hit (blastp)  insulin receptor (AGAP012424-PA) [Anopheles gambiae str. PEST] (1e-72)
Best NR hit (blastx)  PREDICTED: similar to insulin receptor [Apis mellifera] (1e-70)
GeneOntology terms















  
GO:0004714 transmembrane receptor protein tyrosine kinase activity
GO:0043434 response to peptide hormone stimulus
GO:0004672 protein kinase activity
GO:0004713 protein tyrosine kinase activity
GO:0005515 protein binding
GO:0005524 ATP binding
GO:0006468 protein amino acid phosphorylation
GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway
GO:0016020 membrane
GO:0016021 integral to membrane
GO:0046777 protein amino acid autophosphorylation
GO:0016301 kinase activity
GO:0016740 transferase activity
GO:0004872 receptor activity
GO:0000166 nucleotide binding
GO:0048589 developmental growth
GO:0009790 embryo development
InterPro families



  
IPR000494 EGF receptor, L domain
IPR006211 Furin-like cysteine-rich domain
IPR009030 Growth factor, receptor
IPR008957 Fibronectin type III domain
IPR013783 Immunoglobulin-like fold
Orthology groupMCL31185

Nucleotide sequence:

ATGGCAAATATAAAAAAGTTACAAAACTGCACAGTTGTGGTTGGGGATCTGATAATAACA
CTTCTAGAGAGAACTAAACCAAAAGACTTTCGGGATATAAGTTTTCCTAAATTAAAAGAG
GTTACAGGATTTATGGTTGTTTACCGAGTGTCGGGATTGGAGTCTCTGGGGGACCTCTTT
CCGAATTTAGCGAGAATCCGCGGTAACACGCTTCTATACAACTACGCGCTCATTGTTTAT
GACATGCCCCGTTTGAGAGAGATAGGTTTCTATAACCTGCTCAAAATCGATAGAGGCGGA
GTCATCATATGGGGCGGTAAACTTACTTGCTTCATTGATTCCATTGATTGGAACGTTATT
GCGCCCAAATCTCGTCACGTTCTCAGCATACCAGACAAAGGGACACGGTGCATGTTTGTT
TGCACTTGTACAAGAAACGCTGTCTCCAATCGCTGTTGGAATAATAAGAAATGCCAACGT
TTCCTTGAGGGTCCGGATGCAGAAGATTGTGATGTGAATTGCTTTGGATGCCGCAAGACC
AACCCGAAGAGCTGCACATTATGTAGGAACTACACCATCAATAATACATGTGTGAACCGC
TGCCCTAACAATACCATAATATTAACGGAGAGCAATTATTGCGTGACGATTGACGAATGT
AAACATTTAAATAGATTTGAATTCAATAATACATGCGTGGAAAAGTGTCCGAATAATTAT
GAAATGGTGACCATTGGAAGAGATACATCATGCAAACCCTGCGTTAATTGCGATAAGACT
TGTAAAAGCCTCATTATTCAAACATTGGCTTCCATACAAGCTACAGAAAAGTGTGTATAT
GTGAATGGCTCATTGACAATACACGTTAGATCAGTTCCCGGAGCGATGGATGAATTGAGA
TATTATTTGAAAAATATCAAAGAAGTTTCTGGATACATTCTAATTTATGGTTCTATTTCA
GTTACATCACTAGATTTTCTATCATCGCTAAAAAGTATCAAGGGCAATACACTATTAAAC
GGAAAGTATAGTTTAGTCGTTTACGATATGCAAAACCTTCAGATGCTATTTTCAGACAAT
GTTACCAAAAAACTTAAAATAAACAAGGGTTCAATGAGATTTTACCGAAACCCCATCCTT
TGTATGAGCCAAATCGAAAAGTTAAAGCCATTATTTCCGGTGGCTCCTAATGAAATTGAT
TTACCTCAGGGACTCAATGGTTATAGCGGGGGTTGTAAAGAAATAAATTTGGGTCTAAAA
ATTAACGTCAAGAATCAAACGTTTGCAGTTGCCACTTTTGATGGTGAGACTGGAACTGAC
GTGTTTTACACTATTTTATATATCGAAATATCTCACGATACAAAAGTGCCCATTGGACCG
GAAGCATGTAGTGAGTCAGAATGGAATGCTATAAGCGTTTCATATTCTTCAAATAGGCTA
ATTGAAGTTCCCCTACACTCTCTTCGACCGGCTTCGATGTATGCTGTTTGTATAGAAAAG
TATGAACCTTCCACACGTCATCTCGCTCGCAGTGCTATAGTAAATTTTACAACGCCACCT
GGTAAACCAGAGCCGCCATTCATAACAGAACTTGTGGCTTCTTCCTCTGACGTAGTTGTA
GTAAGATGGGTTGATCACAAAAACTATGAACGGCACATTACTAGATACGAGTTAGACGTG
TACTTAATAGAAAAGAATCAAAACCATATAAATACAAGAGATTATTGCCAAAATTATAAT
GATATTGATGAAATTGACTATTCACGTCACGCGAAAGTTATGAGACCACCGCGTAATTAT
GGAAAAGGTTGTGAAAGTATGTGCGGTATTTTATCATCTTTTACTTTTGGTGCAATGGTC
GATGAGTATTTTGATATATGCAATTCAATAAAAGGCTGTGAGAAAGAAGTGGATCGTCCT
AAAGTTGATTATATCAAAGGATTACTTAAAACGGTATCGTTAGACATTACTGCCCCAAGA
AAAGTTTATCAAATTGGAGGATTAGCACCTTTTAGAGATTATAGATTTCACCTTCGGGCT
TGTATTAAAGATTTGTGTAGCCGTTCTGCTAGAGAGGTAGTGCGGACCTTAAGGTTAGAA
AACATTGATATAGCCTCTATTACATTTACAAGCGCTGAAGAGAATGGTTTAATAGTCGTG
AACTGGGATCCACCGGCAATATCAAACGGAGTTATATTGTCATACACTGTGGAAATTTGT
CCAGATAATAATTTAAATGACATGAGTCATTTATTGCCTCAAGTTATGTGCGTTTTTGGA
AACGAGACAAGTCTCACAGTAAAATCTCATAAAGCAAATATTTATCTTATAAGAGTGTGT
ACAACGACGCTGGCTTATTCGTATGTTTGTAACAATTGGACTAAAGTGATGGTTATTCAA
CAAAATTATCTTTCCATATGGATTGGTGGTGTAGTCTTCGGAATATTACTGTGTGTTATA
TCCATAAAATTTGGATGGCACTGGAAACAAACTACTATCAAATCGGACGATATACCGTTG
GTAGACGCTACTTCTGCTAATCGCAATGAATCTGAACCACCAGCAATTATGATGTCGGAT
TTTATGCCACTGTATAGCATAGATTTTGGACATTCAGAATAG

Protein sequence:

MANIKKLQNCTVVVGDLIITLLERTKPKDFRDISFPKLKEVTGFMVVYRVSGLESLGDLF
PNLARIRGNTLLYNYALIVYDMPRLREIGFYNLLKIDRGGVIIWGGKLTCFIDSIDWNVI
APKSRHVLSIPDKGTRCMFVCTCTRNAVSNRCWNNKKCQRFLEGPDAEDCDVNCFGCRKT
NPKSCTLCRNYTINNTCVNRCPNNTIILTESNYCVTIDECKHLNRFEFNNTCVEKCPNNY
EMVTIGRDTSCKPCVNCDKTCKSLIIQTLASIQATEKCVYVNGSLTIHVRSVPGAMDELR
YYLKNIKEVSGYILIYGSISVTSLDFLSSLKSIKGNTLLNGKYSLVVYDMQNLQMLFSDN
VTKKLKINKGSMRFYRNPILCMSQIEKLKPLFPVAPNEIDLPQGLNGYSGGCKEINLGLK
INVKNQTFAVATFDGETGTDVFYTILYIEISHDTKVPIGPEACSESEWNAISVSYSSNRL
IEVPLHSLRPASMYAVCIEKYEPSTRHLARSAIVNFTTPPGKPEPPFITELVASSSDVVV
VRWVDHKNYERHITRYELDVYLIEKNQNHINTRDYCQNYNDIDEIDYSRHAKVMRPPRNY
GKGCESMCGILSSFTFGAMVDEYFDICNSIKGCEKEVDRPKVDYIKGLLKTVSLDITAPR
KVYQIGGLAPFRDYRFHLRACIKDLCSRSAREVVRTLRLENIDIASITFTSAEENGLIVV
NWDPPAISNGVILSYTVEICPDNNLNDMSHLLPQVMCVFGNETSLTVKSHKANIYLIRVC
TTTLAYSYVCNNWTKVMVIQQNYLSIWIGGVVFGILLCVISIKFGWHWKQTTIKSDDIPL
VDATSANRNESEPPAIMMSDFMPLYSIDFGHSE