DPGLEAN21438 in OGS1.0

New model in OGS2.0DPOGS212132 
Genomic Positionscaffold1706:- 14261-21959
See gene structure
CDS Length3048
Paired RNAseq reads  1796
Single RNAseq reads  4880
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006582 (0.0)
Best Drosophila hit  ia2, isoform C (7e-140)
Best Human hitreceptor-type tyrosine-protein phosphatase N2 isoform 1 precursor (8e-125)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC012616 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC012616 [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0005886 plasma membrane
GO:0005001 transmembrane receptor protein tyrosine phosphatase activity
GO:0006470 protein amino acid dephosphorylation
GO:0004725 protein tyrosine phosphatase activity
InterPro families



  
IPR000242 Protein-tyrosine phosphatase, receptor/non-receptor type
IPR000387 Protein-tyrosine/Dual-specificity phosphatase
IPR021613 Protein-tyrosine phosphatase receptor IA-2
IPR016130 Protein-tyrosine phosphatase, active site
IPR003595 Protein-tyrosine phosphatase, catalytic
Orthology groupMCL12459

Nucleotide sequence:

ATGAGCCGTACGTGCTATCGGCAGGCTCTGTGGGCGCTGGCGGTGATATGTACGCTGGCG
CCCTCGAATGCCGACGGGAATATTGGTTGTTTGTTTAGCTCGTCCTTATGCATTGACGGA
GCTGAGTGGTGCTATGATGATTTCGCATTTGGAAAATGCATTCCGATCTATGACAACGAT
CCCGAAGAGGGGTCACTTTACCAATATGATATGAGCTCTACGCAGCTACAGTGGTTCGAG
AGGGAGTTGCAACAACTAGCAGCCCAAGGCTATCGCTGGGAGCACGCATTCACGCAATGC
ATGCTGCAAAGTATGTTGTACGCTCTACGACATCACCTGGATCCAAATCAAGTTAACTCA
AAGCTGTGTGAGCATTTCGCGGATCCCAAGCTAAGTGCGGGAGTAACAAACGTTGGAGAC
GAAACGTTGGATGCTAACTCTGATGAAACAGCGTACATAAGATTCGTACCAAACACTAAA
TTGATCGACTCAGATTATGCAAATGAAGTATATAATCCACCCTTACTTGACGATGAAGAA
CCTAAAGGCGATGATTCATCTATGAAAATTGAAGATTTAGAACCTGTTGAAGACACAAAC
GATAAAATACGAAATATAATGCTAATGAGTGGGGTCGAATCACCTGTGATTGTACCATTT
GAAGGCTTTAGAGAACGCTTACAAGCTGAAGAGGAAGCACGACATCATGTACCTAAGGAC
AGTATCATTAATTATGATAAAGAAAAAAAATCCTTGAACGCAAATAACAATCAAGAAAAG
CCAGTTAATGAACTCCCAGATGAAGAACGGTTATTGGCTCATTTTCGTAAATACAAAATA
AAACCACCGCCCTTCACAGCGGAATACTTGACTGCTAACAGATTCTCACCATTGGATGAA
GAAATAAGATCGAATGCTCTGGAAAAATACAAGCAAAGCTTTCTAGAAAAGAACTTCCCT
TTTGAATATGAAAACCCAGAAGATCTTTCAGAAGCAAGGAGCTATGTAGAAACCCCACCT
AGCGAATTAAATGGAGACGAAGGAACAACTAATGAAAAGGAAAACAATTCCAAAGAAATA
AACCCAAAGAACATGGAATATTTAATGAACTATTGGCGTGAAATTGTTGGTGCAAAATTA
AAACCTCAAGAAAATTTATATGCTGAAGGAGGTCCATTAAAAACGGATGAACTGCAAGGT
GAAAATTCAAAATTTTATTTATCTCAAGATTTGCAAGACTTAGTTAATAGAGAATGGGGA
TTTAAGCGTAGGGAAAGAGATGATGTTAAAAAGCCTGGGCCGCGTGTGGACGCAAAAGCA
TTAAAGATTTTATACAGCAATAAATCAGTGACAGCCCAATCATCTAATCAGAATCAGATT
ATATCGGACCACGATCACAACGATTACGACTACGATCCATCTTACGCGTTTGTAACTTTT
CACAATAGGTTTTTGACAGACTGGGAGAAAGGTATTTCATTCATAACACGTCTTGAAGAG
ATGTTGGGCTTAGAAAAAAATACGTTTACAAATCCCCGAGTCGATCCCAGCGAAGTCACT
TTTAAAGTAGAAAAAAATAGCAAAGGCTACGATGCAGCAGATGTTGCTAAGCAAATTGAC
GTTATCAAGGAAAAAGTACGTAAGGACACTGGAGCACAAATACAATCGGCTGGAGTTGGA
GATAGGAGCAAATATCCAATGATTCGTAACTCCGAGTCCAAGGAGAATCAACTATTTGGT
TTGGATTATCCAGTACTACTAGCACTTGTGGGTAGTTTGTCAGTTCTTATCGTGGGAGCA
GTGGTGTTTGCTGTTTTGTTGAAGAGGGATATGAGTGCTAGGCGGAAGATGCAGGGCTTG
GCTTCAGCAGCTGAGATCGACGCTGAGGCTACAAGGGATTATCAGGAACTTTGTCGTGCT
CGCATGTCCGGTAAATGGACGGGCACGCAGACCGCAGTCGCTCCTCCAACTGAACCTCCG
CAAAGGATTACGTCGCTATCACGTGACCCAGACGGGAATTCACCCTCTACTAGATCAAGC
ACTTCATCTTGGAGTGAGGAACCGGCTTTGACTAATATGGACATTTCCACTGGACATATG
GTTTTGGCTTACATGGAAGACCATCTCCGAAACAAGGATCGCCTGGAACAAGAATGGCAA
GCGCTTTGCGCTTATGAAGCTGAACCATGTGCTACCGCAGCGGCCCTGAAACCTGAGAAT
AACGGCAAGAACCGTTGCGCCGATGTCTTGCCTTACGACCATTCTAGAGTCATACTCAAC
ACTCTCTCCAATCACCTTGGATCTGATTATATCAACGCATCTACGATAACTGACCACGAC
CCACGTAACCCGGCCTACATAGCGGCAGCTGGTCCATTGGTGCAAACAGCTCCGGATTTC
TGGCAAATGGTATGGGAACAAGGCAGTGTAGTCATGGTGATGTTAACCCGCCTCACTGAA
AACGGACAACAGCTCTGTCATCGATATTGGCCTGAAGAGGGTTCAGAACTGTACCACATT
TATGAGGTCCATCTCGTGAGCGAGCACATTTGGTGTGACGACTATTTGGTCCGAAGCTTC
TATCTGAAGAACCAACGTACTGGCGAAACTCGTACTGTCACACAGTTCCACTTCCTCTCG
TGGCCCGAGAATGGAGTACCAGCTTCTACCAAGGCATTGCTTGAGTTCAGAAGGAAGGTT
AATAAGTCTTACCGCGGAAGATCTTGTCCGATTGTTGTCCATTGCAGTAATGGAGCCGGT
CGAACCGGTACATACTGTTTGATCGACATGGTTCTCAACCGCATGGCTAAAGGTGCAAAG
GAAATTGACATCGCCGCTACTTTGGAGCACATCCGCGACCAACGCACACGCACTGTCGCT
ACCAAACAGCAGTTTGAATTCGTACTGATGGCTGTTGCAGAAGAGGTACACGCTATACTA
AAAGCCTTACCAGCCCATCTACAACAGCTGCAGGAGAAGAAGGACAAAGAGAAGGAGAAA
GAAAAAGGATCAGAGAAAGAAGGCACTGATAAAGATAAACCAAACTAA

Protein sequence:

MSRTCYRQALWALAVICTLAPSNADGNIGCLFSSSLCIDGAEWCYDDFAFGKCIPIYDND
PEEGSLYQYDMSSTQLQWFERELQQLAAQGYRWEHAFTQCMLQSMLYALRHHLDPNQVNS
KLCEHFADPKLSAGVTNVGDETLDANSDETAYIRFVPNTKLIDSDYANEVYNPPLLDDEE
PKGDDSSMKIEDLEPVEDTNDKIRNIMLMSGVESPVIVPFEGFRERLQAEEEARHHVPKD
SIINYDKEKKSLNANNNQEKPVNELPDEERLLAHFRKYKIKPPPFTAEYLTANRFSPLDE
EIRSNALEKYKQSFLEKNFPFEYENPEDLSEARSYVETPPSELNGDEGTTNEKENNSKEI
NPKNMEYLMNYWREIVGAKLKPQENLYAEGGPLKTDELQGENSKFYLSQDLQDLVNREWG
FKRRERDDVKKPGPRVDAKALKILYSNKSVTAQSSNQNQIISDHDHNDYDYDPSYAFVTF
HNRFLTDWEKGISFITRLEEMLGLEKNTFTNPRVDPSEVTFKVEKNSKGYDAADVAKQID
VIKEKVRKDTGAQIQSAGVGDRSKYPMIRNSESKENQLFGLDYPVLLALVGSLSVLIVGA
VVFAVLLKRDMSARRKMQGLASAAEIDAEATRDYQELCRARMSGKWTGTQTAVAPPTEPP
QRITSLSRDPDGNSPSTRSSTSSWSEEPALTNMDISTGHMVLAYMEDHLRNKDRLEQEWQ
ALCAYEAEPCATAAALKPENNGKNRCADVLPYDHSRVILNTLSNHLGSDYINASTITDHD
PRNPAYIAAAGPLVQTAPDFWQMVWEQGSVVMVMLTRLTENGQQLCHRYWPEEGSELYHI
YEVHLVSEHIWCDDYLVRSFYLKNQRTGETRTVTQFHFLSWPENGVPASTKALLEFRRKV
NKSYRGRSCPIVVHCSNGAGRTGTYCLIDMVLNRMAKGAKEIDIAATLEHIRDQRTRTVA
TKQQFEFVLMAVAEEVHAILKALPAHLQQLQEKKDKEKEKEKGSEKEGTDKDKPN