DPGLEAN16172 in OGS1.0

New model in OGS2.0DPOGS210324 
Genomic Positionscaffold1126:- 7859-27015
See gene structure
CDS Length5256
Paired RNAseq reads  101
Single RNAseq reads  288
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011962 (0.0)
Best Drosophila hit  CG31665, isoform B (4e-08)
Best Human hitprotocadherin Fat 1 precursor (2e-12)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC003136 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC003136 [Tribolium castaneum] (0.0)
GeneOntology terms










  
GO:0005509 calcium ion binding
GO:0005515 protein binding
GO:0005886 plasma membrane
GO:0007015 actin filament organization
GO:0007155 cell adhesion
GO:0007156 homophilic cell adhesion
GO:0007163 establishment or maintenance of cell polarity
GO:0016021 integral to membrane
GO:0016337 cell-cell adhesion
GO:0030027 lamellipodium
GO:0030054 cell junction
GO:0030175 filopodium
InterPro families







  
IPR016187 C-type lectin fold
IPR000859 CUB
IPR000742 Epidermal growth factor-like, type 3
IPR001304 C-type lectin
IPR006209 EGF
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
IPR016186 C-type lectin-like
IPR013032 EGF-like region, conserved site
Orthology groupMCL19626

Nucleotide sequence:

ATGCCAAGCAAACGTCGCAGACTGGAGCTAGTCTTAAATGAAGATGAATGTTACATAACA
GCACTCCAGCCAGTCATTACGTCATTAGAAAGACTTCAACAGAATTCAAAAACGTTGTCG
ACCGCACTTTTCATAATTACTAACTCTATACAACGGCGGGCTTTATCACAGTCGTGTATG
GACGTGAAACAGGTACTGTTCAACGGTTACTGCTACCTTTTCTCCGGCTACCCACAGGCT
TCTTGGTCAACCGCGAAACAGGTTTGTGAAGGTCTCAACATGCATCTGTCCTCGATACAT
ACAGCGGAAGAGGAGCGCTTTATAGTGACCGGTATCAGACAATCCAGCGATTACAGCGCT
GGATCCGTATACTGGCTCGGTTCCCGTCTAGACGATAATGCAATAAGTTGGATCGATGGA
AGCACTTTAGATTACCAAGCCTGGCCTCCTTATAATGACACCGAGGAAGTCGAAGACAGC
TGTTTAGGTGTTCAGTGGAAAACCTCTCCAGTACCCTCACAGCCATCGGGACTGTACTGG
ACTCCATACAAGTGCTCGGCCACTGGTGGCTACGTGTGCAGGAGACGCCTCACATCAGAG
CACGTCCTCAGAAATACTACCGTCGAGGGCACTTCGGGTACATTAAGGAGTCCTAACTAT
CCCGGTCTGTATGACAACGATCTGGATTATTGGGTTCATGTCAGAAGTGCCCCGGACACA
CGCCTAGTATTCGTTTTCACATCCATCAACTTGGAATATCAAAACGACTGCCTTTATGAC
TTTATAGAGGTGCGTATTTTGATAGACTCAGAACCTTATTTCAAGTGCGACGATGCTCTG
AAACGCGTAAAAAGTTACGAGATCGTAAAGTCAGTTCGAAGTCATCGAGATACTGCGGTT
CGGTCGGAGAGACGAGATGGGTCGCCGCCACCAACGAAGCGATACTTCACTTTCATTCTG
ACTATAAAACCTCTCTTCAACAGTACCCAGGGTGCAGGTTTCTCAGTGAACTGGTGGGCG
GTCGAGCTGGCGGGATGCCCATCTCAGACGTTCACGTCCAAGGAAGGAATCATTCACAGT
CCTAATTACCCCCACTTTTTACTACCCGATATGGATTGTACAATCGACATATTTGCTCCA
GCGGGAAAAAGGGTGTACTTAAATATAAGTTTTTTTGATTTCGGTTATGGACAATTCGAG
AACGGAATTCCGAACAACGTGTCGGATGTCATATCCGAAGACAATTATTTAGAAATTCAA
GTCGACTCTCAAAGTCGACCTATAAGACCATTCCAAAATTCAAAAATTTTAACAAACGGA
CTGTTCGTGTCGCAATCTGAAATCATGAGAATACGACTGAAAACCGGAGAGAACGTTACC
GGGATCGGATTTCTTGCTCATTTCAAGACTGTGTGGCATTTAAATGCTTCCATCACGATC
AGTTTATCGAACGCCGGTCGTCTGGCTTCTATAAACTATCCAGAGATGGGTCCGTCCCGG
AGTACGCTCCGCATGCGACTGGTGGCTCCCCACGGGCACACGCTCGCAGTAGCGTTCAGT
TCCACTGCCCTTGTACCAGCTGGTGAATACCCATGTGGAAGGGAGGCAGGCTGGATCGAG
GTGGTTGATAGTTACACGGACAATAATGGCACACAATGGACTCTTTGCGAGGCGAACCTT
CGAAAAAGAGCCGTCGAGAGTGCTCCACTTGTCATCACCTCATATTTGCACTCATTGATA
GTGACGCATCACTCTGGTGAAGAACCTATGGGACTAGACGTAGCTGTAGTAGTAAATATT
GATACAGAATATCACAATAAGGTTCTCCTACTACCGGATGAAACTAATTTAGAGTCCTGT
TACCCGAACCCGTGTCTGCACGGTGGACAATGCGCTTTTGAAGATTCCAGAAACATGTGT
CAATGCTCAGGATATTATACAGGTGTTTTCTGTCTCCTGACTGCATGCGAGCGATCCCCC
TGTGTGAACGGTAACTGTTCGCTGGCGGCGGACGGTGCGCTGTGTGCCTGTGCTCGTGGC
TGGAGAGGCCGGCGCTGTGCCGAGCGAGTGAGACCTTGCGCAGCGCGGCCCTGCAACCAC
CGGGGGGCTTGCGTTGAAAGGGACGCCGGCTTCTTGTGTCAGTGTAACCCTTCATGGAAA
GGAAAAAGATGTGAAATACCTAATCCTACGCCAAATATCGTAGGTTTGGGTACAAGAATG
ATGCAAGAACCATTTTGGCTCGGTTTATTCGCTGTTTTCGTCGTCCTCGGTTTTATTGGC
CTAATTTGGTGTGGAAAAAGACATTTCCCTGAAAAGATAGAAAAACTCCTAGCTGAAGAA
GCCGACAGATGCGCGAGACGGGTGGCGGGTCCAGGGCCGGGGCCGGAGGCGCGCACACTG
CTCACGCGGCTGGGATCTCCTTCGCCTCGTAAAAAACGCAACAATTCAACACCAACAAAG
AAAAACGCAGCGGAGAAAAAACAGATATTACAGCAACTCATAAGCCCGGCGCCAGCTAAT
GATCACTCGAAGAAAATAAGTATGGGGGAATTAATACAGATGTCAGAGAAGAGAACACTG
AGTACAGTAGAGAGTGCCCCTGCGTTTGTAGAATATGGATTAGACATTAAAGACACGTCA
TTCGCTAGTGAAGCCTCGTCACCCTCCACGTCACCGTCAATGCGTCAAGTATCTGACCCT
AAACTGGAAAAGAAAGTGACGTTTGCAAGATTATTGAACAAAGTTTCAGCCGAAATGAGC
TCGAGTTCAGATGTCGATATGGTTAATACTATTGCAATTCCGATGGCCGTCATAGCTGAT
AGAACTATAAAGACAAAAGCCTCGAGCACTCCACCCTCACCTGGTGTAGAAGTAAGGTCC
CCCCATAGTACGTCAAGTAATCAAGGAAGCGACTCCCTGTCCAGTCTTGATTTGACCTTG
GCTAACGGTGCTATAAAGAAATTTTCTAGAGCACCAAAAATATCAAGCGCTGATTCCATA
CTCGCAATGTTCCGAAACTTTTCTTCGTCAGCTGCTATAGTTTCTCGCGCTTCCTCTGGA
GCTATATCCGCATCCAGCACACCCACAGCATCATCACCTCAAGACGACACCGTCGATGGC
GATGATCTGTCAATAGCCTCTTCACATATACCTTCTTTAGCACCCGATTCTCCCATTTCA
AGACCACACACAACTATCGAAATACAGGTTGTTGACCCACTAAGCTCTCATAAATCTTCT
ACATCTGGTAATCTCCTACACCCTCCATCCATCCTACTTGAAGTACCCAGTAGCATCAAT
AAGTGTCTGTCTCCTATTCGAGAACTCCCAACACCACTGCCAACTCCGCTCCCTACACCA
CTGCCAACCCCACTACCATCACCACGAATGCCACGGGTCAAACTGGAACAGGATATCAAA
AAAGATTCGGTAATTTCTTGCATATCACCAAGCGAAGACGAATTAGAGGAGATAAGATAT
GAGAAAGCTGAAAGACCAACAATGGGACTGAGATTAAAGCTTCGACAGCCGGCAGTGTGT
TCTGAAACCCCAACGCCGTCCACTCACGTCACGGACTCACCTAGCCCTAGTTTTACACAC
AGTCAGGACTCTGAGAGAGAAATGTCACCTCTCTCCCCCGCCCCGCCCTCGCTGAGAGTT
CCCGTCCTCACTATTGAACGACCCTCACCTGGATCCCCTCCACCGAGAAGAACCCCACCA
CATTTAGACTATCAACCACCGCCATTAATAACAGTCACATATAACCCCAGTGAAGAATCA
GATGAACCTATGTCACCAAGACCACCTCCGCCAACGGCAAATATGTGCTATCTGAGCCCC
TTTTCGATGTCGGCAAGAGGAGAGAGAGCACCATCAGAATCTAATTTGTCATCATCGGGA
TACAGTTCAATGGCAAGTCCTGGCCCGTCGAGATGTGGGTCTAGTAATCCCTTGTGTCCA
TCTGAAATGGAAGATCCTGGATCAGGAGGAGGTCCATCTTGTTTTCAATCTAGGCGGAGA
CCACTCATGAAGACAAATTCAAGTCCAGCGGGCTCTAATGATGGAGGCAATGAGAGAAGA
AGAGGTAGGTCTGATTCTGAAACACTTTCAGACGATCCATTGTTAGAATCCAACGACGAA
GGAATCGGTACGGATGAAAGAGTAGACGATGTTCCTTCAAGTGCAAAAGAAATGGAAACT
TTGACAGTGTTAAAAGAATGCTTAGATATACCACAAACAACTTTATGCTCCCCGAGCGGT
GTCACTAAATGTACCATCGTAAAGTGCATAAGTGTTGAACGAGGTTTAGATGAAAAGGCG
AGTCTTAAACCGCCAATTTTGTTCTCAGATTGTAGTAGACCATTGAGCCCTGTCAGTTCA
AGAAGTGAGAGTCCTTTAAGTGACAAAACTGGTTTAGGTAGATTCTCTCCACAATTTTAC
GGTAGACAACTACCTTTTACTGATTCCGATGGACTTTATGATTTTCCAAGTTCCGAATGC
GTTAAAGGCGGTAGTTGTAAGAGTGGGAGCGCATCTCACAGAAAAGCCGGTAGAAGAAGA
GACAGAAAAACGACCAGGACAACGTCACATGAGCCCACAGGAACTACTAAATCTACTCTA
CCACACATGCCGCACTCCATGCATAATTTATTAGAGGTGCCGTATGGCAATCGTGGTCGC
AAAGGAGGAAGAAGGAGATCGAGGTCTCAGGCGCCAGCTCTAGCTACCTCATCATCATCT
GAAGAGTCTGTGTCCAACGCATCCGTGGCCTCTGTGGCATCAGCTCGAGAACTCAGACTA
CCCGACTTGGAAATGCAGTATGTCTGTTCACAGCCTGAGCCTGTCAGAACTAAGAAGCCG
TTGAAGCGTCAAAAATGCCGAAGCTCTGAGGACACGTCGTCTAAAATATCATCAAGCTTG
GACCTAACCGAGGACTCGAAGAAACCGAATAAAATTAGCAAACTCCGGTCCATCGGAAAT
CAAATAAGATTTCTCCGTCGGTTAGAAAAAAGTTTAAAAATGAAAGAAAGCTACCCGGCG
ATCTCAGACGACGAAGGAGACGAGTCGTCCAGCGTGACGTCACCCTTGTTACAAGGTAGG
AAGGATCTGAACCGTATGACGGGCCACGCGATCTCCGCTCCTCTGCTGGGGGCTGCAAGA
CCAAAGATCTCGCGACAGAGACGGTACGAGAGATCCCTCCTTGGCGAAGACACGAGGACT
TTGAGTACTGCGCCTGGCTACGATAACTCCGATTGA

Protein sequence:

MPSKRRRLELVLNEDECYITALQPVITSLERLQQNSKTLSTALFIITNSIQRRALSQSCM
DVKQVLFNGYCYLFSGYPQASWSTAKQVCEGLNMHLSSIHTAEEERFIVTGIRQSSDYSA
GSVYWLGSRLDDNAISWIDGSTLDYQAWPPYNDTEEVEDSCLGVQWKTSPVPSQPSGLYW
TPYKCSATGGYVCRRRLTSEHVLRNTTVEGTSGTLRSPNYPGLYDNDLDYWVHVRSAPDT
RLVFVFTSINLEYQNDCLYDFIEVRILIDSEPYFKCDDALKRVKSYEIVKSVRSHRDTAV
RSERRDGSPPPTKRYFTFILTIKPLFNSTQGAGFSVNWWAVELAGCPSQTFTSKEGIIHS
PNYPHFLLPDMDCTIDIFAPAGKRVYLNISFFDFGYGQFENGIPNNVSDVISEDNYLEIQ
VDSQSRPIRPFQNSKILTNGLFVSQSEIMRIRLKTGENVTGIGFLAHFKTVWHLNASITI
SLSNAGRLASINYPEMGPSRSTLRMRLVAPHGHTLAVAFSSTALVPAGEYPCGREAGWIE
VVDSYTDNNGTQWTLCEANLRKRAVESAPLVITSYLHSLIVTHHSGEEPMGLDVAVVVNI
DTEYHNKVLLLPDETNLESCYPNPCLHGGQCAFEDSRNMCQCSGYYTGVFCLLTACERSP
CVNGNCSLAADGALCACARGWRGRRCAERVRPCAARPCNHRGACVERDAGFLCQCNPSWK
GKRCEIPNPTPNIVGLGTRMMQEPFWLGLFAVFVVLGFIGLIWCGKRHFPEKIEKLLAEE
ADRCARRVAGPGPGPEARTLLTRLGSPSPRKKRNNSTPTKKNAAEKKQILQQLISPAPAN
DHSKKISMGELIQMSEKRTLSTVESAPAFVEYGLDIKDTSFASEASSPSTSPSMRQVSDP
KLEKKVTFARLLNKVSAEMSSSSDVDMVNTIAIPMAVIADRTIKTKASSTPPSPGVEVRS
PHSTSSNQGSDSLSSLDLTLANGAIKKFSRAPKISSADSILAMFRNFSSSAAIVSRASSG
AISASSTPTASSPQDDTVDGDDLSIASSHIPSLAPDSPISRPHTTIEIQVVDPLSSHKSS
TSGNLLHPPSILLEVPSSINKCLSPIRELPTPLPTPLPTPLPTPLPSPRMPRVKLEQDIK
KDSVISCISPSEDELEEIRYEKAERPTMGLRLKLRQPAVCSETPTPSTHVTDSPSPSFTH
SQDSEREMSPLSPAPPSLRVPVLTIERPSPGSPPPRRTPPHLDYQPPPLITVTYNPSEES
DEPMSPRPPPPTANMCYLSPFSMSARGERAPSESNLSSSGYSSMASPGPSRCGSSNPLCP
SEMEDPGSGGGPSCFQSRRRPLMKTNSSPAGSNDGGNERRRGRSDSETLSDDPLLESNDE
GIGTDERVDDVPSSAKEMETLTVLKECLDIPQTTLCSPSGVTKCTIVKCISVERGLDEKA
SLKPPILFSDCSRPLSPVSSRSESPLSDKTGLGRFSPQFYGRQLPFTDSDGLYDFPSSEC
VKGGSCKSGSASHRKAGRRRDRKTTRTTSHEPTGTTKSTLPHMPHSMHNLLEVPYGNRGR
KGGRRRSRSQAPALATSSSSEESVSNASVASVASARELRLPDLEMQYVCSQPEPVRTKKP
LKRQKCRSSEDTSSKISSSLDLTEDSKKPNKISKLRSIGNQIRFLRRLEKSLKMKESYPA
ISDDEGDESSSVTSPLLQGRKDLNRMTGHAISAPLLGAARPKISRQRRYERSLLGEDTRT
LSTAPGYDNSD