DPGLEAN21089 in OGS1.0

New model in OGS2.0DPOGS210220 
Genomic Positionscaffold1200:+ 407-12731
See gene structure
CDS Length4716
Paired RNAseq reads  4954
Single RNAseq reads  12422
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002547 (0.0)
Best Drosophila hit  neurexin IV, isoform A (0.0)
Best Human hitcontactin-associated protein-like 2 precursor (3e-174)
Best NR hit (blastp)  GK25462 [Drosophila willistoni] (0.0)
Best NR hit (blastx)  GK25462 [Drosophila willistoni] (0.0)
GeneOntology terms

















  
GO:0005886 plasma membrane
GO:0005918 septate junction
GO:0008104 protein localization
GO:0007391 dorsal closure
GO:0016080 synaptic vesicle targeting
GO:0016081 synaptic vesicle docking involved in exocytosis
GO:0004888 transmembrane receptor activity
GO:0008065 establishment of blood-nerve barrier
GO:0016021 integral to membrane
GO:0019991 septate junction assembly
GO:0005887 integral to plasma membrane
GO:0045216 cell-cell junction organization
GO:0007163 establishment or maintenance of cell polarity
GO:0035151 regulation of tube size, open tracheal system
GO:0007155 cell adhesion
GO:0005919 pleated septate junction
GO:0060857 establishment of glial blood-brain barrier
GO:0021682 nerve maturation
GO:0008366 axon ensheathment
InterPro families








  
IPR000421 Coagulation factor 5/8 type, C-terminal
IPR012680 Laminin G, subdomain 2
IPR008278 4'-phosphopantetheinyl transferase
IPR006209 EGF
IPR008979 Galactose-binding domain-like
IPR008985 Concanavalin A-like lectin/glucanase
IPR001791 Laminin G domain
IPR000742 Epidermal growth factor-like, type 3
IPR006210 Epidermal growth factor-like
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
Orthology groupMCL10312

Nucleotide sequence:

ATGACCTACTCCGTGTTTTATAATATTTTGCTATTTTGCTACACTCTTTCTTGTACTAAA
GCAGAATTAGCAAGATATTATTACGACTACGAATGTAATGAGCCTTTAATCGAGGGAGCT
AAACTCACCGCTACTTCCAGTCTGAGAGACAGAGGACCTGAGAATGCAAAGTTGCTTGAT
GATGCAAAATCATCACTTGTAGGACGTCTCATGTTACGGAGATGTGTTCATCTGATGCTG
GACATTCCTTATGAAAATATTAGATTTGGAAGGGACGAACACAGGAAGCCATATCTCATA
GGTGCTGGCGACATTCCAGCTTACTTCAATGTATCACATCAGGGAGATTATGTTGTTCTA
GCTGGTAGCACTAAACAGAATATTGGTGTTGATACCATGAAAATAGAACCACCAGCAAAT
AAAAACTTGTCAGAGTTCTTTCGACTCATGAAGAGACAAATGTCTGATCATGAATGGTCG
ACTATATACAGTTATCCCACGGAAGCTCAGCAGATCGCATGTTTTTACAGATTGTGGTGC
TTGAAAGAGAGTTACGTAAAGAATATAGGAGTCGGTATAACTGTAGCGCTCCATAAAATA
AGTTTTGACATTAAATCACCATTGAAAGTAGGACAAATTGTGACGGACACAAACTTGTAT
GTGAATAATGTGCTGAAATCCGATTGGAGGTTCGAAGAGACCTTGCTGGATGAGAAACAT
GCTGTGGCCACTTCACTCCAGGTCGAGGATGAATCTATAGCACCTGCTGTGTACGACTTA
TTATCTTTTGATGAAATTGTCAAGGAAGCTAAACCATTTGTCGAACCAGATGTCAGTTTG
AACGCGTGGACAGCATCAGAGAACGATTTCGATCAGCAGCTGATCATAGACTTGGGGACG
GTGAAGAACATCACCCGGGTGGCCACACAGGGCCGGCAACACTCGCAGGAGTTCGTTCAG
GAGTATCACATTAGCTACGGTACTAATGGACTCGATTATGTCATGTACAAAGCGCCGGGG
GGCGAAGTTAAGAGTAGCCACCATCGACTAACAAGTAGCGCTAACATTGGTGAAGGAAAA
TCAGCATGGACGACGGTCGAGAGCTCTTACTACCAGCACCTGACGATCAATCTACACTCA
CGGAAAGAGTTGCGCGGTGTAGCCACCAGGGGGCGCTTCGCCACCGACGAGTACGTCTCG
GAGTACATGATACAGTACTCCGATGACGGTGAGACCTGGACCGCGGTCGCTGACGCGGAC
GGCTACACGCAGATGTTCGAAGGCAACCATGATGGCAACACTGTAGTCAAGAACGAGTTC
GACGTTCCTATCATAGCTCAGTACATCAGGATCAATCCCATGAGGTGGAGGGACAAGATA
TCCATGAGGGTCGAGCTGTACGGGTGTGATTATGTTGCTGATACGTTGTTCTTCAACGGG
TCGTCCCTGGTTCGTATGGACCTGCTTCGCTCCCCCGTGTCCTCCTCCCGCGAGGCCATC
CGTTTCCGGTTCAAGACCTCTGCCGCTTCCGGCGTGCTGTTATACTCTCGCGGCACCCAG
GGCGACTACCTCGCGCTGCAGCTCCGAGACAACCGCCTCGTCCTCAACATAGACCTCGGG
TCGGGGAAGGCGACGTCGTTGTCGGCGGGCAGCTTGTTGGACGACAACACTTGGCACGAC
GCGCTGGTGTCCCGCGCCCGCCGCGACCTCGTGTTTTCGGTGGACCGCGTGGTCATGAGG
GCGCGGATCAAGGGCGAGTTCTCCCGACTCAACCTCAACAGAGCGATTTATATCGGCGGG
GTGCCAAATTTCCAAGAGGGTCTGGTGGTGACACAGAACTTCACAGGGTGTATAGAGAAC
ATGTATCTGAACGCCACCAACGTCATCCAGGAGCTCAAGATGGGGTACGAGGCCGCGGAG
CCCTTCAAGTATCAGAAAGTCAACACCTTGTATTCCTGCCCTGAGCCGCCGGTCGTGCCG
ATCACTTTCCTGAAAGAGGGCTCGTACGCCAAGCTGCGCGGCTACGGCGGGGGCGGCGTG
CTCAACGTGTCGCTGGAGTTCCGCACGTACGAGCACCACGGACTGCTCGTATACCATCAG
TTTAAAAGTGAAGGATACGTCAAGGTGTTCCTTGAGGAGGGCAAGGTGAAGGTGGAGTTG
TTCACGGAGGGGTCTCCCAAGGTGAAGCTGGACAACTTCGAGGACACCTTCAACGACGGT
CGCTGGCACGCTCTCATGCTGACCATGGCCCAGGACAGCCTCACCCTGTCTCTCAACTAC
AGGGCCGTCAGAACCAGCAAGAAGATGAAGTTCTTCACCGGGGGCTACTACTACATAGCA
GGTGGCAAGGCGCCGCCCCGTGGGTTCGTGGGCTGTATGCGGAAGCTGGCCGTGGACGGC
AACTACCGCTCGCCCACGGACTGGACGCGCGAGGAGTACTGCTGCCCCGACGAACTCGTG
TTTGACGCCTGCCACATGATAGACAGGTGTAACCCCAACCCGTGCGAGCACGGCGGCGTG
TGTACGCAGAGCGCGGACGAGTTCGCCTGCGACTGCACCGACACCGGCTACGCGGGCGCC
GTCTGCCACACGTCGATCCACCCCGTGTCGTGTGCGGCGTATGCGTGGTCGGGGGCCACG
GGGCGGCGCTCCACCCGGGTGCTGTTGGACGTGGACGGCTCGGGTCCCCTGCCGCCCTTC
CCCGCCACCTGCCACTTCTACGCGGACGGTCGCATCATAACGTCGGTGCAGCACTCGGCG
GTGTCCAGCACGCAGGTGGACGGCTTCCAGGAGGCGGGCAGCTTCAGGCAGGACGTGACG
TACGACGCAACGCGGCCGCAGCTCGAGGCGCTGCTCAACAGGAGCCACTCGTGCAGCCAG
CGATTGGAGTACATGTGCCGACACTCCAGACTGCTCAATTCGCCCAGCGAGGAGGCCACG
TTCCAGCCGTTCGCGTGGTGGGTGTCTCGCAGCGGGCAGCGGATGGACTACTGGGCGGGG
GCGCAGCCAGGATCCCGCATGTGTGAGTGCGGGGTGCTCGGCACGTGTCTCGACCCTACC
AAGTGGTGCAACTGTGACGCCGAGCACTCGCCCATGCCTCACGACGAGTTTCAAACTGAC
GGCGGAGACATCACGGAGAAGGAGTTCCTGCCGGTGAAGCAACTTCGCTTCGGTGACACG
GGCAGCCACCTCGACGAGAAGATAGGGAGGTACTCGCTGGGACCCCTGCTGTGCGAGGGA
GACGACCTGTTCTCTAACGCCGTCACGTTCCGTATCTCGGACGCGGTCATCACCCTGCCG
ACCTTCGACCTGGGTCACAGTGGAGACATCTACTTTGAATTCCGGACCACTAAAGAAAAC
GCTGTCCTGTTACATTCGAAGGGCACTCAAGACTATATAAAGCTGTCAATAATCGGCGGA
GACCAGCTGCAGTTCCAGTTCCAGGTGGGGGACACGCCCCTCGGGGTCTCCGTGGAGACC
AGTAACCGGTTGGCCGACGACCAGTGGCATTCCGTCTCTATCGAGAGGAACAGGAAGGAG
GCCCGCGTAGTTGTGGACGGAGCCTTGAAGAACGAGATACGAACGGCCAAGGAACCGGTC
CGCGCCCTCCAGCTAGCCACCCCGCTGGTGCTGGGGGCCAGCCTCGACAGGAAGGACGGG
TTCGTGGGCTGCATGAGGGCGCTCCTACTGAACGGACGGCCTGTGGACCTGCGGGGACAC
GCCAGGAGAGGTCTGTACGGCGTGTCGGAGGGCTGCGTGGGCAAGTGCTCGTCATCGCCG
TGTCTGAACAACGGCACGTGTCTGGAGCGGTACGACTCGTACTCCTGCGACTGCCGCTGG
ACCGCCTTCAAGGGCCCCATCTGTGCTGACGAGATCGGCGTCAACCTCCGCCCCAACTCC
ATGGTGAAGTACGACTTCCTGGGCTCGTGGCGCTCCACCATCAACGAGAAGATCCGCGTG
GGCTTCACCACCACCAACCCCAAGGGCTTCCTGCTCGGCTTCTACTCCAACATATCCGGG
GAGTACCTCACGCTCATGGTCTCCAACTCAGGTCACCTGCGCGTGGTGTTCGACTTCGGG
TTCGAGAGGCAGGAGATCATCTTCGAGGGGAAGCACTTCGGCCTGGGACAGTACCACGAC
GTGCGCCTCTCCAGGAAGGACAGCGGCGCTACCATGGTGCTGCAGGTGGATAACTATGAG
ACCCAGGAGTACCAGTTCAACATCCGCGAGTCTGCGGACGCGCAGTTCAACAACATCCAG
TACATGTACGTGGGCAGGAACGACTCCATGGCCGAGGGCTTCGTGGGCTGCGTCAGTCGC
GTGGAGTTCGACGACATCTACCCTCTCAAGCTGCTGTTCCAGCAGGACCCGCCGCCCAAC
GTCAGGAGCATCGGCGGTCCCCTGCACGAGGACTTCTGTGGCGTGGAGCCGGTGACGCAC
CCGCCCGTGATCCCGGAGACCCGGCCGCCGCCGCCCGCCGACCTCGCCGCGGACCTCGAC
TTCCACCGGACCGACGAGGCCATACTAGCCACGGTGCTGGCGTTCGTGTTCCTGCTGCTG
ATAGCGGTGGCCGTGGTGCTGGTGAGGGCGCTGTCCCGCCACAAGGGAGAGTACCTCACA
CAGGAGGAGCGCGGGGCGGCGGGAGCGGCGGGGCCTGACGACGCCGCGCTGGCGGCCGCC
ACGGGGGCCCGGGTCACCAAGCGGTTCTTTATATAG

Protein sequence:

MTYSVFYNILLFCYTLSCTKAELARYYYDYECNEPLIEGAKLTATSSLRDRGPENAKLLD
DAKSSLVGRLMLRRCVHLMLDIPYENIRFGRDEHRKPYLIGAGDIPAYFNVSHQGDYVVL
AGSTKQNIGVDTMKIEPPANKNLSEFFRLMKRQMSDHEWSTIYSYPTEAQQIACFYRLWC
LKESYVKNIGVGITVALHKISFDIKSPLKVGQIVTDTNLYVNNVLKSDWRFEETLLDEKH
AVATSLQVEDESIAPAVYDLLSFDEIVKEAKPFVEPDVSLNAWTASENDFDQQLIIDLGT
VKNITRVATQGRQHSQEFVQEYHISYGTNGLDYVMYKAPGGEVKSSHHRLTSSANIGEGK
SAWTTVESSYYQHLTINLHSRKELRGVATRGRFATDEYVSEYMIQYSDDGETWTAVADAD
GYTQMFEGNHDGNTVVKNEFDVPIIAQYIRINPMRWRDKISMRVELYGCDYVADTLFFNG
SSLVRMDLLRSPVSSSREAIRFRFKTSAASGVLLYSRGTQGDYLALQLRDNRLVLNIDLG
SGKATSLSAGSLLDDNTWHDALVSRARRDLVFSVDRVVMRARIKGEFSRLNLNRAIYIGG
VPNFQEGLVVTQNFTGCIENMYLNATNVIQELKMGYEAAEPFKYQKVNTLYSCPEPPVVP
ITFLKEGSYAKLRGYGGGGVLNVSLEFRTYEHHGLLVYHQFKSEGYVKVFLEEGKVKVEL
FTEGSPKVKLDNFEDTFNDGRWHALMLTMAQDSLTLSLNYRAVRTSKKMKFFTGGYYYIA
GGKAPPRGFVGCMRKLAVDGNYRSPTDWTREEYCCPDELVFDACHMIDRCNPNPCEHGGV
CTQSADEFACDCTDTGYAGAVCHTSIHPVSCAAYAWSGATGRRSTRVLLDVDGSGPLPPF
PATCHFYADGRIITSVQHSAVSSTQVDGFQEAGSFRQDVTYDATRPQLEALLNRSHSCSQ
RLEYMCRHSRLLNSPSEEATFQPFAWWVSRSGQRMDYWAGAQPGSRMCECGVLGTCLDPT
KWCNCDAEHSPMPHDEFQTDGGDITEKEFLPVKQLRFGDTGSHLDEKIGRYSLGPLLCEG
DDLFSNAVTFRISDAVITLPTFDLGHSGDIYFEFRTTKENAVLLHSKGTQDYIKLSIIGG
DQLQFQFQVGDTPLGVSVETSNRLADDQWHSVSIERNRKEARVVVDGALKNEIRTAKEPV
RALQLATPLVLGASLDRKDGFVGCMRALLLNGRPVDLRGHARRGLYGVSEGCVGKCSSSP
CLNNGTCLERYDSYSCDCRWTAFKGPICADEIGVNLRPNSMVKYDFLGSWRSTINEKIRV
GFTTTNPKGFLLGFYSNISGEYLTLMVSNSGHLRVVFDFGFERQEIIFEGKHFGLGQYHD
VRLSRKDSGATMVLQVDNYETQEYQFNIRESADAQFNNIQYMYVGRNDSMAEGFVGCVSR
VEFDDIYPLKLLFQQDPPPNVRSIGGPLHEDFCGVEPVTHPPVIPETRPPPPADLAADLD
FHRTDEAILATVLAFVFLLLIAVAVVLVRALSRHKGEYLTQEERGAAGAAGPDDAALAAA
TGARVTKRFFI