DPGLEAN01134 in OGS1.0

New model in OGS2.0DPOGS201950 
Genomic Positionscaffold1376:- 6444-45149
See gene structure
CDS Length3378
Paired RNAseq reads  1979
Single RNAseq reads  4490
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000164 (4e-17)
Best Drosophila hit  sticks and stones, isoform A (0.0)
Best Human hitnephrin precursor (2e-90)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC003361 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC003361 [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0007155 cell adhesion
GO:0005886 plasma membrane
GO:0007520 myoblast fusion
GO:0007523 larval visceral muscle development
GO:0007157 heterophilic cell-cell adhesion
GO:0007514 garland cell differentiation
InterPro families






  
IPR013783 Immunoglobulin-like fold
IPR007110 Immunoglobulin-like
IPR003961 Fibronectin, type III
IPR008957 Fibronectin type III domain
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR013098 Immunoglobulin I-set
IPR013162 CD80-like, immunoglobulin C2-set
Orthology groupMCL11063

Nucleotide sequence:

ATGCGCAGCACGGTACAGCTCAGTGTACTATATCCACCTGGAGCACCATATATTGAAGGT
TATGCAGAGGGCGAAACTGTTAGAAGAGGACAAAGTCTCGAGCTGGTTTGCCGGAGTAGG
GGAGGGAATCCGCCAGCACAGTTGATATGGTACAAGAACGGCGAGCAAATACGAATGGCA
TACAGGACAAGCGGCAGGATGTCCGAAAACGTGTTGTCTTTCAAGGCGGATGCGTCTGAT
AACAAAGCGAGATACACTTGTGAGGCTAAAAACATCATGATAAGCAACACGCTGAAGGCC
GAAATAGATCTCACTGTGCTATTTGCACCATCCCACGTGACGATATCTGGCCCTTCAGAG
GCGAGGGTCGGTGATCCAGTACCCTTAAGTTGCAGCACGGCCCCTTCAAACCCAGCTGCT
GATATCAAATGGTTGGTCTTAGGTAAACATCACAGGGAAGCAAGCAACAGAACCGTCATA
TCTCCCGATGGTGGTTGGATCACCACATCTAATATCACAGTGGTGGTGGAGCCACATCGG
CGGTCGATCGTCGTGGTATGCCACGGCATTAACGGACAACTGACTGAGAACGTGGTCGCC
ACACACACCATCAATGTACTATATCCACCTTCAGCTCCAATGATAACTGGTTACATTCCC
GGGACGACTCTCTCAGCTGGGACGGTTCAAAAGCTGTCCTGTATATCCACTGGTGGAAAT
CCGTTGGCTACCTTGACCTGGTTTAAGAATGACAAAAAGATACATTCAATAACTAAAACC
ACGGACAAGTCGGTGTCGGCTGAGATATCAATACTGACAAACGTGACTGACAACCAAGCG
CAGTATAGATGCGAGGCGACAAACAGCGCCACAGAGATACCGCTCTTTGAAACTGTCACT
CTGAATGTACATTTCGCACCCGAGACTGTAAAAGTTAGAGCATCACCCGCCGAGCTAACT
CCTGGTATAGAAGGCACACTGTACTGCGACGCCGCTTCTAGCAATCCACCCGCGACGCTA
TCCTGGTGGAGGGATGGGATACCAGTTCAAGGCCTGCCGATGCAGTTAAAGAAGGGTCTC
CACGGTGGTACCGTCTCTACCGTAGAGTTAAAGCTGAATATCACCAAGGAGCTAAATGGC
GCCGTTTATACCTGCCAGGCTTCAAACGACGCTCTACAAAGGAGCGTCCACGACGCTTTG
ACTCTTAAAGTATTCTATCCGCCGATATTCGACGACACGCCCCTCTCGATTGTGGGGGTT
GAAAACGACCCGTTGGTTGTGATGCTGCGAGCTGACGGGAACCCCTCCAGCATCACGTAC
ACCTGGACCAAGGACGGTCTCCCGGTCACACAAGCTTCATACAGCAGTGCCAACGATCGT
ATCGTCTCGTCGGGCGGGACTTTGAATATGACTCGTGTGTCACGACACGATGCTGGGACT
TACTCCTGTGAAGCTCTGAACGCTTATGGCAGCGCTCGGATTAACATAACAGTTAATGTG
CACTATCCAGCCGATATTAAATCTGTCTGGCAGACAGGTATTGTGGATCCTAATGACAAC
GCTGTACTGGCGTGTACGGCCAGCGGGAACCCTTTAACATCGGATCACATAAAATGGGAG
CGCAAAAACTATGACATGTCGACGAAATTAGTGACATTTGAATCTAAGAACCAAACAAGC
TATCTAACGATAGAGAGAGCGGCGAGAGAGGACGTCGGCTCGTTTGAATGCGTGGTGAAC
AACGGCATCGGCGGCGAGACTCGCCACGAAGTGATGTTGGTTGTCAAATTCAAACCTGAA
ATGAACACCTCGCCGACACTCGCCAAATCCGCGTCCAATGTCGGCCAAGTCGGGCGGCTA
ACTTGCAAATGCAAGTCCGCCCCGGCACCTAACTTCACGTGGTCGAAGGGCGGCGTTAAG
CTCCCCGTGAACACGTCTACGAAGTACTTCGCTGAGTATCACAGGAACGACCAGATCACA
TACACCTCTGTGTTATTAATAAACGACATAAGCACGTCGGATTACGGCGCATACGAGTGC
GGGGCGAGGAACGACCTCGGCTTCGGGTCAGTCTCCGTAAAATTGGATGTCACCGGTCCG
CCAGACCCGGTGTCGTCTATAGTTGTAACCAACGTCACCCACGACACCATAACCCTAGAG
TGGGTGCCAGGCTTCGACGGCGGACTGACCTCCTGGTTCAGAGTTCGTTATCGCAAACCC
CACGACTCTACATACACGTATCACGACGTAACCCCCAACACGACCCACTATACTGTGTCG
GGGTTGGAGCGACACACGGACTACGTACTGTCCGTCATGGCGGTCAACGGAATGGGGGAG
AGTCGCTACAGACCTGATGATACGAAGGCTACGACCCTCACTTCATCAGAAGTTGGTGAA
CTGAACGTAGTTTCTACGGAGCATGTAGAAACGGCCGACGTGTCTAAATCAGTAGTTTTA
TACGTGTGCGTGACAGTCGCTGTGTTAGTTATTATAAACGCTGTTTTAGTTGCTTGTTTT
GTATTGAAACGACGCTCCAAGCGCTCCAAAGAGCAAGCTGGGCAGTCGTCAAAATCAACG
CCCATAGAAATGTACGCTCCGTCATCTTACAACGACACTATGGGCGAAACACTAAGCTCA
GTGTCAGAGAAATCTGAAACGTATTCACAAGATGAAGCGCCCCCCGTCCCTGACGTACCC
AGCATGCCGAGACACATGATGAACCAGTCGGATTCTTATCTCCTGGATGAGAACCTGGTC
CCTCCCCCCTTAGACTACCCTCCGCCGAACTACGTGTATGACGAACACGCGAGGACTCTA
CCACATCCACACAGACTACGAGAGGTCCGGGGACACAGCACCCTCGGGCGGACGGCCGGT
AAACAAGCGTACGTACCGACGCCGAGTCCGATGCCACCATTAGACGGCTCCTACTACAAT
ATGGCGTCCGATAGATACCTGTCTTACCCACCACTCATTGGAGAATATTTACAACAGCAA
GCCGGTAGAACTCCGACTCCGCCACAGCAATACTCTAGAGATAATCATCTGAGTCCACCG
AACTGTGGAATCGATGGTGAACGAGCTGTCCCCCCTGATGTGACTGTTCTTCACCCCCCA
GTATGTACACAGCAATTTCCGTTAAACCCATCTCTATCTGTGAAGCAACCGCAGTCCATA
CTAAAAGATCCGTCGAGGCATAAATATAGTAACCAATACGGCAGTCCCATATCCTCTAGT
TCGCCTCAGAACCAAAGTCAAATATTGACAGTTCAGAATTTAACGGATGTACCACAGTAC
GGTACCATAAAGAAAGACAAGAAACAAAACGTCACTATAGACGAATCATTCAACAAACAA
CAAACGCACGTAGTTTAA

Protein sequence:

MRSTVQLSVLYPPGAPYIEGYAEGETVRRGQSLELVCRSRGGNPPAQLIWYKNGEQIRMA
YRTSGRMSENVLSFKADASDNKARYTCEAKNIMISNTLKAEIDLTVLFAPSHVTISGPSE
ARVGDPVPLSCSTAPSNPAADIKWLVLGKHHREASNRTVISPDGGWITTSNITVVVEPHR
RSIVVVCHGINGQLTENVVATHTINVLYPPSAPMITGYIPGTTLSAGTVQKLSCISTGGN
PLATLTWFKNDKKIHSITKTTDKSVSAEISILTNVTDNQAQYRCEATNSATEIPLFETVT
LNVHFAPETVKVRASPAELTPGIEGTLYCDAASSNPPATLSWWRDGIPVQGLPMQLKKGL
HGGTVSTVELKLNITKELNGAVYTCQASNDALQRSVHDALTLKVFYPPIFDDTPLSIVGV
ENDPLVVMLRADGNPSSITYTWTKDGLPVTQASYSSANDRIVSSGGTLNMTRVSRHDAGT
YSCEALNAYGSARINITVNVHYPADIKSVWQTGIVDPNDNAVLACTASGNPLTSDHIKWE
RKNYDMSTKLVTFESKNQTSYLTIERAAREDVGSFECVVNNGIGGETRHEVMLVVKFKPE
MNTSPTLAKSASNVGQVGRLTCKCKSAPAPNFTWSKGGVKLPVNTSTKYFAEYHRNDQIT
YTSVLLINDISTSDYGAYECGARNDLGFGSVSVKLDVTGPPDPVSSIVVTNVTHDTITLE
WVPGFDGGLTSWFRVRYRKPHDSTYTYHDVTPNTTHYTVSGLERHTDYVLSVMAVNGMGE
SRYRPDDTKATTLTSSEVGELNVVSTEHVETADVSKSVVLYVCVTVAVLVIINAVLVACF
VLKRRSKRSKEQAGQSSKSTPIEMYAPSSYNDTMGETLSSVSEKSETYSQDEAPPVPDVP
SMPRHMMNQSDSYLLDENLVPPPLDYPPPNYVYDEHARTLPHPHRLREVRGHSTLGRTAG
KQAYVPTPSPMPPLDGSYYNMASDRYLSYPPLIGEYLQQQAGRTPTPPQQYSRDNHLSPP
NCGIDGERAVPPDVTVLHPPVCTQQFPLNPSLSVKQPQSILKDPSRHKYSNQYGSPISSS
SPQNQSQILTVQNLTDVPQYGTIKKDKKQNVTIDESFNKQQTHVV