DPGLEAN00045 in OGS1.0

New model in OGS2.0DPOGS207490 
Genomic Positionscaffold130:- 119577-140153
See gene structure
CDS Length2127
Paired RNAseq reads  896
Single RNAseq reads  2046
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000963 (3e-09)
Best Drosophila hit  tenascin accessory, isoform E (0.0)
Best Human hitteneurin-3 (2e-109)
Best NR hit (blastp)  type II transmembrane protein [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Tenascin major CG5723-PB isoform 1 [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0005578 proteinaceous extracellular matrix
GO:0007155 cell adhesion
GO:0005576 extracellular region
GO:0007160 cell-matrix adhesion
GO:0005886 plasma membrane
GO:0007366 periodic partitioning by pair rule gene
InterPro families


  
IPR013032 EGF-like region, conserved site
IPR000742 Epidermal growth factor-like, type 3
IPR006210 Epidermal growth factor-like
IPR013111 EGF, extracellular
Orthology groupMCL10165

Nucleotide sequence:

ATGACGATGAAATCGCTGAAGTACAGCGAAAGGGAGGATATGATGGAAGGCTTTAGCCCG
GTCCCTCCGCCGCTGCCGCGTCGTCAGCCCGTCAGGAATATTTACGCAGAGCCCTTCGTT
GATCCTAGAATGGGTGGAACTGGTATGACAAGCGGAATGGGTGGTGGTACTCTGGGCCAC
TGTCAGCTGGCAAACCAGCCCCTGGTGATGCCAGGGTTTCCTTTGCGAGCCAGCCAACCG
AGCCCTGTTCCAATGTACTCGCCTTCAAGATTCCATATAGACAAGCGATGTCAACACCGA
TGCACATGGAAATGTCTAGCCATAGCCATGATTTGCCTGTGTGTCATTCTTGCGGCCATG
CTAGCCTACTTTATAGCATTATCGTCAATTAAAAGTAATATAGACAATTCAAACTGTATT
CTGGTACAAGACGTAAAGGCAGTAACCCATTCTCACGGTGTTAGAGACGCTCTTTCGACA
TCCTCACCATCGGACGAATCTATATCAACATCATCATTGGGAGGCTGGTCAACAACTGCG
GAGGCTGCACTTGAATCTGCAGTAATACCACAGCAAGCGCAACAAGCAGCTCCACCTCCT
TGGACTCCAGCTTTAGAACTTAGAGAATTTGATGAATTACACAGTGTTACAATACCTGCT
TATCAGTTTTGGAGTTCTGAATTTCGTAACAAACAACCGGCGTTCGTCAGTCTTAACTTC
ACCGTTCCTTGGGGAGCTAATTTCGCTGTCTATGGTAGAAGAAACGTTGCTCCTAGTGTA
ACCCAATACGATTTTGTTGAATTTATCCGTGGTGGTAGAGTTGATCACAGACTACGAAAA
AGAAGAAGTCTACACGAATCAGACCCACATGATCATGGTTTTAACGGCTTTGAGTCATTG
TTTAGTACATTTAGCCCATTTGATAGATGGAACCACACGATAATCAAGAGATCTACAGAT
ATGAGGGTTAATGTCAGTATTATTCAATATCTTGATACTGGTAGATGGTTCATATCGATA
TATAATGACGAACTTCAGCCACACCACGTTGAAATGATTGTCTCAGAAGCTGAAGGGGTG
ACAAACTCTTGCCCTGATGACTGTTCTGGTCACGGTTCTTGCTATTTAGGAAAATGTGAA
TGTATGGATGGTTTTGAAGGTCACGATTGTTCAAAAAGTGTATGTCCTGTTCTGTGTTCT
GGACACGGTGCCTACGCTGGCGGGATTTGTCATTGTTCTGAGGGATGGAAAGGAGCTGAA
TGTGATGTACCAGCTCACGATTGTGAACCAGCAGATTGTTCTGGTCGGGGGCAATGCATA
GCAGGCCAATGTCAATGTAAAGCAGGATGGAAAGGAGCAAAATGTGATGAAGAGGATTGC
TTGGATCCAACGTGTGGTGGTCATGGATCTTGTGTTCGCGGTCGATGCGTCTGTCGCGCT
GGTTGGCGTGGAGCAGCTTGTACTGAACGCGATGCTCGTGTACAACGTTGTCTGCCAGCA
TGTTCTCAAAGAGGCGTGTACGACTTAGATGCTGGAAGATGTGTTTGTGATCCCTTGTAT
ACAGGCGACGATTGTTCTCAAGTGGTATGCTCGTTGGATTGCGGACCTCACGGTGTTTGT
GCTGAAGGAGTATGTCGTTGTGATGATGGGTGGACCGGTTCATTGTGTGACCAACGACCA
TGCGATATTCGCTGTCATGAACATGGTCAATGCAAGAACGGTACTTGTGTTTGCACACAA
GGTTGGAATAGTAAACATTGTACACTGCCCGGTTGTCCAAACGGATGCTCTCGACACGGC
CAATGCCTACTGGAGGAAGGAGTCTACCGATGTTCCTGTGCAGATGGCTGGGCTGGCACA
GATTGTTCTATTGAGTTGGAACTATCCTGCAACGACAATGAAGACAATGACGAAGATGGC
ATGACGGACTGCTCGGACTCTGAATGCTGCAGCCGCCCAGAATGTTCTGAGCACATTATG
TGCTTAGCTTCCAATGATCCTGTAGAAGTATTACTTCGTAAGCAGCCTCCATCCGTCACA
GCTTCCTTCTATCAACGTGTCAAATTTCTTATTGAAGAAAACTCAGTACAGAGTTACGCG
CACATGGACGAATATTCTGAAAGGTAA

Protein sequence:

MTMKSLKYSEREDMMEGFSPVPPPLPRRQPVRNIYAEPFVDPRMGGTGMTSGMGGGTLGH
CQLANQPLVMPGFPLRASQPSPVPMYSPSRFHIDKRCQHRCTWKCLAIAMICLCVILAAM
LAYFIALSSIKSNIDNSNCILVQDVKAVTHSHGVRDALSTSSPSDESISTSSLGGWSTTA
EAALESAVIPQQAQQAAPPPWTPALELREFDELHSVTIPAYQFWSSEFRNKQPAFVSLNF
TVPWGANFAVYGRRNVAPSVTQYDFVEFIRGGRVDHRLRKRRSLHESDPHDHGFNGFESL
FSTFSPFDRWNHTIIKRSTDMRVNVSIIQYLDTGRWFISIYNDELQPHHVEMIVSEAEGV
TNSCPDDCSGHGSCYLGKCECMDGFEGHDCSKSVCPVLCSGHGAYAGGICHCSEGWKGAE
CDVPAHDCEPADCSGRGQCIAGQCQCKAGWKGAKCDEEDCLDPTCGGHGSCVRGRCVCRA
GWRGAACTERDARVQRCLPACSQRGVYDLDAGRCVCDPLYTGDDCSQVVCSLDCGPHGVC
AEGVCRCDDGWTGSLCDQRPCDIRCHEHGQCKNGTCVCTQGWNSKHCTLPGCPNGCSRHG
QCLLEEGVYRCSCADGWAGTDCSIELELSCNDNEDNDEDGMTDCSDSECCSRPECSEHIM
CLASNDPVEVLLRKQPPSVTASFYQRVKFLIEENSVQSYAHMDEYSER