DPGLEAN20680 in OGS1.0

New model in OGS2.0DPOGS209171 
Genomic Positionscaffold602:- 9140-12111
See gene structure
CDS Length1290
Paired RNAseq reads  312
Single RNAseq reads  1157
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011483 (4e-107)
Best Drosophila hit  notum, isoform A (1e-64)
Best Human hitprotein notum homolog precursor (5e-62)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_275198 [Branchiostoma floridae] (7e-94)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_275198 [Branchiostoma floridae] (1e-76)
GeneOntology terms



  
GO:0016787 hydrolase activity
GO:0005576 extracellular region
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families  ND
Orthology groupMCL13228

Nucleotide sequence:

ATGACCGTCAGTCATTCCTGCTGGCTCAAATGGACCCTCTGTGCGGTGGTCATATCAGTC
TGCGAAAGTCTGGTCCAGGCAGACAGTCTGCGACTGGTGTGGCTCACAAACACTTCACTG
ACCTGTAATGATGGATCACCCGCAGGATATTACATCCGTCGTGGCAGTAACAGCCGTCAC
TGGGTGTTGTATTTGGAAGGTGGTGGCTATTGTTGGGACGCGGGCTCATGTGGCGCGCGG
TGGACGAGACGCCCTGGCCTAATGTCTTCCACACGTTGGCCTCGAGCGCGAAGAGCTCCA
GCCTTGCTATCTTCCGACCCCCAAGCAAACCCTCTCTGGCACGCCTCCAATCATGTTTTA
TTACCGTATTGCTCTAGCGATATGTGGGCAGGAACTCGTCTCCATACAAGAACTAATGGC
AGTTTCGCGTTTGTGGGGCACCTTATTGTCCGATCGGTCCTCAATGAACTATTGCACCTA
GGCCTCGCGGGCCGTTTGCTACTTGTAGGATCTAGTGCTGGAGGTACGGGTGTCATGCTT
CACGCTGACTCTACAAGAAGAACTCTTAGAGCTCACAGTGTACGAGTTGCGGCTATAGCA
GATTCTGGATGGTTCTTGGATCGTCCACCAAGAGCGAGACGTGCATCATCAGCTAACGCT
GTAGCTCGTTTAGGCCACACATTATGGTTAGGGGCACCACCCAATTCCTGCGTTAGGGAT
TTCCACGACAAGCCCTGGCTATGCTATTTTGGGTATCGGCTCTACCCTCACATACGCACG
CCCCTTTTTGTTTTCCAATATCTTTTTGACTCTGCCCAGCTTACAGCAGAAGGAGTACGC
GCTCCTAGGACGAGAGCGCAATGGGACGCCGTTCATGAGACGGGCGCGGCTATTCGGGCT
AGCTTGAAGACCGTACGCGCTACCTTCGCGCCTGCATGTATAGCCCACGGCGCCCTCGCA
CGCCCGGAGTGGCTGGCAATAAATGTGTCGGGCATATCATTGCCAAACGCGATCGCCTGC
TGGGAACGCCGGTTCAGAGACGGTAATAGGAAGGAACGCCCTAGATGTGCACCTCGGAGA
CTGATTGAGCGTTGTTCTTGGCCGCAATGTAACAGTTCGTGTCCTAGACTGCGAGATCCT
CGGACTGGTGAGGAAGTCGCTCTGGCGGCTTTGCTACAAAGTTTCGGTCTAGACGTCCGT
GGTGCTGCAGCCGCGATGGGTCTTGATGCTCGAGCTTTGTCTCGTATGAGTCGAGCCGAG
CTACTGCCACTCTTGGCACCCCACACGTGA

Protein sequence:

MTVSHSCWLKWTLCAVVISVCESLVQADSLRLVWLTNTSLTCNDGSPAGYYIRRGSNSRH
WVLYLEGGGYCWDAGSCGARWTRRPGLMSSTRWPRARRAPALLSSDPQANPLWHASNHVL
LPYCSSDMWAGTRLHTRTNGSFAFVGHLIVRSVLNELLHLGLAGRLLLVGSSAGGTGVML
HADSTRRTLRAHSVRVAAIADSGWFLDRPPRARRASSANAVARLGHTLWLGAPPNSCVRD
FHDKPWLCYFGYRLYPHIRTPLFVFQYLFDSAQLTAEGVRAPRTRAQWDAVHETGAAIRA
SLKTVRATFAPACIAHGALARPEWLAINVSGISLPNAIACWERRFRDGNRKERPRCAPRR
LIERCSWPQCNSSCPRLRDPRTGEEVALAALLQSFGLDVRGAAAAMGLDARALSRMSRAE
LLPLLAPHT