DPGLEAN12572 in OGS1.0

New model in OGS2.0DPOGS207335 
Genomic Positionscaffold474:- 16171-22484
See gene structure
CDS Length2184
Paired RNAseq reads  720
Single RNAseq reads  1758
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010110 (3e-76)
Best Drosophila hit  medea, isoform B (4e-108)
Best Human hitmothers against decapentaplegic homolog 4 (2e-94)
Best NR hit (blastp)  PREDICTED: similar to Xsmad4a [Tribolium castaneum] (1e-123)
Best NR hit (blastx)  smad4, putative [Pediculus humanus corporis] (5e-118)
GeneOntology terms























  
GO:0007179 transforming growth factor beta receptor signaling pathway
GO:0009950 dorsal/ventral axis specification
GO:0005634 nucleus
GO:0005515 protein binding
GO:0009953 dorsal/ventral pattern formation
GO:0005737 cytoplasm
GO:0030718 germ-line stem cell maintenance
GO:0042078 germ-line stem cell division
GO:0030616 transforming growth factor beta receptor, common-partner cytoplasmic mediator activity
GO:0003677 DNA binding
GO:0007507 heart development
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0007480 imaginal disc-derived leg morphogenesis
GO:0008283 cell proliferation
GO:0006355 regulation of transcription, DNA-dependent
GO:0045705 negative regulation of salivary gland boundary specification
GO:0003702 RNA polymerase II transcription factor activity
GO:0030509 BMP signaling pathway
GO:0005667 transcription factor complex
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045887 positive regulation of synaptic growth at neuromuscular junction
GO:0031290 retinal ganglion cell axon guidance
GO:0030707 ovarian follicle cell development
GO:0035019 somatic stem cell maintenance
GO:0048666 neuron development
InterPro families




  
IPR013019 MAD homology, MH1
IPR001132 SMAD domain, Dwarfin-type
IPR017855 SMAD domain-like
IPR008984 SMAD/FHA domain
IPR003619 MAD homology 1, Dwarfin-type
IPR013790 Dwarfin
Orthology groupMCL14760

Nucleotide sequence:

ATGAATACGACGGCGCCAACCTCGGCAGATGCATGCCTAAGCATTGTCCACTCACTGATG
TGCCATAGGCAAGGTGGTGAGAGTGAAGGCTTTTCAAAGCGGGCTATTGAGTCACTAGTC
AAGAAATTGAAAGAAAAGAGAGATGAACTTGATTCTTTGATCACAGCAATCACCACAAAT
GGTGCCCATCCCAGTAAATGTGTTACTATTCAGAGAACTCTAGACGGTCGATTACAGGTT
GCGGGAAGGAAAGGATTTCCCCATGTGATATATGCTCGCATATGGCGTTGGCCCGATCTA
CACAAGAATGAATTGAAACATGTTAAATTCTGTCAGTTTGCTTTTGATCTGAAATGTGAC
TCGGTGTGTGTTAATCCATACCATTATGAAAGAGTTGTATCTCCAGGTATTGATCTTTCT
GGGCTGACTCTTCAGTCGGGTCCTAGTAGGTTAGTAAAAGATGAGTATACAGCTGGTCTG
AGCGGGAATGGCATGGACATGGATACTGGAGAACTCGTAACAATCCAGCACCATGCCACA
AGCCCCAGACATCATCACTCCACCATTCCCCATCACCATCAACAGTTCCAGACCTCTAAC
ATTATAATAAATCAAGGACAAACGCCAGATGGCGTTGCCAATATGTTTTCTGCTACTCAT
GGACCTCGACCGCAAATTCGAGCTGGAGCACCTATGGTTCCACAAATGGTACATTCGCCA
GGGGCGCAAATGATGGCTAATCATCAGGGACAAATGGCGGGCGCACCTCAAATGGGTCCG
GGAAACCCACAAATGGGTCCCGTTAACCCACAAATGGGGCCTGGAACGCCACAAATGGGT
CCAGGCACTCCTCAAATGGGGCCGGGAACACCTCAAATGGGTACAAACGTCCCACAAATG
GCTTCACCAAGAATGGCGTCAGCTCCCACCCAAATGTCCCCAGGAACCCCACAAATACCA
AATATAAGTCAGGGAATGTCAATACCGAGCCCACAACAAATGGCAATGGCACAACAAAGA
ACTATAGCCCCAAAACTAGAACCGCCCGATGCTATGGATGCAAGAGCTATGTGGCTGCCA
AAGAGAATGAATCATCCTTCAATGCCTGTCAGTATGTCTCCCGGTGGGACGACGCCGTTA
ATAGACGGCTCCAATAATGCATTCTTTACAAACGAGCAGACTTCTACGGATACTCAAATG
ACTCAGACCATGCCAGCTGGGAGTCAATCGGTGTCAGCTGTGGTGCCAGTGACGTCATCA
GCTATGCCAAGTGAAGCCCAGAATGGTTTCGCCGCGACCAGCCCACCACCACAACCCAGT
CCTATACCACATCGCACCCAACATCAACAGGGCACCTGGACCGGGAACAACACCTTGACT
TATACACAGAGCCTGGCGCCGCCGCCCGCCGCTCCTATGCAGGATGTACCCACTCACCAC
CATCACTACTATAATGGCAACCCAGGTGGTTTATTGTCAAGCCAGCCAGCTCCGGAGTAT
TGGTGTTCGGTGGCTTACTTTGAGCTGGATACTCAAGTGGGGGAAACATTCAAAGTGCCA
TCCAGCAGACCAAACGTTACGGTCGATGGTTATGTGGATCCGTCGGGTGGCAACAGATTC
TGTTTGGGTGCTCTCAGTAATGTACACAGAACTGAACAGAGTGAAAGGGCTCGACTCCAC
ATCGGCAAGGGTGTACAGTTGGATCTCCGTGGTGAAGGAGACGTGTGGCTGAGATGTCTC
TCAGATCACTCGGTGTTTGTGCAGTCCTACTACTTGGATAGAGAGGCAGGCCGGGCCCCG
GGAGACGCTGTTCATAAGATATACCCATCAGCATGTATCAAGGTGTTCGATCTCCGTCAG
TGTCACCGTCAGATGCAAACGCAGGCGGCTACAGCCCAGGCGGCGGCGGCAGCGCAGGCT
GCAGCTGTCGCAGGACACATACAGCCAGCACATCCGGGAATGAACAAATGTTTGTCAGCG
GCGGCTGGTATCGGCGTGGATGATCTTCGGAGGCTGTGTATAGTCCGTCTGTCGTTCGTG
AAGGGCTGGGGGCCAGACTACCCTCGCACCTCCATCAAGGAGACGCCCTGCTGGGTTGAG
GTCCATTTACATAGGGCTCTACAGTTACTGGACGAGGTGCTCCACACTATGCCCATAGAT
GGTCCTCGGACTAGCATCGAGTAG

Protein sequence:

MNTTAPTSADACLSIVHSLMCHRQGGESEGFSKRAIESLVKKLKEKRDELDSLITAITTN
GAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARIWRWPDLHKNELKHVKFCQFAFDLKCD
SVCVNPYHYERVVSPGIDLSGLTLQSGPSRLVKDEYTAGLSGNGMDMDTGELVTIQHHAT
SPRHHHSTIPHHHQQFQTSNIIINQGQTPDGVANMFSATHGPRPQIRAGAPMVPQMVHSP
GAQMMANHQGQMAGAPQMGPGNPQMGPVNPQMGPGTPQMGPGTPQMGPGTPQMGTNVPQM
ASPRMASAPTQMSPGTPQIPNISQGMSIPSPQQMAMAQQRTIAPKLEPPDAMDARAMWLP
KRMNHPSMPVSMSPGGTTPLIDGSNNAFFTNEQTSTDTQMTQTMPAGSQSVSAVVPVTSS
AMPSEAQNGFAATSPPPQPSPIPHRTQHQQGTWTGNNTLTYTQSLAPPPAAPMQDVPTHH
HHYYNGNPGGLLSSQPAPEYWCSVAYFELDTQVGETFKVPSSRPNVTVDGYVDPSGGNRF
CLGALSNVHRTEQSERARLHIGKGVQLDLRGEGDVWLRCLSDHSVFVQSYYLDREAGRAP
GDAVHKIYPSACIKVFDLRQCHRQMQTQAATAQAAAAAQAAAVAGHIQPAHPGMNKCLSA
AAGIGVDDLRRLCIVRLSFVKGWGPDYPRTSIKETPCWVEVHLHRALQLLDEVLHTMPID
GPRTSIE