DPGLEAN03839 in OGS1.0

New model in OGS2.0DPOGS211550 
Genomic Positionscaffold8729:+ 621-13594
See gene structure
CDS Length3111
Paired RNAseq reads  2114
Single RNAseq reads  4836
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014363 (1e-07)
Best Drosophila hit  cramped, isoform A (2e-43)
Best Human hitprotein cramped-like (7e-35)
Best NR hit (blastp)  PREDICTED: similar to cramped [Tribolium castaneum] (1e-57)
Best NR hit (blastx)  PREDICTED: similar to cramped [Tribolium castaneum] (2e-55)
GeneOntology terms



  
GO:0045449 regulation of transcription
GO:0007379 segment specification
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0006260 DNA replication
InterPro families

  
IPR009057 Homeodomain-like
IPR017884 SANT, eukarya
IPR001005 SANT domain, DNA binding
Orthology groupMCL13233

Nucleotide sequence:

ATGGTGGGGATAGCCATGAGGATTCTTTGGAAGTGTACTCCACAGGTAGCGATCAAAGCA
GGCGTCATGTCTTCGGGAGTTCAACAGGATGAAGAGCGGACCGAGCTGTTGGGATCGTTA
ACAACACAACAACAACAGAGGACCAGCGCCAGGGTCATTAAGAAACTGAGACTGGAACCG
CAAATTGACAAACGAGACGTCATAGAATGCGAGACTCCAAACAAAATAGACGATAAAGAC
CCTTTAAAGTTCCCAACGGTCAAACAACGTATGCCCAAGGCGTTGTGGTCGGCAGATGAG
AAAAGCCTTTTCTTCGAAGCCTTGAACGAATACGGCAAGGACTTCGACTCAATCACCTCG
TACATTTGCGCTAAAATGAAGAAGAAGGGCATGCTAGATGGAAATCTGAAGACTAAGACC
CAAGTCAGCCATTTTTATTACAGGACGTGGCATAAGCTGTCCAAACACGTGCGTTTTGAT
GAAAATGTCAAAAAAGTAGCCCAGGAGCTTTACGCTCTGATAAATTATGGCGAGTTGAGA
AGGAAGCTGGTGTCTGTTAATGAGAAGATCTGCGCTCGTTTGGGAGAAATGGTCCGTGGA
GGATCCATAGCTGTGAGGACCAAGGGAAAGACGATCAGAGTCAGAACACCAATGTGCAGA
GCTCTCAGACGACTCAATCAAATCGCGGAGCGCGCGTACGGCGCTCGTGTGTGCACCCGG
GCTCAAGTGATATTGCGCGCTCGGGACGCGGCGTCTTGGACGCGCGTGCAGGCCGCCGCA
CACAACCCGCGGGCGGTCGTCGCGCTCACTCTCAGGACGAGGCTCGTGGCGCTGCTGTGG
GCTCTCGAGAGACGGTGGAATTGTAAGCCATGTGTTGTAAAAAAAAGTGATTTAAAAACC
GAGGAATATTCTTCGCCATTTTGTGAGAACGGAGACAAGCTGGACGAACACCTCAACTTG
GAAACAGACAGGACCATAATCCAAGAGCGTGCGCCCCCCGGTGTGTCGCTCCACGTCGGT
CCTCGGCCTGAGGCCGACGTCCGTCTGCCGGAGCTTCGCCCTCGCGAGCCTCTGTCGAGT
CAGAAGATCTGCTTTGCTTCATATCTGGAAAGAATGGGCGCTTTGAGAAGACAGGACGGA
GATGTCAAAATTCGTACGCCGAAACGGCACAGAAAAGACAGCGTGTCCGATAAAGATAAG
GAAAATGACAAGAAAATTAAAATAGAGGACATAGAAACAAACAAACTGATTAACATAGAA
GAAACAGCCATAGACGGCATAGAACTGATGGCGCATTACAAGAACAACCAGGAAGATGAA
GAGAAACCCAGCACTGAGGAAGAGAAGGAGATGAGGTCGGAGGAGGTCGTGGAGGAAGAT
TCAGAGAGGGACAAAGACGTGATAGAGAGAGACAATGACGTGTTGGACAGAGACAGAGAC
ATGCGAGAAAGAGACAAAGATATGTTGGACAGAGAGAAAGAGATGCCAGAGAGAGACAAG
GATATGTTAGAGAGAGAGAAAGACAGCTTCTCAGAAATGGAAGACGATGAGAAATATAAC
AAGAGTGACACAGACAATGAGAGCGATGGACGAGAGAAGAAACAGATGAAATATAAGAAT
TTGAAGGTGAAGTTCCGTATACGTCCCAAGAAGAGAGGTGGCTCCGTCTACACCCTCGTG
TTGGATCATGATAAGAATAAAGAGGAAGATGCCAAGATAGATGAAAAGGAGGAGAAGGAA
GAAGAACAGAAGCCGGATATAGATGTAGACTTCGCGATGAGGCAGGTCAGGAAAGGATGG
AGCGTTTGGGACGCGGGCGATCTTACCATAGGAGATTTGTATTTGATGTTCGGTTCCCGC
TCGAGATTAGAATTGGACTACTGGTGGGCGGAGCCTACACCCCCACTACCTAAGACGAGG
ACGGAAGATAGGCTCGATAGGAGAGAGAAAGGGAACAAAACGCCAGACAAGGGAGACAGC
TCAGTAGAAGACGAGAGGGAGAGGTGTGACAACGACATACTCTCACCCAAGAACACGTTC
AGCCAAGACAGTAACGACGGCCTGTCCGGCGACGAGAGGAAGAGTGAGCAGCTGTCGTCG
CCGGACCACAAGTCCGGGACCTTGAAGCTGGTCAGTAAACTGATCAACCGACCCACGCAC
GTCTCCACCAACAACGGCTACTCGCTGGTTAGCGACAGACTTCGGCGACTGCTGGCGCTG
GCGGGGAACAGCCACTTTGGTGGAGGGGCGAGAGGAGTGCACGCGCGGAAACACGGAGCG
ACACATGTGCAGAAGCCCCCCGCGTGTAGATTGAGTCCGACGAGGTCTCCACCCGCCACG
GGCCCCGCGCTCTTCAGACACCCCGCACCCATCGCTCCCAAGCCAGAGCCGGAGTCGTGT
CAGTCGCCTATTAGTCTCAATGGTCTGCCTAAGTGGCGCCGCGGGCGACCTTCCACCGAC
AGACGGGTCGTGGTCCAACGTCTCCTGCCCCTCATGCCCAAACTACCACCACCAAATAAT
CTGATTCCCGTGAAGATGGTGTCCAACTCCCAGCCGGTCCAACCGAGGCTGGTCCCCAAG
CCGCCGCCGTCAAGCTCCTCCGACCTGTCGATGTACTACGTGCTGAGTCAGTCCAACGGA
CAGTTCTTCTTCCACGACGGCGACAGACGGATCCCCATCCTCACCGACTCCGGGAACACC
TCCCAGAACGCCGAAGACGCCAAGGCTGATGATGGAGAGGTTAAGGAAGACAAGGAGTCG
GACATCAAGATAGTTAAAGTAGAAGCTGGCGTCGAGAACGGGGGGGAGGCAGGGGAGGGG
CGGACATACGAGCAGAATGATATCTCAAGTTTCCTCCCGTCCGAGTCCCTGTCGCTGTCT
CCATCCCGCTTGCTCCGCGCGCCGGGCGAGGGGGGCGAGGCGGACTGGCTCGACACACAC
GACTTCTCACTCAGCAGCTTCCTCTCCCACCTGGAGAAGGCCCAGCAGGAGCTACCGGTG
GATTCCCATCTCCAGTCGTTGATGGCTGAAAGCAGTGTGGACTACGTGGCCAAGTTCGCT
GACCTCGCCGCTGAGGTGGACGACCAAGACCTCAGTGACGACCTGCCTTAA

Protein sequence:

MVGIAMRILWKCTPQVAIKAGVMSSGVQQDEERTELLGSLTTQQQQRTSARVIKKLRLEP
QIDKRDVIECETPNKIDDKDPLKFPTVKQRMPKALWSADEKSLFFEALNEYGKDFDSITS
YICAKMKKKGMLDGNLKTKTQVSHFYYRTWHKLSKHVRFDENVKKVAQELYALINYGELR
RKLVSVNEKICARLGEMVRGGSIAVRTKGKTIRVRTPMCRALRRLNQIAERAYGARVCTR
AQVILRARDAASWTRVQAAAHNPRAVVALTLRTRLVALLWALERRWNCKPCVVKKSDLKT
EEYSSPFCENGDKLDEHLNLETDRTIIQERAPPGVSLHVGPRPEADVRLPELRPREPLSS
QKICFASYLERMGALRRQDGDVKIRTPKRHRKDSVSDKDKENDKKIKIEDIETNKLINIE
ETAIDGIELMAHYKNNQEDEEKPSTEEEKEMRSEEVVEEDSERDKDVIERDNDVLDRDRD
MRERDKDMLDREKEMPERDKDMLEREKDSFSEMEDDEKYNKSDTDNESDGREKKQMKYKN
LKVKFRIRPKKRGGSVYTLVLDHDKNKEEDAKIDEKEEKEEEQKPDIDVDFAMRQVRKGW
SVWDAGDLTIGDLYLMFGSRSRLELDYWWAEPTPPLPKTRTEDRLDRREKGNKTPDKGDS
SVEDERERCDNDILSPKNTFSQDSNDGLSGDERKSEQLSSPDHKSGTLKLVSKLINRPTH
VSTNNGYSLVSDRLRRLLALAGNSHFGGGARGVHARKHGATHVQKPPACRLSPTRSPPAT
GPALFRHPAPIAPKPEPESCQSPISLNGLPKWRRGRPSTDRRVVVQRLLPLMPKLPPPNN
LIPVKMVSNSQPVQPRLVPKPPPSSSSDLSMYYVLSQSNGQFFFHDGDRRIPILTDSGNT
SQNAEDAKADDGEVKEDKESDIKIVKVEAGVENGGEAGEGRTYEQNDISSFLPSESLSLS
PSRLLRAPGEGGEADWLDTHDFSLSSFLSHLEKAQQELPVDSHLQSLMAESSVDYVAKFA
DLAAEVDDQDLSDDLP