DPGLEAN06737 in OGS1.0

New model in OGS2.0DPOGS201463 
Genomic Positionscaffold124:+ 73058-79303
See gene structure
CDS Length1986
Paired RNAseq reads  88
Single RNAseq reads  348
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002615 (0.0)
Best Drosophila hit  defective proventriculus, isoform A (3e-44)
Best Human hitDNA-binding protein SATB1 isoform 1 (3e-11)
Best NR hit (blastp)  PREDICTED: similar to defective proventriculus CG5799-PA [Tribolium castaneum] (1e-164)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL012091 [Aedes aegypti] (4e-124)
GeneOntology terms






  
GO:0007494 midgut development
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003680 AT DNA binding
GO:0045449 regulation of transcription
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0016348 imaginal disc-derived leg joint morphogenesis
InterPro families

  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR012287 Homeodomain-related
Orthology groupMCL16062

Nucleotide sequence:

ATGGAGTATCAACAAAATATTAATAAAGTTGGTAGCAAAGCTGGGGATCCAAGTAAAACC
TTGCCTGTCCATTGTGTGGTGGAAGCCGTTGCGTCATTGGAAGAAGGTGTTTGGAGGCGA
AGGGCAGTTGTTGAAACTGACAGCTATGTCATCATTCCCGCTGCTACCGCCTTCCATGAA
CTCGTGCCAGCGGCCATGATGCGACTTGGATACCCCCACGAGCTAGCTGCTTCTGCGAAA
GGTTCAGTGGTAATTAATAACTGGAAGCCGTTGCCGTTCGAGCGCATATCCGATGGGCCT
TTAGCCACTGTCGGTGAAGTGTTGGGTGAGTTGACGACCGTGGCCACCCTTAGGATCCAA
CTGTTACGGCCGAGACCCACACCCTTGCAGGATATCAAGGACAAACTCCTGAGACTTTTG
CTACTCCAGAGCAGACCCTTGTTGATGTCGACTGGATGTCCTTTAGATGAGCGAGTTCGG
TACAGTGAGCTGAGCCAGATTCCCAATGTCCCGGCAATGTTACGTTCTTTAATACCTGAT
TGTAAACCCGACCCGTACACAATGGCCACCCCTACAACCCCCCAAACCCCCAACCCCACA
CCCCCACACCCCAACCCCCTGTATTACTTTGTAACGTTAACTCAGATTTGTCGTGGACAA
GATAGGTCTTCGGGTCCTCATAATTTTCATGAACCGACAGAGGAAACTCGTCGTAAATTT
GAATCTTGGTGGAGTGCCCAAGTTTCACCTCGCCCCCCACCATTCAGTCCTCGGCGCTAC
CCATCTCCAGGTCCTAAAAATCGATCCCCTACACTTAACACCATCCCAGACCATCTGCAC
CCAGCACTACAGACCGTTCAAAACCAATATCCAACACAAAAAACTAGAATGAGAACAAGT
TTTGATCCCGAACTAGAACTACCAAAGTTACAACGATGGTTCTCTGAAAATCAACACCCT
AGTAGGCAGCAAATCCAACAATATGTCAGAGAGTTAAATAATTTAGAATCAAGACGGGGA
CGAAAGCCCTTAGACGTCAATAATGTCGTTTACTGGTTTAAAAATGCAAGAGCTGCTCAA
AAACGGGCTGAGTTACGTAATATTGGAGGGATAGGGGGACATCTTGGTGTCAACGGCTTT
AATAGCAGGAGTCATAGTCCATCGAATGGATCACTAATGGCTGGTAATGATAACTATAGT
TCTCATGACCATAATTCTTTGAAGAGTCCCATGCAATTATCAGGAAGTCCTGGTAGATAC
CCAATGTCAGTTATGTCTGAAGACAATCTTTCAAACCCTGGATCTGATTTGGAAGATGAC
GGAGTCCATGATATCAAACAAGAGCCAAAGGACTTAAGCAAACAAGAACAAGTGCACTCA
CCTCAGCGTTCACCGACCAAAAATAGTGACAGCTCTCACAATAATAATAATAACAATGAA
GATGAAAATGGAGGCGCCGAAGATCATGACATTCCTTCGGATGAAGAAGTCGTTCAAGAG
CGTCATTACAGACCATCGTCTCCTCATCTCGACCGTTTACCGTTTCCAATGGTACCGAAT
CATCCCATGTTCGGTCACGGTATAATGTACATGAGCCAATACATGGGAGGATTCCCAGGT
GTAGGGGGTGTTCCAGGTGAAGGGGCTAGCGGCTTAAATTTAGCCCTAGCAGGCGCGTCT
GACGAGCGCCGCAAACGCAATCGTACCTTCATAGACCCCGTCTCTGAGGTTCCCGTGTTA
GAGCAGTGGTTTTCAATGAACACACATCCTTCGCACAATCTCATACTTAAATATACAGAA
GAGTTGAACAGGATGCCATATAGGCAAAAATTTCCACGACTGGAATCTAAAAATGTTCAG
TTCTGGTTCAAGAACCGTCGGGCTAAGTGCAAGAGGCTGAAGATGTCTCTTTACGAGCCG
ACTTCACCTGGTCATTACTCCCATCCCGGTCATCCACACGCAATTGCTGAAAGAAAATTG
GTGTAA

Protein sequence:

MEYQQNINKVGSKAGDPSKTLPVHCVVEAVASLEEGVWRRRAVVETDSYVIIPAATAFHE
LVPAAMMRLGYPHELAASAKGSVVINNWKPLPFERISDGPLATVGEVLGELTTVATLRIQ
LLRPRPTPLQDIKDKLLRLLLLQSRPLLMSTGCPLDERVRYSELSQIPNVPAMLRSLIPD
CKPDPYTMATPTTPQTPNPTPPHPNPLYYFVTLTQICRGQDRSSGPHNFHEPTEETRRKF
ESWWSAQVSPRPPPFSPRRYPSPGPKNRSPTLNTIPDHLHPALQTVQNQYPTQKTRMRTS
FDPELELPKLQRWFSENQHPSRQQIQQYVRELNNLESRRGRKPLDVNNVVYWFKNARAAQ
KRAELRNIGGIGGHLGVNGFNSRSHSPSNGSLMAGNDNYSSHDHNSLKSPMQLSGSPGRY
PMSVMSEDNLSNPGSDLEDDGVHDIKQEPKDLSKQEQVHSPQRSPTKNSDSSHNNNNNNE
DENGGAEDHDIPSDEEVVQERHYRPSSPHLDRLPFPMVPNHPMFGHGIMYMSQYMGGFPG
VGGVPGEGASGLNLALAGASDERRKRNRTFIDPVSEVPVLEQWFSMNTHPSHNLILKYTE
ELNRMPYRQKFPRLESKNVQFWFKNRRAKCKRLKMSLYEPTSPGHYSHPGHPHAIAERKL
V