New model in OGS2.0 | DPOGS201463  |
---|---|
Genomic Position | scaffold124:+ 73058-79303 |
See gene structure | |
CDS Length | 1986 |
Paired RNAseq reads   | 88 |
Single RNAseq reads   | 348 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002615 (0.0) |
Best Drosophila hit   | defective proventriculus, isoform A (3e-44) |
Best Human hit | DNA-binding protein SATB1 isoform 1 (3e-11) |
Best NR hit (blastp)   | PREDICTED: similar to defective proventriculus CG5799-PA [Tribolium castaneum] (1e-164) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL012091 [Aedes aegypti] (4e-124) |
GeneOntology terms    | GO:0007494 midgut development GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003680 AT DNA binding GO:0045449 regulation of transcription GO:0007476 imaginal disc-derived wing morphogenesis GO:0016348 imaginal disc-derived leg joint morphogenesis |
InterPro families    | IPR009057 Homeodomain-like IPR001356 Homeobox IPR012287 Homeodomain-related |
Orthology group | MCL16062 |
Nucleotide sequence:
ATGGAGTATCAACAAAATATTAATAAAGTTGGTAGCAAAGCTGGGGATCCAAGTAAAACC
TTGCCTGTCCATTGTGTGGTGGAAGCCGTTGCGTCATTGGAAGAAGGTGTTTGGAGGCGA
AGGGCAGTTGTTGAAACTGACAGCTATGTCATCATTCCCGCTGCTACCGCCTTCCATGAA
CTCGTGCCAGCGGCCATGATGCGACTTGGATACCCCCACGAGCTAGCTGCTTCTGCGAAA
GGTTCAGTGGTAATTAATAACTGGAAGCCGTTGCCGTTCGAGCGCATATCCGATGGGCCT
TTAGCCACTGTCGGTGAAGTGTTGGGTGAGTTGACGACCGTGGCCACCCTTAGGATCCAA
CTGTTACGGCCGAGACCCACACCCTTGCAGGATATCAAGGACAAACTCCTGAGACTTTTG
CTACTCCAGAGCAGACCCTTGTTGATGTCGACTGGATGTCCTTTAGATGAGCGAGTTCGG
TACAGTGAGCTGAGCCAGATTCCCAATGTCCCGGCAATGTTACGTTCTTTAATACCTGAT
TGTAAACCCGACCCGTACACAATGGCCACCCCTACAACCCCCCAAACCCCCAACCCCACA
CCCCCACACCCCAACCCCCTGTATTACTTTGTAACGTTAACTCAGATTTGTCGTGGACAA
GATAGGTCTTCGGGTCCTCATAATTTTCATGAACCGACAGAGGAAACTCGTCGTAAATTT
GAATCTTGGTGGAGTGCCCAAGTTTCACCTCGCCCCCCACCATTCAGTCCTCGGCGCTAC
CCATCTCCAGGTCCTAAAAATCGATCCCCTACACTTAACACCATCCCAGACCATCTGCAC
CCAGCACTACAGACCGTTCAAAACCAATATCCAACACAAAAAACTAGAATGAGAACAAGT
TTTGATCCCGAACTAGAACTACCAAAGTTACAACGATGGTTCTCTGAAAATCAACACCCT
AGTAGGCAGCAAATCCAACAATATGTCAGAGAGTTAAATAATTTAGAATCAAGACGGGGA
CGAAAGCCCTTAGACGTCAATAATGTCGTTTACTGGTTTAAAAATGCAAGAGCTGCTCAA
AAACGGGCTGAGTTACGTAATATTGGAGGGATAGGGGGACATCTTGGTGTCAACGGCTTT
AATAGCAGGAGTCATAGTCCATCGAATGGATCACTAATGGCTGGTAATGATAACTATAGT
TCTCATGACCATAATTCTTTGAAGAGTCCCATGCAATTATCAGGAAGTCCTGGTAGATAC
CCAATGTCAGTTATGTCTGAAGACAATCTTTCAAACCCTGGATCTGATTTGGAAGATGAC
GGAGTCCATGATATCAAACAAGAGCCAAAGGACTTAAGCAAACAAGAACAAGTGCACTCA
CCTCAGCGTTCACCGACCAAAAATAGTGACAGCTCTCACAATAATAATAATAACAATGAA
GATGAAAATGGAGGCGCCGAAGATCATGACATTCCTTCGGATGAAGAAGTCGTTCAAGAG
CGTCATTACAGACCATCGTCTCCTCATCTCGACCGTTTACCGTTTCCAATGGTACCGAAT
CATCCCATGTTCGGTCACGGTATAATGTACATGAGCCAATACATGGGAGGATTCCCAGGT
GTAGGGGGTGTTCCAGGTGAAGGGGCTAGCGGCTTAAATTTAGCCCTAGCAGGCGCGTCT
GACGAGCGCCGCAAACGCAATCGTACCTTCATAGACCCCGTCTCTGAGGTTCCCGTGTTA
GAGCAGTGGTTTTCAATGAACACACATCCTTCGCACAATCTCATACTTAAATATACAGAA
GAGTTGAACAGGATGCCATATAGGCAAAAATTTCCACGACTGGAATCTAAAAATGTTCAG
TTCTGGTTCAAGAACCGTCGGGCTAAGTGCAAGAGGCTGAAGATGTCTCTTTACGAGCCG
ACTTCACCTGGTCATTACTCCCATCCCGGTCATCCACACGCAATTGCTGAAAGAAAATTG
GTGTAA
Protein sequence:
MEYQQNINKVGSKAGDPSKTLPVHCVVEAVASLEEGVWRRRAVVETDSYVIIPAATAFHE
LVPAAMMRLGYPHELAASAKGSVVINNWKPLPFERISDGPLATVGEVLGELTTVATLRIQ
LLRPRPTPLQDIKDKLLRLLLLQSRPLLMSTGCPLDERVRYSELSQIPNVPAMLRSLIPD
CKPDPYTMATPTTPQTPNPTPPHPNPLYYFVTLTQICRGQDRSSGPHNFHEPTEETRRKF
ESWWSAQVSPRPPPFSPRRYPSPGPKNRSPTLNTIPDHLHPALQTVQNQYPTQKTRMRTS
FDPELELPKLQRWFSENQHPSRQQIQQYVRELNNLESRRGRKPLDVNNVVYWFKNARAAQ
KRAELRNIGGIGGHLGVNGFNSRSHSPSNGSLMAGNDNYSSHDHNSLKSPMQLSGSPGRY
PMSVMSEDNLSNPGSDLEDDGVHDIKQEPKDLSKQEQVHSPQRSPTKNSDSSHNNNNNNE
DENGGAEDHDIPSDEEVVQERHYRPSSPHLDRLPFPMVPNHPMFGHGIMYMSQYMGGFPG
VGGVPGEGASGLNLALAGASDERRKRNRTFIDPVSEVPVLEQWFSMNTHPSHNLILKYTE
ELNRMPYRQKFPRLESKNVQFWFKNRRAKCKRLKMSLYEPTSPGHYSHPGHPHAIAERKL
V