New model in OGS2.0 | DPOGS206411  |
---|---|
Genomic Position | scaffold303:- 76755-85672 |
See gene structure | |
CDS Length | 822 |
Paired RNAseq reads   | 54 |
Single RNAseq reads   | 228 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013872 (4e-65) |
Best Drosophila hit   | tailup, isoform B (7e-61) |
Best Human hit | insulin gene enhancer protein ISL-2 (6e-48) |
Best NR hit (blastp)   | insulinprotein enhancer protein isl [Aedes aegypti] (1e-102) |
Best NR hit (blastx)   | insulinprotein enhancer protein isl [Aedes aegypti] (4e-86) |
GeneOntology terms    | GO:0007362 terminal region determination GO:0008293 torso signaling pathway GO:0008258 head involution GO:0046665 amnioserosa maintenance GO:0007390 germ-band shortening GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0006357 regulation of transcription from RNA polymerase II promoter GO:0007391 dorsal closure GO:0007399 nervous system development GO:0008270 zinc ion binding GO:0006355 regulation of transcription, DNA-dependent GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding GO:0008045 motor axon guidance GO:0008407 bristle morphogenesis GO:0070983 dendrite guidance GO:0035310 notum cell fate specification |
InterPro families    | IPR001781 Zinc finger, LIM-type IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR001356 Homeobox IPR017970 Homeobox, conserved site |
Orthology group | MCL11887 |
Nucleotide sequence:
ATGAGAGCAAAGACGAAGATATATCATATAGACTGCTTCAGATGCTGCGCTTGCGCACGA
CAACTTATACCCGGTGACGAGTTCGCGTTGAGAGAAGGCGGAGCTTTATATTGTAGAGAA
GATCACGATGTATTAGAAAAGAGCGCTAACACAAGCGGCAGCAGCGCCGGCAACGCCGAG
AGCAACAACAACACAACACTCAGCAACAACAATTCGCATCACCCGCACGAGTTAGGATCT
ATGTCGGATTCAGGAAGTGAGTCTGGCTCGCATAAGAGTGGAAGAGCCAGGGCTGGCGCT
GCGGCTGATGGTAAACCCACCAGGGTGAGGACTGTCCTCAATGAGAAACAATTACACACA
CTAAGAACCTGTTATGCTGCGAATCCTAGACCTGACGCTCTCATGAAGGAACAGCTGGTT
GAAATGACAGGTCTTAGTCCTCGAGTGATAAGAGTGTGGTTCCAGAACAAGAGATGCAAA
GACAAGAAGAAGACTATACAGCTGAAGATGCAGATGCAGCAAGAGAAGGAAGGCCGCCGT
TTGGGCTATATGTCTATGGGAGTGCCGTTAGTGGCCGGTTCGCCTGTAAGACATGAGGCT
GGGTCTCTAGCTCTAGAGGTGACGGCGTATCAGCCGCCGTGGAAGGCCCTCAGCGACTTC
GCACTCCACGCGGACCTTGACAGGCCTCAACACAGCGCCGCCTTCCAACAGCTCGTGAAC
CAGATGCACGGTTACGACATCCCCTCTCTGCCCCCTCCACGTCACGAGGACAACTACGTC
ACCTATCTCGAGAGTGACGACAGTCTGCCGCCGTCACCCTAG
Protein sequence:
MRAKTKIYHIDCFRCCACARQLIPGDEFALREGGALYCREDHDVLEKSANTSGSSAGNAE
SNNNTTLSNNNSHHPHELGSMSDSGSESGSHKSGRARAGAAADGKPTRVRTVLNEKQLHT
LRTCYAANPRPDALMKEQLVEMTGLSPRVIRVWFQNKRCKDKKKTIQLKMQMQQEKEGRR
LGYMSMGVPLVAGSPVRHEAGSLALEVTAYQPPWKALSDFALHADLDRPQHSAAFQQLVN
QMHGYDIPSLPPPRHEDNYVTYLESDDSLPPSP