New model in OGS2.0 | DPOGS210967  |
---|---|
Genomic Position | scaffold307:- 86340-88349 |
See gene structure | |
CDS Length | 1647 |
Paired RNAseq reads   | 71 |
Single RNAseq reads   | 195 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006395 (2e-123) |
Best Drosophila hit   | proboscipedia, isoform A (1e-20) |
Best Human hit | homeobox protein Hox-B3 (2e-26) |
Best NR hit (blastp)   | zerknullt [Bombyx mori] (9e-150) |
Best NR hit (blastx)   | zerknullt [Bombyx mori] (9e-150) |
GeneOntology terms    | GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0051216 cartilage development GO:0001525 angiogenesis GO:0060216 definitive hemopoiesis GO:0021615 glossopharyngeal nerve morphogenesis GO:0030878 thyroid gland development GO:0007275 multicellular organismal development GO:0009952 anterior/posterior pattern formation GO:0048704 embryonic skeletal system morphogenesis GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR001356 Homeobox IPR020479 Homeobox, eukaryotic IPR012287 Homeodomain-related IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site |
Orthology group | MCL40068 |
Nucleotide sequence:
ATGTCCTCAAGCTCCCATTCTCCACCCCCAAGTGATCGGTCAGACGAAGATTCTAAGGAC
TCGATTTCATCTGAATACGCTGAAACAAGTCAAATACACAACACCAGCCATATAACATTG
GCACAAAAAAGTTTTTACAGAGTTAACTCTTTAACGTCACACGGCGTTGAAGATATACTT
TCTGAAGGCAATAGCTACAATACTCAAGAGAATACTTATCCGGATTGTAAAACCGAAGTG
CTTTCGTCAATCTCAAATCCCTTCGTTCCAGATTATAATTCTAGAGATTCAACAGGATTT
AGTATCCATGACATACTTGGCCTTCAACAAGCCTACAATGTGGCCAACGCTCAAGACGAA
TTGGAATCCCGATACGAGTATCAAATACCGAACTATGACAATATAAGTAATAGTTCTCAA
AATAATTATGGTGAAGAGAGAATTTCTGACCATACGATACCAAAGAGCGCAGATATTTTC
AACGTAACCGAATCAGAAATTCGAAATGAAGTTGTGTTTCAAAGGAATTACTCAAATAAC
GAAACTATTTCTTGTCACCAAAGAAGTGATTTGGACAATGATGTTGTAAACAATGCTGAA
AGGGAAGAAAGCGATATTAACGAATCTAGTTTTCCCGGACAGAATTCTTCTTGGTGTGAA
AAAAACTCGCTGATTAGTAGTCAAGTCATCAGTACAGCCTCTTCTATATCTAACGATATG
TCTACCGATTCGTCATCATATCCAAAAGGTTTTACAAAACGAGCACGCACTGCTTACACA
AGTTCCCAACTTGTAGAATTGGAAAATGAGTTTCATCAAAATCGATACTTGTGTCGTCCT
AGGAGAATAGAATTGGCCAATTATTTGCAGCTTTCGGAACGCCAAATCAAAATATGGTTT
CAAAATAGGAGGATGAAATACAAGAAAGATAATAAACACAATAAACCAAGCTCGTCCGTA
GACGATAACAGTCCTACAACAAGTTCTAAGGAAATGTCTCCAACTCAGGATCATAAATTG
AGCCACAGTCGTGGCTGCGGAGGTCATGATAGACATAGACGTTTACTTAACGAAAGCCAT
GCAACTCATCATAAAATGTATCTTCCAACCAACGAAACTATACCAAGACCTCCCGATTAT
TCTTCAATTAGTCCGATTAAATCAGTTGTTAAACCTGGTTCTCAGAGCACTATAGAATTG
CCAGCATATACACCTAACTTATCTTACTCGTCCTACTACACAGGAGCAAGCCGGAGCGGT
TACTCACCGATATCTGAGGTTTATCGATACAACAGCGATGAATCATTGCAGCAAACTTCT
CACACATTGTCGTTATTACAATCGGATAGTTACGTACCTAATGGGATGAATCTAAAGCTC
GCCGAAGACATGACTCGATGCCCAACTGGATCTCCATATTATAACACGCTTTCAAACGGA
GTGGTTATGCATATTCCAACTACAGATGCATATGGTTACGCCAGCACTATTCCCGCTCTT
TCAGCATCTGCTTTCGAAGATAATACCGTTCACACAAGATCAAGCATATCTCAAGATCCT
TACTTTGCTTATTTATCATCAGCAGAGACGTCTAACCAACAAACCTCTTCGACAGCTAAC
AAGTTTTCTTCGTACATTTCACTCTAA
Protein sequence:
MSSSSHSPPPSDRSDEDSKDSISSEYAETSQIHNTSHITLAQKSFYRVNSLTSHGVEDIL
SEGNSYNTQENTYPDCKTEVLSSISNPFVPDYNSRDSTGFSIHDILGLQQAYNVANAQDE
LESRYEYQIPNYDNISNSSQNNYGEERISDHTIPKSADIFNVTESEIRNEVVFQRNYSNN
ETISCHQRSDLDNDVVNNAEREESDINESSFPGQNSSWCEKNSLISSQVISTASSISNDM
STDSSSYPKGFTKRARTAYTSSQLVELENEFHQNRYLCRPRRIELANYLQLSERQIKIWF
QNRRMKYKKDNKHNKPSSSVDDNSPTTSSKEMSPTQDHKLSHSRGCGGHDRHRRLLNESH
ATHHKMYLPTNETIPRPPDYSSISPIKSVVKPGSQSTIELPAYTPNLSYSSYYTGASRSG
YSPISEVYRYNSDESLQQTSHTLSLLQSDSYVPNGMNLKLAEDMTRCPTGSPYYNTLSNG
VVMHIPTTDAYGYASTIPALSASAFEDNTVHTRSSISQDPYFAYLSSAETSNQQTSSTAN
KFSSYISL