New model in OGS2.0 | DPOGS211229  |
---|---|
Genomic Position | scaffold525:+ 17269-49379 |
See gene structure | |
CDS Length | 1446 |
Paired RNAseq reads   | 35 |
Single RNAseq reads   | 119 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003208 (7e-116) |
Best Drosophila hit   | Lim3, isoform A (9e-107) |
Best Human hit | LIM/homeobox protein Lhx4 (8e-81) |
Best NR hit (blastp)   | lim homeobox protein [Culex quinquefasciatus] (3e-149) |
Best NR hit (blastx)   | lim homeobox protein [Culex quinquefasciatus] (8e-121) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0003704 specific RNA polymerase II transcription factor activity GO:0007399 nervous system development GO:0043565 sequence-specific DNA binding GO:0008270 zinc ion binding GO:0008045 motor axon guidance |
InterPro families    | IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site IPR001781 Zinc finger, LIM-type IPR001356 Homeobox IPR012287 Homeodomain-related |
Orthology group | MCL11104 |
Nucleotide sequence:
ATGGATGTAAGCATCGGGGGTGTGTCGTACGACTCGGGAAAATTAGTTCGGGGCGGGGGA
GCCCCTCACCCCTATTCCGGGGTTTATCAGATTCAGGGGTTCCGAGTTGAGCGTTCAATA
GCATTTGCAGTCAACGAGAATTTTCCTTGTAATCGATCGCTGGGTGTACCCGACACTAGA
GATTGGTTGAGGGGCGTGGTGGATGTTATCATTAGGGTGACGAGGACCAATCGCATGTTG
GTACTGATAAGGGGGTATGGGTACGCCGGACGCAGCGGCGCAGGGCGGCCGCAGATCATG
CTGGGCTCCATGATGTACCCCGGCGCGGAAGACGAGCTCGACATGCGCGTACCACCCATC
CAACTAGAACACCTACCTGAAGTGTTCCTATCCAGCATACCAAAATGTGGAGGCTGTCAT
GAGATGATAGTAGATCGATACGTGCTAAAAGTGTCAGACAGGACTTGGCACGCTGGCTGT
TTGAGGTGTGTCGAGTGTCGAGCCATGCTGTCAGGAAAGTGCTTCGCTAGAAATAACCAG
CTCTACTGCACCGAAGACTTCTTCAAGCGTTACGGCACTAAGTGCGCGGGCTGCGGGCAG
GGCATCCCGCCGACACAGGTCGTACGGCGGGCCCAGGCTCACGTTTACCATCTACGGTGT
TTCGCATGTGCTGCCTGTGCACGCACACTTAATACCGGAGACGAGTTTTATCTAATGGAG
GATGGCAAGCTTGTTTGCAAACCGGACTATGAAGCTGCGAGAGCGAAAGGTGAAGGTTCG
TTGGATGGAGACGCAGCTAGCAAGAGACCGCGAACTACAATTACTGCAAAACAGCTTGAA
ACTCTTAAAAGTGCTTATAGCAGCAGTCCAAAGCCAGCTCGCCACGTGAGGGAACAGCTT
GCTCAAGATACAGGCTTAGATATGAGGGTCGTTCAAGTTTGGTTTCAAAATCGGAGGGCA
AAAGAGAAGCGACTAAAGAAGGACGCGGGTCGAACAAGATGGTCACAATACTTCAGATCT
ATGAAGGGCGGAGGAAGTGGCTCACCGCGTCACGATCGTCTTCTGGATAAAGACGAACTG
AAAATAGATTTAGACTCTTTCAGTCATCATGAGCTAAGCAACGATAGTTATAGCACTGCT
GCGCTGGGTGGTGAGGAGGGATCACCAGCCGGCGGCGCGGCGGGAGGGACTCGGTTTGGT
ACTACGCCACCATATCTTCGTGCTCATTCACCCCCCCACGCACATTATCATTATCCCCCC
GATCACCTCGTATATACCAATATCGGTCAATCAATGGGTGGCGCTGGTTTAGGCGTGGGG
GCGGGGGGTGCGTCTGATATGAGCAGCTCTTCTTCTCCTGCGGCTGGTGGTTACCCTGAC
TTTCCTCCGTCTCCGGACTCTTGGCTCGGCGAACCACATCACTATTCACCCCGAGGCTAC
CCCTAG
Protein sequence:
MDVSIGGVSYDSGKLVRGGGAPHPYSGVYQIQGFRVERSIAFAVNENFPCNRSLGVPDTR
DWLRGVVDVIIRVTRTNRMLVLIRGYGYAGRSGAGRPQIMLGSMMYPGAEDELDMRVPPI
QLEHLPEVFLSSIPKCGGCHEMIVDRYVLKVSDRTWHAGCLRCVECRAMLSGKCFARNNQ
LYCTEDFFKRYGTKCAGCGQGIPPTQVVRRAQAHVYHLRCFACAACARTLNTGDEFYLME
DGKLVCKPDYEAARAKGEGSLDGDAASKRPRTTITAKQLETLKSAYSSSPKPARHVREQL
AQDTGLDMRVVQVWFQNRRAKEKRLKKDAGRTRWSQYFRSMKGGGSGSPRHDRLLDKDEL
KIDLDSFSHHELSNDSYSTAALGGEEGSPAGGAAGGTRFGTTPPYLRAHSPPHAHYHYPP
DHLVYTNIGQSMGGAGLGVGAGGASDMSSSSSPAAGGYPDFPPSPDSWLGEPHHYSPRGY
P