DPGLEAN15077 in OGS1.0

New model in OGS2.0DPOGS211229 
Genomic Positionscaffold525:+ 17269-49379
See gene structure
CDS Length1446
Paired RNAseq reads  35
Single RNAseq reads  119
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003208 (7e-116)
Best Drosophila hit  Lim3, isoform A (9e-107)
Best Human hitLIM/homeobox protein Lhx4 (8e-81)
Best NR hit (blastp)  lim homeobox protein [Culex quinquefasciatus] (3e-149)
Best NR hit (blastx)  lim homeobox protein [Culex quinquefasciatus] (8e-121)
GeneOntology terms






  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007399 nervous system development
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0008045 motor axon guidance
InterPro families



  
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
IPR001781 Zinc finger, LIM-type
IPR001356 Homeobox
IPR012287 Homeodomain-related
Orthology groupMCL11104

Nucleotide sequence:

ATGGATGTAAGCATCGGGGGTGTGTCGTACGACTCGGGAAAATTAGTTCGGGGCGGGGGA
GCCCCTCACCCCTATTCCGGGGTTTATCAGATTCAGGGGTTCCGAGTTGAGCGTTCAATA
GCATTTGCAGTCAACGAGAATTTTCCTTGTAATCGATCGCTGGGTGTACCCGACACTAGA
GATTGGTTGAGGGGCGTGGTGGATGTTATCATTAGGGTGACGAGGACCAATCGCATGTTG
GTACTGATAAGGGGGTATGGGTACGCCGGACGCAGCGGCGCAGGGCGGCCGCAGATCATG
CTGGGCTCCATGATGTACCCCGGCGCGGAAGACGAGCTCGACATGCGCGTACCACCCATC
CAACTAGAACACCTACCTGAAGTGTTCCTATCCAGCATACCAAAATGTGGAGGCTGTCAT
GAGATGATAGTAGATCGATACGTGCTAAAAGTGTCAGACAGGACTTGGCACGCTGGCTGT
TTGAGGTGTGTCGAGTGTCGAGCCATGCTGTCAGGAAAGTGCTTCGCTAGAAATAACCAG
CTCTACTGCACCGAAGACTTCTTCAAGCGTTACGGCACTAAGTGCGCGGGCTGCGGGCAG
GGCATCCCGCCGACACAGGTCGTACGGCGGGCCCAGGCTCACGTTTACCATCTACGGTGT
TTCGCATGTGCTGCCTGTGCACGCACACTTAATACCGGAGACGAGTTTTATCTAATGGAG
GATGGCAAGCTTGTTTGCAAACCGGACTATGAAGCTGCGAGAGCGAAAGGTGAAGGTTCG
TTGGATGGAGACGCAGCTAGCAAGAGACCGCGAACTACAATTACTGCAAAACAGCTTGAA
ACTCTTAAAAGTGCTTATAGCAGCAGTCCAAAGCCAGCTCGCCACGTGAGGGAACAGCTT
GCTCAAGATACAGGCTTAGATATGAGGGTCGTTCAAGTTTGGTTTCAAAATCGGAGGGCA
AAAGAGAAGCGACTAAAGAAGGACGCGGGTCGAACAAGATGGTCACAATACTTCAGATCT
ATGAAGGGCGGAGGAAGTGGCTCACCGCGTCACGATCGTCTTCTGGATAAAGACGAACTG
AAAATAGATTTAGACTCTTTCAGTCATCATGAGCTAAGCAACGATAGTTATAGCACTGCT
GCGCTGGGTGGTGAGGAGGGATCACCAGCCGGCGGCGCGGCGGGAGGGACTCGGTTTGGT
ACTACGCCACCATATCTTCGTGCTCATTCACCCCCCCACGCACATTATCATTATCCCCCC
GATCACCTCGTATATACCAATATCGGTCAATCAATGGGTGGCGCTGGTTTAGGCGTGGGG
GCGGGGGGTGCGTCTGATATGAGCAGCTCTTCTTCTCCTGCGGCTGGTGGTTACCCTGAC
TTTCCTCCGTCTCCGGACTCTTGGCTCGGCGAACCACATCACTATTCACCCCGAGGCTAC
CCCTAG

Protein sequence:

MDVSIGGVSYDSGKLVRGGGAPHPYSGVYQIQGFRVERSIAFAVNENFPCNRSLGVPDTR
DWLRGVVDVIIRVTRTNRMLVLIRGYGYAGRSGAGRPQIMLGSMMYPGAEDELDMRVPPI
QLEHLPEVFLSSIPKCGGCHEMIVDRYVLKVSDRTWHAGCLRCVECRAMLSGKCFARNNQ
LYCTEDFFKRYGTKCAGCGQGIPPTQVVRRAQAHVYHLRCFACAACARTLNTGDEFYLME
DGKLVCKPDYEAARAKGEGSLDGDAASKRPRTTITAKQLETLKSAYSSSPKPARHVREQL
AQDTGLDMRVVQVWFQNRRAKEKRLKKDAGRTRWSQYFRSMKGGGSGSPRHDRLLDKDEL
KIDLDSFSHHELSNDSYSTAALGGEEGSPAGGAAGGTRFGTTPPYLRAHSPPHAHYHYPP
DHLVYTNIGQSMGGAGLGVGAGGASDMSSSSSPAAGGYPDFPPSPDSWLGEPHHYSPRGY
P