DPGLEAN10290 in OGS1.0

New model in OGS2.0DPOGS212336 
Genomic Positionscaffold101:- 365616-373753
See gene structure
CDS Length1107
Paired RNAseq reads  119
Single RNAseq reads  434
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009688 (7e-17)
Best Drosophila hit  knirps-like (5e-49)
Best Human hitnuclear receptor ROR-beta (8e-16)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC003413 [Tribolium castaneum] (9e-68)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL011231 [Aedes aegypti] (7e-57)
GeneOntology terms









  
GO:0004879 ligand-dependent nuclear receptor activity
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0007427 epithelial cell migration, open tracheal system
GO:0035151 regulation of tube size, open tracheal system
GO:0046845 branched duct epithelial cell fate determination, open tracheal system
GO:0007088 regulation of mitosis
GO:0007424 open tracheal system development
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families

  
IPR001628 Zinc finger, nuclear hormone receptor-type
IPR013088 Zinc finger, NHR/GATA-type
IPR008946 Nuclear hormone receptor, ligand-binding
Orthology groupMCL17369

Nucleotide sequence:

ATGAATCAGAAGTGCAAAGTCTGTGGAGAACCGGCTGCCGGCTTTCATTTCGGAGCTTTC
ACCTGCGAGGGATGCAAGTCATTCTTCGGACGCTCCTACAATAATCTCAACTCCATTACG
GAATGCAAAAACAACGGCGAGTGTGTTATCAACAAGAAGAATAGGACTGCATGTAAAGCT
TGCCGACTTCGAAAATGTTTGATGGTGGGCATGTCAAAGTCTGGTTCCAGATACGGAAGA
CGTTCCAATTGGTTTAAGATTCACTGTCTTCTTCAAGAACAACAGCAAGCAGCTCAATCA
CAGTCACCCCCTAGAATGCCACAATCGCCGCACATGGCGCCTCCCTTTCCACCTCATCTG
TTCCCTGGACTAGCGAGACCACGGTCAAAAGAAGAACTCGCTCTTTTGAGCCTCGATGAT
TACAAGATGCCCTGCTCTGGATCCCCAGATTCCCACCGAAGCGGGTCCTCACCTAAATTA
GATGAGAAATCTAGGCTCACACAATCCCGGCCGCCTGACAGACCTCTGACACCGCCCAGA
GACGCTTTTCTCCATCTTCCTCTAGCCAATATATCCTTGCCACACTTCCCGCATTCGCCG
TTTCTACCGCCACACCATTTCAATACATTCCCTCCGAACCATCCACTATTATTTCCACCT
GGTTTCCATCCGATTTATTCTAGACATTTACTGGATCATGCGGCACTCAGACAGGCCGCT
GAAAACAACAATGATGTCAGAATCGACGATAACAACACAGACTCGTCGAAGCGATTCTTT
TTGGACGAGATATTAAAGCAACAGAGATCCAACCAGCCCGCACAAGAAGATGTCATATCG
GAGGCTGAGTTCGTGCCAACACCTCCGGCGGAAAGAAGGACGTCAGAATCACCGTTACAG
GAAAACCCGATGGATCTGTCGGTGAAATCCGACGGTAGATCGAGTTCAGCGAGACGAAGG
TCCGATGATAGCGAGATAATCACCCCAGACAATGATGACCCGGAGAGTGGCAGTGATCGA
GCATCGGCCAGTGACGAAGAGGACATGGCATACTCTCAAATAAAGAGGATCAAACTCCAT
CCTCTCGACCTGACGACTAAAGTCTGA

Protein sequence:

MNQKCKVCGEPAAGFHFGAFTCEGCKSFFGRSYNNLNSITECKNNGECVINKKNRTACKA
CRLRKCLMVGMSKSGSRYGRRSNWFKIHCLLQEQQQAAQSQSPPRMPQSPHMAPPFPPHL
FPGLARPRSKEELALLSLDDYKMPCSGSPDSHRSGSSPKLDEKSRLTQSRPPDRPLTPPR
DAFLHLPLANISLPHFPHSPFLPPHHFNTFPPNHPLLFPPGFHPIYSRHLLDHAALRQAA
ENNNDVRIDDNNTDSSKRFFLDEILKQQRSNQPAQEDVISEAEFVPTPPAERRTSESPLQ
ENPMDLSVKSDGRSSSARRRSDDSEIITPDNDDPESGSDRASASDEEDMAYSQIKRIKLH
PLDLTTKV