DPGLEAN13721 in OGS1.0

New model in OGS2.0DPOGS210071 
Genomic Positionscaffold430:- 199133-212770
See gene structure
CDS Length2118
Paired RNAseq reads  1296
Single RNAseq reads  2989
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012712 (1e-154)
Best Drosophila hit  Rfx, isoform B (3e-42)
Best Human hittranscription factor RFX3 isoform b (2e-45)
Best NR hit (blastp)  PREDICTED: similar to GA19507-PA [Tribolium castaneum] (3e-122)
Best NR hit (blastx)  DNA-binding protein RFX2 [Xenopus laevis] (2e-48)
GeneOntology terms





  
GO:0006355 regulation of transcription, DNA-dependent
GO:0010843 promoter binding
GO:0030528 transcription regulator activity
GO:0031018 endocrine pancreas development
GO:0050796 regulation of insulin secretion
GO:0060285 ciliary cell motility
GO:0060287 cilium movement involved in determination of left/right asymmetry
InterPro families
  
IPR003150 DNA-binding RFX
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL10936

Nucleotide sequence:

ATGGGTGAAGTAACGAAAAGTGAAGCTGTTTATGTAAATGACGGTACAAACTTAACCGTG
TATGTATACGATGTTGATGTGGAAGATTTTGAGTGCGCGGGCGACGAAGTGTTGGTGGAG
TCGTCTCCGCCCGCTTCACCAGTCATGGCGGCGCGACTAGCGGCGCCCTCACAGGGCTCG
GGCGCGGGCGGCGCCTCGCCGGACGCTGTGCGTGAGTTGATCGTTATACCGGAGCTCCCT
AATTCAATCCACCTGCAACACGCCATACAGCAGGTGTCGAGTACGGTGGTAGAAGTCAAC
GGAGACAGTTCAGGACACTCCAGTCCCACGACGGAGGCACAGCACACATACATCGTCACA
AGCGAGGGCGGGAACGGCGTCAACTATCACGTCCAGTATGTGGAGCCGCAGGAGATATAC
GCACAGACAGGACATCAGACACATATGGAGACCCTCCGCTCATATCCCGTGTACGGCGTG
GCGAGCGTGCCCGCGGACGGCGCGGTGACGGCCGTGACCGCGGTCACAGCGGTCACCGCC
TCCGATGACACCTGGCCGGCTGAGTTCACCTTCGAACAACCGGCCTCGCCAGCACCGGCA
GCTCGTATGCCGCCGGCCACCGTGCAGTGGCTGCTGGATCACTACGAGACGGCCGAAGGT
GTGTCTCTCCCGCGTTCGACGCTGTACGCCCATTACCTCCGTCACTGTTCCACACATCGC
CTGGAGCCGGTGAACGCGGCGTCTTTCGGGAAGCTCATCAGATCGGTGTTCGTGGGTCTG
AGGACCCGCCGGCTCGGGACCCGGGGAAACTCCAAGTATCACTACTACGGCATCAGGGCG
AAGCATTCCGCGCCCCGAGACCTGCCGCCCACCGTACAGAAGATAGACGAGGAACCGCAC
TCGTCAGACGAATCCCGTCCCCGTGAGCCGGAGAGTCCCGTGGGTCTGTCTGGTATCGCT
CACAGACAGTACTTGGGCTCGGTGAGCGCCCCTGACCCGCCGCCGCTGCAGCTAGACGAC
CCACCGCCAGACGTGACGCCTGAAGCGATGCAGCAGTTCAGGGATCACCACAGGCAACAC
GGGGTGGAGTTCCTCGAGGCCGTGGCGTCCCTGGACACGGGAGCTGCGGAGCGCTCTCGT
CGGTGGTTCTGGAGGCGCGTGGGCAGGAGCGGGGCCCGCCTGGCCGGTCGCAGGGACGTG
TGCACCTGGCTCAGGAGGGCCGAGCTCGAGCTGCACCAGCGAGCCGTGGACCTCCTGCTG
CCCGACGTACTCAGGCCCATACCCTCACAACTCACACAGGCCATCCGTAACTTCGCCAAG
AGCCTGGAGGGCGCGCTGTCGTCGGGGTCCTCCGGAGCCCCGGCCCCAGCGGCGCGCGCT
CAGGCGTTGGCTGCGGGGGCTCTGTCGGCCGCCCTCAGGCGCTACACCTCCCTCAACCAC
CTGGCGCAGGCCGCGCGGGCCGTCCTCAACAACCACCATCAGATCCAGCAGATGTTGTCG
GACCTGAACCGCGTGGACTTCCGCGTGGTGCGCGAGCAGGCGGCCTGGGCCTGCGCCTGT
GGCAGTGCGGCCACCGCGCACCGCCTCGAGGCTGACTTTAAAGCCCGCCTCGGTCGCGGG
TCGTCGCTGGAGTCGTGGGCGTCGTGGCTGGAGAGCTGCGTCCGCGCCGCGTTGGCCCCG
CACGAGCGCCGCGCCGACTACACGCCGCGTGCGCGACGACTGCTGCTCGACTGGTCCTTC
TACTCCTCGCTCGTCATCAGGGAACTCACGCTCAGGTCGGCGGCGTCGTTCGGGTCGTTC
CACTTGATCCGCCTGCTGTACGACGAGTACGTCTCCTTCCTCATAGAGCGGCGCGTGGCC
GAGCACCGCCAGGAGCCGCCCATAGCTGTGATGCAGCGAGCGATGGATGACGACGATGAA
CTGCCGGAGGAGGTTCCCCGCGACGACGACGACATGAACGGAGAGATGGTGGACGAGGGG
CTCGACCACGGGGAGGGAGAGGGGGACGGAGACGGCGAGGGGAACGGAGAGGAGGGGGAG
GGGGAGTGGGAGTGGGAGGACGACGACGACGAGCACGAGGAGAGGGAGCAGAAGAGGGCC
CGCCTGGACCGAGGCTAA

Protein sequence:

MGEVTKSEAVYVNDGTNLTVYVYDVDVEDFECAGDEVLVESSPPASPVMAARLAAPSQGS
GAGGASPDAVRELIVIPELPNSIHLQHAIQQVSSTVVEVNGDSSGHSSPTTEAQHTYIVT
SEGGNGVNYHVQYVEPQEIYAQTGHQTHMETLRSYPVYGVASVPADGAVTAVTAVTAVTA
SDDTWPAEFTFEQPASPAPAARMPPATVQWLLDHYETAEGVSLPRSTLYAHYLRHCSTHR
LEPVNAASFGKLIRSVFVGLRTRRLGTRGNSKYHYYGIRAKHSAPRDLPPTVQKIDEEPH
SSDESRPREPESPVGLSGIAHRQYLGSVSAPDPPPLQLDDPPPDVTPEAMQQFRDHHRQH
GVEFLEAVASLDTGAAERSRRWFWRRVGRSGARLAGRRDVCTWLRRAELELHQRAVDLLL
PDVLRPIPSQLTQAIRNFAKSLEGALSSGSSGAPAPAARAQALAAGALSAALRRYTSLNH
LAQAARAVLNNHHQIQQMLSDLNRVDFRVVREQAAWACACGSAATAHRLEADFKARLGRG
SSLESWASWLESCVRAALAPHERRADYTPRARRLLLDWSFYSSLVIRELTLRSAASFGSF
HLIRLLYDEYVSFLIERRVAEHRQEPPIAVMQRAMDDDDELPEEVPRDDDDMNGEMVDEG
LDHGEGEGDGDGEGNGEEGEGEWEWEDDDDEHEEREQKRARLDRG