New model in OGS2.0 | DPOGS210071  |
---|---|
Genomic Position | scaffold430:- 199133-212770 |
See gene structure | |
CDS Length | 2118 |
Paired RNAseq reads   | 1296 |
Single RNAseq reads   | 2989 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012712 (1e-154) |
Best Drosophila hit   | Rfx, isoform B (3e-42) |
Best Human hit | transcription factor RFX3 isoform b (2e-45) |
Best NR hit (blastp)   | PREDICTED: similar to GA19507-PA [Tribolium castaneum] (3e-122) |
Best NR hit (blastx)   | DNA-binding protein RFX2 [Xenopus laevis] (2e-48) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0010843 promoter binding GO:0030528 transcription regulator activity GO:0031018 endocrine pancreas development GO:0050796 regulation of insulin secretion GO:0060285 ciliary cell motility GO:0060287 cilium movement involved in determination of left/right asymmetry |
InterPro families    | IPR003150 DNA-binding RFX IPR011991 Winged helix-turn-helix transcription repressor DNA-binding |
Orthology group | MCL10936 |
Nucleotide sequence:
ATGGGTGAAGTAACGAAAAGTGAAGCTGTTTATGTAAATGACGGTACAAACTTAACCGTG
TATGTATACGATGTTGATGTGGAAGATTTTGAGTGCGCGGGCGACGAAGTGTTGGTGGAG
TCGTCTCCGCCCGCTTCACCAGTCATGGCGGCGCGACTAGCGGCGCCCTCACAGGGCTCG
GGCGCGGGCGGCGCCTCGCCGGACGCTGTGCGTGAGTTGATCGTTATACCGGAGCTCCCT
AATTCAATCCACCTGCAACACGCCATACAGCAGGTGTCGAGTACGGTGGTAGAAGTCAAC
GGAGACAGTTCAGGACACTCCAGTCCCACGACGGAGGCACAGCACACATACATCGTCACA
AGCGAGGGCGGGAACGGCGTCAACTATCACGTCCAGTATGTGGAGCCGCAGGAGATATAC
GCACAGACAGGACATCAGACACATATGGAGACCCTCCGCTCATATCCCGTGTACGGCGTG
GCGAGCGTGCCCGCGGACGGCGCGGTGACGGCCGTGACCGCGGTCACAGCGGTCACCGCC
TCCGATGACACCTGGCCGGCTGAGTTCACCTTCGAACAACCGGCCTCGCCAGCACCGGCA
GCTCGTATGCCGCCGGCCACCGTGCAGTGGCTGCTGGATCACTACGAGACGGCCGAAGGT
GTGTCTCTCCCGCGTTCGACGCTGTACGCCCATTACCTCCGTCACTGTTCCACACATCGC
CTGGAGCCGGTGAACGCGGCGTCTTTCGGGAAGCTCATCAGATCGGTGTTCGTGGGTCTG
AGGACCCGCCGGCTCGGGACCCGGGGAAACTCCAAGTATCACTACTACGGCATCAGGGCG
AAGCATTCCGCGCCCCGAGACCTGCCGCCCACCGTACAGAAGATAGACGAGGAACCGCAC
TCGTCAGACGAATCCCGTCCCCGTGAGCCGGAGAGTCCCGTGGGTCTGTCTGGTATCGCT
CACAGACAGTACTTGGGCTCGGTGAGCGCCCCTGACCCGCCGCCGCTGCAGCTAGACGAC
CCACCGCCAGACGTGACGCCTGAAGCGATGCAGCAGTTCAGGGATCACCACAGGCAACAC
GGGGTGGAGTTCCTCGAGGCCGTGGCGTCCCTGGACACGGGAGCTGCGGAGCGCTCTCGT
CGGTGGTTCTGGAGGCGCGTGGGCAGGAGCGGGGCCCGCCTGGCCGGTCGCAGGGACGTG
TGCACCTGGCTCAGGAGGGCCGAGCTCGAGCTGCACCAGCGAGCCGTGGACCTCCTGCTG
CCCGACGTACTCAGGCCCATACCCTCACAACTCACACAGGCCATCCGTAACTTCGCCAAG
AGCCTGGAGGGCGCGCTGTCGTCGGGGTCCTCCGGAGCCCCGGCCCCAGCGGCGCGCGCT
CAGGCGTTGGCTGCGGGGGCTCTGTCGGCCGCCCTCAGGCGCTACACCTCCCTCAACCAC
CTGGCGCAGGCCGCGCGGGCCGTCCTCAACAACCACCATCAGATCCAGCAGATGTTGTCG
GACCTGAACCGCGTGGACTTCCGCGTGGTGCGCGAGCAGGCGGCCTGGGCCTGCGCCTGT
GGCAGTGCGGCCACCGCGCACCGCCTCGAGGCTGACTTTAAAGCCCGCCTCGGTCGCGGG
TCGTCGCTGGAGTCGTGGGCGTCGTGGCTGGAGAGCTGCGTCCGCGCCGCGTTGGCCCCG
CACGAGCGCCGCGCCGACTACACGCCGCGTGCGCGACGACTGCTGCTCGACTGGTCCTTC
TACTCCTCGCTCGTCATCAGGGAACTCACGCTCAGGTCGGCGGCGTCGTTCGGGTCGTTC
CACTTGATCCGCCTGCTGTACGACGAGTACGTCTCCTTCCTCATAGAGCGGCGCGTGGCC
GAGCACCGCCAGGAGCCGCCCATAGCTGTGATGCAGCGAGCGATGGATGACGACGATGAA
CTGCCGGAGGAGGTTCCCCGCGACGACGACGACATGAACGGAGAGATGGTGGACGAGGGG
CTCGACCACGGGGAGGGAGAGGGGGACGGAGACGGCGAGGGGAACGGAGAGGAGGGGGAG
GGGGAGTGGGAGTGGGAGGACGACGACGACGAGCACGAGGAGAGGGAGCAGAAGAGGGCC
CGCCTGGACCGAGGCTAA
Protein sequence:
MGEVTKSEAVYVNDGTNLTVYVYDVDVEDFECAGDEVLVESSPPASPVMAARLAAPSQGS
GAGGASPDAVRELIVIPELPNSIHLQHAIQQVSSTVVEVNGDSSGHSSPTTEAQHTYIVT
SEGGNGVNYHVQYVEPQEIYAQTGHQTHMETLRSYPVYGVASVPADGAVTAVTAVTAVTA
SDDTWPAEFTFEQPASPAPAARMPPATVQWLLDHYETAEGVSLPRSTLYAHYLRHCSTHR
LEPVNAASFGKLIRSVFVGLRTRRLGTRGNSKYHYYGIRAKHSAPRDLPPTVQKIDEEPH
SSDESRPREPESPVGLSGIAHRQYLGSVSAPDPPPLQLDDPPPDVTPEAMQQFRDHHRQH
GVEFLEAVASLDTGAAERSRRWFWRRVGRSGARLAGRRDVCTWLRRAELELHQRAVDLLL
PDVLRPIPSQLTQAIRNFAKSLEGALSSGSSGAPAPAARAQALAAGALSAALRRYTSLNH
LAQAARAVLNNHHQIQQMLSDLNRVDFRVVREQAAWACACGSAATAHRLEADFKARLGRG
SSLESWASWLESCVRAALAPHERRADYTPRARRLLLDWSFYSSLVIRELTLRSAASFGSF
HLIRLLYDEYVSFLIERRVAEHRQEPPIAVMQRAMDDDDELPEEVPRDDDDMNGEMVDEG
LDHGEGEGDGDGEGNGEEGEGEWEWEDDDDEHEEREQKRARLDRG