DPGLEAN18441 in OGS1.0

New model in OGS2.0DPOGS206305 
Genomic Positionscaffold505:- 54574-78709
See gene structure
CDS Length3396
Paired RNAseq reads  207
Single RNAseq reads  497
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014124 (0.0)
Best Drosophila hit  Calmodulin-binding transcription activator, isoform E (6e-82)
Best Human hitcalmodulin-binding transcription activator 1 isoform 1 (1e-46)
Best NR hit (blastp)  PREDICTED: similar to calmodulin-binding transcription activator [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to calmodulin-binding transcription activator [Tribolium castaneum] (0.0)
GeneOntology terms









  
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0045941 positive regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005516 calmodulin binding
GO:0045449 regulation of transcription
GO:0016059 deactivation of rhodopsin mediated signaling
GO:0003702 RNA polymerase II transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0009416 response to light stimulus
GO:0042803 protein homodimerization activity
InterPro families




  
IPR013783 Immunoglobulin-like fold
IPR020683 Ankyrin repeat-containing domain
IPR000048 IQ motif, EF-hand binding site
IPR014756 Immunoglobulin E-set
IPR005559 CG-1
IPR002909 Cell surface receptor IPT/TIG
Orthology groupMCL11996

Nucleotide sequence:

ATGCTTCTGTACAGCAGAAAAAAAGTCCGTTACCGACGTGATGGATACTGTTGGAAGAAG
CGGAAAGACGGAAAGACCACCCGAGAAGACCACATGAAGCTTAAAGTGCAAGGAACTGAG
TGCATCTATGGTTGTTATGTGCACTCGGCAATTCTACCTACTTTCCATCGGAGATGTTAC
TGGCTTTTGCAGAATCCCGACATAGTACTGGTCCACTATCTGAACGTGCCGTACCCAGAT
GACAACAAGTTGGCTACCGTAGCCCCGAGCCTCGCGCTCTGGGCTGATAAAAAGGAATGG
ACGAAAGATGAACTCGTCAGTCAATTGAAGCCAATGTTCTTTAGCGAAGATGAACCGGAC
GTAAATAGTGACTTGGAAATATCTGGAAAAATGTTTCAGACAACGGAGACGGTGGAAGCT
ATTGTTGGTCAGTTGATGGAGAAGCAAAGAGCAGCGAGAGCTGCCGCTTTGGCTCGCCAA
CTTGAATGCGGCTGCCCAGACTCCACCTGCCAAGATTCAAGGACATGTGCTCACCCAATG
CGGCGTATTCAAACAGCAAAGGCACCGGCTTCTGACCATCATGTTTCCTCAACTACTGGC
CCGTCACCAAGACCAATGGCTCAACCACCACGACAGTACACCAGAGACCATAGAGCTACG
ACACAATCGTCACCGTTACTGCTGTCGCTGGGTCAGATACAAGGTGGTGGGGGACTTCTC
ATATTAAATGGCACCAGCAACAGTTCTCAACAGCATTCATCATTAGTTTCACCTCTGTCT
GTTACATCATTTGTTTGCGAGGAACCTAGAGACAGGTATCGTCAACAGTACAAACCGACA
TTCGTCCTGAAAAGGGAAATACCGGATAGTCAACAAAACACATGTTTGACTAATACTGAA
TCAACGTTTGAAGTGGAGAGTCGAGTTGAAGAAAAAGTTGAAATTGAAACTTTTGATCGA
AAAATAAAGATGGAACCTAGAAGTAGAAATAATATAATAGCTAGTGCACCAGCGACGCCG
TCACGTTACCCAGACTTGGTGGAACGATTGGAAAGTAAAATTCATACAGACCATTGTGAA
GATACGCTGGTTTTGCTTGGGACTGATGCCCATTTGGAATCATCTAGTGGGTTTTTCGAT
GAAACATTGGAGCTATCTCACGAGGATATACAGAAGACATTGTCAGCGAATATGCCAACA
TGTGAATTAAATCGAAGTGGAGTGAGATCAACTGAAACCGCCAATGTGATGGTATCGGGA
ATAGATACTATGGACTTTATAGAGAGTTGTGAAGCTGTCGCTTCCCCTACACATGTGGTT
GATGATAATGTGTTTGTAAATTTAGATGCTTTTGATATGCTCGGTGACTTTCCGGAATTG
GAGGTATTGGATCCCAGCACTATATCTACTAATCCCGCGAATCTTTGTGGAAATTCTCCT
CAAACGGAGGAAAACAACGATAAAATGCAGACTGATAGTCCAAGGGAAGGTGCACTTAGC
ATTACTGACTATTCTCCTGAATGGGCGTATCCTGAAGGTGGTGTCAAGGTACTGGTAGCT
GGGCCTTGGACGGAAACCTCCGATCAGTATACCATTCTTTTTGACAACTTTCCGGTACCT
TCAATATTGGTGCAGAATGGTCTACTTAGATGTTATTGTCCAGCTCATGAGGCCGGGTTG
GCAGCATTGCAAGTAGCTCGAGCTGGTCGCGTAGTATCTGACACGGTGGTGTTCGAATAT
AAGGCAGGTCCAATGTTGGCGCCGTCCTCACCCGCTTCAGCGCCGCTGCCTTCTTTGGAT
CTTAGACGATTCTCGTTGTTGCAGCGTCTGCAGCGGCTGCACGGGCGTTTGCAACTGAAG
ACGGAACCAATGGATGATAATAACCAGATTGAAGATGTGCAGTTATATTCAAATCCAAAA
TTTGAGGATCGTCTCGTGGTTTTTTGCCAATTCTTAAGTAACCGGTCGTTCGGTAACTCC
GAAGGATTCACTACGGAGCCTGGTGAAGACAGTTCCACCATATTACATCTAGCTGCAGCT
TTGGGTTACACGAAGCTGACAACAGCTTTGTTGAGGTGGAGACAGGACGATAATAGCTTA
GCTTTGGAGAAGGAAGTTAATTTGGGAGCTAGAGATAGCGACAATTGTACCCCCTTGATG
GTAGCAAGCGCGCTCGGTCACTCAGACACCGCGTTAGTGTTGGCTCGCTGGGCGGCGGGG
ACGCGGCGGGAGGCCGGGGCTAGAGCGGCGGTTGCTGCAGCACGGCGCGGGGGGCACAGC
ACCCTCGCAGCTGCCTTAGAGAGAATACAGGGGGACTGCGTGTTCAGAAGACCGCTCAGT
TTATCTCAAAAGAATAGAGCTGGCAGTTTGGAGAGTAATTTAGTGAAACGGCCCTCCATC
GACAGCGGTATCAACATGGCTGATGCCTTTAGATCCAGTTCAGCTATAGACAAAACTGAC
ACTAATTCGTCCAGATGGGAACGAAGTATGTCGCTTCCACTGGACTCGGATACCGAGGAC
AGCTTCGGTGACATGAAACTTGGTCGCAGGATGGACCTAGCTCTATGGGAGCAGGATGAC
CGTGTCTTCACGTTGGCTGAGCAGATTATAGCCGCTATGCCGGAGAGAATTAAGAATGAG
GGTATTCTTTCGTGCGACCTGGACAGCGGCGCTTGCAGCGAGGACGTGCTGATGGTGCCG
TTGTTAGATGACGCTTCAACCTTCAGCAGCGAGTTCAGCTTTGAGTTTTGTGATAACACA
TACAGATACACTGGGGCATCTACTCCATCATCAGGCTCCGTGTCTCCAGGCTCTGCGCTG
TCTCCTCCGCCCTCTTCACCCCTCGCCCCCGCCTCTGCCACTCTACAAGAGTTCCTCAAC
ACGACGCACTTTTCCAGCTTAACTCTAAACGACCGGGAGCAGCGCGAGTTATACTCAGCG
GCGATCACGATCCAGAAGGCCTATCGTCAGTACCGCGGTAGACAGTTGCAGCGCCGGGCT
GCCGCCGCTGCAATCACCATACAGAACTGCTATCGTCGATACAAACAGTTCGCGTACTTG
AAGCAAATGCACGCAGCGGCGACGGTTATCCAACGAGGATACCGCGGATTGAGGGAGAGA
CGGCTCAATAACACCAACTACGTCAAGCGAACATACTCTCAGAGGAGACAACACCAAGCA
GCGAGGAAAATCCAACAGTTCATGAGGCAAACCAAGATCAAGTTGCAGAGAGAGCGAGCC
GCAAACGCGAAGGCGGCCCTGCGCTCCCCGGATGCCCACCAAAGCTCGTCGCAGCCCATC
ACCAGTACACCCAATAGGATCATTGACTATCTAGCACCTGAATCACCGATGAACGCAGAT
GATGACCTTCTGATCGAGCTTCTGTTTAAAATGTGA

Protein sequence:

MLLYSRKKVRYRRDGYCWKKRKDGKTTREDHMKLKVQGTECIYGCYVHSAILPTFHRRCY
WLLQNPDIVLVHYLNVPYPDDNKLATVAPSLALWADKKEWTKDELVSQLKPMFFSEDEPD
VNSDLEISGKMFQTTETVEAIVGQLMEKQRAARAAALARQLECGCPDSTCQDSRTCAHPM
RRIQTAKAPASDHHVSSTTGPSPRPMAQPPRQYTRDHRATTQSSPLLLSLGQIQGGGGLL
ILNGTSNSSQQHSSLVSPLSVTSFVCEEPRDRYRQQYKPTFVLKREIPDSQQNTCLTNTE
STFEVESRVEEKVEIETFDRKIKMEPRSRNNIIASAPATPSRYPDLVERLESKIHTDHCE
DTLVLLGTDAHLESSSGFFDETLELSHEDIQKTLSANMPTCELNRSGVRSTETANVMVSG
IDTMDFIESCEAVASPTHVVDDNVFVNLDAFDMLGDFPELEVLDPSTISTNPANLCGNSP
QTEENNDKMQTDSPREGALSITDYSPEWAYPEGGVKVLVAGPWTETSDQYTILFDNFPVP
SILVQNGLLRCYCPAHEAGLAALQVARAGRVVSDTVVFEYKAGPMLAPSSPASAPLPSLD
LRRFSLLQRLQRLHGRLQLKTEPMDDNNQIEDVQLYSNPKFEDRLVVFCQFLSNRSFGNS
EGFTTEPGEDSSTILHLAAALGYTKLTTALLRWRQDDNSLALEKEVNLGARDSDNCTPLM
VASALGHSDTALVLARWAAGTRREAGARAAVAAARRGGHSTLAAALERIQGDCVFRRPLS
LSQKNRAGSLESNLVKRPSIDSGINMADAFRSSSAIDKTDTNSSRWERSMSLPLDSDTED
SFGDMKLGRRMDLALWEQDDRVFTLAEQIIAAMPERIKNEGILSCDLDSGACSEDVLMVP
LLDDASTFSSEFSFEFCDNTYRYTGASTPSSGSVSPGSALSPPPSSPLAPASATLQEFLN
TTHFSSLTLNDREQRELYSAAITIQKAYRQYRGRQLQRRAAAAAITIQNCYRRYKQFAYL
KQMHAAATVIQRGYRGLRERRLNNTNYVKRTYSQRRQHQAARKIQQFMRQTKIKLQRERA
ANAKAALRSPDAHQSSSQPITSTPNRIIDYLAPESPMNADDDLLIELLFKM