DPGLEAN16908 in OGS1.0

New model in OGS2.0DPOGS201864 
Genomic Positionscaffold414:- 6700-15370
See gene structure
CDS Length3009
Paired RNAseq reads  3128
Single RNAseq reads  7482
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006039 (0.0)
Best Drosophila hit  NFAT, isoform A (3e-102)
Best Human hitnuclear factor of activated T-cells 5 isoform b (2e-71)
Best NR hit (blastp)  PREDICTED: similar to NFAT CG11172-PA [Apis mellifera] (3e-127)
Best NR hit (blastx)  conserved hypothetical protein [Pediculus humanus corporis] (7e-114)
GeneOntology terms




  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006350 transcription
GO:0003702 RNA polymerase II transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0009651 response to salt stress
InterPro families




  
IPR002909 Cell surface receptor IPT/TIG
IPR011539 Rel homology
IPR008967 p53-like transcription factor, DNA-binding
IPR014756 Immunoglobulin E-set
IPR013783 Immunoglobulin-like fold
IPR008366 Nuclear factor of activated T cells (NFAT)
Orthology groupMCL13734

Nucleotide sequence:

ATGAGCGCCCGCGTTCACCGGAAAGTGATGCGCGCGCCTCATAAGCGAGCGCATCCGGGG
AAAATGCTCCACGCCGGGAAACTGGTGCACCCGGGGAAGGGAATCCACCCGGGCAAGTTC
GCACATGTGGGGAAGTTCGGGAAACTCGGTCACTACACGCACACGCACTCGTTGCGTCCA
CCGGAGCCCTGCGACAATAGTAACGACAGCGGTCTGGGGCCCGACCCCGTCAACAGATTG
TCGGAGGTGGCCGAGGAGTGGGAAGAGCCGGAAACAAAGAGGCGGTGCGAGAGAGAGACG
GGTGTTAAGATAGAGTGCGATGACGCTAACGATGCGTACGCGTTCGCTCTACCCGCCGCG
CCCTCGCACGCGCCCAGCGCCCCTGACACTGCGCCCGCTACACTACCGATTCCCGCAATC
AAAGCCGGTTCCAGCATTTCTAGTCCAAGCATAGCAGAGACGACTGGCAAAAGATTTCAA
AATCCAACGTATTGGTCGTACGGGAAACTGGAGAGTAGCGGCAAGCTAGCCAATAAGTAT
GCCAGTGGTGGCAAATTAGGGGGAAAATTGGGTGGTAAATTATCAGGGAAGCATGGTGGA
AAATTAGCCCAGGCATTGGCGCGACGAAGAGCGCTGGCAGCTATGACACACTCAGGGCCG
CCGGGCTTATCGGCGCCGCTCTCATGCAAATCTAGAGATGGGACGGTTGAGCTGCAAATA
CTATGTCAACCGGAGACACAGCATAGAGCGAGATATCAGACCGAAGGAAGTCGAGGAGCA
GTTAAAGACAATTCTGGGAATGGCTTCCCCGTTGTGAAACTTGTCGGTTATGACAAGCCA
GCCGTGCTTCAGGTATTCATAGGTACTGATACAGGACGCGTCGCCCCTCATATGTTTTAC
CAGGCGTGCCGCGTCTCTGGCAAGAACTCTACGCCGTGCAAAGAAAGGAAAGAAGATGGG
ACTGTTGTCATTGAAATTGACTTGGAACCGGCAAAAAACTGGCAGGTCACTTGCGACTGC
GTAGGAATTCTGAAGGAACGTAACGTGGATGTAGAGCACAGGTTCGGCGAGGCCCTGGGC
GGTGGGGCGGGCGGGGTGCACGCGGCGCGCGGGAAGAAGAAGTCCACAAGATGCCGCATG
GTCTTCCGCACCGAGATACTGGACTCCAACGGACAAACAGAGACCCTGCAAGTCTGCTCC
ACACAGATCATATGCACCCAGCCTCCCGGAGTACCGGAAGTGTGCAGGAAGTCGCTGGTG
TCGTGTCCGGTGACGGGCGGCTTGGAGTTGTACCTGCTGGGGAAGAACTTCCTGAAGGAG
ACCAGAGTGGTGTTCCGCGTCAAACAAGACGGAGTCACCTGGGAGGAGGAGGTCGTTCCG
GACAAGGAGTTCTTGCAACAGACCCACCTGGTGTGCTGCGTGCCGGCGTACTCGCGCCCC
GACATCCAGGAGGCGGTGTGCGTCCAGTTGTTCGTGCGCTCGGGCGGCAAGTCCTCCGAG
CCGCACGCCTTCTACTACACGCCGGCCGGGGCGCGCGCGATGCACTGCACCCAGCACCCC
GCGCACACCCCCCTCACCCCCCACACCCCTCACACAGGTGAGGCTGCTCTCATGCCGCCG
CCTCTCGCCCCCCTCGCGCCTCTCCCGCCCGCTCGCCGGACCTCTCTGCTGCACGACCCT
CACTCTCCGCTGGGACTCAAGAGCGAGGTGGACGAGTCCAGCCAACACTCGCTGCTGGAG
GGCGAGCGCTCGGAGCCCGACGCGCCGCTCGCCGATGACATCATGGACCTGCGGCTCAAG
TCGGAGACCATCACCTGCGACTCGCAGACACAGGTGGGTTTCGTGAGCGGTTACGACTCT
ATCAAGCTGTCTCCGAACACTACGTCGCGGGACGAGTCCCCGTCCGTGATCGCGTCTTTC
ACACAGCAGCTGCAGGCCATACAGAACCAGGTGCAGACGGACAAGATGGTCGAGTCCGTG
ACCGCCGCTATCTTCAACTCTGACAACGCCGGCCAGATGTACGAGCAGCCGCTGCTTCCC
ATCAACACCATGGACACCATGCAGCGGATCATGTCTGCCAAGACCGCCATGGACCCGCTC
GACCGAGACATGAAAGTACTGAACTCAGACCTGATGATGACCGGCGACCCCATGCAGACC
AGTGTACTGCAGACTGCCGGTGAACAGCGACTCATGGTGTACGATCAAGTGCCGCCAGCC
CGCGAGGACGGCTTTAACCCTTTCGGAGCCATCGGCAAGATGGAGGCGACTCAGATAAAA
CAACGGCTGGCGCAGCAAACCGCGCACATGGACGCGCTCGTGGAGGACGCCATGCGCTCC
GCCGGCGCCACCATCATGCCCGGCGACGCCGCCAAGCTCGACGAGCTGGTGAACTCGCGA
GTAGAAGACCACCTGGGCGGCACGGGGACCTCGCCTTCCGGCGCTTCGCACGCTTCCGAC
GTGCTGCTTAGTCCCGGGGCGGCGGTCGTGCCCCGCACCTCCGACCTCCTGCTGCCGCTG
GCCGCCACCACCATGTCCCCGGACGTGATCCTCGATCCTCAAGTGTCCCCCTCGATGCTA
TGCGACTCCTCGCAGCGTATCGTGCTCCCGCCGCGGTCGCAGGACGAGCTGATGATGATG
CCGGACATCCCGTCGTCCGTGAAGACGCCGCCGGCCGCCGTCAAGTCTATGATCCTGAAC
GCGGCCGCCGAGATTCTGACCTCGGACCGCGCCATGAACGCGCTCGTCACTTCCGCCATC
AACACAGCCAACATGGCGGCGGCGGACGCGGCGCCGGCGGACGAGCCCGCGGCCGCCATG
TCGCAGGCCGTGTCGCAGGCCGTCACTCAGGCGGTGTCGCAGGCCGTGTCGCAGGCGGTG
TCGCAGGCCGTGTCGCAGGAGATGACTGCGCCCGTGCAGGGCCTCACGGACATGAGCGAC
CAGGACCTGCTGTCCTACATCAACCCCAGCACCTTCGACCAGGGTGAGTACTACATCGAC
CTGTTCTAG

Protein sequence:

MSARVHRKVMRAPHKRAHPGKMLHAGKLVHPGKGIHPGKFAHVGKFGKLGHYTHTHSLRP
PEPCDNSNDSGLGPDPVNRLSEVAEEWEEPETKRRCERETGVKIECDDANDAYAFALPAA
PSHAPSAPDTAPATLPIPAIKAGSSISSPSIAETTGKRFQNPTYWSYGKLESSGKLANKY
ASGGKLGGKLGGKLSGKHGGKLAQALARRRALAAMTHSGPPGLSAPLSCKSRDGTVELQI
LCQPETQHRARYQTEGSRGAVKDNSGNGFPVVKLVGYDKPAVLQVFIGTDTGRVAPHMFY
QACRVSGKNSTPCKERKEDGTVVIEIDLEPAKNWQVTCDCVGILKERNVDVEHRFGEALG
GGAGGVHAARGKKKSTRCRMVFRTEILDSNGQTETLQVCSTQIICTQPPGVPEVCRKSLV
SCPVTGGLELYLLGKNFLKETRVVFRVKQDGVTWEEEVVPDKEFLQQTHLVCCVPAYSRP
DIQEAVCVQLFVRSGGKSSEPHAFYYTPAGARAMHCTQHPAHTPLTPHTPHTGEAALMPP
PLAPLAPLPPARRTSLLHDPHSPLGLKSEVDESSQHSLLEGERSEPDAPLADDIMDLRLK
SETITCDSQTQVGFVSGYDSIKLSPNTTSRDESPSVIASFTQQLQAIQNQVQTDKMVESV
TAAIFNSDNAGQMYEQPLLPINTMDTMQRIMSAKTAMDPLDRDMKVLNSDLMMTGDPMQT
SVLQTAGEQRLMVYDQVPPAREDGFNPFGAIGKMEATQIKQRLAQQTAHMDALVEDAMRS
AGATIMPGDAAKLDELVNSRVEDHLGGTGTSPSGASHASDVLLSPGAAVVPRTSDLLLPL
AATTMSPDVILDPQVSPSMLCDSSQRIVLPPRSQDELMMMPDIPSSVKTPPAAVKSMILN
AAAEILTSDRAMNALVTSAINTANMAAADAAPADEPAAAMSQAVSQAVTQAVSQAVSQAV
SQAVSQEMTAPVQGLTDMSDQDLLSYINPSTFDQGEYYIDLF