New model in OGS2.0 | DPOGS201864  |
---|---|
Genomic Position | scaffold414:- 6700-15370 |
See gene structure | |
CDS Length | 3009 |
Paired RNAseq reads   | 3128 |
Single RNAseq reads   | 7482 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006039 (0.0) |
Best Drosophila hit   | NFAT, isoform A (3e-102) |
Best Human hit | nuclear factor of activated T-cells 5 isoform b (2e-71) |
Best NR hit (blastp)   | PREDICTED: similar to NFAT CG11172-PA [Apis mellifera] (3e-127) |
Best NR hit (blastx)   | conserved hypothetical protein [Pediculus humanus corporis] (7e-114) |
GeneOntology terms    | GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006350 transcription GO:0003702 RNA polymerase II transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0009651 response to salt stress |
InterPro families    | IPR002909 Cell surface receptor IPT/TIG IPR011539 Rel homology IPR008967 p53-like transcription factor, DNA-binding IPR014756 Immunoglobulin E-set IPR013783 Immunoglobulin-like fold IPR008366 Nuclear factor of activated T cells (NFAT) |
Orthology group | MCL13734 |
Nucleotide sequence:
ATGAGCGCCCGCGTTCACCGGAAAGTGATGCGCGCGCCTCATAAGCGAGCGCATCCGGGG
AAAATGCTCCACGCCGGGAAACTGGTGCACCCGGGGAAGGGAATCCACCCGGGCAAGTTC
GCACATGTGGGGAAGTTCGGGAAACTCGGTCACTACACGCACACGCACTCGTTGCGTCCA
CCGGAGCCCTGCGACAATAGTAACGACAGCGGTCTGGGGCCCGACCCCGTCAACAGATTG
TCGGAGGTGGCCGAGGAGTGGGAAGAGCCGGAAACAAAGAGGCGGTGCGAGAGAGAGACG
GGTGTTAAGATAGAGTGCGATGACGCTAACGATGCGTACGCGTTCGCTCTACCCGCCGCG
CCCTCGCACGCGCCCAGCGCCCCTGACACTGCGCCCGCTACACTACCGATTCCCGCAATC
AAAGCCGGTTCCAGCATTTCTAGTCCAAGCATAGCAGAGACGACTGGCAAAAGATTTCAA
AATCCAACGTATTGGTCGTACGGGAAACTGGAGAGTAGCGGCAAGCTAGCCAATAAGTAT
GCCAGTGGTGGCAAATTAGGGGGAAAATTGGGTGGTAAATTATCAGGGAAGCATGGTGGA
AAATTAGCCCAGGCATTGGCGCGACGAAGAGCGCTGGCAGCTATGACACACTCAGGGCCG
CCGGGCTTATCGGCGCCGCTCTCATGCAAATCTAGAGATGGGACGGTTGAGCTGCAAATA
CTATGTCAACCGGAGACACAGCATAGAGCGAGATATCAGACCGAAGGAAGTCGAGGAGCA
GTTAAAGACAATTCTGGGAATGGCTTCCCCGTTGTGAAACTTGTCGGTTATGACAAGCCA
GCCGTGCTTCAGGTATTCATAGGTACTGATACAGGACGCGTCGCCCCTCATATGTTTTAC
CAGGCGTGCCGCGTCTCTGGCAAGAACTCTACGCCGTGCAAAGAAAGGAAAGAAGATGGG
ACTGTTGTCATTGAAATTGACTTGGAACCGGCAAAAAACTGGCAGGTCACTTGCGACTGC
GTAGGAATTCTGAAGGAACGTAACGTGGATGTAGAGCACAGGTTCGGCGAGGCCCTGGGC
GGTGGGGCGGGCGGGGTGCACGCGGCGCGCGGGAAGAAGAAGTCCACAAGATGCCGCATG
GTCTTCCGCACCGAGATACTGGACTCCAACGGACAAACAGAGACCCTGCAAGTCTGCTCC
ACACAGATCATATGCACCCAGCCTCCCGGAGTACCGGAAGTGTGCAGGAAGTCGCTGGTG
TCGTGTCCGGTGACGGGCGGCTTGGAGTTGTACCTGCTGGGGAAGAACTTCCTGAAGGAG
ACCAGAGTGGTGTTCCGCGTCAAACAAGACGGAGTCACCTGGGAGGAGGAGGTCGTTCCG
GACAAGGAGTTCTTGCAACAGACCCACCTGGTGTGCTGCGTGCCGGCGTACTCGCGCCCC
GACATCCAGGAGGCGGTGTGCGTCCAGTTGTTCGTGCGCTCGGGCGGCAAGTCCTCCGAG
CCGCACGCCTTCTACTACACGCCGGCCGGGGCGCGCGCGATGCACTGCACCCAGCACCCC
GCGCACACCCCCCTCACCCCCCACACCCCTCACACAGGTGAGGCTGCTCTCATGCCGCCG
CCTCTCGCCCCCCTCGCGCCTCTCCCGCCCGCTCGCCGGACCTCTCTGCTGCACGACCCT
CACTCTCCGCTGGGACTCAAGAGCGAGGTGGACGAGTCCAGCCAACACTCGCTGCTGGAG
GGCGAGCGCTCGGAGCCCGACGCGCCGCTCGCCGATGACATCATGGACCTGCGGCTCAAG
TCGGAGACCATCACCTGCGACTCGCAGACACAGGTGGGTTTCGTGAGCGGTTACGACTCT
ATCAAGCTGTCTCCGAACACTACGTCGCGGGACGAGTCCCCGTCCGTGATCGCGTCTTTC
ACACAGCAGCTGCAGGCCATACAGAACCAGGTGCAGACGGACAAGATGGTCGAGTCCGTG
ACCGCCGCTATCTTCAACTCTGACAACGCCGGCCAGATGTACGAGCAGCCGCTGCTTCCC
ATCAACACCATGGACACCATGCAGCGGATCATGTCTGCCAAGACCGCCATGGACCCGCTC
GACCGAGACATGAAAGTACTGAACTCAGACCTGATGATGACCGGCGACCCCATGCAGACC
AGTGTACTGCAGACTGCCGGTGAACAGCGACTCATGGTGTACGATCAAGTGCCGCCAGCC
CGCGAGGACGGCTTTAACCCTTTCGGAGCCATCGGCAAGATGGAGGCGACTCAGATAAAA
CAACGGCTGGCGCAGCAAACCGCGCACATGGACGCGCTCGTGGAGGACGCCATGCGCTCC
GCCGGCGCCACCATCATGCCCGGCGACGCCGCCAAGCTCGACGAGCTGGTGAACTCGCGA
GTAGAAGACCACCTGGGCGGCACGGGGACCTCGCCTTCCGGCGCTTCGCACGCTTCCGAC
GTGCTGCTTAGTCCCGGGGCGGCGGTCGTGCCCCGCACCTCCGACCTCCTGCTGCCGCTG
GCCGCCACCACCATGTCCCCGGACGTGATCCTCGATCCTCAAGTGTCCCCCTCGATGCTA
TGCGACTCCTCGCAGCGTATCGTGCTCCCGCCGCGGTCGCAGGACGAGCTGATGATGATG
CCGGACATCCCGTCGTCCGTGAAGACGCCGCCGGCCGCCGTCAAGTCTATGATCCTGAAC
GCGGCCGCCGAGATTCTGACCTCGGACCGCGCCATGAACGCGCTCGTCACTTCCGCCATC
AACACAGCCAACATGGCGGCGGCGGACGCGGCGCCGGCGGACGAGCCCGCGGCCGCCATG
TCGCAGGCCGTGTCGCAGGCCGTCACTCAGGCGGTGTCGCAGGCCGTGTCGCAGGCGGTG
TCGCAGGCCGTGTCGCAGGAGATGACTGCGCCCGTGCAGGGCCTCACGGACATGAGCGAC
CAGGACCTGCTGTCCTACATCAACCCCAGCACCTTCGACCAGGGTGAGTACTACATCGAC
CTGTTCTAG
Protein sequence:
MSARVHRKVMRAPHKRAHPGKMLHAGKLVHPGKGIHPGKFAHVGKFGKLGHYTHTHSLRP
PEPCDNSNDSGLGPDPVNRLSEVAEEWEEPETKRRCERETGVKIECDDANDAYAFALPAA
PSHAPSAPDTAPATLPIPAIKAGSSISSPSIAETTGKRFQNPTYWSYGKLESSGKLANKY
ASGGKLGGKLGGKLSGKHGGKLAQALARRRALAAMTHSGPPGLSAPLSCKSRDGTVELQI
LCQPETQHRARYQTEGSRGAVKDNSGNGFPVVKLVGYDKPAVLQVFIGTDTGRVAPHMFY
QACRVSGKNSTPCKERKEDGTVVIEIDLEPAKNWQVTCDCVGILKERNVDVEHRFGEALG
GGAGGVHAARGKKKSTRCRMVFRTEILDSNGQTETLQVCSTQIICTQPPGVPEVCRKSLV
SCPVTGGLELYLLGKNFLKETRVVFRVKQDGVTWEEEVVPDKEFLQQTHLVCCVPAYSRP
DIQEAVCVQLFVRSGGKSSEPHAFYYTPAGARAMHCTQHPAHTPLTPHTPHTGEAALMPP
PLAPLAPLPPARRTSLLHDPHSPLGLKSEVDESSQHSLLEGERSEPDAPLADDIMDLRLK
SETITCDSQTQVGFVSGYDSIKLSPNTTSRDESPSVIASFTQQLQAIQNQVQTDKMVESV
TAAIFNSDNAGQMYEQPLLPINTMDTMQRIMSAKTAMDPLDRDMKVLNSDLMMTGDPMQT
SVLQTAGEQRLMVYDQVPPAREDGFNPFGAIGKMEATQIKQRLAQQTAHMDALVEDAMRS
AGATIMPGDAAKLDELVNSRVEDHLGGTGTSPSGASHASDVLLSPGAAVVPRTSDLLLPL
AATTMSPDVILDPQVSPSMLCDSSQRIVLPPRSQDELMMMPDIPSSVKTPPAAVKSMILN
AAAEILTSDRAMNALVTSAINTANMAAADAAPADEPAAAMSQAVSQAVTQAVSQAVSQAV
SQAVSQEMTAPVQGLTDMSDQDLLSYINPSTFDQGEYYIDLF