New model in OGS2.0 | DPOGS208770  |
---|---|
Genomic Position | scaffold2102:+ 19122-24551 |
See gene structure | |
CDS Length | 1542 |
Paired RNAseq reads   | 1525 |
Single RNAseq reads   | 3788 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007631 (0.0) |
Best Drosophila hit   | CG2875, isoform A (8e-41) |
Best Human hit | nucleolar complex protein 4 homolog (2e-30) |
Best NR hit (blastp)   | PREDICTED: similar to CG2875-PB, isoform B [Apis mellifera] (7e-112) |
Best NR hit (blastx)   | PREDICTED: similar to CG2875-PB, isoform B [Apis mellifera] (8e-97) |
GeneOntology terms    | GO:0005634 nucleus GO:0031965 nuclear membrane GO:0005730 nucleolus GO:0016020 membrane GO:0005515 protein binding GO:0016021 integral to membrane |
InterPro families   | IPR005612 CCAAT-binding factor |
Orthology group | MCL12202 |
Nucleotide sequence:
ATGGCTGCTGTGCAACAAAATTTATCTAAAATGATTTCTGCACAGTTGCGAAATAAAGCA
AATGAGTTTCTAAACTCTAGAAAAAATGCTAATAATCTAGCCGACATTCTTCAAATGTTT
GAGGCTGAAACGGAAAATTACACCCCGTTGCTTTTAACAATTGAAGTGATCTTTACGGAG
CTATTAAAACGAGGTGATTTAGTACAACATATTGAACCCTTAAAACCAATAGACCGTAGT
CCTGAGGCAGAATATACAAGATGGCTCAACGAGTGTTATGAGACCGCTCTCTCCCGTGTA
TTGGAATGTATTCGACGCGGACGCACCAGCTCTCGCCTTCAGGCCCTAGTCACATCTTGC
AAATTGATGCAAGCTGAGGGAAAATATCCTTTAGAACACACCAGTGGTTACTTCTTCCCC
TCTGTTAGATTGAAGAATATATTTTTGGTACTCCTGGATTCTGAAATTTCCATGTCAGCA
CCAATAGCTCGCTTCCAGGAGTTCACAGAGTACAGAGACGTGCAGCAACATGGCCTTAAA
GTACTGTCGACACTGGCTTGTCATAAATCTCCATCTCAGACGTACATGCAAAATTATCTA
GAGTTGTTCGACAAGTTGTTGGCATCGGAAATACCAGCAGAAGTGAGGAAGACAAAAGAT
AAGATCGGTGAAGAAGATTTCAAAGTACTGTGCGCTAATGAAGGCAAGCCGTCTTTCCCA
TACAACACATCCGTGTGTCGTCGTTATGCCAACCGTTGCTGGGGTTTCTCCTGTCAGTGG
CCCTTATGTGAGTCCCCTCGGTCTCATCGCCGCGCCCTAGTGCTTCTCGTTGAGAAACTG
ATGCCGCTACTGAACAAACCTCACCTGGCCACCGACATGCTCTGTGACAGCCTGGACGCG
GGTGGTCCTATCAGCATGTTGGCTCTGCAGGGCATGCTGGAGCTGGTCCGTCACCACAAC
ATCGACTACCCGGACATGTACGACCGCCTGTACGCCATGTTCGAGCCGGAGATGTTCGCC
ACCAGATACAAGAAACGCCTCATCCACCTCGCCGATATATTCCTGAGTTCCACTCACCTG
CCCGAGAGTCTGGTGGCAGCCTTCGCTAAGCGTCTGTCTCGCCTGGCGTTGGTGGCGTCT
CCCGAAGACGCCATGGGACTGTTGCAACTGGTGGGGAACCTTCTACTGAGACATACTGCA
CTGAAACGAATGATTTGTTGCGAGGACACGCCCGCTGTCATGTCTAACGACCCCTACGTG
ATGGAGGAGTCTTCTGCGTCGCGGTCCAGAGCCCTGGGTTCGTCTTTATGGGAGGTGCGA
GCCTTGACGCGGCACTGGCAGCCCACGCTGGCCACCGTCGCCAGACAGGTCACTGACCCT
GACAGGCGAGCCCCCATCGACATCGATCATGCTGGAGAAGAGATGTTCGATGCGGAACTA
AAGAAGAGGTTCAAGACGATAGAAGTGAACTTCATACGTCCTCAGAGTATGTCGCTGCCG
TCCGGGGAGAGACTCGCGCAGTACTGGGAGATAATGGCCTGA
Protein sequence:
MAAVQQNLSKMISAQLRNKANEFLNSRKNANNLADILQMFEAETENYTPLLLTIEVIFTE
LLKRGDLVQHIEPLKPIDRSPEAEYTRWLNECYETALSRVLECIRRGRTSSRLQALVTSC
KLMQAEGKYPLEHTSGYFFPSVRLKNIFLVLLDSEISMSAPIARFQEFTEYRDVQQHGLK
VLSTLACHKSPSQTYMQNYLELFDKLLASEIPAEVRKTKDKIGEEDFKVLCANEGKPSFP
YNTSVCRRYANRCWGFSCQWPLCESPRSHRRALVLLVEKLMPLLNKPHLATDMLCDSLDA
GGPISMLALQGMLELVRHHNIDYPDMYDRLYAMFEPEMFATRYKKRLIHLADIFLSSTHL
PESLVAAFAKRLSRLALVASPEDAMGLLQLVGNLLLRHTALKRMICCEDTPAVMSNDPYV
MEESSASRSRALGSSLWEVRALTRHWQPTLATVARQVTDPDRRAPIDIDHAGEEMFDAEL
KKRFKTIEVNFIRPQSMSLPSGERLAQYWEIMA