DPGLEAN22174 in OGS1.0

New model in OGS2.0DPOGS208770 
Genomic Positionscaffold2102:+ 19122-24551
See gene structure
CDS Length1542
Paired RNAseq reads  1525
Single RNAseq reads  3788
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007631 (0.0)
Best Drosophila hit  CG2875, isoform A (8e-41)
Best Human hitnucleolar complex protein 4 homolog (2e-30)
Best NR hit (blastp)  PREDICTED: similar to CG2875-PB, isoform B [Apis mellifera] (7e-112)
Best NR hit (blastx)  PREDICTED: similar to CG2875-PB, isoform B [Apis mellifera] (8e-97)
GeneOntology terms




  
GO:0005634 nucleus
GO:0031965 nuclear membrane
GO:0005730 nucleolus
GO:0016020 membrane
GO:0005515 protein binding
GO:0016021 integral to membrane
InterPro families  IPR005612 CCAAT-binding factor
Orthology groupMCL12202

Nucleotide sequence:

ATGGCTGCTGTGCAACAAAATTTATCTAAAATGATTTCTGCACAGTTGCGAAATAAAGCA
AATGAGTTTCTAAACTCTAGAAAAAATGCTAATAATCTAGCCGACATTCTTCAAATGTTT
GAGGCTGAAACGGAAAATTACACCCCGTTGCTTTTAACAATTGAAGTGATCTTTACGGAG
CTATTAAAACGAGGTGATTTAGTACAACATATTGAACCCTTAAAACCAATAGACCGTAGT
CCTGAGGCAGAATATACAAGATGGCTCAACGAGTGTTATGAGACCGCTCTCTCCCGTGTA
TTGGAATGTATTCGACGCGGACGCACCAGCTCTCGCCTTCAGGCCCTAGTCACATCTTGC
AAATTGATGCAAGCTGAGGGAAAATATCCTTTAGAACACACCAGTGGTTACTTCTTCCCC
TCTGTTAGATTGAAGAATATATTTTTGGTACTCCTGGATTCTGAAATTTCCATGTCAGCA
CCAATAGCTCGCTTCCAGGAGTTCACAGAGTACAGAGACGTGCAGCAACATGGCCTTAAA
GTACTGTCGACACTGGCTTGTCATAAATCTCCATCTCAGACGTACATGCAAAATTATCTA
GAGTTGTTCGACAAGTTGTTGGCATCGGAAATACCAGCAGAAGTGAGGAAGACAAAAGAT
AAGATCGGTGAAGAAGATTTCAAAGTACTGTGCGCTAATGAAGGCAAGCCGTCTTTCCCA
TACAACACATCCGTGTGTCGTCGTTATGCCAACCGTTGCTGGGGTTTCTCCTGTCAGTGG
CCCTTATGTGAGTCCCCTCGGTCTCATCGCCGCGCCCTAGTGCTTCTCGTTGAGAAACTG
ATGCCGCTACTGAACAAACCTCACCTGGCCACCGACATGCTCTGTGACAGCCTGGACGCG
GGTGGTCCTATCAGCATGTTGGCTCTGCAGGGCATGCTGGAGCTGGTCCGTCACCACAAC
ATCGACTACCCGGACATGTACGACCGCCTGTACGCCATGTTCGAGCCGGAGATGTTCGCC
ACCAGATACAAGAAACGCCTCATCCACCTCGCCGATATATTCCTGAGTTCCACTCACCTG
CCCGAGAGTCTGGTGGCAGCCTTCGCTAAGCGTCTGTCTCGCCTGGCGTTGGTGGCGTCT
CCCGAAGACGCCATGGGACTGTTGCAACTGGTGGGGAACCTTCTACTGAGACATACTGCA
CTGAAACGAATGATTTGTTGCGAGGACACGCCCGCTGTCATGTCTAACGACCCCTACGTG
ATGGAGGAGTCTTCTGCGTCGCGGTCCAGAGCCCTGGGTTCGTCTTTATGGGAGGTGCGA
GCCTTGACGCGGCACTGGCAGCCCACGCTGGCCACCGTCGCCAGACAGGTCACTGACCCT
GACAGGCGAGCCCCCATCGACATCGATCATGCTGGAGAAGAGATGTTCGATGCGGAACTA
AAGAAGAGGTTCAAGACGATAGAAGTGAACTTCATACGTCCTCAGAGTATGTCGCTGCCG
TCCGGGGAGAGACTCGCGCAGTACTGGGAGATAATGGCCTGA

Protein sequence:

MAAVQQNLSKMISAQLRNKANEFLNSRKNANNLADILQMFEAETENYTPLLLTIEVIFTE
LLKRGDLVQHIEPLKPIDRSPEAEYTRWLNECYETALSRVLECIRRGRTSSRLQALVTSC
KLMQAEGKYPLEHTSGYFFPSVRLKNIFLVLLDSEISMSAPIARFQEFTEYRDVQQHGLK
VLSTLACHKSPSQTYMQNYLELFDKLLASEIPAEVRKTKDKIGEEDFKVLCANEGKPSFP
YNTSVCRRYANRCWGFSCQWPLCESPRSHRRALVLLVEKLMPLLNKPHLATDMLCDSLDA
GGPISMLALQGMLELVRHHNIDYPDMYDRLYAMFEPEMFATRYKKRLIHLADIFLSSTHL
PESLVAAFAKRLSRLALVASPEDAMGLLQLVGNLLLRHTALKRMICCEDTPAVMSNDPYV
MEESSASRSRALGSSLWEVRALTRHWQPTLATVARQVTDPDRRAPIDIDHAGEEMFDAEL
KKRFKTIEVNFIRPQSMSLPSGERLAQYWEIMA