New model in OGS2.0 | DPOGS215209  |
---|---|
Genomic Position | scaffold3249:+ 12086-18159 |
See gene structure | |
CDS Length | 2340 |
Paired RNAseq reads   | 1096 |
Single RNAseq reads   | 2503 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008716 (2e-167) |
Best Drosophila hit   | CG1234 (1e-85) |
Best Human hit | nucleolar complex protein 3 homolog (9e-55) |
Best NR hit (blastp)   | PREDICTED: similar to CG1234 CG1234-PA [Tribolium castaneum] (1e-115) |
Best NR hit (blastx)   | GE25803 [Drosophila yakuba] (2e-89) |
GeneOntology terms    | GO:0008150 biological_process GO:0005488 binding |
InterPro families    | IPR011501 Nucleolar complex-associated IPR005612 CCAAT-binding factor |
Orthology group | MCL15823 |
Nucleotide sequence:
ATGAAAAAACAGGGTAAATTAAAGCTCCAGAGGCACAGGACGAAGGTGCAAAAACAGAAA
CAGCCGGTGAAGGAGACACCAGAATATAGCAGTGACAGTGAGTCCTCGAGTAACGAGTGG
GCGGACATGTTGGATGAGGAGGAACAGCAATACATAGTGAATAGACTGGCCAAGCAACCC
AATCTGCTGTCAAACATCCCAGAAAAAGAAGACAATAAGAGAGGATCTAAACGCAAGAAA
GACAAAGAACGCTTACCCAAAGTGAAGAAGCCAAGAACTGAAAATAGTATGGATGACAAC
ATTGGATCGTCCGATGACTCTGACAGTGAAGTGGAAGTCAAGTATGAGAAGGACCTGGCA
CAGCGGCCAGTGAAGAAGATGAGATCACTGCTACCCATTAAAACCAAGGATGGCCTGGTG
GAGAGAACTGAGGAGTGTGACGACTCAGATACGGAGTCGGCAGAGGAAGATCAGGCAGGT
GTCGATGATGAAGTAGAAAAGGCAGCTGTTGAATCAGGCTCGGAACATGATTCAGATGAA
GAGACGATGGAAAAGTCAGAAGATAACGAGGAGAAAGAGATTACTACTGTTGAACTGCTA
GCAGCCAGACGGGACCGGCTGAGGCATGAGAAGTTGAGGATCGGAGCACTCTGCTCGTCT
CTGTTGGAGAGCCCCGAAAAGAAGCTTAAAAACCTCTATCCAATCTTGTATCTGATGGAT
GAGCACTTGAAAGATGGCACCGCCAACCTGGTGTCAGTCCGTAAGCTGGCCTCCTTGTCT
GCTGCGGAGGTGTTCAGAGACATCCTGCCCGAGTACAAGCTCCGGCATCAGGACTACAGC
GACGTTAAATTGAAGAAAGATACTCTGTCACTATACAAATATGAAAAGGAACTGCTGGAG
TTCTACAAGAGATACCTCCAGAGATTGGAGAAAGCAGCCAGCGTACTGAGACCCAAGAAA
GGAGATAAACGTAAACCCGATGACCCTCGGGTCAGTCTTGGCCTGCTGTCGATCAAGTGT
ATGTGTACCTTGTTAGTGGCGAGACCTAACTTCAACTACGCCACCAATATAGCTCAGAGT
GTGATCCCGTTCCTTGACACCACACCGGAGGCAAGAAACACTGTCACCGAAGCCTGCACC
GACGTGTTCAAGGAGGACAATAAAGGAGAGATCACATTGGCTATAGTTCGTCTAATAAAC
CAGCTCGCTAAACGGCGAGGCTCCCGCCTGAACCCCGCGGCCTTGGACTGTTTGTTACAG
CTGAAGATACAGGACGTGGAGCTGGACGAGGAACACGACCTCAAGATGAAGAAGAAAACG
GAAGAGAAGAAGAAGAAGAGGATCGTCAATCTGTCCAAGAAGGAGAAGAAGAGAGCTAAG
AAGTTGAAAGAGGTCGAACGAGAACTTCTGGAAACAGAAGCCAAGGAGAGCGAGTCGGCG
AGGAGGAAACAGCTGACGGAAGTTACGAAAACCGTCTTCCATATATACTTCCGGTTACTG
AAGAGCTCTCCCACCACTGGCCTACTCATCGCGGCTTTGAACGGGATCGCCAAGTTCACG
CACGTCATCAACCTGGAGTACTACTCGGACCTGGTGTCTATCTTGTCTGGTCTGGTGCGA
TCATCCGCCGACCGCTCCACCCGGCTGGTGGTGGTGGGGACCGTGCTGGCTGTCCTGGCC
AAGGCCGGAGACGCTCTCAACGTGGACCCCGCCGTCTTCCACACACACCTCTATCAGGAC
ATGCTGGCCGTACACGCTGGTTGTTCGCGGTCGGAGACGGCCACGGTGGTGGAGTGTGCG
AGCGCGGTATGCCAGAGACTGAGGAGGGTGTCGTGCGGCGTGCTGAGGGCGCTGGCCAAG
AGATTGCTGACCATGTCCTCGCACGGACAACACCACGCCGCGCTCGCCTCACTCGCGCTC
GTGTGGACCATCATGCAGCACAACAAGCACGTGTCCGCCTTGTTCGAGGCGGAGTGTGCG
GGCTCGGGTAGGTTCGACCCGCTGTCGCCGTCCCCCGAGCACTGCGGATCGCATGCCGCC
CTCGGGTACGAGGGACCCATCCTGACCTCCCACTACCACCCGGTGGTGCGCGCCGCCGCC
GCCGCGCTTCTCCAGAGAGGAGGCTGGCCTCAGGAGTTACACGGACTCACCCCGATGCAG
ATCTTCGATCAGTACGACTGTTCCCAGCTGTGTTTCAAGCCGGCCATTCCTCCGCCCAAG
CCCCGCGACGTCACCAAGCCGAAGACCTCCCAGACCTGGGCCAGGCCGGACTTCAAGACC
TACTGCGAGACCGTAGAAGACAATCTAAATATGGACATCGACAACTTCATAGAAAGATGA
Protein sequence:
MKKQGKLKLQRHRTKVQKQKQPVKETPEYSSDSESSSNEWADMLDEEEQQYIVNRLAKQP
NLLSNIPEKEDNKRGSKRKKDKERLPKVKKPRTENSMDDNIGSSDDSDSEVEVKYEKDLA
QRPVKKMRSLLPIKTKDGLVERTEECDDSDTESAEEDQAGVDDEVEKAAVESGSEHDSDE
ETMEKSEDNEEKEITTVELLAARRDRLRHEKLRIGALCSSLLESPEKKLKNLYPILYLMD
EHLKDGTANLVSVRKLASLSAAEVFRDILPEYKLRHQDYSDVKLKKDTLSLYKYEKELLE
FYKRYLQRLEKAASVLRPKKGDKRKPDDPRVSLGLLSIKCMCTLLVARPNFNYATNIAQS
VIPFLDTTPEARNTVTEACTDVFKEDNKGEITLAIVRLINQLAKRRGSRLNPAALDCLLQ
LKIQDVELDEEHDLKMKKKTEEKKKKRIVNLSKKEKKRAKKLKEVERELLETEAKESESA
RRKQLTEVTKTVFHIYFRLLKSSPTTGLLIAALNGIAKFTHVINLEYYSDLVSILSGLVR
SSADRSTRLVVVGTVLAVLAKAGDALNVDPAVFHTHLYQDMLAVHAGCSRSETATVVECA
SAVCQRLRRVSCGVLRALAKRLLTMSSHGQHHAALASLALVWTIMQHNKHVSALFEAECA
GSGRFDPLSPSPEHCGSHAALGYEGPILTSHYHPVVRAAAAALLQRGGWPQELHGLTPMQ
IFDQYDCSQLCFKPAIPPPKPRDVTKPKTSQTWARPDFKTYCETVEDNLNMDIDNFIER