New model in OGS2.0 | DPOGS200865  |
---|---|
Genomic Position | scaffold1237:+ 9163-21205 |
See gene structure | |
CDS Length | 2259 |
Paired RNAseq reads   | 2242 |
Single RNAseq reads   | 6247 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009854 (3e-87) |
Best Drosophila hit   | lethal (2) NC136 (4e-103) |
Best Human hit | CCR4-NOT transcription complex subunit 3 (5e-84) |
Best NR hit (blastp)   | PREDICTED: similar to MGC80612 protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to lethal (2) NC136 CG8426-PA [Apis mellifera] (7e-125) |
GeneOntology terms    | GO:0006350 transcription GO:0005634 nucleus GO:0045449 regulation of transcription GO:0030528 transcription regulator activity |
InterPro families    | IPR012270 CCR4-NOT complex, subunit 3/ 5 IPR007207 Not CCR4-Not complex component, N-terminal IPR007282 NOT2/NOT3/NOT5 |
Orthology group | MCL13817 |
Nucleotide sequence:
ATGGCTGCGACAAGAAAATTACAAGGTGAAATAGACAGGTGTTTAAAAAAGGTCACGGAG
GGGGTGGAGACGTTTGAGGACATCTGGCAAAAGGTACACAATGCGACGAACAGTAATCAA
AAAGAAAAGTATGAGGCGGATCTCAAAAAGGAGATTAAAAAGCTTCAGAGGCTACGAGAT
CAGATTAAGTCATGGATCGCCTCGGGCGAAATTAAGGATAAGAGTACACTTTTAGAATAT
AGGAAACTAATAGAAACGCAAATGGAAAGGTTCAAAGTTGTGGAACGGGAAACAAAAACG
AAAGCATACTCTAAAGAAGGGCTGGGTGCGGCGCAAAAGTTGGACCCTGCCCAGAAGGAA
CGAGAGGAAATGTCATCATGGCTAATATCTTCAATAGATGCACTTAATTTACAGATTGAT
CTATTTGAGTCTGAAGTTGAGTCACTGTTAGTTGGTAAGAAGAAACGTCTGGACAAGGAG
AAACAGGATCGTATGGAGGAACTCAAGCTCAAGTTGGAAAGGCACAGGTTCCACATAAAG
AAGCTAGAAACCTTACTCCGAATGCTAGACAACATGTCCGTAGAAGTGGAACAGATAAAG
AGAATAAAAGAAGATGTTGAGTACTACATAGTATCATCGTTAGAGCCAGGGTACGAAGAG
AATGACTACATCTACGAAGACATTAATGGCCTGGACGAGATCGAGCTCAGTGGAGTGGGA
CTGCCCTCGTCGGCTACAACGGATAGCAATAATAGTAACGATTCACCCGGTTCACCCACC
AGTATACTCTCAGGAACGAGTCCCGTGACGTCACCATCGTTAGACACACACAACCACACG
ACGGATTCCATAGACGTTGACAAAAAGAAAAAAGAAGATATTACAACTAAACCTATCAAG
CCGCTGCCGCTCCGTGCGGTGACGTGCGTCAGTCCGGCTAACGTTAGTTCCTTGCTCAAT
AACTCCGCCGCATCCAATAGCAGTATAAACAATTCTGTGACGTCGGTGACTTCGCTTTCG
GGGTCTTCGACGCCCAGCAAGCCCGCGCCGCCGTCCCCGCACCCCGCCCCGCCCGCGCCG
CATCCCGCCCAGACCCTGCCGCCGGCGACGCACCCCGCGCCCCACACACCCGCCTACCCC
GTACCCAGGGTACCCGAGGTATTGGAGAATGGTCCCGTGTCGAGCGCTGTCCTTACTCAG
CTGCCGGCGCACCCCGTGCTCGTACACGCGTCTCACCCAGTGTCTCACCCCGTGTCGCAC
CCTACATCGCACCCCGTGTCTCATCCTGTGTCACACCCTGTGTCACACCCCGTGTCACAC
CCTGCGCCGGCGCCAGCGCCGGTACCTACGTCAAAGAGTTCATCTGTAACGACGTTGTCG
TCGTCGACGGCGGTCGTCAACTCGTTGTCTCACAACACGTCCGGAGCCCCGTCGCCAGCG
CCGCCGGCGCCCTCGGCCTCTGCCCCCATCCCCGCGACAGCGACCGCGCCCCCACCAGCG
ACCGCTCTCAACGGACCCACGCTGGCCGTAGCACAGGAACACACGCAGTATGTTAACAAT
GTGAGGGCGCTGTCTCCGCCGGCGGTGAGCGGGAACACTACCGCCAACAGCATGGACAGC
GGCGTCACAGGAACCGCCTCGCTGAAGAGCATGGCCCAGGAGGCCGTGCAGAGAGCGGGG
CTCGACCACCACCACACGCAGGCGACGGGTACAGTCGGCTCGCTAACAGGAGGCACGGGC
GCCAGGCGAGGCACAGCACTCTCCCAGGCGCTCATACCGCCCATACTGGGAGTGGCGCCG
CTGGGACCACTGCCACTTAATAATGACCACCAGGTGCAGTTCCAGATGATGGAGGCGGCG
TTCTACCACATGCCGCATCCATCAGACTCGGAGCGCACCCGAGTCTACCTGCCCAGGAAT
ATTTGTCAGACACCGTTATATTACAATCAGGTGTTACTACCCCACTCAGACTCAGTAGAG
TTCTTCCAGCGGTTGTCGACGGAGACGCTGTTCTTCGTGTTCTACTACATGGAGGGGACC
AAGGCGCAGTACCTGGCGGCAAAAGCGCTCAAGAAGCAGAGCTGGCGCTTCCACACCAAG
TACATGATGTGGTTCCAGAGACACGAGGAGCCCAAGGTTATCAATGAGGAATACGAACAG
GGCACATACATTTACTTCGACTACGAGAAGTGGGGCCAGCGGAAAAAAGAAGGCTTCACG
TTCGAGTACAAGTACTTAGAAGACCGCGACCTGAACTGA
Protein sequence:
MAATRKLQGEIDRCLKKVTEGVETFEDIWQKVHNATNSNQKEKYEADLKKEIKKLQRLRD
QIKSWIASGEIKDKSTLLEYRKLIETQMERFKVVERETKTKAYSKEGLGAAQKLDPAQKE
REEMSSWLISSIDALNLQIDLFESEVESLLVGKKKRLDKEKQDRMEELKLKLERHRFHIK
KLETLLRMLDNMSVEVEQIKRIKEDVEYYIVSSLEPGYEENDYIYEDINGLDEIELSGVG
LPSSATTDSNNSNDSPGSPTSILSGTSPVTSPSLDTHNHTTDSIDVDKKKKEDITTKPIK
PLPLRAVTCVSPANVSSLLNNSAASNSSINNSVTSVTSLSGSSTPSKPAPPSPHPAPPAP
HPAQTLPPATHPAPHTPAYPVPRVPEVLENGPVSSAVLTQLPAHPVLVHASHPVSHPVSH
PTSHPVSHPVSHPVSHPVSHPAPAPAPVPTSKSSSVTTLSSSTAVVNSLSHNTSGAPSPA
PPAPSASAPIPATATAPPPATALNGPTLAVAQEHTQYVNNVRALSPPAVSGNTTANSMDS
GVTGTASLKSMAQEAVQRAGLDHHHTQATGTVGSLTGGTGARRGTALSQALIPPILGVAP
LGPLPLNNDHQVQFQMMEAAFYHMPHPSDSERTRVYLPRNICQTPLYYNQVLLPHSDSVE
FFQRLSTETLFFVFYYMEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKVINEEYEQ
GTYIYFDYEKWGQRKKEGFTFEYKYLEDRDLN