DPGLEAN01172 in OGS1.0

New model in OGS2.0DPOGS200865 
Genomic Positionscaffold1237:+ 9163-21205
See gene structure
CDS Length2259
Paired RNAseq reads  2242
Single RNAseq reads  6247
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009854 (3e-87)
Best Drosophila hit  lethal (2) NC136 (4e-103)
Best Human hitCCR4-NOT transcription complex subunit 3 (5e-84)
Best NR hit (blastp)  PREDICTED: similar to MGC80612 protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to lethal (2) NC136 CG8426-PA [Apis mellifera] (7e-125)
GeneOntology terms


  
GO:0006350 transcription
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0030528 transcription regulator activity
InterPro families

  
IPR012270 CCR4-NOT complex, subunit 3/ 5
IPR007207 Not CCR4-Not complex component, N-terminal
IPR007282 NOT2/NOT3/NOT5
Orthology groupMCL13817

Nucleotide sequence:

ATGGCTGCGACAAGAAAATTACAAGGTGAAATAGACAGGTGTTTAAAAAAGGTCACGGAG
GGGGTGGAGACGTTTGAGGACATCTGGCAAAAGGTACACAATGCGACGAACAGTAATCAA
AAAGAAAAGTATGAGGCGGATCTCAAAAAGGAGATTAAAAAGCTTCAGAGGCTACGAGAT
CAGATTAAGTCATGGATCGCCTCGGGCGAAATTAAGGATAAGAGTACACTTTTAGAATAT
AGGAAACTAATAGAAACGCAAATGGAAAGGTTCAAAGTTGTGGAACGGGAAACAAAAACG
AAAGCATACTCTAAAGAAGGGCTGGGTGCGGCGCAAAAGTTGGACCCTGCCCAGAAGGAA
CGAGAGGAAATGTCATCATGGCTAATATCTTCAATAGATGCACTTAATTTACAGATTGAT
CTATTTGAGTCTGAAGTTGAGTCACTGTTAGTTGGTAAGAAGAAACGTCTGGACAAGGAG
AAACAGGATCGTATGGAGGAACTCAAGCTCAAGTTGGAAAGGCACAGGTTCCACATAAAG
AAGCTAGAAACCTTACTCCGAATGCTAGACAACATGTCCGTAGAAGTGGAACAGATAAAG
AGAATAAAAGAAGATGTTGAGTACTACATAGTATCATCGTTAGAGCCAGGGTACGAAGAG
AATGACTACATCTACGAAGACATTAATGGCCTGGACGAGATCGAGCTCAGTGGAGTGGGA
CTGCCCTCGTCGGCTACAACGGATAGCAATAATAGTAACGATTCACCCGGTTCACCCACC
AGTATACTCTCAGGAACGAGTCCCGTGACGTCACCATCGTTAGACACACACAACCACACG
ACGGATTCCATAGACGTTGACAAAAAGAAAAAAGAAGATATTACAACTAAACCTATCAAG
CCGCTGCCGCTCCGTGCGGTGACGTGCGTCAGTCCGGCTAACGTTAGTTCCTTGCTCAAT
AACTCCGCCGCATCCAATAGCAGTATAAACAATTCTGTGACGTCGGTGACTTCGCTTTCG
GGGTCTTCGACGCCCAGCAAGCCCGCGCCGCCGTCCCCGCACCCCGCCCCGCCCGCGCCG
CATCCCGCCCAGACCCTGCCGCCGGCGACGCACCCCGCGCCCCACACACCCGCCTACCCC
GTACCCAGGGTACCCGAGGTATTGGAGAATGGTCCCGTGTCGAGCGCTGTCCTTACTCAG
CTGCCGGCGCACCCCGTGCTCGTACACGCGTCTCACCCAGTGTCTCACCCCGTGTCGCAC
CCTACATCGCACCCCGTGTCTCATCCTGTGTCACACCCTGTGTCACACCCCGTGTCACAC
CCTGCGCCGGCGCCAGCGCCGGTACCTACGTCAAAGAGTTCATCTGTAACGACGTTGTCG
TCGTCGACGGCGGTCGTCAACTCGTTGTCTCACAACACGTCCGGAGCCCCGTCGCCAGCG
CCGCCGGCGCCCTCGGCCTCTGCCCCCATCCCCGCGACAGCGACCGCGCCCCCACCAGCG
ACCGCTCTCAACGGACCCACGCTGGCCGTAGCACAGGAACACACGCAGTATGTTAACAAT
GTGAGGGCGCTGTCTCCGCCGGCGGTGAGCGGGAACACTACCGCCAACAGCATGGACAGC
GGCGTCACAGGAACCGCCTCGCTGAAGAGCATGGCCCAGGAGGCCGTGCAGAGAGCGGGG
CTCGACCACCACCACACGCAGGCGACGGGTACAGTCGGCTCGCTAACAGGAGGCACGGGC
GCCAGGCGAGGCACAGCACTCTCCCAGGCGCTCATACCGCCCATACTGGGAGTGGCGCCG
CTGGGACCACTGCCACTTAATAATGACCACCAGGTGCAGTTCCAGATGATGGAGGCGGCG
TTCTACCACATGCCGCATCCATCAGACTCGGAGCGCACCCGAGTCTACCTGCCCAGGAAT
ATTTGTCAGACACCGTTATATTACAATCAGGTGTTACTACCCCACTCAGACTCAGTAGAG
TTCTTCCAGCGGTTGTCGACGGAGACGCTGTTCTTCGTGTTCTACTACATGGAGGGGACC
AAGGCGCAGTACCTGGCGGCAAAAGCGCTCAAGAAGCAGAGCTGGCGCTTCCACACCAAG
TACATGATGTGGTTCCAGAGACACGAGGAGCCCAAGGTTATCAATGAGGAATACGAACAG
GGCACATACATTTACTTCGACTACGAGAAGTGGGGCCAGCGGAAAAAAGAAGGCTTCACG
TTCGAGTACAAGTACTTAGAAGACCGCGACCTGAACTGA

Protein sequence:

MAATRKLQGEIDRCLKKVTEGVETFEDIWQKVHNATNSNQKEKYEADLKKEIKKLQRLRD
QIKSWIASGEIKDKSTLLEYRKLIETQMERFKVVERETKTKAYSKEGLGAAQKLDPAQKE
REEMSSWLISSIDALNLQIDLFESEVESLLVGKKKRLDKEKQDRMEELKLKLERHRFHIK
KLETLLRMLDNMSVEVEQIKRIKEDVEYYIVSSLEPGYEENDYIYEDINGLDEIELSGVG
LPSSATTDSNNSNDSPGSPTSILSGTSPVTSPSLDTHNHTTDSIDVDKKKKEDITTKPIK
PLPLRAVTCVSPANVSSLLNNSAASNSSINNSVTSVTSLSGSSTPSKPAPPSPHPAPPAP
HPAQTLPPATHPAPHTPAYPVPRVPEVLENGPVSSAVLTQLPAHPVLVHASHPVSHPVSH
PTSHPVSHPVSHPVSHPVSHPAPAPAPVPTSKSSSVTTLSSSTAVVNSLSHNTSGAPSPA
PPAPSASAPIPATATAPPPATALNGPTLAVAQEHTQYVNNVRALSPPAVSGNTTANSMDS
GVTGTASLKSMAQEAVQRAGLDHHHTQATGTVGSLTGGTGARRGTALSQALIPPILGVAP
LGPLPLNNDHQVQFQMMEAAFYHMPHPSDSERTRVYLPRNICQTPLYYNQVLLPHSDSVE
FFQRLSTETLFFVFYYMEGTKAQYLAAKALKKQSWRFHTKYMMWFQRHEEPKVINEEYEQ
GTYIYFDYEKWGQRKKEGFTFEYKYLEDRDLN