New model in OGS2.0 | DPOGS213148  |
---|---|
Genomic Position | scaffold566:+ 9441-14336 |
See gene structure | |
CDS Length | 1986 |
Paired RNAseq reads   | 177 |
Single RNAseq reads   | 481 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007907 (0.0) |
Best Drosophila hit   | CG7759, isoform A (1e-127) |
Best Human hit | SET and MYND domain-containing protein 4 (1e-23) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL003516 [Aedes aegypti] (1e-155) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL003516 [Aedes aegypti] (4e-150) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0005634 nucleus GO:0005737 cytoplasm GO:0016481 negative regulation of transcription GO:0016564 transcription repressor activity |
InterPro families    | IPR019734 Tetratricopeptide repeat IPR001214 SET domain IPR013026 Tetratricopeptide repeat-containing IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL16398 |
Nucleotide sequence:
ATGAGTAAAGACAGCGAAGGGTTATTTAAGAATTTTCACGAAAGTATAAACAACTCGCTC
GACGACGTCGTGAGAAATAATTTCGCAAATTTAGAATCGAATGAGAAAAGAATATCGTAT
TTATGCAGCCTTCCGTGCGTGAAAAATTATGATATATCAAAAGAGATTAAAAAATTCGAA
AGTGGTGGTGAGTTTCCGGTAAAAAAAGACCTGGAAAAAGCTCTTCAGTTGAAAGATGAA
GGAAACAAGGCTGTACAGAAAGGGGATTGGGGTCGATCTTTACAATACTACAATGAAAGT
ATTATTCTCATGCCGGAAATAAAAAGCGAAGAGTTGTCGATCGTTTTAGCTAATCGCTCG
GCCGCCTTAAACCATTTGGCACAGTACGAAGACACACTGAGAGATATTCAACGTTGCCTG
GCGCTGGGTTACCCCCGACACTTAAGATATAAAGTTTATGAGAGGAGAGCTCGTTGTCTA
CTAGCTCTTAAAAGAAACCAAGAAGCTGTAACGGCATTTCAAAACACAATCACCGCATTA
GATGAAGCAAAAAATCTTGATAAGGAAAAACGGCTGAAACTGCGAACGGATGCGAAGCTT
ATGTTGGAAGTTCTGAACAAGGGCTTGGTTTTAGCTGGTAATCCAAAAGATCCAGAGCCA
TTTAAAAACACACCGCCAAAACCTAAACTTTCTGGAAAACATAATAAGCTATTCCCAGCA
GCTTCAGAAACTGTTGAAATCGCTTTTGATGAAACTAGAGGCCGATTTGCAAAAGCTGAT
AGAGATATACAAGCTGGTGAAATCTTGCTGATTGAGGAACCGCATGGTGGTGTCTTACTA
TCAGAATTTTCTAAAAGCCACTGTCAAAATTGTTTTAATAAATGTCTCATTCCCTTGCCA
TGTCCGAAATGCCCAAACGTTATTTTTTGCAGCGAAAAATGTTTAGACATTGCATTAAAA
TCTTATCACGGATACGAATGCCATATTCTTCCACTTCTATGGAAGTCTGGCTGTTCTGTA
ACTTGCCACATCGCTTTAAGGATGATAACACAAAACAGTAAAGATTATTTTATGAAAATT
ATGCAGGATTTGAAAGAAAAACCAACTGGCCCGTACAAAACTGAGGACTACCGTAACATT
TATCACCTCGTTTCACACGAAAACAAGAGGACTAAACAAGACATCTTGCATAGGACTGAA
ATGGCAATATTTCTTCTTAAGCTTTTAGAGATTAGCGGTTACTTTAATGATGATGCAGCG
TCATTTGGATGTTTGATACTGAAAAATCTTCAAGTTCTTCAATTTAATGCTCATGAAGTT
TTCGAAATACAATGTCTTAAACCAAAGGATGGTACACGTTTCCTTAAACATGAAGGTAAA
TCGGTGTTTATTGGCGGTGCCGTATATCCGACACTAGCTTTGTTTAATCATTCTTGCGAA
CCTGGTATAGTAAGATACTTCTGTGGTTCGCGAATAGTTGTCTGTGCCGTGAAAAATATA
AGAAAAGGTGAAGAGGTTGCAGAAAACTATGGACCTATATTTACAACAGTGCCCAAGGAT
AAACGACAGTCACAGCTAAAAGAGCAGTACTGGTTCGATTGCAAATGCTTGCCTTGCGAA
CAAAATTGGCCAAAATATGAAGACATGACAGAAAATTACTTGCGATTCAAATGCGATTCA
GACCAGCCATGTTCTAATGTAATACCAGTACCATATGACTGCATGGAATTTATGGTACAG
TGTGGTCTATGCCAACAGTATACAAATATTTTAAAAGGGTTAAAGTCATTACAGGATACA
GAGACGCTTTATAAACTTGGACGAGCGGCGATGGGAGAAGGAAAGTATGGAGAAGCAATT
AAAAAGTTTATCGAAACATTAAAACTCTATGATACAACATTAGCACCACCTTACAAATCT
TATTATGATTGTGTTCAAGATTTAAGAAGTTGTATGCTATCATTGGGAAATTACAGCTTT
GTTTAA
Protein sequence:
MSKDSEGLFKNFHESINNSLDDVVRNNFANLESNEKRISYLCSLPCVKNYDISKEIKKFE
SGGEFPVKKDLEKALQLKDEGNKAVQKGDWGRSLQYYNESIILMPEIKSEELSIVLANRS
AALNHLAQYEDTLRDIQRCLALGYPRHLRYKVYERRARCLLALKRNQEAVTAFQNTITAL
DEAKNLDKEKRLKLRTDAKLMLEVLNKGLVLAGNPKDPEPFKNTPPKPKLSGKHNKLFPA
ASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEFSKSHCQNCFNKCLIPLP
CPKCPNVIFCSEKCLDIALKSYHGYECHILPLLWKSGCSVTCHIALRMITQNSKDYFMKI
MQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDILHRTEMAIFLLKLLEISGYFNDDAA
SFGCLILKNLQVLQFNAHEVFEIQCLKPKDGTRFLKHEGKSVFIGGAVYPTLALFNHSCE
PGIVRYFCGSRIVVCAVKNIRKGEEVAENYGPIFTTVPKDKRQSQLKEQYWFDCKCLPCE
QNWPKYEDMTENYLRFKCDSDQPCSNVIPVPYDCMEFMVQCGLCQQYTNILKGLKSLQDT
ETLYKLGRAAMGEGKYGEAIKKFIETLKLYDTTLAPPYKSYYDCVQDLRSCMLSLGNYSF
V