New model in OGS2.0 | DPOGS202147  |
---|---|
Genomic Position | scaffold669:+ 11759-17040 |
See gene structure | |
CDS Length | 1386 |
Paired RNAseq reads   | 614 |
Single RNAseq reads   | 1513 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003421 (2e-180) |
Best Drosophila hit   | CG6751 (3e-90) |
Best Human hit | periodic tryptophan protein 1 homolog (4e-78) |
Best NR hit (blastp)   | wd-repeat protein [Aedes aegypti] (8e-115) |
Best NR hit (blastx)   | wd-repeat protein [Aedes aegypti] (2e-115) |
GeneOntology terms    | GO:0016251 general RNA polymerase II transcription factor activity GO:0005669 transcription factor TFIID complex |
InterPro families    | IPR011046 WD40 repeat-like-containing domain IPR001680 WD40 repeat IPR019781 WD40 repeat, subgroup IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR019775 WD40 repeat, conserved site IPR020472 G-protein beta WD-40 repeat IPR015943 WD40/YVTN repeat-like-containing domain |
Orthology group | MCL14286 |
Nucleotide sequence:
ATGGAAGAGGAGGGTACTCCCACTGTAAGTTTAGTATCATGTATGCATTTTGTGAGACGG
GGAATAGCGAAACCAGTGCCAGAAAAGATTGAATTGACAGAAAATGAATTAGAAAAAATT
ATAAAGCAGACTGCTGAAGATCTTCGTTTAACAGAAGCAGGAGATGATCAAAGTGGAGAA
GAAGATGAAGCTGCACAGAGTATCAGGGAGCCTCCAGCAAACCCCAATGATGAGTTTGAC
TTTGAACATTATGACCAAGAAGATTCGAGTAACCCTGTAGGTATAGGGACTATAGCAACT
CTACCTAACTTAGGTGATCTCAGTGAAAACATACAAATCAGAACAGAAGGTCCAGATAGT
GATGAAGAAGATGACATCATTAAGCCAGATGACAACTTATTACTTGTAGGACATGTTGAA
ACAGATGCCAGTGTCTTAGAAGTTTATATTTTCAACAAAGAAGAGGGATCATTCTACGTC
CATCATGACATAATACTGCCCTGGTTTCCGCTGTGTATAGAGTGGCTCAATCATGACCCC
TCAGATCCACAACCAGGCAATCTTTGCGCTCTCGGTGGCATGGACCCAGTGATACAAGTG
TGGGATTTGGATATTGAAAACTGTTTGGAACCGGCTTTCAAGCTCGGCAGGAAACCAAAT
AAGAAGAAAAAGACAAAAAGAATTGGTCACAAGGATGCTGTTCTGGATCTGTCTTGGAAC
ACGAACTTTTCTCACGTCTTAGCGAGTGGCTCGGCGGACAACACTGTACTACTGTGGGAT
CTCGATCAAGGCTTACCACACACTAAACTAACCTACTTCGAAGACAAAGTCCAATCACTA
TCGTTCCACCCCCTGGAAGCCCAGACCCTCCTGTCTGGTTGTTGTGACGGCCGAGCGCGT
GTGTCGGACTGTCGGGACGAGGCCGCCTTCCGCACGTGGGTGCTCCCCACTGAGATAGAG
CGAGTGCACTGGGATAGGAACCAACCGTTCTGTTTCGCGATGAGCAACAATATCGGTAAA
GTGGCGTACGTGGACGTCAGACAGGAAGAACCGTTGTGGACCATCGACGCTCATCAGAAG
GAAGTCACAGGACTCATTTTAAGTGAAAAGGTTCCAGGGCTGATGATAACTGTCGGCTCG
GATGAAAAACTCAAATGCTGGGATATCACGGGCCCTACTCCGCTACAAATAAACGAGCGC
ACCAACAGGGTCGGACAGGCCTTATGCGCCGCTCAGTGCCCGGAGGCGCCGTTCGCCGTA
GCGGTGGGCGGAGACAACAAAGAGTGCTACATCGAAATGGTAGACCTCAGCAACAACGAT
GAAGTTATGAACCGTTTCGGCCAGCGCGTCACGACCGAATCCAACGCTGAAGCTATGGAC
GCGTAA
Protein sequence:
MEEEGTPTVSLVSCMHFVRRGIAKPVPEKIELTENELEKIIKQTAEDLRLTEAGDDQSGE
EDEAAQSIREPPANPNDEFDFEHYDQEDSSNPVGIGTIATLPNLGDLSENIQIRTEGPDS
DEEDDIIKPDDNLLLVGHVETDASVLEVYIFNKEEGSFYVHHDIILPWFPLCIEWLNHDP
SDPQPGNLCALGGMDPVIQVWDLDIENCLEPAFKLGRKPNKKKKTKRIGHKDAVLDLSWN
TNFSHVLASGSADNTVLLWDLDQGLPHTKLTYFEDKVQSLSFHPLEAQTLLSGCCDGRAR
VSDCRDEAAFRTWVLPTEIERVHWDRNQPFCFAMSNNIGKVAYVDVRQEEPLWTIDAHQK
EVTGLILSEKVPGLMITVGSDEKLKCWDITGPTPLQINERTNRVGQALCAAQCPEAPFAV
AVGGDNKECYIEMVDLSNNDEVMNRFGQRVTTESNAEAMDA