New model in OGS2.0 | DPOGS202124  |
---|---|
Genomic Position | scaffold495:+ 154874-162513 |
See gene structure | |
CDS Length | 3570 |
Paired RNAseq reads   | 1093 |
Single RNAseq reads   | 2746 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006967 (0.0) |
Best Drosophila hit   | TBP-associated factor 2 (0.0) |
Best Human hit | transcription initiation factor TFIID subunit 2 (0.0) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC011774 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC011774 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0006367 transcription initiation from RNA polymerase II promoter GO:0005669 transcription factor TFIID complex GO:0016251 general RNA polymerase II transcription factor activity GO:0008270 zinc ion binding GO:0008237 metallopeptidase activity |
InterPro families    | IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal IPR016024 Armadillo-type fold |
Orthology group | MCL12952 |
Nucleotide sequence:
ATGAAAAAAGAACGTACTGGCGACAATTGTCGCCCATTTAAATTAGCCCATCAAATTTTG
AGCTTAACAGGAATAAGCTTTGAAAGAAGAAGTGTGATAGGTTTTGTTGAGTTAACAATA
GTACCCCTAAAGGATAACTTAAGGTATATTCGCCTAAATGCCAAGCAATGTCGTATATAT
CGTGTGTGCCTAAACGACCAGTATGAAGCCAACTTTCAGTATTTTGATCCTTTCCTTGAT
ATTTGTCAAAGTGATGCCAACACGAGATCCCTCGAGGTGTTTTCTCAAAACCATTTATCA
GCTGCACAAAAGACAGATCCTGATCACAATTCTGGTGAACTCCACATACAAGTTCCAGAT
GATGCTGCCCACTTAGTCGGTGAAGGAAGGGGTTTGAGGATTGGCATTGAATTCTCTCTT
GAATCCCCACAGGGCGGGATGCACTTTGTGGTTCCAGAAGGAGAGGGAACTATGGTTGAG
AAATCAGCACATATGTTTACATACGGCCATTCAGCACGACTCTGGTTTCCTTGTGTGGAT
AGCTTTGCTGAACCTTGCACTTGGAAGCTTGAGTTTACTGTAGATGAGACATTCACAGCT
GTGTCTTGTGGGGAGTTATTAGATGTAGTGTACACACCTGATCATAGACGGAAGACATTC
CACTATGTTGTTAATACTCCAGCCTGTGCTCCGAATATTGCGCTGGCCATTGGGCCATTC
GATACCTATGTTGATCCACATATGAATGAAGTGACACATTACTGTTTGCCTCATCTCCTA
CAAATTCTCAAGAACACTGTGAGATATTTGCATGAGGCCTTTGAATTTTATGAAGAAACA
CTATCTACAAGATATCCTTACCCGTGTTATAAGCAAGTGTTTGTTGATGAGACGGAAGAT
GATGCAACGGCATACACAACAATGTCAATACTTAGCACGCATCTTCTTCATTCAATTGCC
ATCATAGATCAAACATACATCAGTAGAAAGGCCATGGCCCAAGCTGTGGCTGAACAATTC
TTTGGCTGTTTTATAACTATGCAGAACTGGTCCGATCTGTGGCTTGCCAAGGGTATACCT
GATTACTTGTGTGGTCTTTACTCTAAGAAGTGCTTTGGTAATAATGAGTACAGATATTGG
ATTCAACAGGAATTACAAGAGGTGGTGAGTTACGAAGAGCATTATGGTGGTATAGTCCTC
GATCCATGGCAGCCGCCAGCAAGCGGAGCTCGTGTTGAACCCAAGGACGTTTTCTATTTC
CCTGTCAGAAATGTACACACCATGTCCCCTAGATATATCGAGGTAATGCGAAAGAAATCC
CATCTGGTATTGCGGATGTTAGAACAACGAATAGGCCAAGAGCTGTTGTTACAAGTATTC
AATAAACAGCTTTCGTTAGCAACAAACGCAGCAAACACAAAAATCGGTAGCGGTTTGTGG
GGACATCTGCTTTTATCGACAAATTTGTTTGTCAAAGCTATATTTACTGTGACTGGCAAA
GATATGGCCGTGTTTGTAGATCAGTGGGTTAGAACGGGCGGGCATGCTAAGTTTCAATTG
ACTTCCGTTTTCAACAGGAAAAGAAATACAGTTGAATTGGAAATTCGTCAAGACAGCGTT
CATGAGCGTGGGATCAGGAAGTATGTGGGGCCTCTCTTAGTCCAACTACAAGAATTAGAT
GGAACTTTCAAACATACTTTGCAAATAGAAAATACTGTTGTAAAAGCGGATATCACGTGC
CACAGTAAGAGTAGGAGGAATAAAAAGAAGAAAATTCCATTATGCACTGGAGAGGAAGTT
GACATGGATTTATCTGCTATGGATGACTCACCAGTACTATGGATTCGGCTGGACCCAGAG
ATGTCCCTCTTACGAAGTACAGTGATATCCCAACCGGATTACCAATGGCAGTACCAATTA
CGTCACGAACGTGACGTCACAGCTCAAAGCGAGGCTATAGACGCGCTCCACAACTACCCC
GAACCAGCTACCAGGAAGGCCTTGACGGATATCATAGAGAATGAACAAACACATTATAAA
ATCCGATGCCGGGCCGCGCACTGTTTGACTAAGGTTGCTAATGCCATGATAAGCTCGTGG
GCGGGACCGCCGGCTATGTTGACGATATTCAGGAAAATGTTCGGATCATTCGCCGCACCG
CACATCATCAAACAAAATAACTTCGATAATCTACAACATTACTTTTTGCAGAAAACTATA
CCTGTTGCTATGGCCGGTTTGAGAAATATCCATGGTATATGTCCACCGGAAGTCGTAAGA
TTTCTATTGGATCTGTTCAAATATAACGATAATTCAAAGAACCACTTCTCTGACAACTAT
TACAAAGCTGCTCTAGTCGATGCGCTGGCTGCAACCATAACTCCCGTCATATCTGTTTTA
CAACCCGGTGCTCCAATAACCGCGGAATCGTTATCAGCAGACACGCGTTTGGTCCTCGAA
GAGATAACGCGCGTGTTGAATCTGGAGAAGGTGCTGCCGTGCTACAAAAACACGGTGACA
GTCAGTTGTCTACGAGCTATCAGACGTTTACAGCAGTGTGGTCACTTGCCCAGTATACCC
ACAGTGTTTAGAGCGTACGCGCAATACGGGCAGTATATAGATGTGCGTTTAGCAGCGTTT
GAGGGTCTAGTGGACTTCGTACGAGTGGACGGCAAGCCAGAGGACCTGTCATATCTGTTG
ACCGCTATAGAGAACGACCCTGACCCTGGCGTGAGGCATGGTCTGGCGCGACTCATGGTC
TCAATGCCGCCCTTCGAGAGAGCTCAGAGACATAGACTGGATACGGAATCCGTCGTTCAT
AGATTATGGAACAACATAAATAGTCAATTATCAAATGATGCAAGGTTGAGATGCGATCTC
GTCGACTTGTATTACACGCTGTATGGCCTCAAACGACCTATATGCGTGCCCTTGCCCGAA
ATTCAGGCCATGATGAAACAGATGCATCACAAGGAAAGAGAAAGACTTGACAGAGAAAGA
GAACGAGCAGAGAGGGGGAGGGAGAAGGAGAAAATAAGAGAAATGGACATAAAACCGGTT
ATCAAACAAGAAATTGAGGATATACCTGTGAAAGATGAGTTAGATATGGGTTTAGATGAG
ACGATGCAAGTAAGTGAGTTACCAGTACCGGTGTCAAGCATTAAGGAGGAAGATGACAAA
ATTGATGTCACAACAGTCCATGAGTTGCCGATCAGGGTTTACAGTGACGATTCCAAACGC
GAGTTTTCATCAGATAATGCGGTTCCTCTTCCCGGTATTCCGGGCTCGTGTGGTCCTGTG
GGCTTCGAGCCGGGAATGTTTAAACTGGAGAGAGATGACCCAGCTGCACCGAAGGCCAAA
AAGAAGAAGAAGGAGAAAAAGAAGCACAAACACAAGCACAAGCACAAACACAGCAAAGAA
AAAAGCAAAGACAAAGACAAGCTGCCGCGTCCTCCCTCCACAGACACGCTTCGTATTAAA
GAAGAGACGAGGGAAACGCTGAGTTCATTCAGCTCAAGTCAGAGCCCCTCGGAAGATATA
TCATCAATGCCATCTAATATGAGTTTCTAA
Protein sequence:
MKKERTGDNCRPFKLAHQILSLTGISFERRSVIGFVELTIVPLKDNLRYIRLNAKQCRIY
RVCLNDQYEANFQYFDPFLDICQSDANTRSLEVFSQNHLSAAQKTDPDHNSGELHIQVPD
DAAHLVGEGRGLRIGIEFSLESPQGGMHFVVPEGEGTMVEKSAHMFTYGHSARLWFPCVD
SFAEPCTWKLEFTVDETFTAVSCGELLDVVYTPDHRRKTFHYVVNTPACAPNIALAIGPF
DTYVDPHMNEVTHYCLPHLLQILKNTVRYLHEAFEFYEETLSTRYPYPCYKQVFVDETED
DATAYTTMSILSTHLLHSIAIIDQTYISRKAMAQAVAEQFFGCFITMQNWSDLWLAKGIP
DYLCGLYSKKCFGNNEYRYWIQQELQEVVSYEEHYGGIVLDPWQPPASGARVEPKDVFYF
PVRNVHTMSPRYIEVMRKKSHLVLRMLEQRIGQELLLQVFNKQLSLATNAANTKIGSGLW
GHLLLSTNLFVKAIFTVTGKDMAVFVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQDSV
HERGIRKYVGPLLVQLQELDGTFKHTLQIENTVVKADITCHSKSRRNKKKKIPLCTGEEV
DMDLSAMDDSPVLWIRLDPEMSLLRSTVISQPDYQWQYQLRHERDVTAQSEAIDALHNYP
EPATRKALTDIIENEQTHYKIRCRAAHCLTKVANAMISSWAGPPAMLTIFRKMFGSFAAP
HIIKQNNFDNLQHYFLQKTIPVAMAGLRNIHGICPPEVVRFLLDLFKYNDNSKNHFSDNY
YKAALVDALAATITPVISVLQPGAPITAESLSADTRLVLEEITRVLNLEKVLPCYKNTVT
VSCLRAIRRLQQCGHLPSIPTVFRAYAQYGQYIDVRLAAFEGLVDFVRVDGKPEDLSYLL
TAIENDPDPGVRHGLARLMVSMPPFERAQRHRLDTESVVHRLWNNINSQLSNDARLRCDL
VDLYYTLYGLKRPICVPLPEIQAMMKQMHHKERERLDRERERAERGREKEKIREMDIKPV
IKQEIEDIPVKDELDMGLDETMQVSELPVPVSSIKEEDDKIDVTTVHELPIRVYSDDSKR
EFSSDNAVPLPGIPGSCGPVGFEPGMFKLERDDPAAPKAKKKKKEKKKHKHKHKHKHSKE
KSKDKDKLPRPPSTDTLRIKEETRETLSSFSSSQSPSEDISSMPSNMSF