DPGLEAN13825 in OGS1.0

New model in OGS2.0DPOGS202124 
Genomic Positionscaffold495:+ 154874-162513
See gene structure
CDS Length3570
Paired RNAseq reads  1093
Single RNAseq reads  2746
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006967 (0.0)
Best Drosophila hit  TBP-associated factor 2 (0.0)
Best Human hittranscription initiation factor TFIID subunit 2 (0.0)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC011774 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC011774 [Tribolium castaneum] (0.0)
GeneOntology terms





  
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0005669 transcription factor TFIID complex
GO:0016251 general RNA polymerase II transcription factor activity
GO:0008270 zinc ion binding
GO:0008237 metallopeptidase activity
InterPro families
  
IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal
IPR016024 Armadillo-type fold
Orthology groupMCL12952

Nucleotide sequence:

ATGAAAAAAGAACGTACTGGCGACAATTGTCGCCCATTTAAATTAGCCCATCAAATTTTG
AGCTTAACAGGAATAAGCTTTGAAAGAAGAAGTGTGATAGGTTTTGTTGAGTTAACAATA
GTACCCCTAAAGGATAACTTAAGGTATATTCGCCTAAATGCCAAGCAATGTCGTATATAT
CGTGTGTGCCTAAACGACCAGTATGAAGCCAACTTTCAGTATTTTGATCCTTTCCTTGAT
ATTTGTCAAAGTGATGCCAACACGAGATCCCTCGAGGTGTTTTCTCAAAACCATTTATCA
GCTGCACAAAAGACAGATCCTGATCACAATTCTGGTGAACTCCACATACAAGTTCCAGAT
GATGCTGCCCACTTAGTCGGTGAAGGAAGGGGTTTGAGGATTGGCATTGAATTCTCTCTT
GAATCCCCACAGGGCGGGATGCACTTTGTGGTTCCAGAAGGAGAGGGAACTATGGTTGAG
AAATCAGCACATATGTTTACATACGGCCATTCAGCACGACTCTGGTTTCCTTGTGTGGAT
AGCTTTGCTGAACCTTGCACTTGGAAGCTTGAGTTTACTGTAGATGAGACATTCACAGCT
GTGTCTTGTGGGGAGTTATTAGATGTAGTGTACACACCTGATCATAGACGGAAGACATTC
CACTATGTTGTTAATACTCCAGCCTGTGCTCCGAATATTGCGCTGGCCATTGGGCCATTC
GATACCTATGTTGATCCACATATGAATGAAGTGACACATTACTGTTTGCCTCATCTCCTA
CAAATTCTCAAGAACACTGTGAGATATTTGCATGAGGCCTTTGAATTTTATGAAGAAACA
CTATCTACAAGATATCCTTACCCGTGTTATAAGCAAGTGTTTGTTGATGAGACGGAAGAT
GATGCAACGGCATACACAACAATGTCAATACTTAGCACGCATCTTCTTCATTCAATTGCC
ATCATAGATCAAACATACATCAGTAGAAAGGCCATGGCCCAAGCTGTGGCTGAACAATTC
TTTGGCTGTTTTATAACTATGCAGAACTGGTCCGATCTGTGGCTTGCCAAGGGTATACCT
GATTACTTGTGTGGTCTTTACTCTAAGAAGTGCTTTGGTAATAATGAGTACAGATATTGG
ATTCAACAGGAATTACAAGAGGTGGTGAGTTACGAAGAGCATTATGGTGGTATAGTCCTC
GATCCATGGCAGCCGCCAGCAAGCGGAGCTCGTGTTGAACCCAAGGACGTTTTCTATTTC
CCTGTCAGAAATGTACACACCATGTCCCCTAGATATATCGAGGTAATGCGAAAGAAATCC
CATCTGGTATTGCGGATGTTAGAACAACGAATAGGCCAAGAGCTGTTGTTACAAGTATTC
AATAAACAGCTTTCGTTAGCAACAAACGCAGCAAACACAAAAATCGGTAGCGGTTTGTGG
GGACATCTGCTTTTATCGACAAATTTGTTTGTCAAAGCTATATTTACTGTGACTGGCAAA
GATATGGCCGTGTTTGTAGATCAGTGGGTTAGAACGGGCGGGCATGCTAAGTTTCAATTG
ACTTCCGTTTTCAACAGGAAAAGAAATACAGTTGAATTGGAAATTCGTCAAGACAGCGTT
CATGAGCGTGGGATCAGGAAGTATGTGGGGCCTCTCTTAGTCCAACTACAAGAATTAGAT
GGAACTTTCAAACATACTTTGCAAATAGAAAATACTGTTGTAAAAGCGGATATCACGTGC
CACAGTAAGAGTAGGAGGAATAAAAAGAAGAAAATTCCATTATGCACTGGAGAGGAAGTT
GACATGGATTTATCTGCTATGGATGACTCACCAGTACTATGGATTCGGCTGGACCCAGAG
ATGTCCCTCTTACGAAGTACAGTGATATCCCAACCGGATTACCAATGGCAGTACCAATTA
CGTCACGAACGTGACGTCACAGCTCAAAGCGAGGCTATAGACGCGCTCCACAACTACCCC
GAACCAGCTACCAGGAAGGCCTTGACGGATATCATAGAGAATGAACAAACACATTATAAA
ATCCGATGCCGGGCCGCGCACTGTTTGACTAAGGTTGCTAATGCCATGATAAGCTCGTGG
GCGGGACCGCCGGCTATGTTGACGATATTCAGGAAAATGTTCGGATCATTCGCCGCACCG
CACATCATCAAACAAAATAACTTCGATAATCTACAACATTACTTTTTGCAGAAAACTATA
CCTGTTGCTATGGCCGGTTTGAGAAATATCCATGGTATATGTCCACCGGAAGTCGTAAGA
TTTCTATTGGATCTGTTCAAATATAACGATAATTCAAAGAACCACTTCTCTGACAACTAT
TACAAAGCTGCTCTAGTCGATGCGCTGGCTGCAACCATAACTCCCGTCATATCTGTTTTA
CAACCCGGTGCTCCAATAACCGCGGAATCGTTATCAGCAGACACGCGTTTGGTCCTCGAA
GAGATAACGCGCGTGTTGAATCTGGAGAAGGTGCTGCCGTGCTACAAAAACACGGTGACA
GTCAGTTGTCTACGAGCTATCAGACGTTTACAGCAGTGTGGTCACTTGCCCAGTATACCC
ACAGTGTTTAGAGCGTACGCGCAATACGGGCAGTATATAGATGTGCGTTTAGCAGCGTTT
GAGGGTCTAGTGGACTTCGTACGAGTGGACGGCAAGCCAGAGGACCTGTCATATCTGTTG
ACCGCTATAGAGAACGACCCTGACCCTGGCGTGAGGCATGGTCTGGCGCGACTCATGGTC
TCAATGCCGCCCTTCGAGAGAGCTCAGAGACATAGACTGGATACGGAATCCGTCGTTCAT
AGATTATGGAACAACATAAATAGTCAATTATCAAATGATGCAAGGTTGAGATGCGATCTC
GTCGACTTGTATTACACGCTGTATGGCCTCAAACGACCTATATGCGTGCCCTTGCCCGAA
ATTCAGGCCATGATGAAACAGATGCATCACAAGGAAAGAGAAAGACTTGACAGAGAAAGA
GAACGAGCAGAGAGGGGGAGGGAGAAGGAGAAAATAAGAGAAATGGACATAAAACCGGTT
ATCAAACAAGAAATTGAGGATATACCTGTGAAAGATGAGTTAGATATGGGTTTAGATGAG
ACGATGCAAGTAAGTGAGTTACCAGTACCGGTGTCAAGCATTAAGGAGGAAGATGACAAA
ATTGATGTCACAACAGTCCATGAGTTGCCGATCAGGGTTTACAGTGACGATTCCAAACGC
GAGTTTTCATCAGATAATGCGGTTCCTCTTCCCGGTATTCCGGGCTCGTGTGGTCCTGTG
GGCTTCGAGCCGGGAATGTTTAAACTGGAGAGAGATGACCCAGCTGCACCGAAGGCCAAA
AAGAAGAAGAAGGAGAAAAAGAAGCACAAACACAAGCACAAGCACAAACACAGCAAAGAA
AAAAGCAAAGACAAAGACAAGCTGCCGCGTCCTCCCTCCACAGACACGCTTCGTATTAAA
GAAGAGACGAGGGAAACGCTGAGTTCATTCAGCTCAAGTCAGAGCCCCTCGGAAGATATA
TCATCAATGCCATCTAATATGAGTTTCTAA

Protein sequence:

MKKERTGDNCRPFKLAHQILSLTGISFERRSVIGFVELTIVPLKDNLRYIRLNAKQCRIY
RVCLNDQYEANFQYFDPFLDICQSDANTRSLEVFSQNHLSAAQKTDPDHNSGELHIQVPD
DAAHLVGEGRGLRIGIEFSLESPQGGMHFVVPEGEGTMVEKSAHMFTYGHSARLWFPCVD
SFAEPCTWKLEFTVDETFTAVSCGELLDVVYTPDHRRKTFHYVVNTPACAPNIALAIGPF
DTYVDPHMNEVTHYCLPHLLQILKNTVRYLHEAFEFYEETLSTRYPYPCYKQVFVDETED
DATAYTTMSILSTHLLHSIAIIDQTYISRKAMAQAVAEQFFGCFITMQNWSDLWLAKGIP
DYLCGLYSKKCFGNNEYRYWIQQELQEVVSYEEHYGGIVLDPWQPPASGARVEPKDVFYF
PVRNVHTMSPRYIEVMRKKSHLVLRMLEQRIGQELLLQVFNKQLSLATNAANTKIGSGLW
GHLLLSTNLFVKAIFTVTGKDMAVFVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQDSV
HERGIRKYVGPLLVQLQELDGTFKHTLQIENTVVKADITCHSKSRRNKKKKIPLCTGEEV
DMDLSAMDDSPVLWIRLDPEMSLLRSTVISQPDYQWQYQLRHERDVTAQSEAIDALHNYP
EPATRKALTDIIENEQTHYKIRCRAAHCLTKVANAMISSWAGPPAMLTIFRKMFGSFAAP
HIIKQNNFDNLQHYFLQKTIPVAMAGLRNIHGICPPEVVRFLLDLFKYNDNSKNHFSDNY
YKAALVDALAATITPVISVLQPGAPITAESLSADTRLVLEEITRVLNLEKVLPCYKNTVT
VSCLRAIRRLQQCGHLPSIPTVFRAYAQYGQYIDVRLAAFEGLVDFVRVDGKPEDLSYLL
TAIENDPDPGVRHGLARLMVSMPPFERAQRHRLDTESVVHRLWNNINSQLSNDARLRCDL
VDLYYTLYGLKRPICVPLPEIQAMMKQMHHKERERLDRERERAERGREKEKIREMDIKPV
IKQEIEDIPVKDELDMGLDETMQVSELPVPVSSIKEEDDKIDVTTVHELPIRVYSDDSKR
EFSSDNAVPLPGIPGSCGPVGFEPGMFKLERDDPAAPKAKKKKKEKKKHKHKHKHKHSKE
KSKDKDKLPRPPSTDTLRIKEETRETLSSFSSSQSPSEDISSMPSNMSF