Monarch geneset OGS2.0

DPOGS204303
TranscriptDPOGS204303-TA1941 bp
ProteinDPOGS204303-PA646 aa
Genomic positionDPSCF300046 + 564478-571088
RNAseq coverage282x (Rank: top 39%)
Annotation
HeliconiusHMEL0151637e-15277.26% 
BombyxBGIBMGA007579-TA0.085.23% 
DrosophilaTaf6-PA0.058.55% 
EBI UniRef50UniRef50_P498470.058.55%Transcription initiation factor TFIID subunit 6 n=35 Tax=Coelomata RepID=TAF6_DROME
NCBI RefSeqXP_001844213.10.057.03%transcription initiation factor TFIID subunit 6 [Culex quinquefasciatus]
NCBI nr blastpgi|1700326900.057.03%transcription initiation factor TFIID subunit 6 [Culex quinquefasciatus]
NCBI nr blastxgi|1700326900.056.39%transcription initiation factor TFIID subunit 6 [Culex quinquefasciatus]
Group
Gene OntologyGO:00056342.1e-30nucleus
GO:00063522.1e-30transcription initiation, DNA-dependent
GO:00510905.3e-28regulation of sequence-specific DNA binding transcription factor activity
GO:00036773.8e-21DNA binding
KEGG pathwaycqu:CpipJ_CPIJ0025440.0 
 K03131 (TFIID5, TAF6)maps-> Basal transcription factors
InterPro domain[10-74] IPR0048232.1e-30TATA box binding protein associated factor (TAF)
[294-401] IPR0114425.3e-28Domain of unknown function DUF1546
[9-74] IPR0090723.8e-21Histone-fold
Orthology groupMCL13742 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204303-TA
ATGGGTGATTCTGACTTAACTTATGGATCATCCCTGAATGTAGATTCTATAAAGGTGATATCTGAAAGTGTCGGCATAGCAAGTCTTGGGGATGATGCTGCTAAGGAACTAGCAGATGATGTTTCCTTTAGATTAAAAGTAATTGTTCAAGATGCTATGAAATTCATGCATCATTCAAAAAGACAAAAACTTTCTATTACAGATATAGACAATGCTTTAAAAATAAAAAACACTGAAGCCCAGTATGGTTTTGTGCAGCCTGACTCATTGCCATTCAGGTTTGCATCTGGTGGTGGTAGAGAGCTACATTTTATAGAGGAGAAAGAAATTGATTTATCAGAAATACTGTCAGCTCCACCCCCAAAAATTCCTCTTGATGTGTCCTTGAGAGCACACTGGCTTAGTGTTGATGGGGTTCAGCCAACTGTTCCCGAGAATCCGCCACCATTATCCAAAGAGGCACAAAAATTGGAGTCAGTTGATCCTGTTTCTAAATTAAGCAAGCCTGCCAATAAAGATTCAGCAGGAAAACCAGTTAGTGGTAAAGCAGCTAGACTTAAAGCCTCAGAGTCTGTCCATGTTAAACAACTTGCAACACACGAGCTTAGTGTGGAACAACAGCTATATTATAAGGAAATCACAGAAGCAGGTGTGGGCAGTGATGAGGGACGGAGAGCTGAAGCCCTGCAATCACTGGCATGTGATCCTGGCCTACATGAGATGTTGCCAAGGATGTGTACATTTATATCAGAGGGTGTAAAAGTCAATGTTGTCCAGAATAACTTGGCTCTCCTTATTTATTTGATGAGAATGGTGAAAGCAATGTTGGACAATCAATCACTTTATTTAGAAAAATATCTTCATGAATTGATTCCATCAGTCTCAACGTGTATAGTGTCCCGACAGCTTTGTACGCGGCCAGAAGTTGACAACCACTGGGCGCTCCGAGACTTCGCCGCTCGACTAATGGCCCAGCTGTGCAAAACATTTAATACTTCTACTAATAATCTACAAACAAGAGTTACAAGGTTGTTTGCAAAAGCCCTGCAATGTCCATCACAAACAAACAACGAAAGTGGACCGTCAATGGTTGCTTCTATGAAGGAATCTGAGAAGACTCCTTTAGCCTCGCTCTATGGAGCAGTCCAAGGTTTAGCTGAGTTGGGTCCTGAGGTGGTGAAGGTATTTATCCTGCCTCGTGTGCGATGGTTAGGCGAGCGTGTGGAGGGTGCGCTAGGTGGGGCTGCGGGCGCAGACCGTGTAGCTGCGAGCAACCTTAAACACCAGTTACTCAAGGTGTTGGCTCCAGTAGTGCGACAGCTTCGTCAACCGCCCGACCTTCCTGATGACTACAAACGCGAGTACGGCTACCTCGGTCCGAGTCTACAGCAAGCTGTGAGTAAGCTGCGGTCGTCTCCGACAGGCGGCGGCGGCGGCGGCGCGGTGGCCGTGTTGCCGTGTACCCCGCCTCTGCTACCTCACCCGCCATCACCCGCACCACACGCCAAGTCCATCGAGTCAATATCGACCCGTAACGTTGTGATTACATCAGGCGCCCCCTCCCCCGCGCCCTCAACACCTCCGCCGCAGAAATTCGTCATAGTAGCCTCGCAACAGAAGACGCAACAGAACCAACCAGCCAGTGGGTCCGGTCACATAGTAGTTCATAGCTCGCAGCCTACCATCGTCCGCAGTCAAAATGTACAGTCGGTGGTGGTGACGAGCGGGCCGGCGGGAGCACAGCCGCCGCAGAAGCTGGTGGTGGTGGGGATGAACCCCCTGCACACAGCACACTCGCAACACTCACCGCTGCAGGCCACCACCACGGTGTCGGGCGTCAGTCAGGCGCCCGTGTCAGTGGTCGCCAAGCCGGTGTTCGCTCGCGGCGGCTCGGCCCCGCAGCCTCCGCCGGAGCTGGACGACCTGTCGCACCTCGCTTGA

Protein sequence:

>DPOGS204303-PA
MGDSDLTYGSSLNVDSIKVISESVGIASLGDDAAKELADDVSFRLKVIVQDAMKFMHHSKRQKLSITDIDNALKIKNTEAQYGFVQPDSLPFRFASGGGRELHFIEEKEIDLSEILSAPPPKIPLDVSLRAHWLSVDGVQPTVPENPPPLSKEAQKLESVDPVSKLSKPANKDSAGKPVSGKAARLKASESVHVKQLATHELSVEQQLYYKEITEAGVGSDEGRRAEALQSLACDPGLHEMLPRMCTFISEGVKVNVVQNNLALLIYLMRMVKAMLDNQSLYLEKYLHELIPSVSTCIVSRQLCTRPEVDNHWALRDFAARLMAQLCKTFNTSTNNLQTRVTRLFAKALQCPSQTNNESGPSMVASMKESEKTPLASLYGAVQGLAELGPEVVKVFILPRVRWLGERVEGALGGAAGADRVAASNLKHQLLKVLAPVVRQLRQPPDLPDDYKREYGYLGPSLQQAVSKLRSSPTGGGGGGAVAVLPCTPPLLPHPPSPAPHAKSIESISTRNVVITSGAPSPAPSTPPPQKFVIVASQQKTQQNQPASGSGHIVVHSSQPTIVRSQNVQSVVVTSGPAGAQPPQKLVVVGMNPLHTAHSQHSPLQATTTVSGVSQAPVSVVAKPVFARGGSAPQPPPELDDLSHLA-