Monarch geneset OGS2.0

DPOGS200432
TranscriptDPOGS200432-TA918 bp
ProteinDPOGS200432-PA305 aa
Genomic positionDPSCF300236 + 53282-55369
RNAseq coverage388x (Rank: top 31%)
Annotation
HeliconiusHMEL0025027e-10971.47% 
BombyxBGIBMGA008893-TA1e-6661.60% 
DrosophilaTaf11-PA9e-4069.23% 
EBI UniRef50UniRef50_B0WET73e-4468.15%Transcription initiation factor TFIID subunit 11 n=4 Tax=Diptera RepID=B0WET7_CULQU
NCBI RefSeqXP_001847221.15e-4568.15%transcription initiation factor TFIID subunit 11 [Culex quinquefasciatus]
NCBI nr blastpgi|1700387709e-4468.15%transcription initiation factor TFIID subunit 11 [Culex quinquefasciatus]
NCBI nr blastxgi|665115931e-4758.46%PREDICTED: hypothetical protein LOC411144 isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00056341.3e-65nucleus
GO:00063671.3e-65transcription initiation from RNA polymerase II promoter
GO:00036774.4e-34DNA binding
KEGG pathwaydpo:Dpse_GA179413e-39 
 K03135 (TFIID9)maps-> Basal transcription factors
InterPro domain[148-304] IPR0068091.3e-65TAFII28-like protein
[196-292] IPR0090724.4e-34Histone-fold
Orthology groupMCL12798 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200432-TA
ATGGCTGATAATCCAAGCAGCGACGGTTTATCCGAAGCGGAAAGATTAAAAGAAGAGGAATTGGCTCAAGAACTGGACCACAGTGATTTTAACGAAGACGACTCTCAGGAAATATCCTTAGAAATAGAAGAGAAACTGTTGAAAGATGAACCAGAATACACAACGTTAGTCAACTATGGTGAGAGTATACAAAGTGAACAGAGTATCACAACAGAGCATTTCAATGATAACTTAGATAAAGGACTAGACTCTGAGGGTCTTGGTAATATATACACTGAAACCACTGACATGTTTATAACACAATCGGATCTCAGTGATCCAAATGATCAGTATGGATCATACATTAATGATTATTCCTTTGATAACTCGCAAACAATTGAGGAGAGCCATTCACAGAGTATAGGAGAATATTCTACAAGTAAGGATGCTAGTGTGATGCCAAACATGGGAGAATTCCAGCAAAGTGAAAATTCAAATGGTGACTATCAAACTGCGGAAGATCTGAGCGAAAGAGAAAGAGAGAAGAAAACAAAGAAGGAATTAGAAGAAGAGGAAAGAGAGAAGATGCAAGTTTTAGTGTCAAACTTTACTGAAGAACAACTAGGAAGGTATGAAATGTATAGACGAGCAGCGTTTCCTAAGGCGGCTGTAAAACGTTTGATGCAGACTATCACGGGATGTTCAGTCGGACAGAATGTTGTGATAGCTATGTCTGGTATCGCCAAGGTGTTTGTTGGTGAAGTCGTTGAGGAAGCATTAGAGGTATTGGAAAAATCTGGAAGATCGAATCTCCATAACCTTTTTAGACCGGAGCCAGGTCCATTACAGCCAAAGCATCTCCGGGAAGCATTGCGTAGGCTAAGAGTAAGGGGTGCCATATCAGCAAGAAAAGCTTATAGGGGATCTTTCAGATTGTAA

Protein sequence:

>DPOGS200432-PA
MADNPSSDGLSEAERLKEEELAQELDHSDFNEDDSQEISLEIEEKLLKDEPEYTTLVNYGESIQSEQSITTEHFNDNLDKGLDSEGLGNIYTETTDMFITQSDLSDPNDQYGSYINDYSFDNSQTIEESHSQSIGEYSTSKDASVMPNMGEFQQSENSNGDYQTAEDLSEREREKKTKKELEEEEREKMQVLVSNFTEEQLGRYEMYRRAAFPKAAVKRLMQTITGCSVGQNVVIAMSGIAKVFVGEVVEEALEVLEKSGRSNLHNLFRPEPGPLQPKHLREALRRLRVRGAISARKAYRGSFRL-