Monarch geneset OGS2.0

DPOGS206547
TranscriptDPOGS206547-TA1956 bp
ProteinDPOGS206547-PA651 aa
Genomic positionDPSCF300190 + 179319-183710
RNAseq coverage324x (Rank: top 35%)
Annotation
HeliconiusHMEL0022820.087.25% 
BombyxBGIBMGA014036-TA0.083.20% 
DrosophilaTaf5-PA1e-16946.36% 
EBI UniRef50UniRef50_E2A7B50.055.98%Transcription initiation factor TFIID subunit 5 n=25 Tax=Eumetazoa RepID=E2A7B5_CAMFO
NCBI RefSeqXP_001652518.10.056.03%wd-repeat protein [Aedes aegypti]
NCBI nr blastpgi|1571150150.056.03%wd-repeat protein [Aedes aegypti]
NCBI nr blastxgi|2824001600.056.75%cannonball [Tribolium castaneum]
Group
Gene OntologyGO:00055155.9e-74protein binding
GO:00056347.4e-39nucleus
GO:00063557.4e-39regulation of transcription, DNA-dependent
KEGG pathwayaag:AaeL_AAEL0070670.0 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[311-650] IPR0159435.9e-74WD40/YVTN repeat-like-containing domain
[311-650] IPR0110463e-67WD40 repeat-like-containing domain
[52-189] IPR0075827.4e-39TFIID subunit, WD40-associated region
[471-508] IPR0197815.9e-13WD40 repeat, subgroup
[469-508] IPR0016805.7e-12WD40 repeat
[411-425] IPR0204721.9e-08G-protein beta WD-40 repeat
Orthology groupMCL13162 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206547-TA
ATGGGAGATAAGTCCACCCCACTTTTAGCTGTACTGCAACTTCTCCGAAAATATGATTTGAAAGGCACAGAAGAACTTTTAAGGAAAGAAGCAAACTTAGGAGATGTTGAATACGACAACTTAGATTTGCCAGAAGTAGAACTTGCCAGTATACTTACTGCTCATCACACAGAAAGTGATCCATATAGCTATGAGTTTGCTTATGATAGTCTTAAAAAGTTCGTTGAAAATTCTCTAGACATTTACAAGTATGAATTATCAACACTTCTTTACCCAGTGTTTGTGCATATGTATTTGGCACTAATATTGTATGACCACAATGAACATGCTTATAAGTTCTTTGAAAAATTTGGCCTAGAACAGGAAGATTATTATCAAGAGGATTTAACTCGTCTCTCAATTGTGAAACATAAAGATCAAATTAAAGGAAATGAAATTGCAGAAATATACAGCTCAAACAAATTTCAAGTTCAAGTATCTCGAGATGCATCTACACAGCTTAAGAGATATTTACATGAACAGAAGAGTTCACCGGTGATTATAAATATCTTAAATAATCACATACAAATTGATATACATGACGGACCAGGTCGAACACAAGCACAAGTGAGAGCTACTATTGGGGGGCTACTAGGTGAAGCTTCTAGAAACGAAAATCGCACAAAAGTGTATTATGGATTATTAAAAGAACCTGACATACAAGTCCTTCCACCACCCATTGAAGATGAAGAGGAAGCAGAGGAGACCCCAGATAAACCCAAAAAAAAGAAGGCCAAAAAGGACAATATTTTTCTCAAAAAACCTAAATCTGATCCCAATGCACCACCAAATGACAGAATTCCCTTGCCAGAGTTAAAAGAAACAGACAAGCTGGAAAAGGGTAAAGCCATAAGAGAAGCTGCAAAACGTGTTCAACTTGGACCAGAGAGTCTACCATCTATTTGCTTTTACACTCTCTTAAACAGCGGCCATACAGCCATATGTGCTGATATCTGTGATGACTCAACACTGCTTGCCGTTGGCTTCAACAACTCTTATATTAAGGTTTGGACTTTGACTACAATAAGATTAAGAGGAATGAAATCAGCTGAAAAGTTGCAAGACATCGACAGAGAAGCTGGTGACGTCTTAGTGAGGATGATGGAAGAGAAGGACAGAGATACATGCCGTACACTATACGGCCACTCGGGATCAGTATTCAAAGTGGCATTCGATCCTTTCAAAACTTTGTTACTATCATGCTCTGAAGATTCCACAGTCCGGCTATGGTCCCTGCAGTGTTGGTCGTGCCTAGTGGCGTACCGTGGCCACGCGTGGTCGGTTTGGGACGTACGTTGGTCGCCTCATGGCCACTACTTCGCCAGCGCCGGGCACGACCGGACCGCACGCCTCTGGGCCACCGACCACCATCAACCGCTCCGAATATTCGCTGGACATCTTTCTGACGTCGATTGTGTTCAATTCCATCCAAACTCGAATTACATAGCAACGGGATCCAGCGACCGCACCGTAAGACTATGGGACTGTTTGACGGGAACGCAAGTACGGATCATGACCGGTCACAAGACAACTCCATATACCGTTGCGTTTTCTGTATGTGGTCGTTGGATAGCTTCAGGCGGCGCGGGGGGAGAGATAGTTGTATGGGATATTTCTACCGGTCTACCAATGAGCACTCTGCCTCCAATGCATGTAGCGCCTGTTCACGCCTTAGCCTTCAGTCGAGACGGCACTATCTTATCTTCAGGTTCATTGGACTCCACAATCAAACTGTGGGATTTCACATTGATTACGGATGAAAGTCTGACGGAAGAGACAGGTTCAAGTACTGTCACACAAAAAGAAGAGAAGGTTCTCTTGCGTTCGTTTGCGACGAAAAATTCGCCAATCAAGCATTTGCATTTCACCCGTCGCAACCTTTTGTTGGCTGTAGGGTCATATGAAGGAAGTTCCTAA

Protein sequence:

>DPOGS206547-PA
MGDKSTPLLAVLQLLRKYDLKGTEELLRKEANLGDVEYDNLDLPEVELASILTAHHTESDPYSYEFAYDSLKKFVENSLDIYKYELSTLLYPVFVHMYLALILYDHNEHAYKFFEKFGLEQEDYYQEDLTRLSIVKHKDQIKGNEIAEIYSSNKFQVQVSRDASTQLKRYLHEQKSSPVIINILNNHIQIDIHDGPGRTQAQVRATIGGLLGEASRNENRTKVYYGLLKEPDIQVLPPPIEDEEEAEETPDKPKKKKAKKDNIFLKKPKSDPNAPPNDRIPLPELKETDKLEKGKAIREAAKRVQLGPESLPSICFYTLLNSGHTAICADICDDSTLLAVGFNNSYIKVWTLTTIRLRGMKSAEKLQDIDREAGDVLVRMMEEKDRDTCRTLYGHSGSVFKVAFDPFKTLLLSCSEDSTVRLWSLQCWSCLVAYRGHAWSVWDVRWSPHGHYFASAGHDRTARLWATDHHQPLRIFAGHLSDVDCVQFHPNSNYIATGSSDRTVRLWDCLTGTQVRIMTGHKTTPYTVAFSVCGRWIASGGAGGEIVVWDISTGLPMSTLPPMHVAPVHALAFSRDGTILSSGSLDSTIKLWDFTLITDESLTEETGSSTVTQKEEKVLLRSFATKNSPIKHLHFTRRNLLLAVGSYEGSS-