Monarch geneset OGS2.0

DPOGS211014
TranscriptDPOGS211014-TA1887 bp
ProteinDPOGS211014-PA628 aa
Genomic positionDPSCF300004 + 1246878-1250492
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0060420.082.93% 
BombyxBGIBMGA006496-TA0.071.28% 
Drosophilawda-PA2e-6929.38% 
EBI UniRef50UniRef50_E0VPQ27e-11838.10%WD-repeat protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VPQ2_PEDHC
NCBI RefSeqXP_002428096.11e-11838.10%WD-repeat protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420148502e-11738.10%WD-repeat protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420148503e-11538.10%WD-repeat protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055154.9e-64protein binding
GO:00056344.8e-09nucleus
GO:00063554.8e-09regulation of transcription, DNA-dependent
KEGG pathwayphu:Phum_PHUM3628004e-118 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[341-616] IPR0159434.9e-64WD40/YVTN repeat-like-containing domain
[338-616] IPR0110463.6e-57WD40 repeat-like-containing domain
[478-514] IPR0197813.8e-12WD40 repeat, subgroup
[517-556] IPR0016802.9e-10WD40 repeat
[63-129] IPR0075824.8e-09TFIID subunit, WD40-associated region
[459-473] IPR0204722.6e-07G-protein beta WD-40 repeat
Orthology groupMCL13645 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211014-TA
ATGAAAAGGACTAGAAATGATGCAGTTAAAGCAGCCGTTACATCCTACTTAGAGCGAAGGAATTATCCCGACATTGATTTCTTTAACCACAATAATTGTACTAGTCGCAGTGCTGAAGAAATGGCAGTGGCTACAATTGTACAATGTGAAGCTAGCCGTGCCAATTCGATTATGTTTTCTCTTATTAATAATGACCCTGGAAATTATGACGCTCAGTACACAAAACTGGTTACTTTTATAAAGGAAATAAATATTGAAAAGGTTAAAAATGAGTTACTTGGTTTACTTACACCATTGTTATGCCACTTATATTTGGAAATGCTGCGCGGTGGCCATGGTGGTGCAGCTCAAATGTTTCTAAAGAGACACTCTGCTACATTACCACAAAAGGAGTTATCATACCACCAACCAATAGACGGCAATCTTCCATCAGCACTCTACCGACCAAACAGCCTTGAGCAATTGTTCAATTCTCTACAAAACGGTACTATAGATAATGAGACCCCAGAGAAAGACTATATGAATCAACTGTTAGATGACATTGGAACTATATATACCTTGCAGGAAGTTGAATCTCGACCTTCCATAGCTGCTTTTCGGTCCTGCAAATATGACATATATCTATCACAAGACTCTCTAAACCTTCTGAAGACTTTTCTGGCTAGACATGGACATGTTTTAATCATACAAGTTCTACAAACATGGTTCCATATTGATATTAATAATGACAATAAGAAAACTTCTGAAGACGATGACGAAGAACACAATGAAGATGAAAATCATATGAATGTTGCGAATTGTGATGAAAAACCAACTGATAAAAATGTGAAATGTTCTGAATCCACAGATGTGTTTTCCAAATGTAATGGTCACACAGAACATCAATCTGTAGACAAAGAATTGAGAGATCTGCAAGATGCTATTAAAGGTGTTAGAGAAACAATTGCACCACTCAAACTGTACAAAATTGCAACTCCTGATAGCCATCTGATATGCGGTAAAACAGACCAATACTGTAATGTGTTATGTGGAGGATTTGAAAACTCAGAAATAAGACTTTGGGATCTTGGACAGAATAATGTTAAGAAAAAGATTAACAGAAACATATCGGAAGTGGAAATTGCTTGTTGTATACCAGCCGAACCCGAAACTTCATTAGACAATACCTTTCAAATAGGAACAGGTTTACCACTTAGGGGTCATTCTGGTCCAATTCAAGCTATCAGTATTCTAGCTCAAGAGCAACTAGTGTTGTCCGCATCCCACGATAATACCATGAGGGCATGGAAATTGTCAGATTATTCATGTGCTTCTATATACCGAGGTCACAATTATCCGATATGGTGCATGGACGTATCCAAAAATGGTTTATTTATTGTAACGGGATCTCATGATAGAACTGCAAAACTATGGTCATTGGATCGCACATTTCCAGTTAGGATTTTTGTGGGACATTTATCTGATGTTACCTGCGTAAAATTTCATCCCAACGAGGCGTACCTGGCGTCAGGAGGCGCGGATCGCACGGTTCGAATGTGGAGTGTATGTGACGCTAGACTTGTTCGTGTATTGTGTGGACATCGCGCTCCACCACGAGCACTGGCCTTCTCACCCTCAGGGAAACATTTGGCTAGTGCAGGTGATGATAAAAAAATTAAAGTGTGGGATCTAGCCGCTTGCAACTGTATTCATGAATACAGAGGACATCATAGTAAAGTGACGTCATTAGATTGGTCAGCGGTCGGAAAGGCTAGCTTAACTAACAGAATATCGTCAGATCCTAATGACACAAATGCAGATAATTCAATATTATGCTCCGCTGGTATGGATGGCATAGTAAAGGTTTTTTATGACACAATGAGTTTTTTGTTCACTCATGATTCATAG

Protein sequence:

>DPOGS211014-PA
MKRTRNDAVKAAVTSYLERRNYPDIDFFNHNNCTSRSAEEMAVATIVQCEASRANSIMFSLINNDPGNYDAQYTKLVTFIKEINIEKVKNELLGLLTPLLCHLYLEMLRGGHGGAAQMFLKRHSATLPQKELSYHQPIDGNLPSALYRPNSLEQLFNSLQNGTIDNETPEKDYMNQLLDDIGTIYTLQEVESRPSIAAFRSCKYDIYLSQDSLNLLKTFLARHGHVLIIQVLQTWFHIDINNDNKKTSEDDDEEHNEDENHMNVANCDEKPTDKNVKCSESTDVFSKCNGHTEHQSVDKELRDLQDAIKGVRETIAPLKLYKIATPDSHLICGKTDQYCNVLCGGFENSEIRLWDLGQNNVKKKINRNISEVEIACCIPAEPETSLDNTFQIGTGLPLRGHSGPIQAISILAQEQLVLSASHDNTMRAWKLSDYSCASIYRGHNYPIWCMDVSKNGLFIVTGSHDRTAKLWSLDRTFPVRIFVGHLSDVTCVKFHPNEAYLASGGADRTVRMWSVCDARLVRVLCGHRAPPRALAFSPSGKHLASAGDDKKIKVWDLAACNCIHEYRGHHSKVTSLDWSAVGKASLTNRISSDPNDTNADNSILCSAGMDGIVKVFYDTMSFLFTHDS-