Monarch geneset OGS2.0

DPOGS200416
TranscriptDPOGS200416-TA1398 bp
ProteinDPOGS200416-PA465 aa
Genomic positionDPSCF300236 - 402444-403841
RNAseq coverage367x (Rank: top 32%)
Annotation
HeliconiusHMEL0171780.087.53% 
BombyxBGIBMGA008992-TA0.085.78% 
Drosophiladik-PA2e-3533.22% 
EBI UniRef50UniRef50_Q1HQA80.085.78%Transcriptional adaptor 3 n=9 Tax=Obtectomera RepID=Q1HQA8_BOMMO
NCBI RefSeqNP_001040349.10.085.78%transcriptional adaptor 3 [Bombyx mori]
NCBI nr blastpgi|1140508210.085.78%transcriptional adaptor 3 [Bombyx mori]
NCBI nr blastxgi|1140508210.085.59%transcriptional adaptor 3 [Bombyx mori]
Group
KEGG pathway 
InterPro domain[331-454] IPR0193404.7e-29Histone acetyltransferases subunit 3
Orthology groupMCL13075 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200416-TA
ATGATGGGTAAGCGTATGCATCATAATAATAAAGGAAGATTAAGCAGCAAAGGCCATGATAACGGCAAACCCTCTAGTCCAGGTGTTTCACCATATAATAAACCTTCAAAAATACCAGGTGCTGTGTCAGCTGGTAAAACAAAAGTAGAAACATGTCCCATTCCATATATAAAGCAGCAAGATAATGTATCTACTTTACCGAGACTAGCTGCTATCTGTGCGAGATCTGCAGATGAACCGATTGGCATGGACGAACTTGACGCATTACAATTAGAGCTTGAATCACTTTTATGTAATACAGCACTACGGATAAGATACTTTCAGAGTGAAATTGAAAGTATTGATACTAATGAATCGAAAAGGGAAAAGAAAGGTAAAGCGGCCGGGAAACAGTTAACATACCCAATGAAAAGAAAATTTCAAGATGACAAATTAGTAAAGACCAAAGATTATGCTAAACTGTCCAACCAACCAAAGGTACCGAAATTGAAAACATTCGGAAATACCTCATCTGGTGCATCTCAAAACTATCTCAATGAAAACAATGCTAACTCTGACAATTCAGTAAAATTAGAACTATCACAGTTAGCTTTACCAAAGAATAATATACCTTATAAGTTCTGGAACTCTGTAGATCCATACTGTGCACCTGTTACACTTGATGATATTAAATTCCTTGAATCATTGCTAGCTCAGAGTAGCAATACAACACTTCCACCCATTCCACCTCTTGGAAAACATTATTCGGAAGTTTGGGCAGATGAACACCTCACAGAAGATCAGAACGCATCAAATCCGAGTAAAATCAAATCATCCGGTATGTCCCCAGACGCATCTAGCTTAAGAAAAAAATTTGATAAATCTTCGGAAAATATGATCACCGGTCCATTAACGCAACGGTTGGTGTCAGCGTTGATGGAAGAAAATGTAATGCCTTATGAAGTCCCAGATATCAAAGTCAAACAGACAACTAACACTAGAAATAGTTATAAGAATTCTTTGACCTTAGAAAAATGTCTAAGGAAAGAACTGGTTGAGCAAGGAATTTTAGACCCGGAAGATTTACCGCCACTCACCAATCCAGCTGATGATGAAATATTGGCAGAAATCAAAAAATGCCAAACTGAATTGACAGCGGTTAGAAAAGAAAATTGTCGAAATCTTAAAAACCTCATTGGTTTATGTAAACAGGAAATGATAAGACTGAATTTAAAGAAGCAGCTCGATCAGGTTGATATGGAATGTATTGATATATACAAGAAGATGGTAGCCGCTAAACAGAAGAAAAGACCTATAACTAAGAAAGAAAAAGATGACGCATGGAGAGCCATTAATGAGCAGATTAGACTTAATAAAGAAATAAATGCTTTACCCTTAACTGGCCCTAATACAAGTTAA

Protein sequence:

>DPOGS200416-PA
MMGKRMHHNNKGRLSSKGHDNGKPSSPGVSPYNKPSKIPGAVSAGKTKVETCPIPYIKQQDNVSTLPRLAAICARSADEPIGMDELDALQLELESLLCNTALRIRYFQSEIESIDTNESKREKKGKAAGKQLTYPMKRKFQDDKLVKTKDYAKLSNQPKVPKLKTFGNTSSGASQNYLNENNANSDNSVKLELSQLALPKNNIPYKFWNSVDPYCAPVTLDDIKFLESLLAQSSNTTLPPIPPLGKHYSEVWADEHLTEDQNASNPSKIKSSGMSPDASSLRKKFDKSSENMITGPLTQRLVSALMEENVMPYEVPDIKVKQTTNTRNSYKNSLTLEKCLRKELVEQGILDPEDLPPLTNPADDEILAEIKKCQTELTAVRKENCRNLKNLIGLCKQEMIRLNLKKQLDQVDMECIDIYKKMVAAKQKKRPITKKEKDDAWRAINEQIRLNKEINALPLTGPNTS-