Monarch geneset OGS2.0

DPOGS204995
TranscriptDPOGS204995-TA1215 bp
ProteinDPOGS204995-PA404 aa
Genomic positionDPSCF300123 + 176816-179659
RNAseq coverage732x (Rank: top 18%)
Annotation
HeliconiusHMEL0094879e-9180.32% 
BombyxBGIBMGA010230-TA0.073.38% 
DrosophilaCG2051-PA5e-10243.75% 
EBI UniRef50UniRef50_B7PSE24e-11449.25%Histone acetyltransferase type B catalytic subunit, putative n=2 Tax=Arthropoda RepID=B7PSE2_IXOSC
NCBI RefSeqXP_625126.24e-11549.25%PREDICTED: similar to histone aminotransferase 1 [Apis mellifera]
NCBI nr blastpgi|3214647992e-11650.88%hypothetical protein DAPPUDRAFT_306646 [Daphnia pulex]
NCBI nr blastxgi|3838527621e-11351.26%PREDICTED: histone acetyltransferase type B catalytic subunit-like [Megachile rotundata]
Group
Gene OntologyGO:00044024.7e-44histone acetyltransferase activity
GO:00165684.7e-44chromatin modification
KEGG pathway 
InterPro domain[3-326] IPR0161811.5e-86Acyl-CoA N-acyltransferase
[10-171] IPR0194674.7e-44Histone acetyl transferase HAT1 N-terminal
Orthology groupMCL13941 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204995-TA
ATGGCCGAGACACTAGAACATCTGGTTGTAGACGGCAACGAGGTGCTAGAAATAAAATTAGTTAGGTCTGCAGATGATATAGAAGACGAGCAGACAACATTCGGTCCTGAGATGTGCCATCAAGTTTTTGGCGAAAACGAAAATATTTTTGGTTACACAGATTTACAAATTAAACTCTATTACAGCGCTGCCAGTCTTCAGACATATCTCGGCATCTCATATTCCGATAAGATAGATCCAAAAAAAACAGGAGGTTTGAAGGCTGATGATATTGAGGAAGCATTGAAAAAAGTATTGGCACCCGGTTATGTGACAAATTTAGATGTATTTGTTTCCATGTTAGAAAAAGATAAAAACTTCACACCACATGGCAAGCTCATCCATACATTTTCAACAATACCTTGTGACGGCTGTGAATCTCGAACGTTTGAAGTTTACTATTCAGAGGTCACAACCCCCGGCTTCCTGTCGTTCCACGAACGAATACAAACATTCTTATTGTGGTATGTTGATGGCGCGTCCTTCATTAATGTGGATGATGACCAATGGACGTTCTTTACTGTTTTTGAGAAGTGTCGTAATAGTGTGGGTGAATATCGTTACTCAGTGGCCGCCTATACGACAGTATTCAGATACTATGCCTATCCTAACAACGTTAGACCAAGGGTGTCCCAAGTGCTGACATTGCCACCGTTTCGCAAGATGGGGATATGCGCTAATTTGCTACAGGCCATCTACTCGCATTTCATAGCACAGCCGGAGGTAGTCGACATAACAGTTGAAGATCCATCGGAAAGTTTCCAAAGGATACGAGATTTTGTTGATGTTAAGAATTGTGAATCGTTACCCGCATTCCAACCCTTGAAACTTTTACAAGGTTTCTCTCCAGAAATGATAAATCAGGCTCGTAGCAAGTTTAAAATTAACAAGAAACAGGCTCGTAGAGTGTATGAGATACTCCGTTTGAAGAACACTAACACATCAGACAAGACAGCTTATCTAACTTACAGACTGGATGTTAAGAATAGGTTGAACGCACCTTTTCAGAAAAAAAAGCTTGAATTGAAAAAGCTCCAGAAGTTTTTAAAACCGGAGGAGTTTATAGCTTCGGCGAATGCCAGCGGCGCCGCGGAAACTCATGCGCGTTTAGCGGCACACTACCGCGCGCTAGAAGACGAATACCGCGCCGTACTGCATAGATTAGAATTACAGTGA

Protein sequence:

>DPOGS204995-PA
MAETLEHLVVDGNEVLEIKLVRSADDIEDEQTTFGPEMCHQVFGENENIFGYTDLQIKLYYSAASLQTYLGISYSDKIDPKKTGGLKADDIEEALKKVLAPGYVTNLDVFVSMLEKDKNFTPHGKLIHTFSTIPCDGCESRTFEVYYSEVTTPGFLSFHERIQTFLLWYVDGASFINVDDDQWTFFTVFEKCRNSVGEYRYSVAAYTTVFRYYAYPNNVRPRVSQVLTLPPFRKMGICANLLQAIYSHFIAQPEVVDITVEDPSESFQRIRDFVDVKNCESLPAFQPLKLLQGFSPEMINQARSKFKINKKQARRVYEILRLKNTNTSDKTAYLTYRLDVKNRLNAPFQKKKLELKKLQKFLKPEEFIASANASGAAETHARLAAHYRALEDEYRAVLHRLELQ-