Monarch geneset OGS2.0

DPOGS214428
TranscriptDPOGS214428-TA1131 bp
ProteinDPOGS214428-PA376 aa
Genomic positionDPSCF300069 + 532441-539568
RNAseq coverage103x (Rank: top 60%)
Annotation
HeliconiusHMEL0202223e-5165.99% 
BombyxBGIBMGA011234-TA3e-6156.19% 
DrosophilaCG15766-PA1e-0826.24% 
EBI UniRef50UniRef50_UPI000206345B2e-2732.72%UPI000206345B related cluster n=1 Tax=unknown RepID=UPI000206345B
NCBI RefSeqXP_001602094.15e-2838.12%PREDICTED: similar to predicted acetyltransferase [Nasonia vitripennis]
NCBI nr blastpgi|3287763977e-2732.72%PREDICTED: hypothetical protein LOC724126 [Apis mellifera]
NCBI nr blastxgi|1700288841e-2833.16%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00081526.8e-05metabolic process
GO:00080806.8e-05N-acetyltransferase activity
KEGG pathway 
InterPro domain[165-368] IPR0161812e-10Acyl-CoA N-acyltransferase
Orthology groupMCL10783 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214428-TA
ATGAAGTGGAAAAGACCAGAAACTGTCCCTCTGGGGAGAGTGTGGAGTCGCTTTGAAGGAAAACAGAGAAATGGCAAGCCCGCAGAAATGTATCAGATAGTAGACATGAGCGAGTCCGTGAGGAGGCAGTGCCTCGACATGATGCAGGAGACATTCCTCCGAGACGAGCCGCTCTCACTGGCGTTAAATATAAAAACAGACGCGGAATCTGTCACATCGATACGTAACAATTGGGAGGAGATGCTCTCACAGAACATTTCCATCGCTTGTTTCACTGAAGAAGAGGGGCGTACCAAGGAGCTGGTGGGATTCAACATACTTATAGTGAAGACCAAGGAAGACGGTCACGAAGAGTTTGAAAATGTAAGTGGACGAGACCAGAAACTGTCCCTCTGGGGAGAGTTTGGAGTCGCTTTGAAGGAAAACAGAGAAATGGCAAGCCCGCAGAAATGTGAGACTTTTTCGACATACATATATATAGACATAATACTAAGGTATCAGATAGTAGACATGAGCGAGTCCGTGAGGAGGCAGTGCCTCGACATGATGCAGGAGACATTCCTCCGAGACGAGCCGCTCTCACTGGCGTTAAATATAAAAACAGACGCGGAATCTGTCACATCGATACGTAACAATTGGGAGGAGATGCTCTCACAGAACATTTCCATCGCTTGTTTCACTGAAGAAGAGGGGCGTACCAAGGAGCTGGTGGGATTCAACATACTTATAGTGAAGACCAAGGAAGACGGTCACGAAGAGTTTGAAAATGTCAAAGGTGAAAAATGGGAGAAGCTTCTGAAGACCCTCATCACGGCCGAGGAGCTGGTGGATATTTTCAGTCATTATGACGTTGATACTTACCTGTCCTCGAGCGGACTGACAGTGTCGCCAGCCCACCGAGGACAGAACATCGGGGCGAGGATGATACAAGTCAGAGAGGACATGTGTAAAGCGTTCGGTATCAAGGCGGTGTCCACCGTGTTCACGGCCACCTCATCACAGGTCCTGGCAGCGAAATGTGGATACGAGGTCCTCGCCGCGCTGCCCTACACCCACATGCTGCAGTACGGGATTGACCTGACTATGAGTGAGACCCCCCTCGCTAAAGTCATGGGAAAGAAATACTATTGA

Protein sequence:

>DPOGS214428-PA
MKWKRPETVPLGRVWSRFEGKQRNGKPAEMYQIVDMSESVRRQCLDMMQETFLRDEPLSLALNIKTDAESVTSIRNNWEEMLSQNISIACFTEEEGRTKELVGFNILIVKTKEDGHEEFENVSGRDQKLSLWGEFGVALKENREMASPQKCETFSTYIYIDIILRYQIVDMSESVRRQCLDMMQETFLRDEPLSLALNIKTDAESVTSIRNNWEEMLSQNISIACFTEEEGRTKELVGFNILIVKTKEDGHEEFENVKGEKWEKLLKTLITAEELVDIFSHYDVDTYLSSSGLTVSPAHRGQNIGARMIQVREDMCKAFGIKAVSTVFTATSSQVLAAKCGYEVLAALPYTHMLQYGIDLTMSETPLAKVMGKKYY-