Monarch geneset OGS2.0

DPOGS211442
TranscriptDPOGS211442-TA1116 bp
ProteinDPOGS211442-PA371 aa
Genomic positionDPSCF300223 - 107849-117149
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0074562e-4355.84% 
BombyxBGIBMGA002190-TA2e-2144.86% 
DrosophilaCG4210-PA6e-1734.90% 
EBI UniRef50UniRef50_D6WL856e-2343.33%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WL85_TRICA
NCBI RefSeqXP_973786.11e-2343.33%PREDICTED: similar to spermidine/spermine N-1 acetyltransferase 2 [Tribolium castaneum]
NCBI nr blastpgi|910829672e-2243.33%PREDICTED: similar to spermidine/spermine N-1 acetyltransferase 2 [Tribolium castaneum]
NCBI nr blastxgi|3323733468e-2244.37%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00081521.2e-08metabolic process
GO:00080801.2e-08N-acetyltransferase activity
KEGG pathwaytca:6626063e-23 
 K00657 (E2.3.1.57, speG)maps-> Arginine and proline metabolism
InterPro domain[2-135] IPR0161815.1e-21Acyl-CoA N-acyltransferase
[55-128] IPR0001821.2e-08GCN5-related N-acetyltransferase (GNAT) domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211442-TA
ATGCCTGCTGTCGCGGAGATGATCAATGAGCTCGCAGTATACCATAACTCAGTGAGATTTCCGTTGAAACTAGAAAAAGATTTACAGCGCGACGGATTTCAAAGAGAACCGGCGGCTTTTCAATGTATCGTTGCTGAAGTACAGAGACAGAACAAGTGTTCAATAGTCGGGTACGCTCTGTACTACCCCGTGTACTCCACCTGGCGAGGCAAGGCGCTGCTGCTAGAAGATCTGTTTGTGAAGGCACATGAAAGAAAACGTGAAATTGGTAGTTATTTATTCGAAGCTGTAGTGAAGGAGGCGCACCTTGCTGGGTACAGTAGAGTCGACTTTCACGTGGCGGGATGGAACAGCGCGAGGTCATTTTACGAAAGAAAAGGTGCAAAGAACTTGACGGAGACTATGGGTGTATGTCACTATCGACTCACGGGGGCGCCGCTGAGAGCTGCGGCCGCCGCGGCGGACCACGCACATGACTGTTGTTCGAGCGGCTGCACCGTCCTCGACTTCCATGTCGCAGGTTGGAACCGAGCCAGGTCGCTCTATGAACGCTTCGGGGCCGTGATCCTCACGAGCACAGACGAACTACACTACTGGAGACTCAAGGACCAGGCGCTCCACGCGGCCGCGGGACAGGCCCCCGACACCGCGCACTCACCTAAAACCATGTATACAATGATCTGTACCCTCAGAAGTTGGAACCGAGCCAGGTCGCTCTATGAACGCTTCGAGGCCGTGAACCTCACGAGCACAGACGGACTAAACTACTGGAGACTCAAGGCCCAGGCGCCGCGACCGCGCCAGACACCGCGCACTCACCTAAAACCATGTATACCCTCAACGGAGTGTGACACGTTACGACGTAGTTTCATTGTAACAGGATGGTACAGATGGTTCTGTACATCGCTGCTGTGTTGTGTGCAGGCCAGCGGCTGTGGGAACGTTGACGATCAGAATCTTAATTACAAATGCATTACTTGCTTTAATAAAGCGAGCGATAAAGACGCTACAGTTGGACGGTTAGAGTGTGGACGAGAAGTCGTCTACGATAATGAGACAGTCGAGTGTGACCGGAGAGCAGTCGACACACGGCTCAGACGATGTACAAGCATATAA

Protein sequence:

>DPOGS211442-PA
MPAVAEMINELAVYHNSVRFPLKLEKDLQRDGFQREPAAFQCIVAEVQRQNKCSIVGYALYYPVYSTWRGKALLLEDLFVKAHERKREIGSYLFEAVVKEAHLAGYSRVDFHVAGWNSARSFYERKGAKNLTETMGVCHYRLTGAPLRAAAAAADHAHDCCSSGCTVLDFHVAGWNRARSLYERFGAVILTSTDELHYWRLKDQALHAAAGQAPDTAHSPKTMYTMICTLRSWNRARSLYERFEAVNLTSTDGLNYWRLKAQAPRPRQTPRTHLKPCIPSTECDTLRRSFIVTGWYRWFCTSLLCCVQASGCGNVDDQNLNYKCITCFNKASDKDATVGRLECGREVVYDNETVECDRRAVDTRLRRCTSI-