Monarch geneset OGS2.0

DPOGS205396
TranscriptDPOGS205396-TA1071 bp
ProteinDPOGS205396-PA356 aa
Genomic positionDPSCF300407 - 217779-220111
RNAseq coverage233x (Rank: top 44%)
Annotation
HeliconiusHMEL0071551e-15178.13% 
BombyxBGIBMGA001401-TA3e-16278.61% 
DrosophilaSirt6-PB2e-10860.44% 
EBI UniRef50UniRef50_E2BAH73e-11554.85%Mono-ADP-ribosyltransferase sirtuin-6 n=10 Tax=Endopterygota RepID=E2BAH7_HARSA
NCBI RefSeqXP_001599869.13e-11861.99%PREDICTED: similar to chromatin regulatory protein sir2 [Nasonia vitripennis]
NCBI nr blastpgi|1565469046e-11761.99%PREDICTED: NAD-dependent deacetylase sirtuin-6-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|910772109e-11760.70%PREDICTED: similar to chromatin regulatory protein sir2 [Tribolium castaneum]
Group
Gene OntologyGO:00063421.8e-98chromatin silencing
GO:00704031.8e-98NAD+ binding
GO:00063551.8e-98regulation of transcription, DNA-dependent
GO:00168111.8e-98hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides
GO:00082701.8e-98zinc ion binding
GO:00064761.8e-98protein deacetylation
KEGG pathway 
InterPro domain[1-273] IPR0030001.8e-98NAD-dependent histone deacetylase, silent information regulator Sir2
Orthology groupMCL15391 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205396-TA
ATGTCCTGTAATTATGCGGAGGGCCTTTCGCCATATGAAGATAAAGGAGTTCTCGGTATACCTGAGAAATTTGACTCTATAGAGAAATTAAATGAGAAGTGCAAAATCTTAGCGGAACTTATAGAAACAAGTAAACACATAGTAGTTCACACTGGGGCTGGCATAAGTACCACAGCTGGGATTCCAGATTTCAGAGGCCCAAACGGCGTGTGGACATTAGAAAAGAAGGGTAAGAAGCCCTCAATTAACATATCATTTACCGATGCTAAACCAACAAAAACACACATGATACTAAAAAATCTAGTCGAATGCAACAAAGTTCAGTACATAATCAGCCAAAACATTGACGGTCTGCATTTGAAGTCAGGGTTACCAAGGAAATACCTGTCGGAACTGCATGGCAATATGTTCATTGATGAATGTAACCTGTGTAAAAAACAGTTTGTAAGAAGCAGTCCCGTTGAAACGGTTGGTAAGAAATGCAGCGGAGTGCCTTGTGCTTCTGCTCATGCCGGTGGTAGACCCTGTAGAGGTCGTTTGTATGACGGAGTATTGGATTGGGAGCACAGTCTGCCAGAAAATGATTTGTTAATGGCCGAGTGGCATTCCAGTGTCGCTGACTTGAGTATATGTTTAGGTACAACCCTACAAATTGTGCCAAGCGGTAACCTACCTTTGGATACAGTGAAATATGGTGGAAAATTGGTTATATGCAATTTGCAGCCGACAAAACATGATAACAAAGCGGATTTAGTTATAAACTACTATGTGGATGACGTGTTAGAGAAAGTCATGGACATTATGAAAATAGAGATCCCACAACATAATGAAGGAGATAATTTACTGATAAAGGCAGAAACATCTATAATAGATTGGACGATAAGCAGAAAGGACGTTTTAGAAATGGAGAAAATATTCAAAGCGAAATGTAAAGGTGTTAAAAAGAAACGTGTTCTAATAAAGAAGAGAAACATTACGGATGTGAACAAGAATGATGATAACACGAAAATGATGAAACTGGAAGCAGACGACAAAGATCTTAGTAATACAAAAGAGATTCTTGCTTTTTAA

Protein sequence:

>DPOGS205396-PA
MSCNYAEGLSPYEDKGVLGIPEKFDSIEKLNEKCKILAELIETSKHIVVHTGAGISTTAGIPDFRGPNGVWTLEKKGKKPSINISFTDAKPTKTHMILKNLVECNKVQYIISQNIDGLHLKSGLPRKYLSELHGNMFIDECNLCKKQFVRSSPVETVGKKCSGVPCASAHAGGRPCRGRLYDGVLDWEHSLPENDLLMAEWHSSVADLSICLGTTLQIVPSGNLPLDTVKYGGKLVICNLQPTKHDNKADLVINYYVDDVLEKVMDIMKIEIPQHNEGDNLLIKAETSIIDWTISRKDVLEMEKIFKAKCKGVKKKRVLIKKRNITDVNKNDDNTKMMKLEADDKDLSNTKEILAF-