Monarch geneset OGS2.0

DPOGS207043
TranscriptDPOGS207043-TA1245 bp
ProteinDPOGS207043-PA414 aa
Genomic positionDPSCF300001 + 1812613-1814793
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0068612e-8252.20% 
BombyxBGIBMGA013166-TA2e-5938.34% 
DrosophilaHdac3-PA1e-6538.51% 
EBI UniRef50UniRef50_UPI0001CB9E155e-9143.77%UPI0001CB9E15 related cluster n=1 Tax=unknown RepID=UPI0001CB9E15
NCBI RefSeqXP_002733814.19e-9243.77%PREDICTED: histone deacetylase 8-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3287026091e-9046.96%PREDICTED: histone deacetylase 8-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287026098e-9146.96%PREDICTED: histone deacetylase 8-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00044075e-81histone deacetylase activity
GO:00165755e-81histone deacetylation
KEGG pathway 
InterPro domain[1-366] IPR0002866.3e-128Histone deacetylase superfamily
[3-366] IPR0238019e-101Histone deacetylase domain
[2-402] IPR0030845e-81Histone deacetylase
Orthology groupMCL26645 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207043-TA
ATGGACAAAAGTAAAATAAGTTATATATGGGACAACAAATTAATTGAATGTTGTGATAGGTTACCCGCTGTACTGGGTCGGGCTTCAATGGTGCATAGCCTTATTGTCACTTACAATTTATTGGATCAAATAAAAGTAGTTCGTTCCAAACCTGCTTCTTACCAAGACCTTAAACTATTTCATACTGAAACTTATATAGAACATGTAAAAAGTTTTACCGATGTTGACGAGGATTATATGCCAACAAAAAAAGACGAAGAATATGGTTTTGGCTACGATTGCACACCAGTATCAGATATGTATGATATAATATCGAATATTGCTGGAGCTTCCATTACTGCAGCAGAATGTCTTCTTCGTGGCATTGCTGATGTCTCTATAAATTGGTGTGGTGGTTGGCATCATGCTCATAAATCAAAGGCTGAAGGTTTCTGTTATGTAAATGATATTGTGATCGCTATTGAAAAACTGAGGAAAAAATTTTCAAAAGTTTTATACATAGATCTCGATGTGCATCATGGCAATGGAGTGCAAGATGCCTATGATTCAAATGAATCTGTATTTACTCTCTCATTCCATAAATATGAACCAGGTTTCTATCCTGGAACTGGTGCTGTCACAGATATCTCATCCAAAGCTCAAGGTTACTCTTGCAACTTTCCATTACATGCATCATACTGTGATAAAACATTTGTACATGCTTTTAATAATATTATTGCTGAGGTCTATTCTTATTTTAAACCTGATGTAATAGTAGTACAGTGTGGAGCTGATGCTTTGTCTCTGGACCCACATGGGGGTGCAGGACTTACAATAAAAGGTTACTGTACATGTGTATATAAAATATTGCAGACAAAAAAACCTACATTACTTTTAGGAGGAGGAGGGTATAATCATGCAAATGCTGCTAAACTTTGGACAGCTATAACAGCACATGCAGCAGATGTTACATTGGATGAGAACATTCCAGAACATACATATTGGCCTGAATACGGACCTGGTTACACTTTGAAAGTGGAGCCACTTTTAGCCAAAGATTTAAATACAACACAGTATATAGAGCATATTACAGCTATTATAAAAGAAAATTTAAACAAATACCTAGGAGAATATAGATTAAGTGCTGTGTTACCCAGAAAAAGGAAACTGGATTGGGGCAATAATTCTAAATGGAATGACAGTAAAAATGATGATCCAGAAAACAAGACTTTAGAGACATCTGACGTTTATGAATTTAATGAATAA

Protein sequence:

>DPOGS207043-PA
MDKSKISYIWDNKLIECCDRLPAVLGRASMVHSLIVTYNLLDQIKVVRSKPASYQDLKLFHTETYIEHVKSFTDVDEDYMPTKKDEEYGFGYDCTPVSDMYDIISNIAGASITAAECLLRGIADVSINWCGGWHHAHKSKAEGFCYVNDIVIAIEKLRKKFSKVLYIDLDVHHGNGVQDAYDSNESVFTLSFHKYEPGFYPGTGAVTDISSKAQGYSCNFPLHASYCDKTFVHAFNNIIAEVYSYFKPDVIVVQCGADALSLDPHGGAGLTIKGYCTCVYKILQTKKPTLLLGGGGYNHANAAKLWTAITAHAADVTLDENIPEHTYWPEYGPGYTLKVEPLLAKDLNTTQYIEHITAIIKENLNKYLGEYRLSAVLPRKRKLDWGNNSKWNDSKNDDPENKTLETSDVYEFNE-