Monarch geneset OGS2.0

DPOGS207273
TranscriptDPOGS207273-TA1452 bp
ProteinDPOGS207273-PA483 aa
Genomic positionDPSCF300008 - 47542-52440
RNAseq coverage1029x (Rank: top 12%)
Annotation
HeliconiusHMEL0157640.099.71% 
BombyxBGIBMGA011747-TA0.085.75% 
DrosophilaRpd3-PA0.090.48% 
EBI UniRef50UniRef50_Q927690.078.13%Histone deacetylase 2 n=529 Tax=root RepID=HDAC2_HUMAN
NCBI RefSeqXP_001848775.10.090.36%histone deacetylase Rpd3 [Culex quinquefasciatus]
NCBI nr blastpgi|1700420970.090.36%histone deacetylase Rpd3 [Culex quinquefasciatus]
NCBI nr blastxgi|1700420970.090.36%histone deacetylase Rpd3 [Culex quinquefasciatus]
Group
Gene OntologyGO:00044076.8e-71histone deacetylase activity
GO:00165756.8e-71histone deacetylation
KEGG pathwaycqu:CpipJ_CPIJ0068360.0 
 K06067 (HDAC1_2)maps-> Pathways in cancer
    Huntington's disease
    Cell cycle
    Chronic myeloid leukemia
    Notch signaling pathway
InterPro domain[7-456] IPR0030840Histone deacetylase
[1-465] IPR0002860Histone deacetylase superfamily
[3-383] IPR0238014.6e-130Histone deacetylase domain
Orthology groupMCL10593 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207273-TA
ATGGCTATGCAACCCCATAGTAAGAAAAGAGTGTGCTATTACTACGATAGTGATATCGGTAATTACTACTATGGACAGGGACATCCCATGAAGCCTCATCGTATACGTATGACTCATAATTTATTGTTAAATTATGGACTTTACAGAAAAATGGAAATCTATAGACCACACAAAGCGACTGCTGACGAGATGACCAAGTTCCACTCAGACGACTACATCCGCTTCTTAAGATCCATCAGACCGGATAATATGTCAGAATATAACAAACAGATGCAAAGATTCAATGTTGGTGAAGACTGTCCCGTTTTTGATGGACTGTATGAGTTCTGCCAGCTATCAGCGGGAGGTTCAGTAGCCGCTGCTGTCAAATTAAACAAACAGGCATCAGAGATTTGTATCAACTGGGGAGGCGGACTCCATCATGCCAAGAAGTCTGAAGCGTCAGGTTTTTGTTATGTAAATGATATTGTTCTTGGAATTCTTGAGTTACTGAAGTATCACCAAAGAGTGCTTTATATTGACATTGATGTACATCACGGTGACGGGGTCGAAGAGGCTTTTTACACTACAGATAGAGTAATGACGGTCTCCTTCCACAAGTACGGGGAATACTTCCCTGGCACGGGTGACCTTAGAGATATCGGGGCCGGTAAAGGAAAGTACTACGCTGTAAACATTCCTCTGAGAGACGGTATGGACGATGAATCATATGAGTCCATCTTTGTTCCTATCATATCCAAAGTTATGGAGACATTTCAACCCAGCGCTGTTGTCCTGCAATGCGGTGCGGATTCATTAACAGGCGACCGACTCGGCTGCTTCAATCTAACAGTGCGTGGTCACGGCCGTTGTGTTGAATTAGTTAAGAGATTTGGTTTACCATTCCTCCTCGTCGGCGGCGGAGGCTACACAATCCGTAACGTATCCCGATGTTGGACGTACGAAACATCGGTGGCACTGGGCGTGGAGATAGCTAACGAGCTGCCGTACAACGACTACTTTGAATACTTCGGACCGGACTTCAAACTACACATCTCTCCCAGTAATATGTCTAATCAGAACACACTGGAATATTTGGAGAAGATAAAGAATAGACTTTTCGAGAATCTTCGCATGTTGCCTCATGCACCTGGAGTTCAAGTTCAAGCTATACCAGAAGACGCAGTCAACGATGAGTCAGAAGACGAAGATAAAGTGGACAAAGACGAACGACTACCACAGAGTGAAAAAGACAAGCGCATCACGAACGACGGCGAGCTGTCAGACTCCGAGGACGAGGGACGGGACGGGGACGGCAGGAGAGACAACCGCTCGTACCGCGCGCCAAGGAAACGACCGCGCCTCGACAAGGACGCCGCTAAAGACGACGGGAAAGCCGAGGATTCAAAGGACGAAGTGAAAAACTTGAGCAATATGGAGGAACCAAAGAAAGAGCTGCCGCCGAACGCCTGA

Protein sequence:

>DPOGS207273-PA
MAMQPHSKKRVCYYYDSDIGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKATADEMTKFHSDDYIRFLRSIRPDNMSEYNKQMQRFNVGEDCPVFDGLYEFCQLSAGGSVAAAVKLNKQASEICINWGGGLHHAKKSEASGFCYVNDIVLGILELLKYHQRVLYIDIDVHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNIPLRDGMDDESYESIFVPIISKVMETFQPSAVVLQCGADSLTGDRLGCFNLTVRGHGRCVELVKRFGLPFLLVGGGGYTIRNVSRCWTYETSVALGVEIANELPYNDYFEYFGPDFKLHISPSNMSNQNTLEYLEKIKNRLFENLRMLPHAPGVQVQAIPEDAVNDESEDEDKVDKDERLPQSEKDKRITNDGELSDSEDEGRDGDGRRDNRSYRAPRKRPRLDKDAAKDDGKAEDSKDEVKNLSNMEEPKKELPPNA-