Monarch geneset OGS2.0

DPOGS212747
TranscriptDPOGS212747-TA1305 bp
ProteinDPOGS212747-PA434 aa
Genomic positionDPSCF300012 + 477986-480550
RNAseq coverage519x (Rank: top 24%)
Annotation
HeliconiusHMEL0177950.096.08% 
BombyxBGIBMGA013166-TA0.090.82% 
DrosophilaHdac3-PA0.078.03% 
EBI UniRef50UniRef50_Q927693e-15458.29%Histone deacetylase 2 n=529 Tax=root RepID=HDAC2_HUMAN
NCBI RefSeqXP_001655108.10.083.57%histone deacetylase [Aedes aegypti]
NCBI nr blastpgi|1571284110.083.57%histone deacetylase [Aedes aegypti]
NCBI nr blastxgi|910878670.084.62%PREDICTED: similar to histone deacetylase [Tribolium castaneum]
Group
Gene OntologyGO:00044078.2e-304histone deacetylase activity
GO:00165758.2e-304histone deacetylation
KEGG pathway 
InterPro domain[1-427] IPR0002860Histone deacetylase superfamily
[2-432] IPR0030848.2e-304Histone deacetylase
[2-379] IPR0238011.8e-125Histone deacetylase domain
Orthology groupMCL13316 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212747-TA
ATGAGTCAGCAGAGGGTTGCGTACTTTTACAACCCTGATGTGGGCAATTTCCACTATGGGCCCGGACACCCCATGAAGCCGCATAGACTCTCGGTCACCCACAGCCTAGTTCTCAACTATGGGCTCCACAAGAAGATGCAAGTGTATAGGCCATATCGAGCGAGTGCTCATGATATGTGCCGTTTCCACAGCGAGGATTACATAGAATTCCTTCAGAATGTCACTCCGCAGAATATTCAAACATATTCCAAGGATCTACTGCACTACAATGTCGGTGATGACTGTCCAGTATTTGAGGGCCTCTTCGATTTCTGTTCAATGTACACGGGAGCATCTCTCGAAGGTGCCATGAAATTAAACAACAACGCCTGTGATATAGCCATCAACTGGTCGGGCGGTTTGCACCATGCTAAGAAATTTGAACCGTCTGGTTTCTGCTATGTCAATGATATTGTGATTGCTGTACTGGAACTGCTGAAGTATCATCCGAGGGTTCTGTACATTGATATAGATGTACATCACGGGGATGGGGTCCAGGAGGCTTTCTATCTGACCGACAGGGTCATGACAGTTAGTTTCCATAAATATGGCAACTATTTCTTCCCCGGAACCGGTGATATGTATGAAATCGGAGCTGAGAGTGGGAGATACTTTTCGGTTAATGTACCACTGAAGGAGGGTATAGACGATCAGAGCTATGTGCAGATATTTAAGCCAGTTATTTCCAATGTGATGGAGTTTTACCGACCAACTGCCATAGTTCTCCAGTGTGGTGCTGATTCCTTAGCTGGTGATAGGCTCGGCTGTTTCTCACTATCCACGCGGGGTCATGGTGAATGTGTGAAGTTCGTCAAGAACTTGAATGTGCCCACTCTAGTCGTGGGCGGTGGTGGATACACCCTGCGTAACGTAGCCAGATGTTGGACGTACGAAACGTCACTGCTGGTCGACGAAAATATATCAAACGAGCTGCCGTACACCGAATATTTAGAGTTCTTTGCACCAGATTTTCAATTGCATCCAGAGATTAACAGTACATCGAATGCAAACAGCAAGCAGTATTTGGAAGCAATAACAAAACATGTCTATGACAATCTGAAGATGTGTCAACACTCCCCCGCAGTTCAAATGACGCATATACCAGGTGATTTCCTGCCAGAAGAATACAGGATAAAGGAGGAGCCAGATCCCGACATCCGAATCAGTCAAGAGGAAGCGGACAAGATGGTGGAACCAAAGAATGAATTCTTTGACGATGACAAAGATAACGACAAAGAATCTCTGCCGGAGGTCAAGGAACCCTAA

Protein sequence:

>DPOGS212747-PA
MSQQRVAYFYNPDVGNFHYGPGHPMKPHRLSVTHSLVLNYGLHKKMQVYRPYRASAHDMCRFHSEDYIEFLQNVTPQNIQTYSKDLLHYNVGDDCPVFEGLFDFCSMYTGASLEGAMKLNNNACDIAINWSGGLHHAKKFEPSGFCYVNDIVIAVLELLKYHPRVLYIDIDVHHGDGVQEAFYLTDRVMTVSFHKYGNYFFPGTGDMYEIGAESGRYFSVNVPLKEGIDDQSYVQIFKPVISNVMEFYRPTAIVLQCGADSLAGDRLGCFSLSTRGHGECVKFVKNLNVPTLVVGGGGYTLRNVARCWTYETSLLVDENISNELPYTEYLEFFAPDFQLHPEINSTSNANSKQYLEAITKHVYDNLKMCQHSPAVQMTHIPGDFLPEEYRIKEEPDPDIRISQEEADKMVEPKNEFFDDDKDNDKESLPEVKEP-