Monarch geneset OGS2.0

DPOGS215725
TranscriptDPOGS215725-TA3591 bp
ProteinDPOGS215725-PA1196 aa
Genomic positionDPSCF300041 + 302742-313697
RNAseq coverage333x (Rank: top 35%)
Annotation
HeliconiusHMEL0096513e-11550.61% 
BombyxBGIBMGA005759-TA0.065.14% 
DrosophilaHDAC6-PC0.045.19% 
EBI UniRef50UniRef50_F4WUL20.047.71%Histone deacetylase 6 n=3 Tax=Bilateria RepID=F4WUL2_ACREC
NCBI RefSeqXP_002431644.10.046.33%histone deacetylase hda2, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320217680.047.71%Histone deacetylase 6 [Acromyrmex echinatior]
NCBI nr blastxgi|3320217680.046.53%Histone deacetylase 6 [Acromyrmex echinatior]
Group
Gene OntologyGO:00082703.1e-15zinc ion binding
KEGG pathway 
InterPro domain[151-1130] IPR0002860Histone deacetylase superfamily
[598-951] IPR0238012e-120Histone deacetylase domain
[1083-1181] IPR0130836.3e-21Zinc finger, RING/FYVE/PHD-type
[1107-1167] IPR0016073.1e-15Zinc finger, UBP-type
Orthology groupMCL12328 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215725-TA
ATGGGAGGGAAAGCAAAGCCAACAAACGTTGGTCTATTGAATATACCAATTACTACTACAGTTGCTAACACAGCCACCCAATTCAAAGAGTGCCCTGACACCAACCATAGGAGGACAGAGAAAAATATTGTTTCAAATGCATTTGAGAAGCTTCTGCAGCAGTATACCAATACCACATACGAACTCGCAAAAATGTCGACCTCGAGTTCGCCGACTGCGGCTCGTCGTACCGCTAGTGAGGTTAAAAAGACATCTCCTTCATCTATTGTTACAAGAAATGGTGCTAGAAAAGCCAAAATACAGACACGTGCTATGGCAGCAGCAAAGCCGTCAGCGTCTGTGGTGGCAGCCAAACGGAAGGCTCAGCAGAAGAAAAAATTTACAATGGACACAGTGCTTAGAGATCCTTATCAAACGGCCATGGAGTCTAAATTCAAAGTGAAGGGTGCTACAGGCTTTGTGACTGACCCTCGGATGTCTGAACACCGTTGCCTATGGGATGATAACTATCCGGAGTGTCCCGAAAGACTGATAAGTGTCATTAACAGATGTCAGGAGCTGAATCTAATAGAGCAGTGTAAAGTGTATCCTCCCCGGAGCGCCACCCGCGAGGAGGTGTTGGAACTGCACTCGCCATCCGTCTACAGTATGATGGAGGGAACCCATCAGAACCAGGACCTGGAGTATTTGGAGGAACTGTCTGCTGGCTTCGATGCGGTTTATATACACCCTACTACTCACGAGCTGGCTTTATCGTCTGCTGGTTGTACCCTGGATATGGTGGAACGCCTGGTCTCCGACGAGCTGCAGAACGCCGCGTGTATGGTGAGGCCGCCCGGACACCACGCCATGAGGGCGGAGCCCTGTGGATACTGTATCTACAACAACGCAGCCCTCGCGGCGAATAGGGCTCTCAAACTTGGGCTACAGAGAATATTAATAGTGGACTGGGACGTGCATCACGGACAAGCCACACAACAGATGTTCTACGACGATCCGAGAGTGGTGTACTTCTCCATCCATCGTTACGAGCACGGGGCGTTCTGGCCCAACCTGCGGCAGTCAGACTTCCCTTACATAGGGAGCGGGCAGGGCGAGGGTCACAACTTCAACGTGCCCCTCAACAACACCGGCATGACGGACGCGGACTATATCGCTATATGGCATCAGTTGTTACTGCCCATGGCTTTTGAGTACCAGCCGCAGCTGGTGCTGGTGTCTGCGGGATACGATGCCGCAGTCGGCTGTCCTGAGTTCTCCCCAGAACTGATCATAGTGTCAGCTGGTTATGATGCCGCACTCGGTGATGAGAAGGGTGAGATGGAAGTGACCCCCGCGTGCTATGCCTCCCTCCTGCACATGCTGCAGGGAGTGTGCTCCCGCGTGTTGGTGCTGCTAGAGGGCGGGTACTGTCTGCGCTCGCTGGCCGAGGGCGCCGCCCTGACGCTGCGGACGCTACTGGGACACGCGCCGCCCGCCCTGCCGCCGTTGCAAGAACCGTGCGAATCGATAAGAGATTCGATCCTGAACTGTATATACTCGCACAAGAAACACTGGAGGTGCTTCAACAACCAGCCGAGCTACAGCATTGACCCCTCCGTGTTGAACACGGGCGAGCGAGGGGTAGGTCAGCATACAGTGGTCATGAAGTGGGAAGGGGATGAGACCAGAGCCGACAGGTTCGCCACCAGGAACTGCTATCCATTACAGACCGACGACACCAGGAAGAGGATCCAGGACAGACTGAACCATTTACAATTGGCCACGGACGTCAGCTGCTCCGAGCACCCCGTGTGTTACATCTACGACCCGGCCATGTTGAAGCATCATAATGTTTGTGAACCTGGCCACGTGGAGTGTCCAGAACGTATAATGCGTATACATGAGCGGCACAGGGATTTCGGTCTGTTGGAACGCGTCCACCGCCTGCCGCCCAGGAGCGCCGCCGACGACGAGATACTCGCTGTGCATACTGAGAAGTATCTAGATAGCCTGAAGGAGTTGTCGAGTACGAAGCTTCGAGATCTGAACGCACAGAGGAAGTCCTTCGACTCTGTATATTTCCACCCTGACTCCTTACAAAGCGCCGCAGCCGCTGCCGGCGCCGTTATACAGATGGTAGACGCCGTTCTAAGACACGGCAGTGGCGGAGTGTGCGTGGTCCGGCCGCCGGGACATCACGCGGACGAGGACGTGCCGAGCGGGTTCTGTCTCCTCAACAACGTGGCGGTGGGCGCCAGGCACGCTCGCGCCGCCCACGGACTGACCAGGATCATGATACTGGACTGGGACGTTCATCACGGGAACGGCACGCAGAGGATCACATATGAAGATAAAGAGATACTGTACATATCGATCCACCGCTATGACAACGGCTCGTTTTTCCCCAACTCCCCCGCGGCCGACCACACCGCCGTGGGCCAGGGTCGGGGGGAGGGGTACAACATTAACATACCGTGGAATAAACGAGGTATGGGAGACGCGGAGTATCTGTCAGCGATGTGTTCCGTGGTGTTGCCGGTCGCCTACGAGTACGGGCCGCAGCTCGTGCTGGTGTCGGCCGGCTTCGACGCGGCCGTCGGAGATCCCCTCGGAGGTTGCAAAGTAACGCCGGAGTGCTACGGGCGGATGACGCACATGTTGCGGGGTCTGGCCGGGGGGCGGGTCATTGTGTGTCTAGAGGGCGGCTACAACGTCACCAGCATATCGTACGCCATGACCATGTGCACTAAGGCCCTCCTCGGAGACCCGCTGCAACATCAGTACGACCCCAAACAAACGGTCAACCCGTCCGCCGTGGAGAGCATCAATAACGTGATCAGAACACACCAGAAGTATTGGAAATCTCTAAAGTTTCAACTAGCCCTGCCCATGGAGGACGTCATTGGCCCGCTCCCCAAATCCAGAGGCTTACCTGACTCCGAGCCGGCGCCGGACCACGCTAAGATCGTGAGCGACAAACTGAAAAAATATGCACACGATAATAATCTAGCAAAAGAAATATCAAACTTGAGCATAAGGAACGACTGCACCGACGGCATCCACTGCGGCACCGACGACGAAAACGATGACGGCGTTCATAAAACTAAGACAAACGACGGCGCCGGCGCCAGCCACGGGACTGGGAGTGCGGGGCCCAGCGGCAATAAACAGAACCGCACGGGAAGCGAACCCACGACCCTAGTGGACTATCTCTCGGAGAACATGCAGGCCATAGTCAACGAGGAGATGTTCGCTGTGATACCTTTAACCTGGTGCCCTCATCTCGACATGCTGCACGCCGTCCCCGAGGGCGTGCACTTCCAACAAGGAGTCCAATGTGTCAGCTGTGATCACGTAGAAGAGAATTGGGTCTGTCTGCACTGTTACATTACCGCCTGCGGCAGACATGTGAACGGTCATATGCAGGACCACTTCAAGGCAGCGCAGCATCCCTTATCGCTGTCTCTCTCCGACCTCTCGGTGTGGTGCAGTGTGTGCGACGCCTACGTCGACAACCATCTCCTCTACGACGCCAAAAACAACGCCCACAGATGTAAATTCGGCGAAGACATGCCCTGGTGCTATAACAATACAATACAAATGCACTGA

Protein sequence:

>DPOGS215725-PA
MGGKAKPTNVGLLNIPITTTVANTATQFKECPDTNHRRTEKNIVSNAFEKLLQQYTNTTYELAKMSTSSSPTAARRTASEVKKTSPSSIVTRNGARKAKIQTRAMAAAKPSASVVAAKRKAQQKKKFTMDTVLRDPYQTAMESKFKVKGATGFVTDPRMSEHRCLWDDNYPECPERLISVINRCQELNLIEQCKVYPPRSATREEVLELHSPSVYSMMEGTHQNQDLEYLEELSAGFDAVYIHPTTHELALSSAGCTLDMVERLVSDELQNAACMVRPPGHHAMRAEPCGYCIYNNAALAANRALKLGLQRILIVDWDVHHGQATQQMFYDDPRVVYFSIHRYEHGAFWPNLRQSDFPYIGSGQGEGHNFNVPLNNTGMTDADYIAIWHQLLLPMAFEYQPQLVLVSAGYDAAVGCPEFSPELIIVSAGYDAALGDEKGEMEVTPACYASLLHMLQGVCSRVLVLLEGGYCLRSLAEGAALTLRTLLGHAPPALPPLQEPCESIRDSILNCIYSHKKHWRCFNNQPSYSIDPSVLNTGERGVGQHTVVMKWEGDETRADRFATRNCYPLQTDDTRKRIQDRLNHLQLATDVSCSEHPVCYIYDPAMLKHHNVCEPGHVECPERIMRIHERHRDFGLLERVHRLPPRSAADDEILAVHTEKYLDSLKELSSTKLRDLNAQRKSFDSVYFHPDSLQSAAAAAGAVIQMVDAVLRHGSGGVCVVRPPGHHADEDVPSGFCLLNNVAVGARHARAAHGLTRIMILDWDVHHGNGTQRITYEDKEILYISIHRYDNGSFFPNSPAADHTAVGQGRGEGYNINIPWNKRGMGDAEYLSAMCSVVLPVAYEYGPQLVLVSAGFDAAVGDPLGGCKVTPECYGRMTHMLRGLAGGRVIVCLEGGYNVTSISYAMTMCTKALLGDPLQHQYDPKQTVNPSAVESINNVIRTHQKYWKSLKFQLALPMEDVIGPLPKSRGLPDSEPAPDHAKIVSDKLKKYAHDNNLAKEISNLSIRNDCTDGIHCGTDDENDDGVHKTKTNDGAGASHGTGSAGPSGNKQNRTGSEPTTLVDYLSENMQAIVNEEMFAVIPLTWCPHLDMLHAVPEGVHFQQGVQCVSCDHVEENWVCLHCYITACGRHVNGHMQDHFKAAQHPLSLSLSDLSVWCSVCDAYVDNHLLYDAKNNAHRCKFGEDMPWCYNNTIQMH-