Monarch geneset OGS2.0

DPOGS203711
TranscriptDPOGS203711-TA5469 bp
ProteinDPOGS203711-PA1822 aa
Genomic positionDPSCF300010 - 1423572-1435281
RNAseq coverage1582x (Rank: top 8%)
Annotation
HeliconiusHMEL0023890.089.86% 
BombyxBGIBMGA003497-TA0.067.83% 
DrosophilaChd1-PB0.066.35% 
EBI UniRef50UniRef50_E2BBM70.063.32%Chromodomain-helicase-DNA-binding protein 1 n=6 Tax=Neoptera RepID=E2BBM7_HARSA
NCBI RefSeqNP_001106734.10.085.15%chromodomain-helicase-DNA-binding protein 1 [Bombyx mori]
NCBI nr blastpgi|1644486420.085.15%chromodomain-helicase-DNA-binding protein 1 [Bombyx mori]
NCBI nr blastxgi|1644486420.086.11%chromodomain-helicase-DNA-binding protein 1 [Bombyx mori]
Group
Gene OntologyGO:00036773.5e-87DNA binding
GO:00055243.5e-87ATP binding
GO:00043862.8e-25helicase activity
GO:00036762.8e-25nucleic acid binding
GO:00056341.5e-15nucleus
KEGG pathway 
InterPro domain[471-748] IPR0003303.5e-87SNF2-related
[464-658] IPR0140012.1e-35DEAD-like helicase
[807-891] IPR0016502.8e-25Helicase, C-terminal
[251-340] IPR0161978.1e-21Chromo domain-like
[255-342] IPR0009531.5e-15Chromo domain/shadow
[376-430] IPR0237804.2e-14Chromo domain
Orthology groupMCL11373 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203711-TA
ATGCACTTGGCTGGTTCTATGGCTGAATCTGGAAGTGAATCTATTAGTGAGAAGGGAGGCAAAAGTGATTCAAGTGGGTCAGGTTCTGGAAGTGACAGTGACAGTGGGTCTAGTTCTTCTGGTTCTGGCTCCGGAGGCAGGTCTGGTTCTGAAAAATCTTCAAATGGGGATCGTTCACACCTAAGTGATGATACCAAGGGATCTCCAAAGCACAGTAATGCTAATTCTTCAAAGTCTGATCGTCACTCTGACAAAGACTCCTCCGATGATTCTATTACAAAGCGTAAGTCAAGAAACAATCATGGAAAAGTTAAATCAGATCTTTGGGAAGATAACCCTGATATTTATGGTATTCGGAGATCAGCCAGGTCAAGGAAAGAGCCAGACAGACTTAAGGTAGCTGATAGTGATTCAAGTGATAAAGGTCAAAGCCACAGTAGAAAAAGTAGGAAAAGAAGTGACTCATGGAATTCAGATACATCAGATAGTGATAGTGATATGAAAGGTTCTCCACCTCCGCCGTCTAAGCGTCCGGGCCAAAGAAGTGTTCCTCTCAGAAAAAAGAAACCCACAAGGAGAAGAAGGTTTACAAGTGACGAAGAAGAAAGTTCAGAAGCATCTGATGAAGATACCAAAAGTCGGACGGCTACTCGACGAACAGGTGCTGCTGTTAGTTACAAAGAAGCCAGTGATGAACAAACAGACTCTTCCGACCTCTGTGGAGATGCTGAAGCTGAACCGGAGCCTGAACCAGAAGACCATAGTGAGACTATTGAAAGGGTTCTTGGCCATAGACGAGGGAAGAAAGGAGTCACTGGAAATGTGACTACAGTTTACTATATAGAGGAAAACGGTGATCCTAATGAGGATTGTAATCCTGATGATGAAGATTCGACTGAACCACAGTACCTGATTAAATGGAAAGGATGGTCGCATATACATAACACTTGGGAGTCCGAGAAAAGTCTTAATGAACAAAAAGTTAAGGGACTTAAGAAATTAGAGAATTATATAAAGAAGGAGGCTGATCTGTCATGGTGGAGGCAGCAAGCTGGTCCTGAAGATATAGACTATTATGAATGTCAGTCTGAACTGCAGCAGGAATTAGTCAAAACATACAACAATGTTGAAAGGATAATAGCTGAACAAACAAGAGAATTAGAAGGAGGTGGAACTGCTCATGAATATTTCTGTAAATGGGAATCTCTACCATATGCCGACGCAACCTGGGAAGATTCATCTCTTATTGAAAAAAGGTGGCCTGAGAAAGCTGAGAATTTTAAAAGTAGGGAAGCTGCTAAGACAACACCGTCAAGGCATTGCCCTGTTTTAAAAAGACGTCCAAAATTTCATCAAGTCAAAGAACAACCAGAGTATATGGGTAAAGATCAGACATATGTGTTAAGGGATTATCAAATGGATGGATTAAATTGGCTAATACATTCCTGGTGTAAAGACAATTCTGTTATTTTAGCTGATGAGATGGGTTTAGGAAAAACTATACAAACAATTTGTTTTTTATATTATCTTTTTAAATCTCAACAACTATATGGACCATTTCTTTGTGTTGTGCCCTTAAGTACTATGACTGCCTGGCAGAGAGAGTTTGCGCAGTGGGCGCCGGATATTAATGTAGTTACATACATTGGTGATGTTACAAGCAGGGATATTATCAGACAGTTTGAATGGAGCTTTGCCAGTTCAAAGAGACTAAAATTTAATGCCATTTTAACTACCTATGAGATATTATTGAAAGATAGACAATTTCTTAGATCATTTAGTTGGGCCTGTTTATTAGTGGATGAAGCACATAGATTAAAAAATGATGATTCTCTGCTTTACAAAGCCCTTAAGGAATTTGAAACAAATCATAGGTTATTAGTTACTGGTACACCTTTACAAAACTCACTAAAAGAATTGTGGGCACTGCTTCATTTTATAATGCCATACAAGTTTGAAACCTGGGAGGAATTTGAAAAAGATCATGAGGATGCTGCCACTAAGGGATATGAGAAGCTCCACAAACAGCTAGAGCCTTTCATTTTAAGAAGGCAGAAAAAAGATGTAGAGAAATCGTTACCTGCTAAAGTGGAACAAATTTTGCGTGTAGAGATGACTTCAATACAAAAACAATATTACAAGTGGATATTGACAAAGAATTACAGTGCACTACGGAAGGGTGTGAAGGGCTCAATTAATACATTTATCAATATTGTAATAGAACTAAAAAAGTGCTGTAATCATGCACTTCTAACCAAACCTGAAGATTTTGAATCAAGAGCATCTCTTGCTACTACAGATGCTGTTGAGAAACTCTTAAGAGGCTCCGGCAAGCTACTTCTTTTAGATAAATTATTGTGCAGGCTTAAAGAAACAGGTCATAGAGTGCTTATATTTTCTCAAATGGTTAGAATGTTGGATATTTTAGCTGAATACTTGCAACGGAGACATTTTCCTTTTCAACGTCTTGATGGAAGCATAAAGGGGGAAATCAGGAAGCAGGCTCTTGATCATTTCAATGCTGAAGGATCTCAAGATTTTTGTTTTTTACTATCAACACGTGCCGGAGGATTAGGAATTAATTTGGCTACAGCTGACACTGTGATAATTTTTGACTCTGACTGGAATCCACAAAACGATCTCCAAGCTCAGGCTCGCGCTCATCGTATTGGGCAAAAAAATCAGGTCAATATTTATCGGTTAGTAACTGCCAGGTCCGTAGAGGAAGATATTGTTGAAAGGGCTAAAAGAAAAATGGTACTTGACCATCTTGTAATTCAAAGAATGGATACAACAGGGAGGACAGTCTTAAATAAACGGGACGCCACCGGAACCAGCGCAAATAATCCGTTTAATAAAGAAGACTTGAATGCTATTCTTAAATTTGGAGCAGAGGAATTGTTTAAAGATGACGATGAAAATGATGAAGACCCTGTATGTGATATAGATGAAATCTTGCAAAGAGCAGAGACAAGGGACGAAGGGCCTTCAATGGCAGGCGATGAACTGCTTTCTGCATTTAAAGTGGCGAGTTTTGCTTTTGACGAAGACAAGGCGGTTATGGAGGTCAAAAAAGAAGGAGGAGATGAAGAAGCAAAAGATTGGGATGACATTATACCAGAAAATGTTAGAAAAACTATAGCAGAACAAGAGAAAAATAAAGAAATGGAAGACTTGTACTTACCACCCAGAAGAAAAAATAATCAGAATAATGCTGATAGTGGTGGTCGTAAGCGTCGCGGTCGTAGCGGAGGTGATGCGGGTGACGGGGGTGAGGCGGGGGACGGTGACGGTGACGGTGACGACAGCGAAGCCTCGGACGGAGACGGAGACGCCAGCGCCGATGACGACCGACCTCGGAAACGGGGACGGCCACCAGCCTCGCACAGAGAAAAGATCAAAGGATTCACTGATCAAGAGATAAGAAGATTCGTCAAAAGTTACAAGAAATTCTCAGCTCCCTTGAAACATCTAGACAGCATTGCATGTGATGCTGAGCTGCAGGAAAAACCACTAGCAGACCTGAAAAAGTTAGGAGAAATTCTTCAAGAAAGATGTAAAGCTGTATTAAATAATACGGCTGATGCACCTAATGAACAAAGTGATGGTCGTAAAAATGCAAGAAAGACTTTTAAACTTGGAGGAGTTCCTGTAAATGCTAAAACAATGGCTGCCTGCCAGGATGAACTTGCACCATTAGATGAATTCTTGCCACAAACCAAGGAAGAGAGGCTCAAATGGCAGTTAGATTTCAGGACAAGACCAGCTAATTTTGATGTCGAATGGGGAGTAGAAGATGATTCGAAGCTATTGGCCGGCATTTATCAATACGGGATGGGATCTTGGGAAGCCATAAAGATGGACTCATCTTTTGAAATCTGTGATAAAATACTTACAAATGAAGATAAAAAGCCGCAAGCAAAGCATTTACAGTCTAGAGCTGAATATTTGTTGAAACTTATCAAAAAGCTTTTAGACCAGAAGAATGGAAAACAAAAACAGAGAAAACCAAGAGTTAAGAAGGGGGCCAAGGAGCCTGTTACAAAAGATATAGTGGATGATGACGTTAGTACTGGCGAAGAAAACAAAAGAACAAGAAGTGCCAAAAATGATAAAAATGACAAGAACAAATCCAAAGTAGAAGAGGTGTTAACCCATGATGAAACATCAAATGATAGAAAAGAAAAGGATAGAAAACGCACAAAGAAGGATGGGAAAGACCGACTTAAAAATGAAAAGGTTAAAGGTCGCAAAAAACCTGCTGGTCCTATGCATTTTACTGCAAATAATGAACCCAGAGCTTTAGAAGTCCTAGGTGACCTTGACCCTTCGGTATTTGAGGAGTGTAAAGAGAAAATGAGACCGGTTAAAAAGGCACTTCGTGCATTAGATAATCCTGACCACACGTTGTCAGACACAGAACAATTGTCGCGAACAAGGGCTTGCCTCACACAAATAGGAAACCAAATAGATATATGTTTATCTGAATATCCAGACCCAGAAAAGAAAGTTGAATGGAGGAGTAATCTCTGGTATTTTGTATCTAAATTTACCAATTTTGACGCTAAGCAGTTGTACAGACTGTATAAGTATGGCCTTAAGAAAACTGATGGAAAGAAGGACAGTAAACACAAAGAAAATAAGTCCAATCTGAAGAACAATGTAAATAGTACAAATAATCACGTCAAAACTCATAAAAAGGAGGTGAAGCAAAACGATAGTGACCGTAGAGATAAAAGGCCTAGACTCGAGAAGGAAAAAGTAGACAAAAATAATGAGAAAACACCTCATGTGTCGGGAGTTAAGAGGAAACTCGAAGAGGGAGAATGTGATCCTGAGCCAACTGAAAATAAACGTCATGAGAGACATAAACACAAGCACAAGGATCGCAGAGATAGGGATGAAAGCGTGAAGAGGGACAATTTGATGTACAGAGAACGGAGCCGGACGGAGCGGGACCGAGAGCGGGATCGGGATCGAGATCGGGACCGAGACCGGGATCGGGACCGCGACCGGGAGCGGGAGCGAGACCGGGACCGGGAACGAGATCGGGACAGGGAGCGATCGAGGACGAGGCGGGACAGTGGAGGCGTGGGGCAACCTCGAGCGAGGCCGCCGTATGCGGCCCACCCGCACGCCGAGCATGCAGCTCATCCAGCGCATCCCGCACATGCCGCGCATCCCGGCTGGACCCCGCCAGCGTATCCTGGCCCGCGGCAGTATCGCGGAGGCGATCGCAATTCACGATCACACGACAAGAGACGGTTCGGAGGTTACGCAGCTATGGGAGGTCCATACAGTGGTTACTATGAGGGCGCGGCGATGGCCAGCGGTATGGGTTTCCCGCCGGTGCGCGCTTACTCGGACGACTGGCGCGGGTACGCGCAGCCCGAGTACCCGTACGAGGACCGGGAGTATGAGCGCGAGGTTTACCGAAGAGATTACGACCGACGTGCGCCTCCAACCTAG

Protein sequence:

>DPOGS203711-PA
MHLAGSMAESGSESISEKGGKSDSSGSGSGSDSDSGSSSSGSGSGGRSGSEKSSNGDRSHLSDDTKGSPKHSNANSSKSDRHSDKDSSDDSITKRKSRNNHGKVKSDLWEDNPDIYGIRRSARSRKEPDRLKVADSDSSDKGQSHSRKSRKRSDSWNSDTSDSDSDMKGSPPPPSKRPGQRSVPLRKKKPTRRRRFTSDEEESSEASDEDTKSRTATRRTGAAVSYKEASDEQTDSSDLCGDAEAEPEPEPEDHSETIERVLGHRRGKKGVTGNVTTVYYIEENGDPNEDCNPDDEDSTEPQYLIKWKGWSHIHNTWESEKSLNEQKVKGLKKLENYIKKEADLSWWRQQAGPEDIDYYECQSELQQELVKTYNNVERIIAEQTRELEGGGTAHEYFCKWESLPYADATWEDSSLIEKRWPEKAENFKSREAAKTTPSRHCPVLKRRPKFHQVKEQPEYMGKDQTYVLRDYQMDGLNWLIHSWCKDNSVILADEMGLGKTIQTICFLYYLFKSQQLYGPFLCVVPLSTMTAWQREFAQWAPDINVVTYIGDVTSRDIIRQFEWSFASSKRLKFNAILTTYEILLKDRQFLRSFSWACLLVDEAHRLKNDDSLLYKALKEFETNHRLLVTGTPLQNSLKELWALLHFIMPYKFETWEEFEKDHEDAATKGYEKLHKQLEPFILRRQKKDVEKSLPAKVEQILRVEMTSIQKQYYKWILTKNYSALRKGVKGSINTFINIVIELKKCCNHALLTKPEDFESRASLATTDAVEKLLRGSGKLLLLDKLLCRLKETGHRVLIFSQMVRMLDILAEYLQRRHFPFQRLDGSIKGEIRKQALDHFNAEGSQDFCFLLSTRAGGLGINLATADTVIIFDSDWNPQNDLQAQARAHRIGQKNQVNIYRLVTARSVEEDIVERAKRKMVLDHLVIQRMDTTGRTVLNKRDATGTSANNPFNKEDLNAILKFGAEELFKDDDENDEDPVCDIDEILQRAETRDEGPSMAGDELLSAFKVASFAFDEDKAVMEVKKEGGDEEAKDWDDIIPENVRKTIAEQEKNKEMEDLYLPPRRKNNQNNADSGGRKRRGRSGGDAGDGGEAGDGDGDGDDSEASDGDGDASADDDRPRKRGRPPASHREKIKGFTDQEIRRFVKSYKKFSAPLKHLDSIACDAELQEKPLADLKKLGEILQERCKAVLNNTADAPNEQSDGRKNARKTFKLGGVPVNAKTMAACQDELAPLDEFLPQTKEERLKWQLDFRTRPANFDVEWGVEDDSKLLAGIYQYGMGSWEAIKMDSSFEICDKILTNEDKKPQAKHLQSRAEYLLKLIKKLLDQKNGKQKQRKPRVKKGAKEPVTKDIVDDDVSTGEENKRTRSAKNDKNDKNKSKVEEVLTHDETSNDRKEKDRKRTKKDGKDRLKNEKVKGRKKPAGPMHFTANNEPRALEVLGDLDPSVFEECKEKMRPVKKALRALDNPDHTLSDTEQLSRTRACLTQIGNQIDICLSEYPDPEKKVEWRSNLWYFVSKFTNFDAKQLYRLYKYGLKKTDGKKDSKHKENKSNLKNNVNSTNNHVKTHKKEVKQNDSDRRDKRPRLEKEKVDKNNEKTPHVSGVKRKLEEGECDPEPTENKRHERHKHKHKDRRDRDESVKRDNLMYRERSRTERDRERDRDRDRDRDRDRDRDRERERDRDRERDRDRERSRTRRDSGGVGQPRARPPYAAHPHAEHAAHPAHPAHAAHPGWTPPAYPGPRQYRGGDRNSRSHDKRRFGGYAAMGGPYSGYYEGAAMASGMGFPPVRAYSDDWRGYAQPEYPYEDREYEREVYRRDYDRRAPPT-