Monarch geneset OGS2.0

DPOGS213117
TranscriptDPOGS213117-TA5136 bp
ProteinDPOGS213117-PA1711 aa
Genomic positionDPSCF300016 + 338236-351699
RNAseq coverage820x (Rank: top 16%)
Annotation
HeliconiusHMEL0097750.089.54% 
BombyxBGIBMGA000930-TA5e-13143.20% 
Drosophilabrm-PA0.071.73% 
EBI UniRef50UniRef50_E2AFG30.071.52%ATP-dependent helicase brm n=20 Tax=Coelomata RepID=E2AFG3_CAMFO
NCBI RefSeqXP_001650090.10.068.57%helicase [Aedes aegypti]
NCBI nr blastpgi|1571081290.068.57%helicase [Aedes aegypti]
NCBI nr blastxgi|2700012590.064.07%brahma [Tribolium castaneum]
Group
Gene OntologyGO:00036773.6e-101DNA binding
GO:00055243.6e-101ATP binding
GO:00055153.8e-37protein binding
GO:00043861.5e-24helicase activity
GO:00036761.5e-24nucleic acid binding
GO:00168171.8e-11hydrolase activity, acting on acid anhydrides
GO:00056344.9e-08nucleus
GO:00063554.9e-08regulation of transcription, DNA-dependent
GO:00168184.9e-08hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
KEGG pathway 
InterPro domain[890-1182] IPR0003303.6e-101SNF2-related
[883-1075] IPR0140017.4e-38DEAD-like helicase
[1543-1653] IPR0014873.8e-37Bromodomain
[621-693] IPR0139992.5e-28HAS subgroup
[1243-1327] IPR0016501.5e-24Helicase, C-terminal
[621-693] IPR0065626.8e-17HSA
[768-812] IPR0065761.8e-11BRK domain
[320-350] IPR0149784.9e-08Glutamine-Leucine-Glutamine, QLQ
Orthology groupMCL10771 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213117-TA
ATGGCGAGTCCTTCACCGCAAAGTAGTCCTATGCCACCGCCACAAGCTCCAAGCCCAATGGGGCCACCAACTCAGAGCCCCGCTCCACCGCAGTCTCCACATAGTCCATATAATCAGCAGCATGTTAATGGACCGCCACCTTCAGCTCATCCTCCCGGATCAGGCCCACCTCCAATGCAAAATCATATGCCCCCCGGCCCGCCACACACTATCGCGTCCAATACTAATGGACCTCCAAGTGTGGTGCAACATCCTGGCATGCCACCAAGCGGACATCAAATACCACCACATATGGTAGGACCACACATGTCTGGTCCTCCAGGGCATCCAATGAGTGCTTCTGGGTCTCATCCACCTGGTCCAAATGGACATTCCAATATGCCAAGTAACCCGCAACACCCAAACATGCCTGGACAAGGCCCCCCTCATGGTTATATGCAACATCAAATTGGACATATGCCCCCTAACCAGGGTTCCATGGCAATTGGGGGTGGTCCACCACCACCTGGTAATGGGCCTCAAGGTCCCCCAGGTCCATCAGGAGTACCACTGCCCCTTGGTGGACCTCCGCCGCATGGGGTACATCCACAGGGTATGCCCCCTATGCCACCACACCATATGATGTCGTCTCAAGGTGGATACTCACATACGTCTTCGCCGGGTGCTCCAACACCACCCGGTGCCCCATCCGGCCAGACACCTCCAGGAGGACCACAGCAGCCTCCCACCGCACAAACCCCTCCTCATGGGTCTGCACCACCTCCGACCTCATCAGCCAATGGTGCACCATCGTCCTCCCCAATGCAAGCCTCCCTAGCTGCCTCGGGTCCGGACAATCTAAATGCTCTACAGCGTGCTATTGATTCCATGGAAGAGAAAGGCTTGCAAGAGGATCCGCGGTATTCACAATTGTTAGCCATTAGAGCTCGTTCAAACTCCCAGGACCCAAGCAAAGGACTGTTTTCAAACACCCAGCTGAGTCAGTTGAAGGCTCAAATAGCAGCATACAGAAACTTGGCTCGCAACCAGCCCATCACACAACAGATTGCAATGATGGCAGCCGGCAAACGTACTGGAGACTCTCCACCAGAGTGTCCTACACCACCAGCACAGCCGCCCTCACCATACAGCCAAGGACTAAGCCAAGGCGGCTCCCAGAGCGGCGCGGGTGGGAAGGGAACAGCGGGAGATGCGGCCCGAGGCGGCGGAGGAGCTCCGCCTACACCACTACCAATGACGGGACAGATGGCGCCACCTACACAACCAACACCACCCTTGGTCAATCCGAGCGGCGCGGGTGGGAAGGGGACAGCGGGAGATGCGGCCCGAGGCGGCGGAGGAGCTCCACCTACACCACTACCAATGACGGGACAGATGGCGCCACCGACACAACCAACACCACCCTTGGTCAATCCGCCATTGGGCCCTGCTGGTCCTATCCGTGGAACAGCTCCCCGTCCGCCCGGTCCCGGGGGACCAACGCAACAGACGCAACAACAGCCAGGGGTACCGACTCCGGGAGCAAAACAAAATCGTATAACAGCGATACCTAAGCCGGTCGGCATTGATCCACTACAAATACTCAACGAAAGAGAAAATAGAATTGCTGCCCGTATCGCTCACCGTATGGAGGTATTATCAAATTTGCCTGCGAACATATCAGAGGATCTTCGTTTACAAGCACAGATAGAACTACGGGCTCTTAGGGTGCTCAACTTCCAGAAGCAGCTGCGTGCAGAGATATTAGGACAAGTCCGTCGCGACACCACACTAGAAACCGCTGTGAACATAAAGGCGTACAAACGTACTAAGCGGCAAGGACTGCGTGAGGCTCGAGCCACCGAGAAACTCGAGAAGCAACAGAAATTGGAAGCTGAGAGGAAACGCAGGCAGAAGCACCAAGAGTTCCTACAGACGGTCTTACAACATGCGAAGGACTTCAAAGAGTATCACCGCAACAACATCGCTAAGCTGTCGAGACTGAACAAGGCGATCATGACTCACCACGCCAACGCTGAGAAGGAACAGAAGAAGGAACAGGAGCGTATAGAGAAGGAGCGCATGAGACGTTTGATGGCTGAAGATGAAGAGGGCTATAGGAAGCTTATTGATCAGAAGAAAGACAAGCGTCTGGCTTTCCTTCTATCGCAGACGGACGAGTACATCGCCAGCCTCACTGAAATGGTGAAACAACACAAACAGGAACAGAGAAAGAAACAACAAGAGGAAGAGAAACGCAAGAGAAAATCACGCAAGAAGAAGGTCTTGGAAGGTGGGGAGATAGATGCCTTGGATGATAGCTCACAGACATCTGACTCACGAGTCAGTGTTATGGATCCTAAGAGTGGAGAGGTGCTGAAAGGGGAAGAAGCACCGTTGCTGTCTCAGTTAAAGGACTGGATGGAGACGCACCCGGGCTGGGAAGTACTATCAGACTCGGACGATTCGGGAGATGACAGCCAAGACGAATATGGTCGAAAGGGAGGTCACAAGGCTGAAAACAAAGAAAAGAGTGAGGAAGAAAAGAATCGCGAGTTGATTAAGAAGGCCAAAGTTGAGGACGACGAATATAAGACTGAGGAACAGACGTACTACAGCATCGCTCACACTGTCCACGAGTCCGTCACAGAACAAGCTAGTATACTTGTGAATGGAAATCTAAAGGAATATCAAATTAAGGGTCTCGAATGGCTGGTTTCTCTATTTAATAATAATCTCAATGGCATTTTGGCGGACGAAATGGGTCTGGGAAAGACTATACAGACTATAGCATTGGTTACTTACCTTATGGAGAAGAAAAAAGTCAATGGACCATTCCTTATCATTGTACCGCTAAGTACACTTTCAAATTGGGTGTTGGAGTTTGAGAAGTGGGCGCCGACCGTTCAAGTGGTATCATATAAAGGGTCGCCTCAATCGCGACGCCTCTCGCAGAGCCAGCTGAGAGCTTCCAAGTTCAACGTCCTCCTCACTACTTACGAATATGTAATCAAGGATAAGAGCACATTGGCCAAGATACACTGGAAGTACATGATCATCGACGAGGGTCACCGTATGAAGAACCACCACTGTAAGTTGACCCAGGTGTTGAACACGCATTACGTAGCGCCGCACCGCCTGCTGTTAACTGGTACCCCGCTGCAGAACAAACTTCCTGAACTATGGGCGTTACTCAATTTCCTGCTTCCCTCTATATTCAAGAGCTGCTCCACTTTCGAGCAATGGTTCAACGCTCCGTTCGCTACAACGGGAGAAAAGGTTGAATTAAATGAAGAAGAAACAATACTTATCATCCGTCGTTTGCACAAAGTGTTGCGGCCGTTCTTGTTGCGACGTCTCAAAAAGGAGGTGGAGAGCCAGCTACCTGATAAAGTGGAATACATCATTAAGTGTGAGATGAGTGGCCTCCAGAGAGTACTCTACAAACACATGCAGTCGAAGGGCGTGCTACTGACTGACGGCTCTGAGAAGGGAAACAAGGGCAAGGGCGGGGCGAAGGCTCTCATGAACACCATCGTACAACTACGAAAACTATGTAACCATCCCTTCATGTTCCAACATATCGAGGAGAAGTTCTGCGACCACATCGGTACTGGCGGAGGAATTGTCACAGGACCCGATCTGTACCGTGTTTCTGGTAAATTTGAACTCCTGGATCGCATCTTGCCTAAGCTGAAACAAACCGGTCATAGGGTGCTAGTGTTCTGTCAAATGACCCAATGCATGACCATCATCGAAGATTACCTGTCTTGGAGGGGATTCCAGTATCTGAGGTTGGACGGTATGACGAAAGCCGAGGATCGTGGGGAACTGCTGAAGAAGTTCAACGACGTGGGCTCTGATTACTTTATCTTCTTGTTGTCGACTCGTGCCGGAGGTCTGGGCCTCAACCTGCAGAGCGCTGATACTGTTATCATCTTTGACTCTGATTGGAATCCCCATCAGGATCTACAAGCACAAGATCGAGCACATCGTATAGGACAGCGAAACGAAGTGCGCGTTCTACGTCTCATGACGGTGAACTCTGTTGAAGAAAGAATTTTAGCAGCTGCCAGATACAAACTGAATATGGACGAGAAGGTTATTCAAGCTGGTATGTTCGATCAGAAGTCCACCGGATCTGAGCGACAACAGTTCCTGCAGAGCATACTACATCAAGACGGAGACGATGAAGAGGAGGAGAACGAAGTGCCAGACGACGACTTGATCAATGAAATGATAGCACGCTCCGAAGAAGAATTGGAGATATTCAGACGGATTGATCTCGAGCGCAAAAAGACTGAGACACAGACGCGGCTCATTGACGAATCAGAGCTACCTGACTGGCTGGTGAAGACCGATGACGAGGTTGTGTGTAATAAGGGCCAAGGTTGGAATTATCCAGACGAGGATGAGACACTCGGTCGCGGATCGAGACAACGCAAGGAGGTCGACTACACCGACTCACTCACTGAGAAAGATCTCCTGCAGGCTATTGACGAAGATATGGACGAAGAAGACGACGACGACGATGACGATGAGGTGTTGGATAAGAAGAGGCGGCGGGGAAGGAAGCGACGACGGAATCAAGATGATTCTGATGAAGACGAAGTGCCGTGCACCTCGAGGAGAAAAAGCAAAACTGAACTTAATCAGCTCAAGAAACGTTTGAAGAATATTATGAAAAAAGTCATTGATTATTCTGACGAGAATGGTCGCGTACTATCAGAACCGTTTATGAAGCTGCCGTCTCGTCGTGAACTTCCCGATTATTATGATGTCATTAAAAAACCGCTCGACATTAAGAAGATTATGAACAGGATCGAGGATGGAAAGTACACTGACATATCAGACTTGGAGCGGGATTTCTTCACTCTGTGTGCAAACGCTCAAACTTACAATGAAGAACAGTCTCTTATTTACGAGGACTCTGTACGTTTACGGAACGTTTTTATAGAAATACGACGTCGCTATGACAGCGGTCAGAACTCGGACGACTCGGATGAAAACGATAAAGACGACGACGATTCTGATGGGGAATCAAATCGGTCGGTTAAAATGAAAATCAAACTGAAGAACAAGTCGAAAAGCGCACAGTCCAGAAAAGGCAAGAAACGTAAATGTATATCTGACGACGAAGAATACGAAGATGATTAA

Protein sequence:

>DPOGS213117-PA
MASPSPQSSPMPPPQAPSPMGPPTQSPAPPQSPHSPYNQQHVNGPPPSAHPPGSGPPPMQNHMPPGPPHTIASNTNGPPSVVQHPGMPPSGHQIPPHMVGPHMSGPPGHPMSASGSHPPGPNGHSNMPSNPQHPNMPGQGPPHGYMQHQIGHMPPNQGSMAIGGGPPPPGNGPQGPPGPSGVPLPLGGPPPHGVHPQGMPPMPPHHMMSSQGGYSHTSSPGAPTPPGAPSGQTPPGGPQQPPTAQTPPHGSAPPPTSSANGAPSSSPMQASLAASGPDNLNALQRAIDSMEEKGLQEDPRYSQLLAIRARSNSQDPSKGLFSNTQLSQLKAQIAAYRNLARNQPITQQIAMMAAGKRTGDSPPECPTPPAQPPSPYSQGLSQGGSQSGAGGKGTAGDAARGGGGAPPTPLPMTGQMAPPTQPTPPLVNPSGAGGKGTAGDAARGGGGAPPTPLPMTGQMAPPTQPTPPLVNPPLGPAGPIRGTAPRPPGPGGPTQQTQQQPGVPTPGAKQNRITAIPKPVGIDPLQILNERENRIAARIAHRMEVLSNLPANISEDLRLQAQIELRALRVLNFQKQLRAEILGQVRRDTTLETAVNIKAYKRTKRQGLREARATEKLEKQQKLEAERKRRQKHQEFLQTVLQHAKDFKEYHRNNIAKLSRLNKAIMTHHANAEKEQKKEQERIEKERMRRLMAEDEEGYRKLIDQKKDKRLAFLLSQTDEYIASLTEMVKQHKQEQRKKQQEEEKRKRKSRKKKVLEGGEIDALDDSSQTSDSRVSVMDPKSGEVLKGEEAPLLSQLKDWMETHPGWEVLSDSDDSGDDSQDEYGRKGGHKAENKEKSEEEKNRELIKKAKVEDDEYKTEEQTYYSIAHTVHESVTEQASILVNGNLKEYQIKGLEWLVSLFNNNLNGILADEMGLGKTIQTIALVTYLMEKKKVNGPFLIIVPLSTLSNWVLEFEKWAPTVQVVSYKGSPQSRRLSQSQLRASKFNVLLTTYEYVIKDKSTLAKIHWKYMIIDEGHRMKNHHCKLTQVLNTHYVAPHRLLLTGTPLQNKLPELWALLNFLLPSIFKSCSTFEQWFNAPFATTGEKVELNEEETILIIRRLHKVLRPFLLRRLKKEVESQLPDKVEYIIKCEMSGLQRVLYKHMQSKGVLLTDGSEKGNKGKGGAKALMNTIVQLRKLCNHPFMFQHIEEKFCDHIGTGGGIVTGPDLYRVSGKFELLDRILPKLKQTGHRVLVFCQMTQCMTIIEDYLSWRGFQYLRLDGMTKAEDRGELLKKFNDVGSDYFIFLLSTRAGGLGLNLQSADTVIIFDSDWNPHQDLQAQDRAHRIGQRNEVRVLRLMTVNSVEERILAAARYKLNMDEKVIQAGMFDQKSTGSERQQFLQSILHQDGDDEEEENEVPDDDLINEMIARSEEELEIFRRIDLERKKTETQTRLIDESELPDWLVKTDDEVVCNKGQGWNYPDEDETLGRGSRQRKEVDYTDSLTEKDLLQAIDEDMDEEDDDDDDDEVLDKKRRRGRKRRRNQDDSDEDEVPCTSRRKSKTELNQLKKRLKNIMKKVIDYSDENGRVLSEPFMKLPSRRELPDYYDVIKKPLDIKKIMNRIEDGKYTDISDLERDFFTLCANAQTYNEEQSLIYEDSVRLRNVFIEIRRRYDSGQNSDDSDENDKDDDDSDGESNRSVKMKIKLKNKSKSAQSRKGKKRKCISDDEEYEDD-