Monarch geneset OGS2.0

DPOGS212970
TranscriptDPOGS212970-TA4638 bp
ProteinDPOGS212970-PA1545 aa
Genomic positionDPSCF300057 + 499338-509834
RNAseq coverage1007x (Rank: top 13%)
Annotation
HeliconiusHMEL0177970.084.05% 
BombyxBGIBMGA011615-TA0.084.55% 
DrosophilaTop2-PA0.063.53% 
EBI UniRef50UniRef50_O161400.083.91%DNA topoisomerase 2 n=6 Tax=Bilateria RepID=TOP2_BOMMO
NCBI RefSeqNP_001037009.10.083.91%DNA topoisomerase 2 [Bombyx mori]
NCBI nr blastpgi|1129832660.083.91%DNA topoisomerase 2 [Bombyx mori]
NCBI nr blastxgi|1129832660.082.91%DNA topoisomerase 2 [Bombyx mori]
Group
Gene OntologyGO:00036772.7e-210DNA binding
GO:00055242.7e-210ATP binding
GO:00039182.7e-210DNA topoisomerase (ATP-hydrolyzing) activity
GO:00056942.7e-210chromosome
GO:00062652.7e-210DNA topological change
GO:00062595.4e-68DNA metabolic process
GO:00063558.8e-33regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[81-671] IPR0012410DNA topoisomerase, type IIA, subunit B/N-terminal
[432-1214] IPR0137606.6e-248DNA topoisomerase, type IIA, central
[694-1193] IPR0022052.7e-210DNA topoisomerase, type IIA, subunit A/C-terminal
[38-274] IPR0035948e-86ATPase-like, ATP-binding domain
[441-451] IPR0011545e-79DNA topoisomerase II, eukaryotic-type
[1010-1056] IPR0137585.4e-68DNA topoisomerase, type IIA, subunit A/ C-terminal, alpha-beta
[432-595] IPR0137592e-56DNA topoisomerase, type IIA, subunit B/N-terminal, alpha-beta
[257-419] IPR0205684e-47Ribosomal protein S5 domain 2-type fold
[275-431] IPR0147211.3e-43Ribosomal protein S5 domain 2-type fold, subgroup
[1057-1184] IPR0137571.1e-39DNA topoisomerase, type IIA, subunit A, alpha-helical
[269-428] IPR0135062.1e-34DNA topoisomerase, type IIA, subunit B, domain 2
[908-1009] IPR0208998.8e-33Arginine repressor, C-terminal
Orthology groupMCL11377 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212970-TA
ATGGCTGATATTAAGTCAATGTTCAATAGAATGAGCAATGGCACTCAGAATGGTACATCAGAGGCTGCCAAGGGCCAGAAAGGTGCTATTGAAAAAATATATCAGAAGAAATCGCAATTGGAACATATACTGTTACGACCCGACACTTACATAGGGTCGGTTGAACGTTCCACAGAAACAATGTGGGTCTATGATAAAGAGAAAGAGTGTATGGTCCAACGAGAATTGACATATGTACCAGGTTTATATAAAATATATGACGAAATATTAGTGAACGCTGCCGATAATAAGCAAAGAGATCCTAAAATGGATGTCATCAAAATAGACATTAATCAAGAACAAAATACAATATCAGTATATAACAACGGATGCGGGATCCCTGTGGTTATGCACAAGGATGAGAAAATGTTTGTGCCCACCATGATCTTTGGACATTTATTGACATCTTCCAACTACAATGACGAGGAAGAGAAAGTGACGGGAGGAAGAAATGGATACGGAGCAAAACTATGTAACATTTTCTCAACAAAGTTCACAGTGGAGACCGCATCCAAACAATACAAGAAACATTTCAAGCAGACATGGGGTTCCAACATGACCAAGGCCTCGGAACCCAAAGTTAAGGAGTCCGGTAAAGATGACGATTTTACAAAAGTGACTTTTAGCCCTGATTTGGCTAAATTTAAAATGGATAAGTTAGAAGACGACATCGTAGCGTTGATGTCACGGAGAGCCTACGACGTGGCCGCCTCTACGACGGGGGTCAAAGTTTATCTTAATGGAGAAAGATTGAAAATAAATAAGTTCAAGGACTATATCGATCTATACATTAAAGGTAAAGAGGACGAGAACGGCCAACCCCTGAAGGTTGTATACGAGAAAGTTAACGACCGGTGGGAAGTGGCTCTGACTCTATCCGACAGGGGCTTCCAACAAGTGTCCTTCGTGAACTCCATCGCCACGACGAAAGGAGGCAAACACGTCGACACTGTAGCCGACAGTGTGGTCAAGAATGTATTAGAAGTACTGAAGAAAAAAAACAAAGGCGGCGTTAACATCAAACCGTTTCAGGTTAAAACTCACATGTGGGTATTCATAAATTGTCTGATCGTCAACCCGACCTTCGACTCACAGACCAAAGAAAACATGACTCTGCAGGCCAAGAGTTTCGGTTCCAAATGTAATTTCTCCGAGAAGTTTATTAATGCGGTGACAAAATCGGGTTTAGTAGAATCTGTCCTCACGTGGGCCAAGTTCAAGGCGCAGAATGAATTAGTTAAGGCGTCTGGAAAGAAACAGAGTAAGTTAAAGGGCATACCGAAACTAGAAGACGCTAATGACGCGGGTACCAAGAATGCTCATCTATGCACGTTAATCCTCACCGAGGGAGATTCGGCCAAAACTTTAGCCGTTTCTGGACTTAGCGTAGTCGGTAGGGATCATTACGGCGTCTTCCCACTGAAGGGTAAACCGCTCAACGTTAGAGACGCATCACACAAGCAGGTTCTTGAAAATGTTGAAATTAATAATTTAATAAAGATCGTAGGCTTACAGTACAAGAAGAAATATAACAGTGTGGATGACTTGAAGTCGTTGAGGTACGGCAAGGTGATGATTATGGCGGATCAGGATCAAGACGGGTCTCATATAAAAGGTCTCATCATTAACTTCATACACCACAACTGGCCGGAGTTACTCAAACTACCGTTCTTAGAGGAATTCATTACGCCCATAGTGAAAGCTACGAAGAAAGATAAGGAATTCTCATTCTATTCCTTGCCAGAGTTCGAAGAGTGGAAGAGGGAAACCGACAACCATCACACTTACAATATCAAATACTACAAAGGTTTGGGTACCTCCACATCCAAGGAGGCTAAAGAATACTTCCAGAATATGGACAGGCATAGGATCAGATTCAGATACAGCGGGCCCACGGACGATCATCACATCGAACTGGCCTTCTCAAAGAAAGGCGCCGACCAACGCAAGGAGTGGCTCACGAATCATATGGACGAAGTTAAAAGGAGGAAGGAGATCGGTCTACCGGAAAGATATTTATATACCAAGGAGACTAAAGCCGTCACTTATTCAGATTTCGTCAATTTAGAGCTGGTTCTCTTTTCTAACGGCGACAATGTCAGATCCATACCATCAATGATGGATGGTCTGAAGCCTGGCCAGCGTAAGGTGATATTTACGTGCATCAAACGTAACGACAAACGAGAGGTCAAAGTGGCTCAGCTGGCTGGTTCCGTAGCAGAGCACTCGGCGTACCATCACGGTCGAAGGATGGGCCCTGAGTTTATAGCGCGACACTTTCACCATTCCCTGGCTATGACGATCGTTAACCTGGCCCAGAACTACGTGGGATCAAACAACATAAACCTGCTGGAACCGCGCGGTCAGTTCGGTACGAGGCTCTGTGGCGGCAAGGACTCGGCCAGCCCGAGATACATCTTCACGCTCATGTCGCCACTCACGAGACTAGTCTTCCATCCTCATGACGATCCGCTGCTAGTGCACGAATTCGAAGACAATCAGAAGATCGAACCCATACATTACGTACCCATCCTGCCCATGGTCCTAGTGAACGGTGCTGAAGGGATCGGTACAGGGTGGTCTACTAAAATACCAAACTACAATCCGAGAGACATTGTAGCTAATATCCGTCGAATGCTTGATGGCGAAGATCCAGTGCCAATGCATCCCTGGTACAAGAACTTCCGCGGAACTATCGAGAGCTTCGGCGACAAATACGTCATTTCCGGCGAGGCGGCCATTTTGCCAAACGAAAAGATAGAAATCACAGAGCTTCCTGTTGGCACTTGGACGCAGAATTATAAGGAGAACGTGCTAGAGCCTATGCTCGGTACAGACAAAGTTAAACCTTTGATATCCGAATACAGGGAATATAACACGGACACGACTGTGAGGTTCGTAGTGACCTTGCTGCCGGGCAAGCTGGCCGAGGTGGAGGCGGAGGGCATACACAAGGTGTTCAAACTGCAGACTACGATATCAATGACGTGCATGAACGCGTTCGATCACAACAACTGTCTCAAGAAGTATGATAAAGTAGAGGAAGTGCTACACGAGTTCTACGGTATACGTCTACGTTACTACGCTCGCCGTAAGGAATGGCTAGAGGGTCAGCTGCAGGCGGAGGCGGACAAGCTCTCCAACCAGGCCAGGTTCATACTGGAGAAGTGCGACAAGGGACTCGTCGTGGAGAATAAGAAGAGAAAGGTTCTGGTGGAAGAACTCATCAAGAGAGGATATGCGCCCGATCCTATCGCTGATTGGAAGAAACGAGCGAGTAAAATGCAGGGATTGACCGCCGTGGAAGAGGAACCGGAGTCCGAGGAGGAGTTGGAACAAGAGGAAGTGAAGGGGAAATCAGCTGATCCTGAAAAAGTGTTCCAGCAACTGAAGGAAGTAAAGAAATTCAATTATCTCCTGGGAATGTCCATGTGGATGTTGACCAAGGAGAAGAAAGACGAGCTGCTCAAACAACGAGATCAAAAACTCACGGAGCTGGAAACGCTGAAAGCGAAGACTCCGGCCATGCTGTGGCGAGAGGATCTGGATTCCTTCCTCATTAAACTCGACGAGGTGGAAGAACAAGAGCGACTCGACGAGAGCAACACGAACAAGAAAACATCAAAGGCATTGGCCGCGAACAAAAAGAATCGCAAATCGATAATGGATATAGTGCCTTCCGACAACGGCAGACGAGTGGAGCCGAAGATATCAGAAGATCTCATCAAGAGAATACAAGCGGCGGAGAAGGCGAAGATAAAGAAGGAGGTCAAGAAGGAATACGACCCGGACGATCCGACCGGCGTGAGTCCCGCCAGCGGAGAGAAGAAGCCGAAGGGACGAGTGAAGAAGGAGAAACCTGAGAAAACAGAAAAGACAGATAAACCCGATAAAGAGGGTCTCAAACAAACCAAATTAACGTTCAAGAAGGAGCCCAAAAAGAAGAAGGCTGCTCTTAGTAACGATAGCTCCGACGAAATGTCCGTAACATCGGACGTGGAGATAACGTCCGCGCCCGCGCCCAGAGACAGGACGATCGTCAGAAGAGCCGCTACTAAGGTACAAAAGTACAAGGACGGTTCGGACGAGTCGTCGGATTCAGAGCCAGAGCTGCTAGACAACAAGATCGACTCTGATAGAGACGTTCCCGAACCGATGTCCGATGACGATTTCACCGTCAAGAAGAATAATAAGAAACCTGCTGAAATGGACTCCGATTGTCTATTCGATTCGTTAATCGAGGACACCAAGAAGGAGGAGTCGGCGAAAAATAATGAACCGATACCAACAACGGTCCTGTCGAGTGACGACGATTCAGTTCAAGAAACTGCTCCCAAGAAGAAGCAGACCAAGAGGAAGTTGATGAACGTCGAGAAAGAGAAGCCTAAAGCGAGGAAGAGGCCGGCCAAGGTGGTGTTCAGCCACAGCACGGACGGGGAGGACGAGGAGGCCTTCGAGACTAAAAAGACAAAGAAGAAGGCCAATCCGAAAAAACGACAGAAGAAAGCCGACTCGGACAGCTCTTGCGAAGAAATTACTCGCGGGAGAGATAAGGACAAGGTGTACCGCTTCAGCGACGGATCCGACAGCGATCTATAA

Protein sequence:

>DPOGS212970-PA
MADIKSMFNRMSNGTQNGTSEAAKGQKGAIEKIYQKKSQLEHILLRPDTYIGSVERSTETMWVYDKEKECMVQRELTYVPGLYKIYDEILVNAADNKQRDPKMDVIKIDINQEQNTISVYNNGCGIPVVMHKDEKMFVPTMIFGHLLTSSNYNDEEEKVTGGRNGYGAKLCNIFSTKFTVETASKQYKKHFKQTWGSNMTKASEPKVKESGKDDDFTKVTFSPDLAKFKMDKLEDDIVALMSRRAYDVAASTTGVKVYLNGERLKINKFKDYIDLYIKGKEDENGQPLKVVYEKVNDRWEVALTLSDRGFQQVSFVNSIATTKGGKHVDTVADSVVKNVLEVLKKKNKGGVNIKPFQVKTHMWVFINCLIVNPTFDSQTKENMTLQAKSFGSKCNFSEKFINAVTKSGLVESVLTWAKFKAQNELVKASGKKQSKLKGIPKLEDANDAGTKNAHLCTLILTEGDSAKTLAVSGLSVVGRDHYGVFPLKGKPLNVRDASHKQVLENVEINNLIKIVGLQYKKKYNSVDDLKSLRYGKVMIMADQDQDGSHIKGLIINFIHHNWPELLKLPFLEEFITPIVKATKKDKEFSFYSLPEFEEWKRETDNHHTYNIKYYKGLGTSTSKEAKEYFQNMDRHRIRFRYSGPTDDHHIELAFSKKGADQRKEWLTNHMDEVKRRKEIGLPERYLYTKETKAVTYSDFVNLELVLFSNGDNVRSIPSMMDGLKPGQRKVIFTCIKRNDKREVKVAQLAGSVAEHSAYHHGRRMGPEFIARHFHHSLAMTIVNLAQNYVGSNNINLLEPRGQFGTRLCGGKDSASPRYIFTLMSPLTRLVFHPHDDPLLVHEFEDNQKIEPIHYVPILPMVLVNGAEGIGTGWSTKIPNYNPRDIVANIRRMLDGEDPVPMHPWYKNFRGTIESFGDKYVISGEAAILPNEKIEITELPVGTWTQNYKENVLEPMLGTDKVKPLISEYREYNTDTTVRFVVTLLPGKLAEVEAEGIHKVFKLQTTISMTCMNAFDHNNCLKKYDKVEEVLHEFYGIRLRYYARRKEWLEGQLQAEADKLSNQARFILEKCDKGLVVENKKRKVLVEELIKRGYAPDPIADWKKRASKMQGLTAVEEEPESEEELEQEEVKGKSADPEKVFQQLKEVKKFNYLLGMSMWMLTKEKKDELLKQRDQKLTELETLKAKTPAMLWREDLDSFLIKLDEVEEQERLDESNTNKKTSKALAANKKNRKSIMDIVPSDNGRRVEPKISEDLIKRIQAAEKAKIKKEVKKEYDPDDPTGVSPASGEKKPKGRVKKEKPEKTEKTDKPDKEGLKQTKLTFKKEPKKKKAALSNDSSDEMSVTSDVEITSAPAPRDRTIVRRAATKVQKYKDGSDESSDSEPELLDNKIDSDRDVPEPMSDDDFTVKKNNKKPAEMDSDCLFDSLIEDTKKEESAKNNEPIPTTVLSSDDDSVQETAPKKKQTKRKLMNVEKEKPKARKRPAKVVFSHSTDGEDEEAFETKKTKKKANPKKRQKKADSDSSCEEITRGRDKDKVYRFSDGSDSDL-