Monarch geneset OGS2.0

DPOGS202418
TranscriptDPOGS202418-TA3789 bp
ProteinDPOGS202418-PA1262 aa
Genomic positionDPSCF300233 + 197626-208504
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0074210.056.62% 
BombyxBGIBMGA003443-TA0.051.83% 
Drosophilamus101-PA7e-9431.54% 
EBI UniRef50UniRef50_E2AE325e-15630.07%DNA topoisomerase 2-binding protein 1 n=2 Tax=Formicidae RepID=E2AE32_CAMFO
NCBI RefSeqXP_001812071.13e-12528.72%PREDICTED: similar to DNA topoisomerase 2-binding protein 1 [Tribolium castaneum]
NCBI nr blastpgi|3071802322e-15530.07%DNA topoisomerase 2-binding protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3838485096e-15531.01%PREDICTED: DNA topoisomerase 2-binding protein 1-like [Megachile rotundata]
Group
Gene OntologyGO:00056222.5e-21intracellular
KEGG pathway 
InterPro domain[618-749] IPR0013572.5e-21BRCT
Orthology groupMCL13407 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202418-TA
ATGAATGAAGATATAAATGTTACGTTTATTATCCCTCCGGAGTGCAGAAATGAAAATGATTGTTCTGAGGAAATGCAGCTGGCTTTCACTGCTTGTGAGCAACACAGCGGGGGAGGTATCAAGGCGCAGTGGCAAGCCCAGGCTACTTGGAAGAAATTTGTAGGACTAACAAAGAAAGATGTTTTTGTTGTTTCTGAGTTTCAAGGAGAGTTTTTTGAGAGACTCAGATTAGTAGGGCCTCGATGTATTTCGTGTTGTCTAACGGAGGGCATATCAATACCATCAGGCCCTGAGCCGGTGTTCACTATAGCCATGAGAGGTCTCGTGGTCACAGCCAGTGGTCTTTCAAAACAACAGAAGGAAAATATCAAGAAGAAAGTTCATTGGATGGGAGGATTATATAGCACGGTGCTGACAGAAGACACCACTCACCTTGTAGCAGACACAGTGCTCTCTGATAAATATGTTAAAGCTGTGGAAAGGAATTTACCCGTGATGAAGGATACCTGGATTGAAGCGGTATGGGAAACATCTCTGAGGCTAAACATAAGTGCTGCCTCATCAGACTTCGACGATCATAAATTACCACCATTTGCAAACTTACAGGTCACAACAACCGGCATAACTAAAAAGGACAAGGCGTTGGTAATGAAGCTTGTCAGTGAAAATGGCGGAACATTCTCCGGAGCATTCCAGAGCGAGACAACAGATATTGTTGTTCTCAATAAGGACAGTATAGGAAGTGAGAAATATAAAGCTGCTTTAGAGTATGGTAAGGCATGTGTTCTACCGTCCTGGGTGATTGATTCCGCTGCCCGTGGTGTGGCGCTGCCTTTATCAAAATATAGAGTCGCTGGGGCCTCCACATCATCACCTCTATTGGAACATAGAGCACCGGATATGAGTTTAAATTTCTCAAGAATAACAAATCTGAGACCTCCCAGTAATTTTGTCGATGAGAGCAGAGCCGTCGATATTTCAACAATGTCTGGTAGAATGAAGCTATCCCAGGAAACAAAGAAATCAAATGATACTTCAGCAGAGAAGGAATTGTTGTCTCATTTTGAGAAGCTTGATGTGACTACTATTAAAAAAGCCGGACCCATATTCGATGGATTCTGTATTTGGTTGACGGGTGTGGAGGGCCCGGCCCGCGACCGCGCCGCCGCCTGTATCTCTCGCTGCGGCGGCGTGCGGTACGACTGTCCACACGAGCGCGTCACACACGCACTGGCCGGCACCGCTGCCGCTGCTGTTAGTGCCAGTGCTGCGCTGCCTAGTGTACCCGTTCTGCAGCCTATGTGGCTGGTGAAGAGCGTGCTGGCTGGACGGCCGCTGGGAGAAGCTGAGGCGAGTGACGTGTACATGTCGTATGCTTGTAGATTTTTAATAAATACAAAACCTGCAACACCAGTCAAAGCTCGCACAAAGATCGAACCCGCGTCTCCGATGAGCAAGCGAAACATGCAGCTACTGAGACACGGACCCCTGGACCTGCCGCCGCCCACGCCCGAGGTGGAGGAACCGCCGGACGATATCGTTCATCATTACTTGAGTCAAAGACAAGAGCAAGAGAAGACTCCAGAACGTGTGATGCCAGCTGAGCCGGAGCAACCGGATGTGACAGAAGACCTCGACGAGCCCTCTGAGGAGGACATAGAGCAGATATTTAAGGGCGTAACGATACAAGTCCGAGGTCTGGATGAGGAGGCGATCACTGAGATAGCTGCGGAGGTGTCCGCGGCTGGCGGGCTGTTGGTGGCGGGGGGCGGGGTGAGGGGTGGTTCTGGGGCGGCGCCCTACGTGCTGGTGCCGCTGGAGTGGGACGGGGACGAGGACGGGGATGGGGACCTCGTCACGGTGTTCTGGATTAAAGATTGTTTGTCCCGACAAGAACTCTTAGAGATACAGTACTATCATCGTCCTGTTAAACTGCCGCAGTGGAGCGTGCCTCCCTTAGAAGGCGTGGTGGCCAGTTTGAGCACGTACAGCGGATTCGAGAGGGCGTTCCTAGACGAGCTCGCCAAACTACTGGGCGCCACCACGCAGCTTCGCTTCTGTCGCCGCAACACGGCGAACGCTCTGGCCTCTACTCACCTGATATGTCCGACGGCCTCGGGCGATAAGTACCAAGGCGCCGTGAAGTGGGGTCTGCCCGCCGTCACCGCGGACTGGCTCGTCGACTGCGCTACTGCTGGTACTCGAAAGAAAGAAACAAAATATTTAGTAGGGGAAACTAAAGCACCGCCGTCTCCGGAACAGCAGCGGGGACAGGAAGAGAGGCCGCGTGATGACACGAGTGTAGAAAACAGGCAGACGGACAAAGAGAATGCTATGTTCCCACCGAACGCCGCGCCTAGACGTGGCTCCATACCGGAACAGGTCACAAAAGCCGACGTCCAAGTCCGCACACCTCAGTTAGACGACGCGTTGTCCACTCCCAATCTCATGGGGCTGAGTCCTACCACCCGGCGGAGGTTGCGGGCCGTCAGGAGGGGGGAGATGCCCTCCGACCCCATCAGGACACCCACTGATCCCTTCCCCCGTGCTCCGTGTACCCCCGACAGTTCGTTCTGTGCGGCCGTGGCGGGCGGTCGCCTGTCTCCGGGCGCGAGGAAGACGCTCTGGTCGCTGGTGGACCGCCTGCCTAGCGGGACACCGGCCGCCGCGAGGTCGCACACGCCGTTATCAGAGATCAGGAATCGTTTTTTGTCTCAGTTCAACGACTCGCCAACAGATGGCAGCGTCAGTAACAACACTGGGCCCAGGAAATTGCACTTGCACGAACAAGCAGAAACACCGCCGGCGAAAATGGCGAAAACTTTTGAAGATCAGAGCGCGGGTGGGATGAATATATCCGAAGATAACGACGGCTCCAAAAGCGCTGTCAGCGCTACAACAGTTGATACGCCGAAAAGCACAACATTGCCACCGGTAGTCGACGCACAGCTGCAGCGCTTGAGCGCTGTACTCGCCAGCCGACTGAGTTCACAACGGAAACGAACAAGAGACTCCGTTCCTTCACATAATAGCGATCCGCCCCCGCCTCAAGACAGCGAGCCGGCCCCCGAGTCACAACCGAACACAGTCGGCTGGGACGACACCGCGAGGAAGACGGAACACACCAACAGACCGACAGTCAGGAAGTTCATGTTGTCGTCCAATGTCGACAACCGTGAAGAGATCGCCGCTATGATCCAGTGTCTGGGCGGGGAAGTGTCGGACAGTCCTGAACTAGATTTGTTCGCTACGCACTTACTGTGCGCCGCGCCGGGGAGGAGTGAGAAGATGCTGGGTTCAGTGGCCGCTGGGAAGTGGATCCTTCATCCCGCTTATGTATCCAGGAGCCGAGCCGCTGGGAGGTTCCTCGAGGAAGAAGAATACGAGTGGGGAAACCCTAAAGCGGCGTGTCTACCGACTCTGACCGGGGCCGAGAAGAGCTTAGCGCAGGCGGCCTATCGCTGGAGGGCGGTGAGGAAGGCCGGGGGGCGAGGGCCCTTCGACGGCATGAGAGCTATACTACACGTGCCCGCGGCTAGGAAGAGGCTACTGGCCAGGTTGCTGGAGGCTGGCGACGGAATCGCTTGTGACGATGAGCCCCCGTACTCCGAGGTGAACGTGACGGTGTGTTTCGCTGATCTCAAGCGCTATCCCTTGTCTGAGAGGGATTCCTCTTGGTTCGTGTCTCGCGGCACCCCGGTCTGCCCCCCGGTGCTGCTGAGCTCCTACCTCACCGAGGACACGCCCCCCGACCCGCTGCAACACTGCTTCCCCAATTTTAGACCTTAA

Protein sequence:

>DPOGS202418-PA
MNEDINVTFIIPPECRNENDCSEEMQLAFTACEQHSGGGIKAQWQAQATWKKFVGLTKKDVFVVSEFQGEFFERLRLVGPRCISCCLTEGISIPSGPEPVFTIAMRGLVVTASGLSKQQKENIKKKVHWMGGLYSTVLTEDTTHLVADTVLSDKYVKAVERNLPVMKDTWIEAVWETSLRLNISAASSDFDDHKLPPFANLQVTTTGITKKDKALVMKLVSENGGTFSGAFQSETTDIVVLNKDSIGSEKYKAALEYGKACVLPSWVIDSAARGVALPLSKYRVAGASTSSPLLEHRAPDMSLNFSRITNLRPPSNFVDESRAVDISTMSGRMKLSQETKKSNDTSAEKELLSHFEKLDVTTIKKAGPIFDGFCIWLTGVEGPARDRAAACISRCGGVRYDCPHERVTHALAGTAAAAVSASAALPSVPVLQPMWLVKSVLAGRPLGEAEASDVYMSYACRFLINTKPATPVKARTKIEPASPMSKRNMQLLRHGPLDLPPPTPEVEEPPDDIVHHYLSQRQEQEKTPERVMPAEPEQPDVTEDLDEPSEEDIEQIFKGVTIQVRGLDEEAITEIAAEVSAAGGLLVAGGGVRGGSGAAPYVLVPLEWDGDEDGDGDLVTVFWIKDCLSRQELLEIQYYHRPVKLPQWSVPPLEGVVASLSTYSGFERAFLDELAKLLGATTQLRFCRRNTANALASTHLICPTASGDKYQGAVKWGLPAVTADWLVDCATAGTRKKETKYLVGETKAPPSPEQQRGQEERPRDDTSVENRQTDKENAMFPPNAAPRRGSIPEQVTKADVQVRTPQLDDALSTPNLMGLSPTTRRRLRAVRRGEMPSDPIRTPTDPFPRAPCTPDSSFCAAVAGGRLSPGARKTLWSLVDRLPSGTPAAARSHTPLSEIRNRFLSQFNDSPTDGSVSNNTGPRKLHLHEQAETPPAKMAKTFEDQSAGGMNISEDNDGSKSAVSATTVDTPKSTTLPPVVDAQLQRLSAVLASRLSSQRKRTRDSVPSHNSDPPPPQDSEPAPESQPNTVGWDDTARKTEHTNRPTVRKFMLSSNVDNREEIAAMIQCLGGEVSDSPELDLFATHLLCAAPGRSEKMLGSVAAGKWILHPAYVSRSRAAGRFLEEEEYEWGNPKAACLPTLTGAEKSLAQAAYRWRAVRKAGGRGPFDGMRAILHVPAARKRLLARLLEAGDGIACDDEPPYSEVNVTVCFADLKRYPLSERDSSWFVSRGTPVCPPVLLSSYLTEDTPPDPLQHCFPNFRP-