Monarch geneset OGS2.0

DPOGS208662
TranscriptDPOGS208662-TA1872 bp
ProteinDPOGS208662-PA623 aa
Genomic positionDPSCF300281 + 264403-267926
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0117420.079.94% 
BombyxBGIBMGA007760-TA0.066.43% 
Drosophilamus101-PA2e-1229.38% 
EBI UniRef50UniRef50_E2AE323e-1332.21%DNA topoisomerase 2-binding protein 1 n=2 Tax=Formicidae RepID=E2AE32_CAMFO
NCBI RefSeqXP_394416.33e-1331.88%PREDICTED: similar to DNA topoisomerase 2-binding protein 1 (DNA topoisomerase II-binding protein 1) (DNA topoisomerase IIbeta-binding protein 1) (TopBP1) [Apis mellifera]
NCBI nr blastpgi|3071802321e-1232.21%DNA topoisomerase 2-binding protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3071802321e-1032.21%DNA topoisomerase 2-binding protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00056222e-08intracellular
GO:00056345.1e-07nucleus
GO:00082705.1e-07zinc ion binding
KEGG pathway 
InterPro domain[393-479] IPR0013572e-08BRCT
[4-47] IPR0129345.1e-07Zinc finger, AD-type
Orthology groupMCL25386 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208662-TA
ATGGTTATGTCTTTTGCCAATGTCCAAATCCACGAAGGAGATGGTTTATCAGATCGTGTATGTACAGCTTGCATGGAGAATTTAAGTACTGCCTATTTATTTAAACAGCAGTGTGAGAGAATTGATGCATTGATGAGAAAATTTCCAGATGCCCAGATTGATAAATCAGAAATCACAAATTACAATGATGTATACAAACAGGAATTTACAATGCACACACAATATATTAATGAAGAAAACGATGAAAATTCAGTCAAAGCGACACATGATTCAAGCCATGACTCTAGACTCAGCCCTTGTTTTGAGACAAGTGATGAAACTGACAGCGATATAGATTCAGTTAAATGTGTTTATTGCAGTCAGTCTTACAACTACGGCGCCCATACATGTCATGTCACTTGCACAATTGAGCACAATGAACTTCAACAATCGAGCAGCAATGAAACACTTGTAGCCACTGATTTAACACCAACCAAATCAAAAATGCCCGCCATAGACCTGCAAAACGTCAGTGTCTCGTGTGTATTGTGCGATGAAAAATTTCAAAGATATGATTCTTATGTCGTACATTTAAATAAATGCACCACTAATGTTAAACTTCACCATTTCGTGTGTCCCGTCTGTCATGATATGTTTAATGAGAAATTTATGTATCTAGAGCATCTGAAAGTAGATCATTTCAAAGAAGATGAGTCAATGTTAAGTGATCCTGGTGTTGACTGTGTTGATTTTTCACCGATTGTCGTTAAGACAAGGAAACCGATGGCTGTTAGACGACAGATTGGTTGGTCTGTTGAGGACATATACCAAGAGATAGATTGTCATAAAATAGAACCGAAACCAAGTCCAGCGGCCAGTCCATTGAAGAATTTCTTTTCTAAGTTGGGGAACGAGTCTTTCAGCAGTAAGCAAAGTACACCTAAGAAGGTCAGCTTTCGTAAGTTTATTGAAAATGGTAAAGCAAAGACCTCTGTATATCTACCATTTAAGAAATACATTCAGAATTATAAACTTAAAAAGAAGATGTCAAACTACAGCCCTATAAATTCCAAGCAACAAATTACTAGCACAATACAAGCTACTATTCCAGAATATGTTTCAGACAGTGATTATGGCTCACCAAGCGGAACATCGGAGGATTCCTGGAAGATGAAACAAACTTTGATATGTGCATGTGACAAGAAAATATTCATGCTCTCTGAACACATCCATGATCGTGACAGAATAGTTGCAATGATAAATGAGCTGGGTGGAGTGGTTGCCGAGAATACTAAAATGGAGATGTTGGCAACACATTTTATTTCAGTGTTGCCAAATGATACATTCACTGGCATGATGGTTTGTTCCTTGGCCACAGGAAAATGGCTACTGCACATCAGCTTCATCTATGATAGTTTTAGATGCAAGAAGTTTTTACAAGAAAATATGTATGAATGGATGAGACATCCCAAAATATTAGAAATTGACAATACCAGCATAGAAATTGCGAAATCAGCTGTGTTCTGGCAAATGGAACTACAAAACAAGAAATCTAAATATCCATTTGAAGGGAAACAGATTGTTCTCATCATGAAGAAGAAGTATAGACAGTACTATCACATGATATTCAAGAGTCTCAAAGCGAAACCAGTCACTTATGATCCAAGAACACCTGGAAGTTGCTGTTCTGCTGACTATTGCTTTGTCGACATGAAAATCATTGAAAGAGTCAAATTACGTTTTTTCTTTCATCACAATGTCCCAGTATTCCCCTACCAGTATATTCTGGTATATCTACTCAAGAGAGGAAAAGTGGACGATGAACACAAATATTTACTTCAAGACTGTAGTAAAATAAAAAACGAAGATTTTTTCTTCAACAATACTTATTAG

Protein sequence:

>DPOGS208662-PA
MVMSFANVQIHEGDGLSDRVCTACMENLSTAYLFKQQCERIDALMRKFPDAQIDKSEITNYNDVYKQEFTMHTQYINEENDENSVKATHDSSHDSRLSPCFETSDETDSDIDSVKCVYCSQSYNYGAHTCHVTCTIEHNELQQSSSNETLVATDLTPTKSKMPAIDLQNVSVSCVLCDEKFQRYDSYVVHLNKCTTNVKLHHFVCPVCHDMFNEKFMYLEHLKVDHFKEDESMLSDPGVDCVDFSPIVVKTRKPMAVRRQIGWSVEDIYQEIDCHKIEPKPSPAASPLKNFFSKLGNESFSSKQSTPKKVSFRKFIENGKAKTSVYLPFKKYIQNYKLKKKMSNYSPINSKQQITSTIQATIPEYVSDSDYGSPSGTSEDSWKMKQTLICACDKKIFMLSEHIHDRDRIVAMINELGGVVAENTKMEMLATHFISVLPNDTFTGMMVCSLATGKWLLHISFIYDSFRCKKFLQENMYEWMRHPKILEIDNTSIEIAKSAVFWQMELQNKKSKYPFEGKQIVLIMKKKYRQYYHMIFKSLKAKPVTYDPRTPGSCCSADYCFVDMKIIERVKLRFFFHHNVPVFPYQYILVYLLKRGKVDDEHKYLLQDCSKIKNEDFFFNNTY-