Monarch geneset OGS2.0

DPOGS212784
TranscriptDPOGS212784-TA2049 bp
ProteinDPOGS212784-PA682 aa
Genomic positionDPSCF300012 + 1344931-1352680
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0070530.058.72% 
BombyxBGIBMGA013227-TA0.055.56% 
DrosophilaSse-PA7e-3825.33% 
EBI UniRef50UniRef50_UPI00021A78FA2e-5528.82%UPI00021A78FA related cluster n=2 Tax=unknown RepID=UPI00021A78FA
NCBI RefSeqXP_001951178.12e-4326.08%PREDICTED: similar to DNA double-strand break repair Rad50 ATPase, putative [Acyrthosiphon pisum]
NCBI nr blastpgi|2700105135e-5625.90%hypothetical protein TcasGA2_TC009919 [Tribolium castaneum]
NCBI nr blastxgi|2700105133e-5525.74%hypothetical protein TcasGA2_TC009919 [Tribolium castaneum]
Group
Gene OntologyGO:00056341.2e-60nucleus
GO:00082331.2e-60peptidase activity
GO:00065081.2e-60proteolysis
KEGG pathwayapi:1001621135e-43 
 K02365 (ESP1)maps-> Meiosis - yeast
    Cell cycle - yeast
    Cell cycle
    Oocyte meiosis
InterPro domain[218-601] IPR0053141.2e-60Peptidase C50, separase
Orthology groupMCL17426 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212784-TA
ATGGAAGATTTCGATATTTTTACAAATGAAATAGACTTCGCCAAACCCGTTCATAAAAATATACTCAATAACTATGTCAACAATGTACCAAAGACTCCGAGTGGACCTAATTATGAAAGTGGCCGGAAACTACTGGGTCAGGAGTGTCTGGCTGATGGTTTGGAAAATGAATGCGCATTTCACTTCAGTGAATCCGCTTCCCCCACCTTGAGGACTCTAGCTGTGTACAGAGATGAACAGCTGAAGACTGCCGATAATAGATTAAATAAAATAACAGATTATATAAAGCCCATTAAGAATATTGTTGACCAGAATCCTATTACCAAAGAGGATGATGCGGTACTCGATCTCTTTATTAACGACGTGGACAACGTCACGAAAATACTTGATAAATATAATATGGAGGAGGAAATGCCATTTTATAAGCAATACATGAGATTCAGTTCCGACAATAATGTCGATAATTTTATGAAGATTTTAAAGGAATTACCCAAAGAGTGGACAATAATTCAGTTGACAGCACCATACAATGCCAATGAAAATCTGAAACCGTTCACAGACTACCGCACTGAGATAAAGTCTCTGTACATAACGTTGCTCGGTAACAAGTATTTCGATAGTCCACTGACGATAGAGGTTCCAGCTAATGTTACGAAAGAGGGTGAAAAACCACTGTTTGAGGAGCTGTACTCTCTGCTAGAGGAGAATTATAGGACGATAGACAATGCACAGTTCCTGAACAACAAACGTTTGGTCCAGAATTATTGGAGCAAACGCGAGGACATTGACCTCAGGATGAAGAGCATTATTAACGTGATGTACAAAGAATGGCTCGGCGGCTTCGCAAGTCTCTTGACTGGACGGCTGTTAGACGACCAGCTACGGGATAGAGTTGTTAGTTTAGTTGATACCGCCATCAATGATTGGGGTTTCATAAGGTTGACGGAAAAACAGAAAATGCTCCTATACAACCTCATTGAGAGCAGTGCGTTATTATCATCGCAGCAAATAAAATCTTGTTTGCGGAAAATATTGACTGAACATGGCAACATTGATGATATAAGAAGAACTTTGAAGACAGACTGTGTTAACTGTAGCGCTGAATTTAGACTAGCGAGTGAATTATGTTTGAAATGTTTGTCCCAGTGTTTCGAGGTCATTCACCATTTTACGCTCGTTGATGGCATCAAAGCGTTTTCACAAGTGGCAACACAGGTTAAAGATGGCGACGAGTGGGCAAGTTTGAAGAAGGCAAAACGACAACCCGTGATTCTGATTGTTGATGAGCTGTTGGACACGTTCCCATGGGAAACCCTGCCCACACTCAACCAACATCCGGTCACGCGGATGGAAAACATACATTTCCTGTACGCGCTGTACAAAATGCATGAAAATAAAATCATAAACGGCTACTACACAGCCAGCTCGAGAGTCGGCAGATACGTTATTAATCCAGAAAAGAATCTCGATCGTATGGAGCACAGGATGAGGTCGTTTGTGAACTATTGGTGTAAGTCTTGGACCGGACACGCTGGCGAGACGCCCAGTGCGGACCAATACCTTAAATGTCTTACCGAAGCTGATGTATTCCTGTACTGCGGACACGGGGATGGTCTCCAATTGGCTAGCACCTCGTCTCACATCGAGGGTGCTTCGTGTGGTGGTGTGTGTGTACTATCAGGGTGTGGGTCGCTGAGGCTGGTGCGGGAGGGGGGCAGAGCACCCCCGACCGCCGCTCATCACCACCTACACGTGGCCGGATGTCCAATGGTGATCGGTATGCTCTGGGAGGTGACTGACCTTGAAGTGGACAAGATGGTGACAACAATGCTGTCTTTGTTTGTACCATCGGAGGCGCCGTGCGATTGGAAATGTATAGGGAAGAGCAAGTGGAGCCAGGGGACTATCGACATCCCCTCCCCTCCGAGTCCGCCACAGTCCCGGTGTAGCGATGTGCTGTTAGCGGCGAGCAAATCACGTTTAGCTACAGGCTTCATGATGATATCTAGCAGTCTAGTCGTTAGAGGTATACCAGTTGTTATCAATTAG

Protein sequence:

>DPOGS212784-PA
MEDFDIFTNEIDFAKPVHKNILNNYVNNVPKTPSGPNYESGRKLLGQECLADGLENECAFHFSESASPTLRTLAVYRDEQLKTADNRLNKITDYIKPIKNIVDQNPITKEDDAVLDLFINDVDNVTKILDKYNMEEEMPFYKQYMRFSSDNNVDNFMKILKELPKEWTIIQLTAPYNANENLKPFTDYRTEIKSLYITLLGNKYFDSPLTIEVPANVTKEGEKPLFEELYSLLEENYRTIDNAQFLNNKRLVQNYWSKREDIDLRMKSIINVMYKEWLGGFASLLTGRLLDDQLRDRVVSLVDTAINDWGFIRLTEKQKMLLYNLIESSALLSSQQIKSCLRKILTEHGNIDDIRRTLKTDCVNCSAEFRLASELCLKCLSQCFEVIHHFTLVDGIKAFSQVATQVKDGDEWASLKKAKRQPVILIVDELLDTFPWETLPTLNQHPVTRMENIHFLYALYKMHENKIINGYYTASSRVGRYVINPEKNLDRMEHRMRSFVNYWCKSWTGHAGETPSADQYLKCLTEADVFLYCGHGDGLQLASTSSHIEGASCGGVCVLSGCGSLRLVREGGRAPPTAAHHHLHVAGCPMVIGMLWEVTDLEVDKMVTTMLSLFVPSEAPCDWKCIGKSKWSQGTIDIPSPPSPPQSRCSDVLLAASKSRLATGFMMISSSLVVRGIPVVIN-