Monarch geneset OGS2.0

DPOGS206874
TranscriptDPOGS206874-TA1161 bp
ProteinDPOGS206874-PA386 aa
Genomic positionDPSCF300001 - 2287744-2289047
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0102005e-15868.77% 
BombyxBGIBMGA013150-TA7e-15469.39% 
DrosophilaRad9-PA1e-3531.60% 
EBI UniRef50UniRef50_UPI000206484F3e-6432.84%UPI000206484F related cluster n=1 Tax=unknown RepID=UPI000206484F
NCBI RefSeqXP_002155070.13e-5639.26%PREDICTED: similar to predicted protein [Hydra magnipapillata]
NCBI nr blastpgi|3287906739e-6432.84%PREDICTED: cell cycle checkpoint control protein RAD9A-like [Apis mellifera]
NCBI nr blastxgi|3800139966e-6432.29%PREDICTED: cell cycle checkpoint control protein RAD9A-like [Apis florea]
Group
Gene OntologyGO:00062813.7e-77DNA repair
KEGG pathway 
InterPro domain[1-265] IPR0072683.7e-77Rad9
Orthology groupMCL14172 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206874-TA
ATGAAATGTCACGTACCTGGTCCGAATGTTAAAGTTTTAGCAAGAACTGTTCATGCTTTAGCAAGATTTGGTGATGAATTGTATTTGGAATCCCTCCCAGATTGTATACTATTGAGAACACTCAATGCCTCCGAAAGCGCCTATGCAATGGTTAAATTAAACAAAAATTTCTTTTCGCATTTTAATTATAATTATTATTCTATAGAAAATGATGGATTAAAATGTAAAATTTCCATGAAATCTGCATTAAATGCTTTTAAATCCCCTGCACATATGGACAAACAGGTTGAGAATCTTGAGATTAAACTTGATCCTCACTCTTCAAAATTAATATTTCAACTCAAATGTAAGCATGGTATTGTTAAGACCCATTATGTGTCAATATTGGATTGCAAAGCCATGCAAGCAATTTACACAAAAGATTTAGTGCCTAACAGAATAACATCATCTCAAAGGCTGTTCTCAGAAGCTATAGGAAATTTTCAATGTTCAGATGATCAAGTAACGCTAGAGGTGACAAGTGAATCACTTATAATTAAAAATTTTGGCGATACCCCTACAGACCTTTCAAGGATTATTAGAAGTCAAGTCACAATTAAACCGTTTGAGTTCAGCAGTTATACTATTGGGACGGACACCAATATTACATTTACAATGAAAGAATTCAGAGCATTGTTAGGCTTTGCTGAAGGCTTAAATCTGCCCGTGCAACTACATTTTGAGATTACCGGTAAACCTGCAGTATTTATAGTACATAATGGTACTACTATTGAGGCACATTTTGTATTAGCTACCTCAAAACCTGATATAGCAACTCAATATACATCTCAACAAACAACAAATTCAGAAAGAAAAAGGAAAGATGACTCAAATGATAACAATATCTCAGCTAAAAAGGCACATCTAGAAGATCTATCGAATCAGTTTCAAGAAGATTCAAATTTGTTTAATTATATACCAAATAATATATCACAGAATGTGATTGAAAATATAAATAATTTAGAAAAGGCAGATTCCATGGATACAAACCGTGATAACATTCCAGCATCGCCGACGTCCAAAATGAAAATAGCATCAGTATTTAAAAGGTGCTTTGAGAGCACTTTTGACCCTAGGATAATTCATGGTGTAGTGTTAGCTGAGAACTCTGATAGTGACTAA

Protein sequence:

>DPOGS206874-PA
MKCHVPGPNVKVLARTVHALARFGDELYLESLPDCILLRTLNASESAYAMVKLNKNFFSHFNYNYYSIENDGLKCKISMKSALNAFKSPAHMDKQVENLEIKLDPHSSKLIFQLKCKHGIVKTHYVSILDCKAMQAIYTKDLVPNRITSSQRLFSEAIGNFQCSDDQVTLEVTSESLIIKNFGDTPTDLSRIIRSQVTIKPFEFSSYTIGTDTNITFTMKEFRALLGFAEGLNLPVQLHFEITGKPAVFIVHNGTTIEAHFVLATSKPDIATQYTSQQTTNSERKRKDDSNDNNISAKKAHLEDLSNQFQEDSNLFNYIPNNISQNVIENINNLEKADSMDTNRDNIPASPTSKMKIASVFKRCFESTFDPRIIHGVVLAENSDSD-