Monarch geneset OGS2.0

DPOGS209112
TranscriptDPOGS209112-TA1398 bp
ProteinDPOGS209112-PA465 aa
Genomic positionDPSCF300483 - 48351-52818
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0108131e-4766.94% 
BombyxBGIBMGA001792-TA7e-3851.52% 
DrosophilaSnm1-PA1e-1029.53% 
EBI UniRef50UniRef50_D6WZE52e-4439.30%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZE5_TRICA
NCBI RefSeqXP_001122361.19e-4639.55%PREDICTED: similar to Artemis protein (DNA cross-link repair 1C protein) (SNM1-like protein) (A-SCID protein) (hSNM1C) [Apis mellifera]
NCBI nr blastpgi|1107595532e-4439.55%PREDICTED: protein artemis-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|1107595538e-4439.55%PREDICTED: protein artemis-like isoform 1 [Apis mellifera]
Group
KEGG pathwayame:7266383e-45 
 K10887 (DCLRE1C, ARTEMIS, SCIDA)maps-> Non-homologous end-joining
    Primary immunodeficiency
Orthology groupMCL16381 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209112-TA
ATGTTTAGGAAAGCACAAACCGCCTTCCACGGCGCTATAGAAGAATTACCGGGTATTTACGTTGATAATTTCGAAAACGCTGCTAAAGTAAATGCCAGAGCTTATTTTTTAAGCCACTGCCACGCTGATCATATGCACGGATTGAGTTCTGAGGAGTTAATGGCTACGCTGAAAAAGAGTGGAGCCAAGATTTACACAACTGAATTATCTGCAGCCATTATAAAGACCGATGTAAATAAAGATATCGGTGATCATGTACAGAGTTTGAAAATGGGTGGTACACAAATATTAAGTTTTCCTTCCATACCCGAACAGAATATTCCAGAACTACTTCTCACCGTGACTCTCATTCCGGCCGGCCACAGCGCCGGTTCGACTATGTTTTTGTTCAGGACCACGACTAAAACTATTCTATTCACTGGTGACTTCAGAATGAACCCAAACGATTTGCCCAAATATTCGGCACTTCATGACGACGGCCACCCTATAAAGTTGACCAGTCTCTATGTAGACACAACCTTCTTAAGCTACAATTACGACAATTTTCCCAAACGTAGCGAGAGTATAGAAAAAATGTGCTCCGAAATCAAGAAGTGGTTGAGTTACGAACAAAATGCCGTGTCCTTGCACACCTCAGCCAAGTATGGTTACGAGTTTGCATTCAATGAGATATATCGGAGGTTGGGTTTGAAGGTACATGTACCGACGGAGAGGTGGAGTTTGTACAGTTCCATACAACATTTGGTGCCCGGTGTCACAAACGAGTCGACAAAGATACATTTATGCAAGAAACATGTCACCGACCAAAGTCATCAGCTTCCCAAACCCTTACAGGCCACTCAAGAGGAAGATCCAGGCATACTGATTTATATAACGAGCGTGTTTTATCCCAAAAACGTGAAACCTATTAACCACACGAGACCGTCGTCTAAATACGACAATCCTAATTTCATTAGAGACAGATATTATGATGAAATATGGAAACCGATGGTTAGCGACAGCTGGAAGAACCGTTTCCGCCAACCATTTCATCCAGCGTGGATGGATTACTGCGATCCTTATCATTGTAACGATTACCACAAAATAGCCTGCGGTTTGAACCGGATGACAATGAGGTTCAAGTGGTTTCAGAGCCAATGTCACATCATTCTGAACAATATGTGTTCAAACTACAGGGGATCTCTGCAATACGACGTCGTAGATACGAAATACTGTTCGTATTACGTAATGTTCCTACGGACAGGTTGTCCGAATGTCTGTCCTGATGTATTGGAGCCAGTTTGTTGTATGAGCACCGTCGATAGCCATGTGGTATTGTTTAAGAACAGTTGCGAAATGGAGAAAGCTAATTGTAAGGGCGGAATGTTGGAAGGTAAGTTGCCCGTTATACGACAATAG

Protein sequence:

>DPOGS209112-PA
MFRKAQTAFHGAIEELPGIYVDNFENAAKVNARAYFLSHCHADHMHGLSSEELMATLKKSGAKIYTTELSAAIIKTDVNKDIGDHVQSLKMGGTQILSFPSIPEQNIPELLLTVTLIPAGHSAGSTMFLFRTTTKTILFTGDFRMNPNDLPKYSALHDDGHPIKLTSLYVDTTFLSYNYDNFPKRSESIEKMCSEIKKWLSYEQNAVSLHTSAKYGYEFAFNEIYRRLGLKVHVPTERWSLYSSIQHLVPGVTNESTKIHLCKKHVTDQSHQLPKPLQATQEEDPGILIYITSVFYPKNVKPINHTRPSSKYDNPNFIRDRYYDEIWKPMVSDSWKNRFRQPFHPAWMDYCDPYHCNDYHKIACGLNRMTMRFKWFQSQCHIILNNMCSNYRGSLQYDVVDTKYCSYYVMFLRTGCPNVCPDVLEPVCCMSTVDSHVVLFKNSCEMEKANCKGGMLEGKLPVIRQ-