Monarch geneset OGS2.0

DPOGS207019
TranscriptDPOGS207019-TA990 bp
ProteinDPOGS207019-PA329 aa
Genomic positionDPSCF300001 + 1375848-1377823
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0106278e-11061.16% 
Bombyx% 
Drosophilaslx1-PB1e-5446.09% 
EBI UniRef50UniRef50_D2A4Q15e-6953.81%Putative uncharacterized protein GLEAN_15403 n=1 Tax=Tribolium castaneum RepID=D2A4Q1_TRICA
NCBI RefSeqXP_971859.11e-6953.81%PREDICTED: similar to GIY-YIG domain containing 2 [Tribolium castaneum]
NCBI nr blastpgi|3072050954e-6952.21%GIY-YIG domain-containing protein 1 [Harpegnathos saltator]
NCBI nr blastxgi|910843011e-7053.81%PREDICTED: similar to GIY-YIG domain containing 2 [Tribolium castaneum]
Group
Gene OntologyGO:00062811.4e-10DNA repair
GO:00056221.4e-10intracellular
GO:00045181.4e-10nuclease activity
KEGG pathwaymdo:1000144492e-34 
 K13882 (CORO1A)maps-> Phagosome
InterPro domain[11-85] IPR0003051.4e-10Excinuclease ABC, C subunit, N-terminal
Orthology groupMCL12768 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207019-TA
ATGACAGAACCAGAAATTGTTGAAGATTTCTATGGCGTATACCTGCTTTATTGTATAAACCCTAAATATAAAGGTCGTACCTACATTGGTTACACTCGGGACCCAAATAGAAGAATTATACAACATAACCGTGGCACCTGGGCTGGGGGAGCTCATAGAACTAGTAAAAGGGGACCATGGAAAATGGTTATGATTGTGCATGGATTTCCAAATAACATTTCAGCATTGAGGTTTGAATGGGCCTGGCAAAACCCAGGCAAGACAACAAGATTACAACACCTGGGGTTGTTTAAGAATGGTCGGAAGGAAACTGTGTTCCAGTTCAAATTACGAGTCCTCAGTGAGATGCTCAGAGTGGGGCCTTGGTGCCGTCTGCCATTGGTCATAAGATGGCTGGAAAATGATTTCAGGGAAGAGTTTCCAGAAGCAAGGATGCCTCCTGAACACATGATTATATGCCAAGGCCCTGTAAAGAGTCATAATCTTAGAAACACAACAAACACCTCATCACCTGACATAATTTGTCGTTTATGTTCTGGACGTTTGAAAGCCTCAGAGCAACTGAGCTGTCCGAACTCAAATTGTGACTTAGTCGCTCACATAACATGCTTGGCGGACAAGATGTTGCCTCCAGGAGAATATATACCAATTGATGGTAAATGTCCGCTATGTTATTTGAAGCTGAAGTGGGGAGATTTGATAAGAAAAATGAAAGGTTGCTTGGACGCTGATATTAATGATGTATTAAGCCAGAATAAGGTTTTAGAGGACTGTGGTAACTTTGTCAATTATGACGTAATGGAGTCATCTAATGATGATGACGTATATACTCAAGATGATTTATCATTAAGGGCTAGTGATTCGTTAAATGGTGATATAAGAGATTCTGATGGTGAAAACGAAGACCAGGACGTCATTCGGATCCCAACACACGTTCTGGATGAACCGTCCTGGTTTACAGACTGTCAAGATGATATAATAAATATGTAA

Protein sequence:

>DPOGS207019-PA
MTEPEIVEDFYGVYLLYCINPKYKGRTYIGYTRDPNRRIIQHNRGTWAGGAHRTSKRGPWKMVMIVHGFPNNISALRFEWAWQNPGKTTRLQHLGLFKNGRKETVFQFKLRVLSEMLRVGPWCRLPLVIRWLENDFREEFPEARMPPEHMIICQGPVKSHNLRNTTNTSSPDIICRLCSGRLKASEQLSCPNSNCDLVAHITCLADKMLPPGEYIPIDGKCPLCYLKLKWGDLIRKMKGCLDADINDVLSQNKVLEDCGNFVNYDVMESSNDDDVYTQDDLSLRASDSLNGDIRDSDGENEDQDVIRIPTHVLDEPSWFTDCQDDIINM-