Monarch geneset OGS2.0

DPOGS214793
TranscriptDPOGS214793-TA1773 bp
ProteinDPOGS214793-PA590 aa
Genomic positionDPSCF300059 - 695283-699754
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0049770.061.84% 
BombyxBGIBMGA012103-TA4e-17556.51% 
Drosophilatos-PA2e-10457.78% 
EBI UniRef50UniRef50_Q7PQB94e-10953.02%AGAP004491-PA n=1 Tax=Anopheles gambiae RepID=Q7PQB9_ANOGA
NCBI RefSeqXP_001657573.18e-11662.42%exonuclease [Aedes aegypti]
NCBI nr blastpgi|1571125762e-11462.42%exonuclease [Aedes aegypti]
NCBI nr blastxgi|1571125769e-11043.77%exonuclease [Aedes aegypti]
Group
Gene OntologyGO:00062817.2e-128DNA repair
GO:00045187.2e-128nuclease activity
GO:00036775.7e-20DNA binding
GO:00038245.7e-20catalytic activity
KEGG pathwayaag:AaeL_AAEL0062092e-115 
 K10746 (EXO1)maps-> Mismatch repair
InterPro domain[1-529] IPR0060847.2e-128DNA repair protein (XPGC)/yeast Rad
[75-162] IPR0060862.2e-24XPG/RAD2 endonuclease
[147-293] IPR0200455.7e-205'-3' exonuclease, C-terminal subdomain
Orthology groupMCL14827 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214793-TA
ATGTTGCAATCAAGGAACATAAAACCAATTTTAGTTTTTGACGGGAGACATCTTCCTGCTAAGGCAATGACTGAGCTGAAAAGAAGAGAAACACGAGATATATCTAAGAAAAGAGCAGCGGAATTACTTAGCTTAGGAAAGATTGACGAAGCGAGGTCCTTCATGCGTCGCAGTGTAGACATAACTCATGCAATGGCCCTAGCTTTGATAAAAGAATGTAGGAAACGTAACATAGACTGTATAGTAGCGCCATACGAAGCTGATGCACAACTAGCATACTTAAATATCAAGAACTACGCTCAGTTAGTTATAACAGAAGATTCCGATTTAATACTTTTTGGTTGCACAAAGGTCCTATTCAAAATGGATTTAGAGGGTAATGGAACTTTAGTAGAAACTATCAAGCTGCCTCTAGTTATGAAATGTCCCATAGAACATTACACATTTGATAAATTCAGGAGAATGTGTATATTGTCGGGGTGTGATTATTTAAACTCGCTACCCGGCATCGGTTTGGCCAAAGCGCGTCAGTTTGTCAATGCTTCACAGGACACAAATTTCGCTAACGCCCTAAAAAAGCTGCCAAGTTTTTTCAACAGATCATTGCAAGTGAGTGATGATTATAGAGAGAATTTTCTCAAAGCCGAAGCAACATTCAAACATCAGTACGTCTATGACCCTTCACAGAGATGTATGACCCGACTCACACCTGTTTATGATGAAGAAATCGAAGCGGCTTTGTGTTCTAATGCCGGAGAGCTTTTGGATCCTCAGATAGCCTTCCAGTTAGCTTTGGGTAACTTAGACCCTTTCACATTGAAGAAGATGGATAATTGGGATCCCGATAGTAGAAGTGATGTGACAGATCATATAAGGAGTTCAAATTGGAAGGATGCGGGGGTATCAAATAAGCCAAGTATATGGAGTGAGTCTTACAAGGAATATTTAGATGAATCTCAACCTTGGATGAAAAAGGTTCAAAAACAGGAACCCATTATATCAACACAAACACGATCAAGGAAGAAGGTTGTCACCTTAACAACTAAATATGTACCGGAGACACAGGATGATAGTTTATCGATAGAGACACTCAGCGGCATGTACTGTATGGAACCAGCCAGCAAGAAACAGAAAGTTGAACAAAAGAAAAATAACATCAATATAGACTATGATAGAAATAATTTTAATCTCAAACAGAAATCACCCATACTGGAAAACAAAGGCAGATCGTTCAAGAAGTGTCTCAGTTCCGGTAGTTTTTCAGTTTTGAAGAAATTGAGCGCTTTCCCAAGGACAGTTCTGGATGATGATATCATTGAAAGCAAATTCTTCAGTTCGTGCGAAAAGGACTCCAATGATACGTGTAACAGAGTTGATAATCAGACGATAATACAGGAATCACCGGAAAAAGATTTAGATACTGCTATGATAGACACATGCACAGGTTCCAGCTCACAGAAAGAGAATTCTCCCAGTCCCGCAAAGAAAAGTCCTATATTAGTGAGCCCTAGAACAAGAAATCCGTTCAAGTTAAAAGACTCACAGTCAACAAACGACACGGGTTTCAGTGAGTCCGTCATAGAAAATACATGTCCCATTGAATCTGAGCCACCCATATGTACCTCGCCCATCAAACCCGCTGTTCCTCAGGATAAGTTTAAGTTCAATCAAAATATAAAGAAGACAAAAGCCCCTGCTAAAAAGGTTCAAAGTTCTCAACCAACTCTCTTAAGTATGTTCGGTTTTCAGAAAAAGCCAGTATTAAAAAGGTGA

Protein sequence:

>DPOGS214793-PA
MLQSRNIKPILVFDGRHLPAKAMTELKRRETRDISKKRAAELLSLGKIDEARSFMRRSVDITHAMALALIKECRKRNIDCIVAPYEADAQLAYLNIKNYAQLVITEDSDLILFGCTKVLFKMDLEGNGTLVETIKLPLVMKCPIEHYTFDKFRRMCILSGCDYLNSLPGIGLAKARQFVNASQDTNFANALKKLPSFFNRSLQVSDDYRENFLKAEATFKHQYVYDPSQRCMTRLTPVYDEEIEAALCSNAGELLDPQIAFQLALGNLDPFTLKKMDNWDPDSRSDVTDHIRSSNWKDAGVSNKPSIWSESYKEYLDESQPWMKKVQKQEPIISTQTRSRKKVVTLTTKYVPETQDDSLSIETLSGMYCMEPASKKQKVEQKKNNINIDYDRNNFNLKQKSPILENKGRSFKKCLSSGSFSVLKKLSAFPRTVLDDDIIESKFFSSCEKDSNDTCNRVDNQTIIQESPEKDLDTAMIDTCTGSSSQKENSPSPAKKSPILVSPRTRNPFKLKDSQSTNDTGFSESVIENTCPIESEPPICTSPIKPAVPQDKFKFNQNIKKTKAPAKKVQSSQPTLLSMFGFQKKPVLKR-