Monarch geneset OGS2.0

DPOGS200726
TranscriptDPOGS200726-TA1893 bp
ProteinDPOGS200726-PA630 aa
Genomic positionDPSCF300030 - 135283-139316
RNAseq coverage1241x (Rank: top 10%)
Annotation
HeliconiusHMEL0089590.064.47% 
BombyxBGIBMGA001124-TA0.061.43% 
DrosophilaRrp1-PB2e-10537.13% 
EBI UniRef50UniRef50_G6DB750.0100.00%Ap endonuclease n=7 Tax=cellular organisms RepID=G6DB75_DANPL
NCBI RefSeqXP_001654881.12e-11363.67%ap endonuclease [Aedes aegypti]
NCBI nr blastpgi|1571272373e-11263.67%ap endonuclease [Aedes aegypti]
NCBI nr blastxgi|1571272371e-12042.39%ap endonuclease [Aedes aegypti]
Group
Gene OntologyGO:00062815.3e-144DNA repair
GO:00045185.3e-144nuclease activity
GO:00036771.2e-83DNA binding
GO:00056221.2e-83intracellular
GO:00045191.2e-83endonuclease activity
KEGG pathwaydme:Dmel_CG31781e-103 
 K10771 (APEX1)maps-> Base excision repair
InterPro domain[216-620] IPR0048085.3e-144Exodeoxyribonuclease III xth
[371-619] IPR0000971.2e-83AP endonuclease, family 1
[370-622] IPR0051352.6e-73Endonuclease/exonuclease/phosphatase
Orthology groupMCL12262 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200726-TA
ATGGCGCCACGTACAGCTAAGGCTAAGAAAAATGCTGATGTTAAAGTTTCTGAGGGTGAAGTTCCAAAAAAAGGTAGAGGGAAAGCTAAAAATGTGACAGAACAAGAACAAGTTGTCCCTGAAGTTGTTATTGATAAAAATCCCTCAGTAGAAAAAAAAACAAAAAGGGGTAAAAATGCAGTTGTTGAAAACAATGATGAGGCACAAATTATAGTCAAAGATGCACCACCAGCTAAAAAGGGAAGGAAGAAGGCTGTTGAGGAACCCATTCCTGAATCAAACTCTCTAAATGGAGATAACTCGGATGAAAACGAGCCAGCAGTTCAATCGGATGATAACGAACAGAATGAAGAACATTCTGAATCAAATGATAATGCTGAAGACACTCAAGCCAGTGCAGGCAAAGGTAGGAAAAAAGTTAACAAGAAAGAACCTATAGAAAAAGAGGTTAAACCTAAAGAAACTGGCAGAGGAAAAAAAAATGTGAAGCAGGAAAATATTACAGCAAAGAAAGATGCAGAAGAAAAACCTAAAGCTAGAGGGAGAGGACGTAAAGTACAGCCCAAGGCAGAAGACGTTCAAGATAATGATGAAGTAGAAGTTAAACCTAAAGGTAGAAAGAAGGCTCAACCAAAAGTAGTTGAAAAAGTACAGAAAGCTGATGATGAAGATGATGAACAAGAAGAAATACCAGATGAAGAAGCTGAAGAAGAGAAGCCAGTTGAAGAAGTAAAGAAGAAAGGTCGGAAGAATGCTGACAAAAAGACAACACAGAAAGAAGACTCCGAACAAAAAGATGATGAAGTTAAAGAACAAATGCCAGTCAGCAAAAGTCGGAAGGGTGCTAAGAAAGATGAAAAAGCAAAAGGAGACACAAAAGACGATGATAAAGATGATGTAGCAGAATCCAAACCGGTTAAAGGGAAACGTGGTCAAAAGAAAGCTGAAGCCAGCGAGCTACAAGATACGGGGGAACCGATAAACAAACGTCGCCGTAAAGATGACAAGGCCACCGAGGACAATAAAAAGAAAACTAAAGCCGCAACGGACTATGAATCTATTGATTTCTCTAACAAATCACAGACGTCTCAGGGTAAAGAGTGGAATTTTAAAATAGCCAATTGGAACGTGGACGGCATTAGGGCTTGGATGGGAAAAGGCGGATTGGACTACCTTAAATACGAAAAACCGGATATATTGTGTCTACAGGAAACGAAATGCGCTCTAGATAAATTGCCGTACGAAGTGAAAAATATACCCGGATATCACGCGTACTGGCTGTCTAGTGATAAAGATGGCTACGCCGGCGTAGGAATTTACACTACAAAGTTAGCTATGAATGTACAATACGGTTTACAAAACGAGGAATTGGATTCCGAAGGTCGGATAATAACGGCTGAGTACGAACAATTCTACTTAATATGCACGTACGTACCTAACGCGGGACGAAAATTAGTTTCACTGCCCAAGAGATTAAAGTGGAACGACGAGTTCAGGGAACACGTTAAGGCGCTGGACGAAAAGAAACCTGTCATTATATGCGGTGACATGAACGTGGCTCACAACGAAATAGATCTAACGAATCCAAAAACGAATAAGAAGAACGCCGGCTTCACGGAGGAGGAACGAGCTGGTATGACGGAGCTGCTCGGGGACGGATTCGTAGACACGTTCAGACATTTTCATCCTGAGAAAGTCGCTTATACGTTCTGGAGTTACATGGCCAATAGTAGAGCTAAGAACGTCGGATGGCGTTTGGACTACTTCATCGTGTCAGAGAGACTTTTACCGTCTATATGCGACAGTTCGATCCGCGGCGAGGTGTATGGGAGTGACCACTGTCCTATAGCACTCTACCTACACTTAACGAGCGCCGACAAACCCAAGGAATAG

Protein sequence:

>DPOGS200726-PA
MAPRTAKAKKNADVKVSEGEVPKKGRGKAKNVTEQEQVVPEVVIDKNPSVEKKTKRGKNAVVENNDEAQIIVKDAPPAKKGRKKAVEEPIPESNSLNGDNSDENEPAVQSDDNEQNEEHSESNDNAEDTQASAGKGRKKVNKKEPIEKEVKPKETGRGKKNVKQENITAKKDAEEKPKARGRGRKVQPKAEDVQDNDEVEVKPKGRKKAQPKVVEKVQKADDEDDEQEEIPDEEAEEEKPVEEVKKKGRKNADKKTTQKEDSEQKDDEVKEQMPVSKSRKGAKKDEKAKGDTKDDDKDDVAESKPVKGKRGQKKAEASELQDTGEPINKRRRKDDKATEDNKKKTKAATDYESIDFSNKSQTSQGKEWNFKIANWNVDGIRAWMGKGGLDYLKYEKPDILCLQETKCALDKLPYEVKNIPGYHAYWLSSDKDGYAGVGIYTTKLAMNVQYGLQNEELDSEGRIITAEYEQFYLICTYVPNAGRKLVSLPKRLKWNDEFREHVKALDEKKPVIICGDMNVAHNEIDLTNPKTNKKNAGFTEEERAGMTELLGDGFVDTFRHFHPEKVAYTFWSYMANSRAKNVGWRLDYFIVSERLLPSICDSSIRGEVYGSDHCPIALYLHLTSADKPKE-