Monarch geneset OGS2.0

DPOGS210816
TranscriptDPOGS210816-TA1173 bp
ProteinDPOGS210816-PA390 aa
Genomic positionDPSCF300027 - 612549-614785
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0085161e-14064.97% 
BombyxBGIBMGA007132-TA2e-11471.32% 
DrosophilaCG9272-PA1e-8355.73% 
EBI UniRef50UniRef50_UPI0000E479261e-8864.20%UPI0000E47926 related cluster n=3 Tax=unknown RepID=UPI0000E47926
NCBI RefSeqXP_001623365.19e-9066.07%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|3287207361e-9268.16%PREDICTED: endonuclease III-like protein 1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287207367e-9160.54%PREDICTED: endonuclease III-like protein 1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00062814.3e-58DNA repair
GO:00038244.3e-58catalytic activity
GO:00062844.8e-53base-excision repair
GO:00036773.3e-06DNA binding
KEGG pathwaynve:NEMVE_v1g1378073e-89 
 K10773 (NTH)maps-> Base excision repair
InterPro domain[39-258] IPR0112574.3e-58DNA glycosylase
[84-234] IPR0032654.8e-53HhH-GPD domain
[165-269] IPR0231702.2e-32Helix-turn-helix, base-excision DNA repair, C-terminal
[146-169] IPR0004453.3e-06Helix-hairpin-helix motif
Orthology groupMCL13326 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210816-TA
ATGCTTGATTTAAACAAATTTAAATTTGAAAAGAAGCCACCTGTAAAAATTGAATTTGATAAGGAGTCTCCCACTAAACAGGATCAAGAAGTTTTGTGGGAACCACCGAAATGGCGAGAATTTTTGATAAATTTGAGAAATATGAGAGCAAACAACGATGCTCCTGTGGATTCAATGGGTTGTCACATGTCCATGGATGAAGATGCTCCTCCAAAAGTAATGAGGTATCAAAGTCTAATTTCCCTCATGCTGTCCAGTCAAACCAAGGATCAAGTTACATTTGCAGCCATGGAAAGACTAAGAGCTAAAGGACTGACGGTGGACAACATCTTGGATATGAGTGATGAGGAATTAGGTCAACTTATTTATCCAGTAGGGTTTTGGAAGACTAAAGTAAAATACATAAAGAAGACAACACAAACATTGAAAGATCAGTACGATGGAGACATACCAGACTCGGTGGATAAACTCTGCAAGCTTACCGGAGTTGGACCTAAAATGGCACATATTTGTATGAAAGTTGCTTGGAATAAAGTGACTGGCATTGGTGTTGACACCCACGTCCATAGAATAAGCAACAGAATAGGATGGGTTAAAAAATCTACATCTACACCAGAAGATACTCGAAAAGCATTACAATCATGGCTGCCATTTGAGCTTTGGAGTGAAGTCAATCATTTAATGGTAGGATTCGGTCAGACGATCTGTTTACCCATCGGACCCAACTGTCAGGAATGTTTAAATAATGATATTTGTCCTTCAAGCGAGAAGGATAAGAAGTCTCCATATAAGAGGTCACCAAAGAAATCACCAGCAAAGATTATTAAAAGTGAACCAATGGAAATGGGTTTGGATAAAATCAACAATCATGAGGTTAAAGAGCTAACCCACACAAGTTTACAAGATGGAAATGCTGATATTCTTAAAGTGAAAGGTTTAATTTCATCCAAACTAGAAAATGAAACGGTTGTAAAAACTACAAAATCACCCAAACAAGAAGTTCAAACATGCAATTTGTTAGAAAACATAGAGTGTCCTGACATCGTGATAACTAATGACAGGAGTTCTAAGAAAATCCCTTCAGAAATAAAAAAACGAAAGTCACCCAGAGTACTAAAACAGAGTTTGGCCGCTAGCGATACAAAGATAAAAAAGATAAAACAAAAGAAATGA

Protein sequence:

>DPOGS210816-PA
MLDLNKFKFEKKPPVKIEFDKESPTKQDQEVLWEPPKWREFLINLRNMRANNDAPVDSMGCHMSMDEDAPPKVMRYQSLISLMLSSQTKDQVTFAAMERLRAKGLTVDNILDMSDEELGQLIYPVGFWKTKVKYIKKTTQTLKDQYDGDIPDSVDKLCKLTGVGPKMAHICMKVAWNKVTGIGVDTHVHRISNRIGWVKKSTSTPEDTRKALQSWLPFELWSEVNHLMVGFGQTICLPIGPNCQECLNNDICPSSEKDKKSPYKRSPKKSPAKIIKSEPMEMGLDKINNHEVKELTHTSLQDGNADILKVKGLISSKLENETVVKTTKSPKQEVQTCNLLENIECPDIVITNDRSSKKIPSEIKKRKSPRVLKQSLAASDTKIKKIKQKK-