Monarch geneset OGS2.0

DPOGS200346
TranscriptDPOGS200346-TA2205 bp
ProteinDPOGS200346-PA734 aa
Genomic positionDPSCF300026 + 533791-535995
RNAseq coverage54x (Rank: top 69%)
Annotation
HeliconiusHMEL0000400.072.33% 
BombyxBGIBMGA005640-TA0.069.97% 
DrosophilaHel89B-PB3e-7636.67% 
EBI UniRef50UniRef50_D0AB880.072.33%Putative DNA excision repair protein ERCC-6 n=53 Tax=Heliconius RepID=D0AB88_9NEOP
NCBI RefSeqXP_001602814.10.046.53%PREDICTED: similar to hCG32740 [Nasonia vitripennis]
NCBI nr blastpgi|2613359500.072.33%putative DNA excision repair protein ERCC-6 [Heliconius melpomene]
NCBI nr blastxgi|2613359500.072.16%putative DNA excision repair protein ERCC-6 [Heliconius melpomene]
Group
Gene OntologyGO:00036772.1e-76DNA binding
GO:00055242.1e-76ATP binding
GO:00043865.7e-20helicase activity
GO:00036765.7e-20nucleic acid binding
KEGG pathwaynvi:1001189520.0 
 K10841 (ERCC6, CSB, RAD26)maps-> Nucleotide excision repair
InterPro domain[1-279] IPR0003302.1e-76SNF2-related
[334-417] IPR0016505.7e-20Helicase, C-terminal
[1-167] IPR0140011.5e-07DEAD-like helicase
Orthology groupMCL16948 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200346-TA
ATGGGTTTAGGGAAAACTGTCCAAGTAATAGCATTCTTAGCTGGGCTTTCTATGACTGACAGTGGGTCTTGGGGAGGTCTTGGTCCTTGTATAATTCTGTCTCCTGCCACAGTTATATATCAGTGGGTATCACACTTTCATTACTGGTTTCCACAAATAAGAGTTGCAGTTCTTCATCACTCAGGATCACATGCGGGAAGTCACCATAAGCTTATCCGTGACATGCACTCCTCACATGGCATTTTGCTAGTTACATACGCTGGCATTGTGAAATACATAAAAGATCTTTTATCAAGAAAATGGCATTACATAATTTTGGATGAGGGTCATAAAATAAGAAATCCAGACACCCAAGTCAGTAAAATGGTGAAGAGGTTTGAAACATCTCATAAACTTCTAATTACAGGTTCCCCCATGCAGAATAGTTTACAAGAATTATGGTCATTATTTGATTTTATGAGGCCTGGCCTATTAGGGAGTCACACTGCTTTTATGGAGCATTTTGCTGTTCCTATTACCCAAGGGGGATATGCTAATGCTAGTGAATTCCAAGAAGCTACTGCATTAGAAATTGCAAAGGCTCTTAAGAATCTTATCACTCCATATATGTTGCGGAGAACAAAAACTGAAGTTCAGGATCACATTCAATTACCAGAAAAAAATGAGCAAGTATTATTTTGTTCACTTACTCAGGAGCAAAAGGATTTGTACATGGGCTATCTTATGAGCAGTACCATCCGAAGCATATTAGACAAGGACAGTAAACATGGAGAACCAATGAGAGCTAGGATACTTGTTGCATTATCAACTCTTAGAAAAATATGTAACCATCCTGACATATATCTCTATGAAGCTTATGAGGAGACTGATGATATTGATGAAAAATCTTTCGGTAATTGGAAAAGATCTGGTAAAATGTCAGTGGTACATTCCTTGTTAAAAATATGGCTCAAGCAGGGACATAGGGCTTTAATTTTTACTCAATCAAGGGCAATGCTTTGCATTTTAGAACAACATTTGCAAAACCACAGCTTTAAGTATTTAAGAATGGATGGTAGTGTTAATGTGGGTGTGAGGCAGAATTTGATTAAAACATACAATGAAAATCCTGAATATTTGGTATTTTTAGCTACCACAAGAGTTGGTGGTCTCGGTGTTAATTTAACGGGAGCTGATAGAGTTATAATATATGACCCTGATTGGAACCCAGCAACAGATAACCAGGCCAAAGAAAGAGCCTGGAGAATTGGCCAGGAGAGAAATGTCACAGTTTACCGATTGTTGTCAGCCGGAACAATAGAAGAAAAAATATATCAAAGACAGATATTTAAGAATTTCTTAAGTAATAAAATATTAATAGATCCCAACCAAAAAAATGTATTAACAACTAGTAATTTGCAAAGTTTATTTAGCTTGGAGAATTTAAATTATGATGGAGATACTGAAACTACTGCCCTTTTTAAGCATACTAAGGTAAATATCAATGGTAAAAAAAATTACAAAAGTGATTTGAGTAAAGGTATGTCTTATTCTAAGAAAAAAATAGAAGCCATGAAAAGACGAGCTAGAGAGATAAGTAAACAAATCAAAAAGTATGCAGAAGTAGGATCATCAGCACCAAAGGATCCGAGACAGGCATATAAAGAAAAAAGAGATTTAATGTTAAATCCTCTACCAAAAGAAGACGAACCGGAAATAAATATCGTTAATAATGAAATCACAAATGTGCCATTTGAACATGCTCTCTCTGAATCAGATATAGTTTATCAACATACAAAGAAGGATTATGAAAATAATTTAATAAAAGAAGCCTTAAGTTCAGCAATTGAAATTCAAGAAAAAAATGATGAAAAGGAGACAGTAGAAGAAAATGTAGTTGCTGAAAATCCACCATTAGATTCAATTTCGATAGCAAATGATCTACAGACAAAAAAGGAAAGTAAAAAACGCCAATCATCCCCAGATACAAAAATTGATAATTTAGTTGCAATTAAAAAAGCGAAAATTGAGAAATCTAAAAAAGACGTCAAGCTCGGTGATGACGATTATGTTTTAAGTAAACTGTTTGCGAAGTCTGCTGTTAAAAATGCTTTACATCATGATGGAGTTGTTGGTTTGGCTGAAAAGGAAAAAAGACACAGAATTAAAGAAGATGCTATCAAAGCTGCCAAAAAAGCCGTTAGGGCTATAAAATTTTCTTCTTAA

Protein sequence:

>DPOGS200346-PA
MGLGKTVQVIAFLAGLSMTDSGSWGGLGPCIILSPATVIYQWVSHFHYWFPQIRVAVLHHSGSHAGSHHKLIRDMHSSHGILLVTYAGIVKYIKDLLSRKWHYIILDEGHKIRNPDTQVSKMVKRFETSHKLLITGSPMQNSLQELWSLFDFMRPGLLGSHTAFMEHFAVPITQGGYANASEFQEATALEIAKALKNLITPYMLRRTKTEVQDHIQLPEKNEQVLFCSLTQEQKDLYMGYLMSSTIRSILDKDSKHGEPMRARILVALSTLRKICNHPDIYLYEAYEETDDIDEKSFGNWKRSGKMSVVHSLLKIWLKQGHRALIFTQSRAMLCILEQHLQNHSFKYLRMDGSVNVGVRQNLIKTYNENPEYLVFLATTRVGGLGVNLTGADRVIIYDPDWNPATDNQAKERAWRIGQERNVTVYRLLSAGTIEEKIYQRQIFKNFLSNKILIDPNQKNVLTTSNLQSLFSLENLNYDGDTETTALFKHTKVNINGKKNYKSDLSKGMSYSKKKIEAMKRRAREISKQIKKYAEVGSSAPKDPRQAYKEKRDLMLNPLPKEDEPEINIVNNEITNVPFEHALSESDIVYQHTKKDYENNLIKEALSSAIEIQEKNDEKETVEENVVAENPPLDSISIANDLQTKKESKKRQSSPDTKIDNLVAIKKAKIEKSKKDVKLGDDDYVLSKLFAKSAVKNALHHDGVVGLAEKEKRHRIKEDAIKAAKKAVRAIKFSS-