Monarch geneset OGS2.0

DPOGS201295
TranscriptDPOGS201295-TA3312 bp
ProteinDPOGS201295-PA1103 aa
Genomic positionDPSCF300176 - 363514-367427
RNAseq coverage302x (Rank: top 37%)
Annotation
HeliconiusHMEL0172450.061.74% 
BombyxBGIBMGA003045-TA0.056.83% 
Drosophilamus201-PA8e-9043.19% 
EBI UniRef50UniRef50_D6WM397e-17637.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WM39_TRICA
NCBI RefSeqNP_001153363.13e-9847.49%mutagen-sensitive 201 [Nasonia vitripennis]
NCBI nr blastpgi|2700069143e-17537.69%hypothetical protein TcasGA2_TC013347 [Tribolium castaneum]
NCBI nr blastxgi|3072066570.036.32%DNA-repair protein complementing XP-G cells [Harpegnathos saltator]
Group
Gene OntologyGO:00056341.2e-216nucleus
GO:00062811.2e-216DNA repair
GO:00062891.2e-216nucleotide-excision repair
GO:00045181.2e-216nuclease activity
GO:00036971.2e-216single-stranded DNA binding
GO:00045191.2e-216endonuclease activity
GO:00036775.6e-26DNA binding
GO:00038245.6e-26catalytic activity
KEGG pathwaynvi:1001176298e-98 
 K10846 (ERCC5, XPG, RAD2)maps-> Nucleotide excision repair
InterPro domain[2-1058] IPR0010441.2e-216Xeroderma pigmentosum group G protein
[2-1058] IPR0060841.2e-216DNA repair protein (XPGC)/yeast Rad
[1-98] IPR0060851.9e-44XPG N-terminal
[718-787] IPR0060861.5e-29XPG/RAD2 endonuclease
[787-949] IPR0200455.6e-265'-3' exonuclease, C-terminal subdomain
[789-822] IPR0089185.4e-09Helix-hairpin-helix motif, class 2
Orthology groupMCL16598 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201295-TA
ATGGGAGTGACTGGCCTGTGGAGACTTATCGAGCCGGCCGGAAAACCCGTGCCTGTTGAAACATTAGAGAATAAAGTTTTAGCAGTCGATATTTCAATATGGTTGCATCAGATGGTGAAAGGCTATCAGGATGCTAAAGGAGCTCCTCTACCGAATGCTCATCTCATTGGGTTGTTTCAAAGACTATGTAAATTACTATATTTTAGAATCAAGCCAGTTTTTGTATTTGATGGTGGATTTCCAGATCTTAAAAGAGAGACAATTGCTAAAAGACAAGACAACAAAACGAAATATAATTCTGCATCAGAAAAATTAAAGAGAGAAATCACTTTGCTTTTGGGCAAAAAAACTGCAATTGGTTCATTGCTAGGAAAACAGATTTCTCCCACTAAGAATAAACAACCTCAGGCTAACGATGACATCTTTAAACTACCGGAATTACCAGAAAAGGGTGCATATTCTGAATCTGAGTCTGAAGATGAACAAGATTCAAGTGCATCAACAGTGGACTTACACTCTGTAGATTTTGACTCAGATAAATTTAAAAATATGCCCATAAAAGAAAAATATGACCTTCTAATTGAACTGAAGGAAACAAGGAAAATGAATTCTTGGGGCAAAATAAACACATTACCTAAGAAAAGTGACAACTTTTCGGATTTTCAAATGCAGAGATTGCTAAAACGGAGGCAACTACAAGAATGTTTGGAAGAAACTGAAAAAGAGATGGGTGATGAGGGCATGTCCTTAAACGAATTGGAATCATTGCTCAATGAAGAGGGCATAGACACCAAAATAGAGAGCTTGCCGAGCCGCAGAATAGCGTCAAATAATACAACGAGATATCTACTTATAAGTAATGTAAGGCAGGCGTTGGAAAATGCTAAGAGAAAGGAAGAGGCTGCCCAACAAGCACAAATGCAAATATCAGCGACAACTGAAGTAGAAAAACATGATACCAAAGATATTCAGAAAAATGATGAATGCGACGATGATTTAGAAAAGGCTATTAAAATGTCGCTGGAATGTGTGGAAGAGGCCGACACGAGCGCTTGTACGTCAAAAACTGATGAGTCCTGGACATCTTTCCTCACAGAATCTGATTATTCAGATGACGAAGACGAAGATGGATTCGCCCCTCCTGATATGACGTCAGCAAAATCATACATCAAACAATATACCGACTTAAACTACAAAGTCATTGATAACATTGTTGCCGCAAAACAAAAAGAAAAAAATAAACCTTCTAAACCAAGAGTTAATGAAATCATTGAAGAGCTCGCACAAGAAAAGACTATTATAGAAGATGAGATTGAGTTAATGTCTAGTGATGATGACAAAGATAAGTGTGAAGTAGTAGAAAAAGAAAACGATAAAGATAAGTCATGTGCTGTAGAAGCGGTAATGGAAAATGAAGTCATAAGTAAAAAGTGTGAATTGGAAGAAAGCAATCATGAAGAGGCTTCTATAATATGTGTTGAGAAATCTATTGCTGATGTTATTTCACTTGATTCAAGCTTAGGAGAAGCGGAAACAGATAACGATGAAGTTAAAGTTGAACCTGCGAAAGATAATATTAAATTAGAAAGTTCAAGCTCCAGTGAGGATGACTTCGAAGATGTTTCTGATGAAGAGACCGAGTCTAAGAAACCAGTTGTTACATTAACCCTCAATATGGGCAATACGATCGAAGATGATATATTTGCCGATATATTTGAAAGTAAAGCTGGTGAAAAATGTTTAGCACCTAATATAAAGGAAGAGCAAGAAATAAAGGAAGATGTTAGCAAACACGAGAACGAAGTGAAAGATATAATTAAATCAAATGTCACAGCTGAAGTTGAAACAACTTCAAAAAGTATAGAAACAACTTCAAAAAATATAGAAACATGCTTGAATGAAGAAAGTGAAAAACAGAGAAATGAAGATCAAAAAGATGTAGTGGATAGAGACCAAACAGTATCAGAGGAGCTTGCTACAACACAAAAGGCAGTGATACCAGAAAGACAAAAAATATCAGCTGAAGAACTTAACACTATGGTAACGGAAATTGAGAATGAGGAACAGTTACTGCTTCAAGAAAAGGGCAAATTAGATCGTATTGGTCGCAACATAACTGAACAGATGACCAAAGAGGCACAGGAACTGCTTCAAATCTTCGGCATCCCGTACATCGTAGCTCCGATGGAGGCTGAAGCGCAGTGCGCGGTGTTGGAGGCTCTCAAACTTACCGACGGCACCATAACAGATGACAGTGACATTTGGCTGTTCGGGGGCAGAACGGTGTACAAAAACTTTTTCAATCAGAAAAAACATGTCTTGCAATTTTTGCGGGAACGAATTGAAAAATCGTTTAATTTGAGTCGTGAGAAGTTGGTGCTGCTGGCTCTGCTCGTGGGAAGCGATTACACAGTCGGAGTTACTGGCGTGGGACCTGTGACCGCTTTGGAGATTTTGGCTTCATTTCCTTTTAACAAGAAAAAAACAATAGCTGAAGACGCAAAATTCACCGATTATCAAGAAATTGTAGCGGGACTGCAAGATTTTAAGAAATGGGTGAAGGCGGGGAGAAGAACGGATAATGTTAGTTTGAAGAAGAAACTAAAAAATGTCAGCCTCTCGGATGACTTTCCCAGTGTGAGGGTAGTTCAAGCATATTTCGAACCGAATGTAGAAAAGAGCAGTGAAAAGTTCTCCTGGGGCGATCCAGACATCACTGAGTTAAGAGAGTACGCGAGGGCCAAGTTTGGGTGGTCCCAACACAAGTTGGATGAAATAATAAAACCAGTCATTAAAAGGATGCAAGAAAATAAGACTCAGAAAACTGTCCACGACTATTTTAAAAAGAAACTCGTGTTGGATTCTTTGGAGGACCAGATGAGTAAGAGAGTCAAAGCTGCAATACAAAAGATGGGAACCGAGGCTTCAGGAGAAGAATTAAATGCACCTGAAAAACCGAAACCCAAAAGAGCAAGAAAAAGATCTACAAATCAAGATGCTACAAAGCCAGGACCTTCCAAGGAAAAACGTAAAAGGCGCAACGATGAAACCAAACAGATAAGCGAAGTTATACAAGCGGCGCCAGCAAACGATCCAGATAAGACATTTGATATAGCAGTTCCAAAGACGGATAGGTATCAGGAAATTATACCGCAGAGAGAGAGGGACAGGAAATGCATTTTAGAAAATAAGTTAAAGGCTATAGAGGTGTTCCGCAAGAGTAAGATTGATCCTAAAAAGAAAACGACGAAGAGGCGGCTTCCTGTACCCAAAGAAAAAGCTGATCTCTCAGAGAGTAGTGACACAGATTAA

Protein sequence:

>DPOGS201295-PA
MGVTGLWRLIEPAGKPVPVETLENKVLAVDISIWLHQMVKGYQDAKGAPLPNAHLIGLFQRLCKLLYFRIKPVFVFDGGFPDLKRETIAKRQDNKTKYNSASEKLKREITLLLGKKTAIGSLLGKQISPTKNKQPQANDDIFKLPELPEKGAYSESESEDEQDSSASTVDLHSVDFDSDKFKNMPIKEKYDLLIELKETRKMNSWGKINTLPKKSDNFSDFQMQRLLKRRQLQECLEETEKEMGDEGMSLNELESLLNEEGIDTKIESLPSRRIASNNTTRYLLISNVRQALENAKRKEEAAQQAQMQISATTEVEKHDTKDIQKNDECDDDLEKAIKMSLECVEEADTSACTSKTDESWTSFLTESDYSDDEDEDGFAPPDMTSAKSYIKQYTDLNYKVIDNIVAAKQKEKNKPSKPRVNEIIEELAQEKTIIEDEIELMSSDDDKDKCEVVEKENDKDKSCAVEAVMENEVISKKCELEESNHEEASIICVEKSIADVISLDSSLGEAETDNDEVKVEPAKDNIKLESSSSSEDDFEDVSDEETESKKPVVTLTLNMGNTIEDDIFADIFESKAGEKCLAPNIKEEQEIKEDVSKHENEVKDIIKSNVTAEVETTSKSIETTSKNIETCLNEESEKQRNEDQKDVVDRDQTVSEELATTQKAVIPERQKISAEELNTMVTEIENEEQLLLQEKGKLDRIGRNITEQMTKEAQELLQIFGIPYIVAPMEAEAQCAVLEALKLTDGTITDDSDIWLFGGRTVYKNFFNQKKHVLQFLRERIEKSFNLSREKLVLLALLVGSDYTVGVTGVGPVTALEILASFPFNKKKTIAEDAKFTDYQEIVAGLQDFKKWVKAGRRTDNVSLKKKLKNVSLSDDFPSVRVVQAYFEPNVEKSSEKFSWGDPDITELREYARAKFGWSQHKLDEIIKPVIKRMQENKTQKTVHDYFKKKLVLDSLEDQMSKRVKAAIQKMGTEASGEELNAPEKPKPKRARKRSTNQDATKPGPSKEKRKRRNDETKQISEVIQAAPANDPDKTFDIAVPKTDRYQEIIPQRERDRKCILENKLKAIEVFRKSKIDPKKKTTKRRLPVPKEKADLSESSDTD-