Monarch geneset OGS2.0

DPOGS204318
TranscriptDPOGS204318-TA2169 bp
ProteinDPOGS204318-PA722 aa
Genomic positionDPSCF300046 + 817223-822347
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0151340.060.84% 
BombyxBGIBMGA007587-TA0.058.65% 
DrosophilaGen-PA5e-11645.83% 
EBI UniRef50UniRef50_D6WYS63e-12249.15%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYS6_TRICA
NCBI RefSeqXP_002008145.19e-12337.65%GI11983 [Drosophila mojavensis]
NCBI nr blastpgi|2700119491e-12149.15%hypothetical protein TcasGA2_TC006044 [Tribolium castaneum]
NCBI nr blastxgi|1951273772e-12537.47%GI11983 [Drosophila mojavensis]
Group
Gene OntologyGO:00062814.7e-117DNA repair
GO:00045184.7e-117nuclease activity
GO:00036771.1e-09DNA binding
GO:00038241.1e-09catalytic activity
KEGG pathwaytad:TRIADDRAFT_236381e-21 
 K10846 (ERCC5, XPG, RAD2)maps-> Nucleotide excision repair
InterPro domain[1-493] IPR0060844.7e-117DNA repair protein (XPGC)/yeast Rad
[127-219] IPR0060867.8e-23XPG/RAD2 endonuclease
[1-95] IPR0060855.4e-19XPG N-terminal
[204-386] IPR0200451.1e-095'-3' exonuclease, C-terminal subdomain
Orthology groupMCL14923 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204318-TA
ATGGGTATAAAGGGGCTATGGACGGTGTTAGCTCCATACTCTGAGAAGATATCATTACACGAAATCAGTGGTCAAACTGTTGCCATAGACTTAGCTGGATGGGTTTGTGATAGCCAAAATGTTACTGATTATTATATTCAGCCTAAGCTATATCTAAGAAATCTATTCTTCCGAACTCTTTACTTAGTGCTAAGTGATGTAAATCCTATATTTGTGCTTGAGGGTGATGCTCCAGAACTTAAGAGAGATGTTATGGCTGCTAGAAATGCATTGCAGTTTAAAGGTGCAGCCCCTAAAGCTACCACAGAGAAAACAAAACAAACTAACATAACAAGAACAAGGTTCAAAGGGGTATTAAAAGAGTGTGAAAATCTCTTAAGGACAATGGGAGTGAGATGTGTAAAAGGTAGAGGAGAAGCTGAAGCTGCTTGTGCTAGATTAAATGCTGAAGGTTTAGTAGATGCTGTTGTATCCCAAGATTCTGACTGCTTCGCATATGGAGCTAAGAAAGTGTATCGCAACTTCAGTGTATCAAGCGCTGGTGGTGGAGGAGCCACACATGGCTCGGTGGACGTGTATGATGCTGTCAAGATGTTTAATAATAAGGGGTTTGGACGTAACAAGATGGTTGCGTTAGCATTGCTCTGTGGTTCTGACTACGGAGTTGGTGTCTGCGGGTCGTCTAAAACAACCGTTGTTTCTTTTCTCCACACTGTCCCAGAAGATCAAGTTATATCGAGGTTATTATCGTGGGTGAGTGATCCACAGCACTACGAGGCGCAGTCCCGCTGGGTGTCCGTCCCGGGTCGCTGTGACCGCTGCGGGCACGCGGGCCGCACTCACCTCAAGAAGGGTTGCTCCACGTGCGCCACTCACCAAGGATGTAATGATACTGGACATAAATCAAAATTATGTGATGTAAAACGTGAACTATTGCTTCGCAATAAAGCCTTGTCGTCCGGCATACCGTTTCCGGAGCCGAAAGTTATGAAGGAATTCCTGAACAGTACACCAGAAGATATAGATCTAGACACTTTGAAGATTCCTAAACCCAGTTTAATACAATTCGTGAAAATTATGTCGTACAAATTGGATTGGCCGCAGCGGTACTGCGTTGAAAAGTTTCTGCCATTATTAACGAAATGGCACCTCCAGGATAATGTTGCGTCTAGGACGTTACGACCGCTTGAAATCAGAAAGAAAAGACATCCGAAAGGCGTACCGAGTTATGAAGTTGTATGGGGGGATATTGATGGGCATTATGAAGGACTAATACCTGACGAGCAGTTGGAAGAGGACGAAGATGTTTCAGCACCTTGGGTGACGATCGAGAGGCAGGATTTGATGTTTAAATATTATCCCGGTATAGTAGAGAAGTATGAAGAGTCTATAAAGAAGCCCCCCAAAGAAAAGAAAACAAGGGGCAGGAAAAAGAAGGAAGAAAATGAAAATTCTGAAGAAGTTCACAAGACAAAGAGGAAATATACTAGAAAACCAAAACCAATTACAGACTTTATGACTACTCTCAATAGGTCTTTGAAAAATCTAAGTCTGTCTAAGAAAGATACAGAACTGAATTCCAGTAAAGTGAGCGTTAATGTAAATAAACTGAAGAGAAAAATCAAAAATAACGGTAAAGTGAAGACAAAAAATACAATAGAAAGCTATTTGAAACCGTGCAAAAAGAAAAAGTCTTCGACTATGACCAGTTTGATAAGTGGAAGTCAAAACAAGTCTAAGTCCTTTAAACTGAATTTGGGAAATAAAGAAAATATGGAACCGAAAAAGCATTTGTTGAGCATGTTCAATGATAGTTCAAGTGATGTAGAGGCAAACGATTTATCGGATATTGTAGAAAAAATCGTATCGAGATCGGCTCCTTCAACTAAAGCAAAAGTGGACAGTAATTACGTGAAATTGATATTTGAGAATAAACTTGATAGGAAATCTATTCTGACGCAAAGATTTGATCAAAAAAATTGTTCGACTCCTATAAGCAGTCCGATAAGGAAAAGTTGTCAGCAGAGAAGGACGTCCAAATCCTCAATATCGGAAGCTGTAGACACCAGCTATTTCTTTGATAAATTAACAGAGGAACGAGATGCTTTTGAATTGTCTCTGGAATTCTCAGCGAACTTGGACTATAGCCTACCCAAAGTGGAGCTATAG

Protein sequence:

>DPOGS204318-PA
MGIKGLWTVLAPYSEKISLHEISGQTVAIDLAGWVCDSQNVTDYYIQPKLYLRNLFFRTLYLVLSDVNPIFVLEGDAPELKRDVMAARNALQFKGAAPKATTEKTKQTNITRTRFKGVLKECENLLRTMGVRCVKGRGEAEAACARLNAEGLVDAVVSQDSDCFAYGAKKVYRNFSVSSAGGGGATHGSVDVYDAVKMFNNKGFGRNKMVALALLCGSDYGVGVCGSSKTTVVSFLHTVPEDQVISRLLSWVSDPQHYEAQSRWVSVPGRCDRCGHAGRTHLKKGCSTCATHQGCNDTGHKSKLCDVKRELLLRNKALSSGIPFPEPKVMKEFLNSTPEDIDLDTLKIPKPSLIQFVKIMSYKLDWPQRYCVEKFLPLLTKWHLQDNVASRTLRPLEIRKKRHPKGVPSYEVVWGDIDGHYEGLIPDEQLEEDEDVSAPWVTIERQDLMFKYYPGIVEKYEESIKKPPKEKKTRGRKKKEENENSEEVHKTKRKYTRKPKPITDFMTTLNRSLKNLSLSKKDTELNSSKVSVNVNKLKRKIKNNGKVKTKNTIESYLKPCKKKKSSTMTSLISGSQNKSKSFKLNLGNKENMEPKKHLLSMFNDSSSDVEANDLSDIVEKIVSRSAPSTKAKVDSNYVKLIFENKLDRKSILTQRFDQKNCSTPISSPIRKSCQQRRTSKSSISEAVDTSYFFDKLTEERDAFELSLEFSANLDYSLPKVEL-