Monarch geneset OGS2.0

DPOGS215742
TranscriptDPOGS215742-TA1464 bp
ProteinDPOGS215742-PA487 aa
Genomic positionDPSCF300041 + 733148-735761
RNAseq coverage216x (Rank: top 45%)
Annotation
HeliconiusHMEL0040780.083.86% 
BombyxBGIBMGA003618-TA0.083.44% 
Drosophilaphr-PA9e-17860.67% 
EBI UniRef50UniRef50_Q244436e-17760.88%Photolyase n=30 Tax=root RepID=Q24443_DROME
NCBI RefSeqXP_001653955.10.066.10%DNA photolyase [Aedes aegypti]
NCBI nr blastpgi|1571238820.066.10%DNA photolyase [Aedes aegypti]
NCBI nr blastxgi|1571238820.066.10%DNA photolyase [Aedes aegypti]
Group
Gene OntologyGO:00039042.7e-263deoxyribodipyrimidine photo-lyase activity
GO:00062812.7e-263DNA repair
GO:00039131.6e-62DNA photolyase activity
KEGG pathway 
InterPro domain[8-480] IPR0081482.7e-263DNA photolyase, class 2
[231-477] IPR0051011.6e-62DNA photolyase, FAD-binding/Cryptochrome, C-terminal
[39-235] IPR0060506.2e-45DNA photolyase, N-terminal
[41-158] IPR0147297.3e-17Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL17762 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215742-TA
ATGAAACAAATACATAAAAAGCGAGAAGAAACTGCTAAGTCCATTTTAGATTTTAACTTCAATAAGTCTCGTTTAAGAATAATATCACAAGAGCAGATGGTATCTGATGATTGCGAAGGAATTGTATATTGGATGTCGAGAGACAGCAGAGTTCAAGACAATTGGGCTTTTCTATACGCACAGGAACTGGCGTTAAAAAATAAAGTACCGCTCCATGTATGTTTCTGTTTAATAGCAAAATATTTGGATGCATCTGTTAGACAATTTCACTTTCTTATCAAAGGTCTCGAAAAAGTTGCTGCTGATTGTGACAAGCTTAACATTTCATTTCACTTACTGGAAGGCAATGGTGCAGAAGTTTTACCTCAATGGGTTATCGATCACAGGATAGGGGCTGTGGTTTGTGATTTCAATCCTCTAAGAGTGCCATTAGGCTGGGTCGAGGGGGCAAAGAAAAAATTAAAAAAGGATGTGCCATTAATTCAGGTTGATGCCCATAATGTTGTGCCGTGTTGGGTGGCATCTAACAAACAGGAGTATTCCGCTAGAACCATAAGAAATAAGATCAACTCAAAACTTGATGAATACCTGACCGAGTTTCCTCCGGTTATTAAACATCCACATTCAAGCAGTTTTAAACCAGAGCCAATAGATTGGGATAAGGCGATAGAGACGAGAGAAGCAGACAAATCTGTCGGTCCAATAGGATGGGCGGGTCCTGGCTATGACAATGCTGTCAAAACATTGAAGAGTTTTCTTGACAAACGTCTCAAAGTCTTTGCAACCAAAAGGAATGATCCCACTCAGGATGCACTTAGCAATTTATCACCATGGTTTCATTTTGGTCAAATATCAGCACAACGGGTAGCCTTGTGTGTGAAGGAGTACAAAACCAAGTATACAGAGAGCGTCAATTCTTATTTAGAAGAGGCTATAGTGCGAAGAGAATTGGCTGACAATTTTTGTTTTTACTGTGAACATTATGATAGCATCAAAGGTGCGAGCCAGTGGGCACAGAAGACTTTAGACGACCATAGAAATGACAAAAGAACACATATATATACACTTGAACAGTTCTGCAAAGCAGAAACCCATGACGACCTGTGGAACTCGGCTCAAATACAAATGGTTAAAGAGGGGAAGATGCATGGGTTTCTAAGAATGTACTGGTGTAAGAAGATCCTAGAGTGGACCTCGAGTCCGGAAGAGGCATTGAAATATGCCATATATTTGAACGATCATTACAGTGTTGACGGCAGGGACCCTAGCGGATATGTTGGTTGTATGTGGTCTATCTGCGGCGTCCACGACCAGGGCTGGGCGGAGCGTGCTGTGTTTGGCAAAATCCGTTTCATGAACTATGACGGCTGCAAACGAAAGTTTAACGTACCAGCCTTCGTATGCAGATACGGAGGGAAAGTCCACAAATATAACAACTTGACCGACAAACAGAAGAAAAAGTAG

Protein sequence:

>DPOGS215742-PA
MKQIHKKREETAKSILDFNFNKSRLRIISQEQMVSDDCEGIVYWMSRDSRVQDNWAFLYAQELALKNKVPLHVCFCLIAKYLDASVRQFHFLIKGLEKVAADCDKLNISFHLLEGNGAEVLPQWVIDHRIGAVVCDFNPLRVPLGWVEGAKKKLKKDVPLIQVDAHNVVPCWVASNKQEYSARTIRNKINSKLDEYLTEFPPVIKHPHSSSFKPEPIDWDKAIETREADKSVGPIGWAGPGYDNAVKTLKSFLDKRLKVFATKRNDPTQDALSNLSPWFHFGQISAQRVALCVKEYKTKYTESVNSYLEEAIVRRELADNFCFYCEHYDSIKGASQWAQKTLDDHRNDKRTHIYTLEQFCKAETHDDLWNSAQIQMVKEGKMHGFLRMYWCKKILEWTSSPEEALKYAIYLNDHYSVDGRDPSGYVGCMWSICGVHDQGWAERAVFGKIRFMNYDGCKRKFNVPAFVCRYGGKVHKYNNLTDKQKKK-