Monarch geneset OGS2.0

DPOGS210113
TranscriptDPOGS210113-TA1113 bp
ProteinDPOGS210113-PA370 aa
Genomic positionDPSCF300017 + 1291632-1292744
RNAseq coverage134x (Rank: top 56%)
Annotation
HeliconiusHMEL0050630.086.18% 
BombyxBGIBMGA000221-TA9e-15278.06% 
Drosophilaphr6-4-PB1e-13358.11% 
EBI UniRef50UniRef50_Q8SXK51e-13158.11%RE11660p n=81 Tax=cellular organisms RepID=Q8SXK5_DROME
NCBI RefSeqXP_001658195.15e-14162.47%DNA photolyase [Aedes aegypti]
NCBI nr blastpgi|1337543440.099.46%(6-4) photolyase [Danaus plexippus]
NCBI nr blastxgi|1337543440.099.46%(6-4) photolyase [Danaus plexippus]
Group
Gene OntologyGO:00062811.2e-101DNA repair
GO:00039131.2e-101DNA photolyase activity
KEGG pathwayaag:AaeL_AAEL0011751e-140 
 K02295 (CRY)maps-> Circadian rhythm - mammal
InterPro domain[81-356] IPR0051011.2e-101DNA photolyase, FAD-binding/Cryptochrome, C-terminal
Orthology groupMCL20655 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210113-TA
ATGCAACATACTGTATACGATTTTAATAGTGTGGTAAAGAAAAACAATGGCAGCATTCCACTCACATATCAAAAGTTTCTTTCGTTAGTCTCTGATGTGCAAGTTAAAGACATCATACAAATATCCAAGGGTGTGTCTGATGAGTGCAAAGCAAGTGATTATGATTCCCAAGGATATGACATTCCTTCTTTAGAAGAATTTGGAGTAAATGAATCTGAGCTTTCAGAATGTAAATATCCTGGTGGAGAATCTGAGGGTTTAAAAAGATTAGATGTGTACATGGCAAAAAAACAGTGGGTCTGTAATTTTGAGAAACCTAAATCCTCACCAAATAGTATTGAACCAAGTACTACAGTATTGAGCCCATATATTAGCCATGGCTGCTTGTCAGCTAAGTTGTTTTATCATAAGCTAAAGCAAGTTGAGAATGGCAGCAAACACACTTTGCCACCTGTTTCACTGATGGGACAGCTCATGTGGAGAGAATTTTATTACACAGCTGGTTCAGGTACTGAAAATTTTGATAAAATGGTAGGAAACTCCGTCTGTACACAAATACCTTGGAAGAAAAACGATGCTCACTTAAAAGCATGGGCAGAAGGTAAAACTGGCTACCCCTTTGTAGATGCAATCATGCGCCAGTTAAAACAAGAGGGTTGGATTCATCATTTGGCGAGACATATGGTTGCCTGCTTCTTGACCAGAGGTGACTTATGGATTTCTTGGGAAGAAGGAGCAAAAGTGTTTGAAGATTTCCTTCTGGACTATGATTGGTCTTTGAATGCTGGAAACTGGATGTGGCTATCTGCATCTGCTTTTTTCTACAAATACTACAGAGTATATAGCCCAGTAGCTTTTGGTAAAAAAACAGATAAAGATGGGCTCTATATAAGAAAATATGTTCCCGAGTTGAAAAAATATCCTAGTGAGTTTATCTATGAACCATGGAAGGCTCCAAAGGGTGTTCAGAAAACGGCTGGTTGTGTAATTGGTGAGGGATATCCCAATAGAATTGTTGATCATGATAAGGTCCATAAAGATAATATTCAGAAAATGAATTCTGCTTATAAAGTAAATAAAGAGAAAAAGGCAATGAAGAGACCGAGACAGTAA

Protein sequence:

>DPOGS210113-PA
MQHTVYDFNSVVKKNNGSIPLTYQKFLSLVSDVQVKDIIQISKGVSDECKASDYDSQGYDIPSLEEFGVNESELSECKYPGGESEGLKRLDVYMAKKQWVCNFEKPKSSPNSIEPSTTVLSPYISHGCLSAKLFYHKLKQVENGSKHTLPPVSLMGQLMWREFYYTAGSGTENFDKMVGNSVCTQIPWKKNDAHLKAWAEGKTGYPFVDAIMRQLKQEGWIHHLARHMVACFLTRGDLWISWEEGAKVFEDFLLDYDWSLNAGNWMWLSASAFFYKYYRVYSPVAFGKKTDKDGLYIRKYVPELKKYPSEFIYEPWKAPKGVQKTAGCVIGEGYPNRIVDHDKVHKDNIQKMNSAYKVNKEKKAMKRPRQ-