Monarch geneset OGS2.0

DPOGS210824
TranscriptDPOGS210824-TA1605 bp
ProteinDPOGS210824-PA534 aa
Genomic positionDPSCF300027 - 450860-456743
RNAseq coverage578x (Rank: top 22%)
Annotation
HeliconiusHMEL0127650.088.20% 
BombyxBGIBMGA007140-TA0.081.12% 
Drosophilacry-PA0.057.57% 
EBI UniRef50UniRef50_Q29SR50.079.74%Antennal cryptochrome n=5 Tax=Noctuidae RepID=Q29SR5_MAMBR
NCBI RefSeqNP_001182628.10.083.37%cryptochrome 1 [Bombyx mori]
NCBI nr blastpgi|620017590.0100.00%cryptochrome [Danaus plexippus]
NCBI nr blastxgi|620017590.0100.00%cryptochrome [Danaus plexippus]
Group
Gene OntologyGO:00062816.4e-92DNA repair
GO:00039136.4e-92DNA photolyase activity
KEGG pathwayaag:AaeL_AAEL0041460.0 
 K02295 (CRY)maps-> Circadian rhythm - mammal
InterPro domain[208-495] IPR0051016.4e-92DNA photolyase, FAD-binding/Cryptochrome, C-terminal
[3-219] IPR0060503e-46DNA photolyase, N-terminal
[5-128] IPR0147292.6e-33Rossmann-like alpha/beta/alpha sandwich fold
[344-360] IPR0020814.9e-06Cryptochrome/DNA photolyase, class 1
Orthology groupMCL18807 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210824-TA
ATGCTTGGTGGTAATGTCATTTGGTTCCGTCACGGTCTCCGTCTCCACGACAACCCTTCGCTTCACAGCGCTCTGGAAGATGCAAGCTCACCGTTCTTCCCTATATTCATATTTGATGGAGAGACAGCTGGTACAAAGATGGTGGGCTACAATCGTATGCGATACCTGCTGGAGGCGCTGAACGATTTGGACCAGCAGTTCAGGAAGTACGGCGGGAAGCTGCTCATGATTAAGGGGAGACCTGATTTAATATTCAGGAGGCTGTGGGAGGAATTTGGTATACGTACGCTATGCTTCGAGCAGGACTGTGAGCCAATATGGCGTCCGCGCGACGCGAGCGTGCGTGCTCTGTGCCGCGACATAGGCGTGTCGTGCCGCGAGCACGTCGCACACACGCTGTGGAACCCGGACACAGTCATCAAGGCCAATGGAGGAATACCGCCGCTTACATACCAGATGTTCCTGCATACAGTTGAAATCATCGGTAATCCTCCGCGTCCCGTAGACGACGTCGACCTGAACGGCGTCAACTTCGGATCGCTGCCTGAGAGCTTTTACAGGGAATTCGTTGTCTTTGATAAGGCCCCAAAACCAGAAGATCTGGGTGTGTTTCTGGAAAACGAAGATATTCGTATGATTCGCTGGGTGGGAGGAGAGACGGCGGCCTTGAAGCAGATGCAGGAGAGATTGGCTGTGGAGTACGAGACATTCTGCAGGGGTTCTTATTTGCCGACCCATGGCAACCCCGACCTCCTTGGACCGCCGATATCTCTGAGTCCAGCCTTGCGCTTCGGCTGTCTGTCTGTCCGTCGCTTCTACTGGAGTCTCCAGGACCTGTTCCAGCAGGTGCATCAGGGACGCCTGGCTTCCACTCAGTTTATCACTGGTCAGTTAATATGGCGGGAGTATTTCTACACCATGAGCGTCAATAACCCCAACTACGCCCAAATGTCGGGGAATCCTATCTGCCTGGACATACCGTGGAAGGAACCGGAAAATGACGAGTTACAGAGATGGAAGGAGGGTCGTACGGGGTTCCCATTCGTGGACGCGGCCATGCGCCAGCTGCGTACGGAGGGCTGGTTGCATCACGTTGTACGGAACACCGTGGCCTCGTTCCTCACCCGCGGGACCCTGTGGCTGTCCTGGGAACACGGGCTGCAGCACTTCCTCAAGTATCTGCTGGATGCTGATTGGTCGGTGTGCGCGGGTAACTGGATGTGGGTGTCGTCCAGTGCGTTCGAGGCCTTATTGGACTCCGGCGAGTGCGCGTGTCCCGTCAGACTGGGCCGAAGACTGGAGCCCACTGGCCATTATGTACGGAGATACGTACCAGAACTGGCTCGGATGCCCGGAGAGTACATTTACGAGCCGTGGCGTGCCCCGCTCGAGGTGCAGGAGGCTGCGGGCTGTGTCATAGGTCGAGACTACCCCGCGCCGGTCGTCGACCACACAGCTGCGGCCGCCAGGAACAGGGCCAACATGCAGGAGCTGCGCCGCCTGTTGGAGAAAGCTCCTCCTCACTGCTGTCCGTCATCTGAAGACGAGGTGCGCCAGTTCATGTGGCTTGGAGACGACTCGCAGCCTGAGCTCACCACCACATGA

Protein sequence:

>DPOGS210824-PA
MLGGNVIWFRHGLRLHDNPSLHSALEDASSPFFPIFIFDGETAGTKMVGYNRMRYLLEALNDLDQQFRKYGGKLLMIKGRPDLIFRRLWEEFGIRTLCFEQDCEPIWRPRDASVRALCRDIGVSCREHVAHTLWNPDTVIKANGGIPPLTYQMFLHTVEIIGNPPRPVDDVDLNGVNFGSLPESFYREFVVFDKAPKPEDLGVFLENEDIRMIRWVGGETAALKQMQERLAVEYETFCRGSYLPTHGNPDLLGPPISLSPALRFGCLSVRRFYWSLQDLFQQVHQGRLASTQFITGQLIWREYFYTMSVNNPNYAQMSGNPICLDIPWKEPENDELQRWKEGRTGFPFVDAAMRQLRTEGWLHHVVRNTVASFLTRGTLWLSWEHGLQHFLKYLLDADWSVCAGNWMWVSSSAFEALLDSGECACPVRLGRRLEPTGHYVRRYVPELARMPGEYIYEPWRAPLEVQEAAGCVIGRDYPAPVVDHTAAAARNRANMQELRRLLEKAPPHCCPSSEDEVRQFMWLGDDSQPELTTT-