DPGLEAN06594 in OGS1.0

New model in OGS2.0DPOGS210824 
Genomic Positionscaffold908:- 42004-47887
See gene structure
CDS Length1605
Paired RNAseq reads  1376
Single RNAseq reads  3463
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007140 (0.0)
Best Drosophila hit  cryptochrome (4e-170)
Best Human hitcryptochrome-1 (2e-103)
Best NR hit (blastp)  cryptochrome [Danaus plexippus] (0.0)
Best NR hit (blastx)  cryptochrome [Danaus plexippus] (0.0)
GeneOntology terms







  
GO:0000060 protein import into nucleus, translocation
GO:0005634 nucleus
GO:0005641 nuclear envelope lumen
GO:0005737 cytoplasm
GO:0009882 blue light photoreceptor activity
GO:0016564 transcription repressor activity
GO:0042752 regulation of circadian rhythm
GO:0045892 negative regulation of transcription, DNA-dependent
GO:0050660 FAD binding
InterPro families


  
IPR005101 DNA photolyase, FAD-binding/Cryptochrome, C-terminal
IPR006050 DNA photolyase, N-terminal
IPR002081 Cryptochrome/DNA photolyase, class 1
IPR014729 Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL19482

Nucleotide sequence:

ATGCTTGGTGGTAATGTCATTTGGTTCCGTCACGGTCTCCGTCTCCACGACAACCCTTCG
CTTCACAGCGCTCTGGAAGATGCAAGCTCACCGTTCTTCCCTATATTCATATTTGATGGA
GAGACAGCTGGTACAAAGATGGTGGGCTACAATCGTATGCGATACCTGCTGGAGGCGCTG
AACGATTTGGACCAGCAGTTCAGGAAGTACGGCGGGAAGCTGCTCATGATTAAGGGGAGA
CCTGATTTAATATTCAGGAGGCTGTGGGAGGAATTTGGTATACGTACGCTATGCTTCGAG
CAGGACTGTGAGCCAATATGGCGTCCGCGCGACGCGAGCGTGCGTGCTCTGTGCCGCGAC
ATAGGCGTGTCGTGCCGCGAGCACGTCGCACACACGCTGTGGAACCCGGACACAGTCATC
AAGGCCAATGGAGGAATACCGCCGCTTACATACCAGATGTTCCTGCATACAGTTGAAATC
ATCGGTAATCCTCCGCGTCCCGTAGACGACGTCGACCTGAACGGCGTCAACTTCGGATCG
CTGCCTGAGAGCTTTTACAGGGAATTCGTTGTCTTTGATAAGGCCCCAAAACCAGAAGAT
CTGGGTGTGTTTCTGGAAAACGAAGATATTCGTATGATTCGCTGGGTGGGAGGAGAGACG
GCGGCCTTGAAGCAGATGCAGGAGAGATTGGCTGTGGAGTACGAGACATTCTGCAGGGGT
TCTTATTTGCCGACCCATGGCAACCCCGACCTCCTTGGACCGCCGATATCTCTGAGTCCA
GCCTTGCGCTTCGGCTGTCTGTCTGTCCGTCGCTTCTACTGGAGTCTCCAGGACCTGTTC
CAGCAGGTGCATCAGGGACGCCTGGCTTCCACTCAGTTTATCACTGGTCAGTTAATATGG
CGGGAGTATTTCTACACCATGAGCGTCAATAACCCCAACTACGCCCAAATGTCGGGGAAT
CCTATCTGCCTGGACATACCGTGGAAGGAACCGGAAAATGACGAGTTACAGAGATGGAAG
GAGGGTCGTACGGGGTTCCCATTCGTGGACGCGGCCATGCGCCAGCTGCGTACGGAGGGC
TGGTTGCATCACGTTGTACGGAACACCGTGGCCTCGTTCCTCACCCGCGGGACCCTGTGG
CTGTCCTGGGAACACGGGCTGCAGCACTTCCTCAAGTATCTGCTGGATGCTGATTGGTCG
GTGTGCGCGGGTAACTGGATGTGGGTGTCGTCCAGTGCGTTCGAGGCCTTATTGGACTCC
GGCGAGTGCGCGTGTCCCGTCAGACTGGGCCGAAGACTGGAGCCCACTGGCCATTATGTA
CGGAGATACGTACCAGAACTGGCTCGGATGCCCGGAGAGTACATTTACGAGCCGTGGCGT
GCCCCGCTCGAGGTGCAGGAGGCTGCGGGCTGTGTCATAGGTCGAGACTACCCCGCGCCG
GTCGTCGACCACACAGCTGCGGCCGCCAGGAACAGGGCCAACATGCAGGAGCTGCGCCGC
CTGTTGGAGAAAGCTCCTCCTCACTGCTGTCCGTCATCTGAAGACGAGGTGCGCCAGTTC
ATGTGGCTTGGAGACGACTCGCAGCCTGAGCTCACCACCACATGA

Protein sequence:

MLGGNVIWFRHGLRLHDNPSLHSALEDASSPFFPIFIFDGETAGTKMVGYNRMRYLLEAL
NDLDQQFRKYGGKLLMIKGRPDLIFRRLWEEFGIRTLCFEQDCEPIWRPRDASVRALCRD
IGVSCREHVAHTLWNPDTVIKANGGIPPLTYQMFLHTVEIIGNPPRPVDDVDLNGVNFGS
LPESFYREFVVFDKAPKPEDLGVFLENEDIRMIRWVGGETAALKQMQERLAVEYETFCRG
SYLPTHGNPDLLGPPISLSPALRFGCLSVRRFYWSLQDLFQQVHQGRLASTQFITGQLIW
REYFYTMSVNNPNYAQMSGNPICLDIPWKEPENDELQRWKEGRTGFPFVDAAMRQLRTEG
WLHHVVRNTVASFLTRGTLWLSWEHGLQHFLKYLLDADWSVCAGNWMWVSSSAFEALLDS
GECACPVRLGRRLEPTGHYVRRYVPELARMPGEYIYEPWRAPLEVQEAAGCVIGRDYPAP
VVDHTAAAARNRANMQELRRLLEKAPPHCCPSSEDEVRQFMWLGDDSQPELTTT