Monarch geneset OGS2.0

DPOGS203011
TranscriptDPOGS203011-TA2091 bp
ProteinDPOGS203011-PA696 aa
Genomic positionDPSCF300068 + 190348-198689
RNAseq coverage414x (Rank: top 29%)
Annotation
HeliconiusHMEL0110260.073.76% 
BombyxBGIBMGA003870-TA0.070.10% 
Drosophilacyc-PA7e-11750.83% 
EBI UniRef50UniRef50_Q8N0R50.064.75%Cycle like factor BmCyc b n=3 Tax=Bombycoidea RepID=Q8N0R5_BOMMO
NCBI RefSeqNP_001036982.10.064.75%Cycle like factor b [Bombyx mori]
NCBI nr blastpgi|381761460.099.54%cycle [Danaus plexippus]
NCBI nr blastxgi|381761460.099.54%cycle [Danaus plexippus]
Group
Gene OntologyGO:00056344.4e-42nucleus
GO:00063554.4e-42regulation of transcription, DNA-dependent
GO:00037004.4e-42sequence-specific DNA binding transcription factor activity
GO:00055152.6e-08protein binding
GO:00071651.8e-07signal transduction
GO:00048711.8e-07signal transducer activity
KEGG pathwaytca:6555164e-158 
 K02296 (ARNTL, BMAL1, CYC)maps-> Circadian rhythm - fly
    Circadian rhythm - mammal
InterPro domain[150-165] IPR0010674.4e-42Nuclear translocator
[136-190] IPR0115985.9e-20Helix-loop-helix DNA-binding
[136-187] IPR0010923e-15Helix-loop-helix DNA-binding domain
[438-515] IPR0136552.6e-08PAS fold-3
[210-278] IPR0000141.8e-07PAS
[228-273] IPR0137673.9e-06PAS fold
Orthology groupMCL13437 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203011-TA
ATGAATGTCGCTGTTCGACCTGATGTGGTTCGAGTATTTGGACAAATTTCTAGAATATTCTTCGGATTCGGGTCGAACGCGAGCTCGACGTCGTACGAGTTGACGGTGGGGGCGGGAGCGGGAGCGGGACCGGGAGCGTGCGACGACATGGCCGGAGCGGGAGCGGTTCTCAGCTTACAACCACACGCGCCCCACAACCCACCACCAGACCCACGGAAGAGGAAGAACCAGCAATATTCGGATGGTTACGGGCTGGTTTGCGAGGTACCGATGTCTGGACATGCCGCTGTCGTGCCGCAGATCGCGACTACCAACAGGAAACGGAAACCGTCATCCTACGGCACGGCCAGCGCTTATGACGATGACGGCTCCGAAGACACGCGCTCGAGAAGCACCGTACCAGATAAGAGGCAGAACCACAGCGAGATCGAGAAACGGCGCCGCGACAAAATGAATACTTACATATCAGAGTTGAGCTCCATGGTGCCCATGTGCGGCGCCATGGCTAGGAAGCTGGATAAGCTGACGGTCTTACGCATGGCGGTGCAGCATCTGCGCTCGGTGAGGGGAGCCCTGTCGGCCGGCCCGCTGACGGTCAGGCCGCGCCCAGCTTTCCTGTCAGAGACGGAACTGAACGCGTTGGTACTCCAAGCTGCCAGGGATTGCTTCCTGATGGTTGTCGGCTGCGATCGAGGCCGTCTCCTCTACGTTTCCGCTTCCGTCTCAAGAATGCTGAACTATGATCAGTCGGAGCTGATAGGTCAGAGCCTTTTCGATATGCTGCATCCAAAAGACGTCGGCAAGGTGAAGGAGCAGCTCTCATCATCCGACCTCAGTCCGCGAGAAAGACTAATCGATGCTAAAACCATGCTGCCGTTGAAGCCGGATGTCGTTGCCGATGCGTCTCGACTGTGTCCGGGAGCACGGCGCTCGTTCTTCTGCCGCATCAAGTGCCGTACTGAGCCTAGCCAGGCCAGCCAGAGCAGCCAGTCTAGCCAGGGCTGCCAGCCGTGCACAAACACCCAGCCCAGCGAGGTCAAGGAGGAGGAGCCCAACTCCAAGATGAAGAAAAAACAGGCGCATGAGAAGAAATACTGCGTTGTTCAATGCACAGGGTACCTGAAGTCCTGGGCCCCGGCTGAGATGTCTGACGGATGCTGCACGGACGCCGGGTCTGAAGAGGAATCCTGCAATCTGTCGTGTCTGGTGGCTGTGGGGAGGGCACTGGCTGATCTCGCCCAGCATCCCGACTCCGCCCCACAAACGAGATATCTTCAGTACATCTCCAGACACGCCCCTGATGGGAAATTCCTGTTCGTTGATCAGAGGGTAACTATAGCTCTCGGCTTTCTGCCTCAAGAGCTTCTGGGCACGAGCATGTACGAGTATATTTTGGCCCCGGAATTGGGGTCCGTGGCTAGGATCCACAAAGCGGCGTTACTGAGACGCGACTCCCTCAGGACCCCTCCCTACTGTTTCAGGAAAAAAGACGGCACATACGTCAGAATACAGACCCATTTCAAACCCTTCAAGAATCCGTGGACCAAGGATGTAGAAAGCCTTGTCGCCAACAACACCGTGGTGGTAGGGAACAGGGTCCAACCACCGCCGGAGGATTGCTCCCAATATGACATTTACAAAATAAATGACATGGACCGACCCCTGTCAGAAGCTGACGTTGAGATGCAGAGACTGATAGACTCCCAAGTTGAGTCCCACAAGATCGGCTCTACCATCGCTGAAGAAGCTCTGAGGAGGTCCTCCTCCGATTTCTCAGATCTGACTTCCGACCTGCTCCAAGACGCCGTCTTCAGTCAACAGTCGTCGCTCGTGGACAATATCCTCGGCGGGGAGGTGAACTACTCCAACCAGGTTCGCAACAACGTCCCCCTGAGCGCTGCCGGCGGCAGTTCGCCATCACCGGAAGCCGAGCTGCCACTCAGCCCCTTACCAGTGTCTCCCCCCCTCCCCCCTCTCGGATTAGACGGCAATGGTGAGGCTGCTATGGCCGTCATCATGAGCCTGCTAGAAGCCGACGCTGGTCTTGGAGGACCGGTCAATATCTCTGGACTCCCATGGCCCCTACCTTAG

Protein sequence:

>DPOGS203011-PA
MNVAVRPDVVRVFGQISRIFFGFGSNASSTSYELTVGAGAGAGPGACDDMAGAGAVLSLQPHAPHNPPPDPRKRKNQQYSDGYGLVCEVPMSGHAAVVPQIATTNRKRKPSSYGTASAYDDDGSEDTRSRSTVPDKRQNHSEIEKRRRDKMNTYISELSSMVPMCGAMARKLDKLTVLRMAVQHLRSVRGALSAGPLTVRPRPAFLSETELNALVLQAARDCFLMVVGCDRGRLLYVSASVSRMLNYDQSELIGQSLFDMLHPKDVGKVKEQLSSSDLSPRERLIDAKTMLPLKPDVVADASRLCPGARRSFFCRIKCRTEPSQASQSSQSSQGCQPCTNTQPSEVKEEEPNSKMKKKQAHEKKYCVVQCTGYLKSWAPAEMSDGCCTDAGSEEESCNLSCLVAVGRALADLAQHPDSAPQTRYLQYISRHAPDGKFLFVDQRVTIALGFLPQELLGTSMYEYILAPELGSVARIHKAALLRRDSLRTPPYCFRKKDGTYVRIQTHFKPFKNPWTKDVESLVANNTVVVGNRVQPPPEDCSQYDIYKINDMDRPLSEADVEMQRLIDSQVESHKIGSTIAEEALRRSSSDFSDLTSDLLQDAVFSQQSSLVDNILGGEVNYSNQVRNNVPLSAAGGSSPSPEAELPLSPLPVSPPLPPLGLDGNGEAAMAVIMSLLEADAGLGGPVNISGLPWPLP-