Monarch geneset OGS2.0

DPOGS206046
TranscriptDPOGS206046-TA1815 bp
ProteinDPOGS206046-PA604 aa
Genomic positionDPSCF300028 - 1174241-1184461
RNAseq coverage221x (Rank: top 45%)
Annotation
HeliconiusHMEL0087847e-16055.83% 
BombyxBGIBMGA000498-TA8e-4350.43% 
DrosophilaClk-PD4e-12755.53% 
EBI UniRef50UniRef50_G0YQM30.078.75%Clock n=2 Tax=Endopterygota RepID=G0YQM3_SPOEX
NCBI RefSeqXP_001662706.17e-16048.82%circadian locomoter output cycles kaput protein (dclock) (dpas1) [Aedes aegypti]
NCBI nr blastpgi|381761440.098.65%clock [Danaus plexippus]
NCBI nr blastxgi|381761440.098.84%clock [Danaus plexippus]
Group
Gene OntologyGO:00055153.7e-16protein binding
GO:00056344.1e-13nucleus
GO:00063554.1e-13regulation of transcription, DNA-dependent
GO:00037004.1e-13sequence-specific DNA binding transcription factor activity
GO:00071652.4e-07signal transduction
GO:00048712.4e-07signal transducer activity
KEGG pathwayaag:AaeL_AAEL0125622e-159 
 K02223 (CLOCK, KAT13D)maps-> Circadian rhythm - fly
    Circadian rhythm - mammal
InterPro domain[253-339] IPR0136553.7e-16PAS fold-3
[26-41] IPR0010674.1e-13Nuclear translocator
[7-82] IPR0115981.8e-11Helix-loop-helix DNA-binding
[86-152] IPR0000142.4e-07PAS
[17-67] IPR0010923.1e-07Helix-loop-helix DNA-binding domain
[90-147] IPR0137678.6e-07PAS fold
[302-345] IPR0016108.9e-07PAC motif
Orthology groupMCL11439 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206046-TA
ATGGATGACGACGGGGATGATAAAGATGACACCAAAAGGAGAACTCGCAATCTTAGTGAGAAGAAGAGGAGAGACCAGTTCAACATGCTTGTCAATGAACTCGGTTCTATGGTATCAACAAATAATAGAAAAATGGATAAATCAACTGTTCTTAAGTCTACTATATCATTCTTAAAGAACCATAATGAGATAACAGTGCGATCACGAGCTCATGATGTCCAAGAAGATTGGAAGCCGGCTTTCTTATCTAATGAGGAATTCACCTACTTAGTCTTGGAGGCCTTGGAGGGTTTTGTTATGGTATTTTCAGCTAGCGGTTGCATTTACTATGTATCGGAAAGTGTGACATCTTTGCTAGGGCATACTCCGGGAGACATTATCAATAAAAGTATATTTGATTTGGCATTTGTTGATGATCGTCCAAATCTATACAACATTTTGCAAAATGGCGGTACCCTCGATCCGACCCAAGTTGTGACGACAGATAATCCTATAAGCTTCCGTTGCCGCCTGCAAAGGGGAACATTAGATTTCAGAGATGAAGTAACCTACGAATTGGTCCAATTCGATGGCCACTTCCGTAAAAATCTGGAGTCGAATGAGAACGGCCATCATTCGTATCAGGATGAACACGAATCGAGATTACTATTCGTGTGCACCGGCAGGCTGTATATGCCACAGCTAGTTCGCGACGTGTCTCTCGTTGATACTATTCGTAGCGAGTTCACATCGCGCCACAGCCTGGAGTGGAAGTTTTTGTTCTTGGACCACCGCGCCCCTCCCATCATAGGATACTTGCCATTCGAAGTCCTAGGCACATCAGGATACGATTACTACCATTTTGATGATCTAGAGAAGGTCGTGTCCTGTCACGAAGCCTTGATGCAAAAGGGCGAGCTGACTTCGTGCTACTATAGGTTTCTGACCAAGGGTCAACAGTGGATCTGGCTCCAGACACGTTTTTATATAACATACCATCAGTGGAACTCTAAACCCGAATTCGTTGTTTGCACTCATCGAGTCGTTAGTTATGCTGATATAATAAAAAGCACAAAACAGGAGCGTACAGAGACAGAGGAGTCTGTTCGTGACTGCGATCATAACGGATCGTCTTTGAAGGATCCTTCCACTGAGGACGCTATGGTGCCCGTATCACCCTCATATATGTCAGAGGCAAGCGACGCCTTCGCCACCTCATATAATTCTATGTCCAAGCTGGCATCGGTGAAGTCTGCGGCCACATCAGGCAGTACAAGTGCGACGGTGGCCACACTTGGAACTGCCATCACCACAGCGAGTGCCACATGGCCACCACGATCGTCCTATCTGCTGTACACCACTGGTTCTGACACCACTTCCGTATCCGGTGGATCTCGATCCTCGCAGAGGAATAGCTCTCAGGAGTTGCAGAGGTTACCTGAACCGGCCCTGGTGCCGCAACACGGTATTGGTGCTCAGTATCTGGAGCCCGCCCCCTACGTGGGCGCTGTCGGCGTCCCCGCTGTACTGCCGCTATCTCTACCACCCATACCGGTTATAGTCGCACAAGATCAGGCCCAGTTACAGGAGCAGTTGCAGCGAACCCATCGTGAGCTGCAGCAGATGATCGTGAGGCAGCAGGAGGAGCTGCGCCAGGTGAAGGAACAGCTGCTGTTCGCGAGGCTCGGTATACTGCAGCCGGTTATCAACGTCCAGGATCCGTTCACGAATCCAGAGCAAATGCCGAACCGGTCGTCCATCATGTACGATGGTAACAGACAGCTAAGCTATCCGCAGACGAGCCACCAGCAAAATCACAACATGCCGCCACAGTAA

Protein sequence:

>DPOGS206046-PA
MDDDGDDKDDTKRRTRNLSEKKRRDQFNMLVNELGSMVSTNNRKMDKSTVLKSTISFLKNHNEITVRSRAHDVQEDWKPAFLSNEEFTYLVLEALEGFVMVFSASGCIYYVSESVTSLLGHTPGDIINKSIFDLAFVDDRPNLYNILQNGGTLDPTQVVTTDNPISFRCRLQRGTLDFRDEVTYELVQFDGHFRKNLESNENGHHSYQDEHESRLLFVCTGRLYMPQLVRDVSLVDTIRSEFTSRHSLEWKFLFLDHRAPPIIGYLPFEVLGTSGYDYYHFDDLEKVVSCHEALMQKGELTSCYYRFLTKGQQWIWLQTRFYITYHQWNSKPEFVVCTHRVVSYADIIKSTKQERTETEESVRDCDHNGSSLKDPSTEDAMVPVSPSYMSEASDAFATSYNSMSKLASVKSAATSGSTSATVATLGTAITTASATWPPRSSYLLYTTGSDTTSVSGGSRSSQRNSSQELQRLPEPALVPQHGIGAQYLEPAPYVGAVGVPAVLPLSLPPIPVIVAQDQAQLQEQLQRTHRELQQMIVRQQEELRQVKEQLLFARLGILQPVINVQDPFTNPEQMPNRSSIMYDGNRQLSYPQTSHQQNHNMPPQ-