Monarch geneset OGS2.0

DPOGS203908
TranscriptDPOGS203908-TA3288 bp
ProteinDPOGS203908-PA1095 aa
Genomic positionDPSCF300005 - 951402-967368
RNAseq coverage392x (Rank: top 31%)
Annotation
HeliconiusHMEL0039790.060.48% 
BombyxBGIBMGA000486-TA0.059.19% 
Drosophilaper-PA2e-10546.38% 
EBI UniRef50UniRef50_Q7Z0C90.095.98%Period protein n=2 Tax=Obtectomera RepID=Q7Z0C9_DANPL
NCBI RefSeqNP_001036975.10.055.98%period [Bombyx mori]
NCBI nr blastpgi|324833530.095.98%period protein [Danaus plexippus]
NCBI nr blastxgi|324833530.095.98%period protein [Danaus plexippus]
Group
Gene OntologyGO:00055152.1e-07protein binding
GO:00071651.9e-06signal transduction
GO:00048711.9e-06signal transducer activity
GO:00063557.5e-06regulation of transcription, DNA-dependent
KEGG pathwaytca:6590152e-143 
 K02633 (PER)maps-> Circadian rhythm - fly
    Circadian rhythm - mammal
InterPro domain[910-1072] IPR0227285.1e-34Period circadian-like, C-terminal
[329-413] IPR0136552.1e-07PAS fold-3
[305-371] IPR0000141.9e-06PAS
[171-240] IPR0137677.5e-06PAS fold
Orthology groupMCL15652 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203908-TA
ATGGACAACTTGGATGACTCCGAGAACAATGCTAAGATATCCGACTCCGCATATTCCAACAGCTGCAGCAACAGTCAATCGAGGAGAAGCCACAGCTCCAAATCTACACATTCTGGAAGCAATTCAAGTGGAAGCAGTGGGTATGGCGGTAAGCCATCGACTTCCGGCTACAGCAATAACTTAAGTCAGCCACCGGAAAAATGGATTAAAGAAAAAGAATCAAAAAAGAAAAAGCCTGTACAGGTAGAATTGAAACCATCTGAAGAGAAAATAGAAGAATGTCCACAGGAGCTCGCACCTGTATGTGAGGCGCCTAAAGAAGAAACTAAAGATGTTAACCGCACCCCGACTCCATCTTTGGTGCAAACCGAAAAGGGTCCGGAGAACATGGAAATAACAACGTTAAAAAGAAACGATGAAAAAGATGAGTCTGTTTCGTCTAACGCGCCTATGGTGACTTCGCTCAATTTAGTCACGGGACGCTCAAATTCTCCTTCCTGTCCTGAAAGCTTCTCCTGTGTCATATCAATGCAAGATGGTGTGGTCATGTTCACAACGTCCTCCATAGTTACTGCTCTCGGCTTTCCAAAGGACATGTGGATTGGCAGATCATTTATAGATTTCGTTCATCCAAGGGATCGGAACACCTTTGCATCGCAGATCACTAGTGGTCTGGCTGTGCCTAAAAATGTTAATGGTACGCAAGAAAAAGCTCCTGTTCCAGGAAATCATGTTTCGACGATGGTATGTCGCATACGGCGTTACAGAGGTCTCAATCTGGGTTTCGGCATTAAAGAGAAAACCGTTTCATTTATGCCGTTCCTATTAAAGTTCTTCTTCAAAAACATTAACGATGAAGATGGCCAGGTTATATACTTGGTTATACAGGCGACACCATTTTTTTCCGCTTTTAAAACTTCGGCAGAGATAATACATAATGCAATACCATTTGTGATAAGGCACTCGGCTACGGGAAGTTTAGAGTACATAGACCATGAATCGGTGCCTTACCTTGGCTACTTGCCACAGGATATTGTTGAGAAAGACGCCTTGCAGTTGTACCATCCAGGCGATCTTGGATACCTGAGGCAGATTTACGAGACGATTGTAAAGGAAGGCGGAGTTCAACGGTCCAAACCTTACAGAATGATGGCTCAGAACGGGGATTATCTGAAATTAGAGACGGAATGGTCTTCATTCATAAATCCTTGGTCAAAGAAACTAGAATTTGTTATTGGTAAACACTACATCGTGGAGGGTCCATCGAATCCAGACGTTTTCCAAATGCCTGATCCTGAAAAATCTCTTAAATTTACCGATGATGAGAAGGCTAAAGCTGCTGCACTAAGGGAAAAAATTACTAGAGTTATGACCGAGGTCCTTACAAAACCAGCTGAAATAGCTAAACAGCAGATGAGTAAAAGATGCCAGGATCTCGCTTCCTTCATGGAGAGTCTGATGGAGGAGACACCCAAGATAGAAGAAGAACTACGTCTTGAAATACAGGATCAAGATCATAGTTATTATGAACGTGATTCTGTAATGCTCGGCGGTATCTCGCCACACCACGACCACAGCGACAGCAAGTCAGGCACCGACACACCAGTCAGCTACAACCAACTTAATTACAACGAAACTCTTCAAAGATATTTTGACAGTCACGAACCATACAGCTTCGAAGATTATTATTTAATGGATAGCGAAAATAAGATCCAAATGAAAGAAAATGAAGAAGGTTCTGTGAGCAAATGTATATCTCCGATGGCACAAGCTTCGACGGAGTGTGATCGGACCAGTTCCTCCGAATGCAGTGGTCTTGGTATAGGGAACTCTTGTCCTTGTGACTACCAGCCAATGCGATTGACAGAGTCATTGCTTAACAAGCACAACGCAGAAATGGAAAGAGAACTAATAAAAATGTATCGTGAAAACCGTTCAAGTAAAGGAGATAGAGAGAAAGCCTCCAACGAAACGAGACAAAAGAAGAAACAACATTTAGCAAGATGTAATGCGGCTTTTCAACCGACGTCGTTGGGACTGCCCGATTCTCAGCCCCATGGAGTGAAGCGTCCCTCAAAACAGGCGGAAGAAGCCAGCGCCCACAAACACAGATGCTCGTCGCCACGTCCGATCCGACATTCAGCGGTATCAAACAACCAGCCAGTAGCCATCAACTCAGTCGTGACTAATATGTGGCCTACCACTGCAGCTAACACCATGAACACGTGTCACCTCCAAGGATTGGGGATGCCACCACAGGTTTCATTCATGACACCAATGGCTATGCCGGGTCAATATCCGATGTGCTATATTCCAGTGCCCGTACAACCTATACAGCCGCAGTCAGATTCATATCAAAATACAAATTCAAATAACAATTATCCTTATCAACCTCAACCGATGCCATACATGATGTATGGCCATGCTATGTACGGATCTCCGTTTATGTATCCGTCTGTGGATCCGAGGACGTACGTGCCTCAAACTACGTCCGGTCATAACATACCACCTTTCGGACTGTCCAGTAGTAACTACCAAGAAGCTTGTAAACTAACTGTGCCATTGAAGACATCTAAAGCGTGTCGCATCACAAGGGAGAATCACAATCAAGCACTAAGAAGAGACGGCGTCAATTATTCAGCTGGCACATCGAATCGGAATACCGAAGTGAATAACGATAAGGACATTCGTAAGCCTCGTGCCACAAACAGTAATCGCACAGTTGAGAAAACTGACGAAGAATCAAGTTTCTCATCGTTTTATTCATCTTTCTTTAAAACTGAATCTGGTAGTGCTGAAGATAGCGATGCCAAAAAGAGTTGGCACAAAAATCATAAGGGCGATGATCTTATGTCATTGCAAAGTTCTACAGAGGCTGTCACGTATGCACCAAACAAGAGCCAGGCACAACAGAAAAAGGTTGATCCGTCGTGGGTTGAAGAAGTTTGCGTAACATCAGAATTAATTTACAAATATCAGATTCGGACGAAGAGTTTAGAGGAAGTACTTTCGGAAGATAAAAAGAAATTGGAAACTTTGGAACAGCCTCTACTCGTGAGTCAACAGCTGGGTCAATTGTACTTGGATCTGCAACTTCAAGGAGTTGCTGCCAGATTGACTTTAGAAGAAGGAATCACCAGTTCTAGCAGTTCGGGTGAAGAGAATTCGTCGATGTCATCTAAGAAGATACGTCGTAGGAAACGGGAATACAGCAAATTAGTGATGATATATGAAGAGGACGCTCCACTACCACCTCCAGATAACGTTGCCGGTACATCAGATTCTTAG

Protein sequence:

>DPOGS203908-PA
MDNLDDSENNAKISDSAYSNSCSNSQSRRSHSSKSTHSGSNSSGSSGYGGKPSTSGYSNNLSQPPEKWIKEKESKKKKPVQVELKPSEEKIEECPQELAPVCEAPKEETKDVNRTPTPSLVQTEKGPENMEITTLKRNDEKDESVSSNAPMVTSLNLVTGRSNSPSCPESFSCVISMQDGVVMFTTSSIVTALGFPKDMWIGRSFIDFVHPRDRNTFASQITSGLAVPKNVNGTQEKAPVPGNHVSTMVCRIRRYRGLNLGFGIKEKTVSFMPFLLKFFFKNINDEDGQVIYLVIQATPFFSAFKTSAEIIHNAIPFVIRHSATGSLEYIDHESVPYLGYLPQDIVEKDALQLYHPGDLGYLRQIYETIVKEGGVQRSKPYRMMAQNGDYLKLETEWSSFINPWSKKLEFVIGKHYIVEGPSNPDVFQMPDPEKSLKFTDDEKAKAAALREKITRVMTEVLTKPAEIAKQQMSKRCQDLASFMESLMEETPKIEEELRLEIQDQDHSYYERDSVMLGGISPHHDHSDSKSGTDTPVSYNQLNYNETLQRYFDSHEPYSFEDYYLMDSENKIQMKENEEGSVSKCISPMAQASTECDRTSSSECSGLGIGNSCPCDYQPMRLTESLLNKHNAEMERELIKMYRENRSSKGDREKASNETRQKKKQHLARCNAAFQPTSLGLPDSQPHGVKRPSKQAEEASAHKHRCSSPRPIRHSAVSNNQPVAINSVVTNMWPTTAANTMNTCHLQGLGMPPQVSFMTPMAMPGQYPMCYIPVPVQPIQPQSDSYQNTNSNNNYPYQPQPMPYMMYGHAMYGSPFMYPSVDPRTYVPQTTSGHNIPPFGLSSSNYQEACKLTVPLKTSKACRITRENHNQALRRDGVNYSAGTSNRNTEVNNDKDIRKPRATNSNRTVEKTDEESSFSSFYSSFFKTESGSAEDSDAKKSWHKNHKGDDLMSLQSSTEAVTYAPNKSQAQQKKVDPSWVEEVCVTSELIYKYQIRTKSLEEVLSEDKKKLETLEQPLLVSQQLGQLYLDLQLQGVAARLTLEEGITSSSSSGEENSSMSSKKIRRRKREYSKLVMIYEEDAPLPPPDNVAGTSDS-