Monarch geneset OGS2.0

DPOGS207197
TranscriptDPOGS207197-TA2637 bp
ProteinDPOGS207197-PA878 aa
Genomic positionDPSCF300001 + 5749334-5788647
RNAseq coverage420x (Rank: top 29%)
Annotation
HeliconiusHMEL0098182e-11158.73% 
BombyxBGIBMGA000657-TA4e-13455.31% 
Drosophilagce-PC3e-8239.37% 
EBI UniRef50UniRef50_B0LL822e-16653.29%Juvenile hormone resistence protein II n=8 Tax=Bombyx mori RepID=B0LL82_BOMMO
NCBI RefSeqNP_001108457.14e-16753.29%juvenile hormone resistence protein II [Bombyx mori]
NCBI nr blastpgi|2943453675e-16653.29%methoprene-tolerant homolog-2 [Bombyx mori]
NCBI nr blastxgi|2943453674e-16248.55%methoprene-tolerant homolog-2 [Bombyx mori]
Group
Gene OntologyGO:00056346.2e-14nucleus
GO:00063556.2e-14regulation of transcription, DNA-dependent
GO:00037003.8e-10sequence-specific DNA binding transcription factor activity
GO:00071652.2e-09signal transduction
GO:00048712.2e-09signal transducer activity
GO:00055152.6e-06protein binding
KEGG pathwayphu:Phum_PHUM4263004e-27 
 K02223 (CLOCK, KAT13D)maps-> Circadian rhythm - fly
    Circadian rhythm - mammal
InterPro domain[75-134] IPR0115986.2e-14Helix-loop-helix DNA-binding
[77-129] IPR0010921.6e-13Helix-loop-helix DNA-binding domain
[90-105] IPR0010673.8e-10Nuclear translocator
[145-211] IPR0000142.2e-09PAS
[146-205] IPR0137679.6e-07PAS fold
[338-419] IPR0136552.6e-06PAS fold-3
Orthology groupMCL26651 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207197-TA
ATGGCAGATTGGCCAATGCTGGAGTGCGACTATCACAATCGGTACGATTCCTATCAGTACAACTACTATCAAGAGAAGGATGAGCAGCCGTTGCCCTCACAATGTCTGCAAACGCAAGCGCAACCCTACCGTATGCCGTCACCTACTCTCACCCTACTACTATCCCCACCGCCTCCCGAGCAAACATCGTCATGCACCTATAAGCCTGCCTCCATAGGAGAAAGTCCCAGGGAAGTCAGAAACAAAGCTGAGAAACAGCGACGGGATAAAATGAACCAGTCCATATCACGTCTAGCAACCATTGTCCCAACCGTCGTAAGACCCGGTCGAAAATTAGATAAAACAAGTATATTGCGGCTAACAGCGCATTACTTGCGCTCCCATCAACATGTATTCGGAAATAGCATCGATCGCTCCCCAGAATTCAGCATGCAGTTTATACAAGGACTTTTGAAAAATCTGAAAGGCTTCCTCATCACTATCACATATAAAGGATTAGTTGTCGTTGTGTCTCCAAATGTTCAAGAATACCTAGGATATTCCGAGGTTGAACTTCTTGGTCAAAATATTTTAAATATCATCCATGAAGACGACCATCAACTCTTAAGAGAACAGATTTACCCTAGAAGTTGCACTCTAGGATCAAACGGGGAATTATTACTACCAAGGGAATCAGAAGCTGAGAAAAAGGTTATGAAAAGCTTGATTAATGAAAAACGCAATTTCATTTTAAGGTTCAAAAAAATGACACAGCAGCGTTCTAATCCGCCTGAATACATCACATGTCACGTGGAAGGGTCTTTAAGAAAATCAGATCGAGCCGGGGTTTATTTTGACAGCATTGTTCACATCGGTCGTCGGGTTCGGGCAAGAGGAGAAAATCCATTCGCTAGTGGAAATGATGTAGTATTCATTGGCATGGTAAGACCTACAACAGAGACTTTTATAACAGAGAGTGGTCTCGAGTCTTTTAAGATGGAGTATCGAACTCGACATTCTATAGATGGCGAGATAATACAGTGTGAGCAACGCATTGCTCTTGTCACAGGCTACATGACTCATGAAGTGAATGGAGTTAATGCAATGAACTTTATGCATAGAGATGACGTTCGCTGGGTGATCATCGCATTAAGGGAAATGTACGACAAACATCGCTTGGTCGGTGAATCATGCTATCGATTGATGACAAAAAATGGTCAGTTCATATATATGAGAACCCTTGGCCATCTAGATGTAGACCAAAACTCCAAAGAAGTAACCAGCTTCGTATGCACGAACACTGTTGTCGCGGAGCACGAAGGCAAGAAACTTATAAAATTGATGAAGAAGAAGTTCACTCTAATGATCAATAACAACGAAGCTGTTAAAGACCTTGAAGACTCCGAGGTTAATGATAAAAAAAATCTACCTGTAGAAGATCCTAGACAACTTGAAAAAGTTATATTGCATCTGGTGACAAATTTACCGTCGTTTAAATCAGGTGATATTTTTCAACAGTCAACATATAATGGTGAAATATCTCCTCCAGAATTAGCTATCATACCCCCAAGGAAAGAAAAAATTCAAAAGGCAATTGAGAGAAGTTACAGCCTTATTAAAAATCTTCGAGACTCTGAGTCATCCAAAAAACAGTCCCCAATGTTTAATCACCATCAATCAAACGAACCGTTAGAACCGAAAGAACCAATGAAACTTATGGACACATTGGAACACCTGGAATCTCTTGAACTAAAAGAGCCACTTAGAGAAGAAACTCATACATCAGCTTTTGTCCCAGTTTCAACAAAATGTAATAGTTTAGTTTTATATCGTCCGAATCCTTCAAATGCATTTGCATCCAGAAATCATCCGAGTGTTCAAGAAAATAATAATTTACAATTTCCAAATGTAAGTGAAATAGAGAAGACTCTCCCAATGGAATATATAGCTTACCCCGGTGCACCACATGGTTACAAATCACACAACATCCCAACACGAGTGGTTTCAGCCCCAATTCCAACACCAGAGTCAAGTTTTATGCCTATTCCAGAAACTGTAATAGGAATGACTACCAAAACTGACCCAGAACCGTTGAGAATGACTCCAGAACTATCACCCGTCATCTCTCCTGATTTTGATCTTGACTATGAAGCTACTCAAAAAATTCTTGAAGATTTCTTTGAACTCGAAAAAAAAGAGGGGAGCCCTAAACCAATCCAATTTGTCAAAAGCCCATTTGAAAATCTTTTTCCACCTGTCGGTCCTAATGGCGACAACCAAGCTTTTAATTGTTTCGGCGAACCAACGCCAAGCACATCGGGAATTAAACGACACTTTGAGGATTTCGATGACGATACAAACTTGGATGATTCTAGTTTGTCTGAAACATTCAAAACACCACGTCAACCAAAAAAGTATCAAACAAAAAAACTTAAAAGATCCCAAACTAGTGCGGAATATGAAATTGAGAAACTTATTTCACGCTTAAATAAAATTCATGTACCCAAAAAGAAGAATAAAAAATTATTCTCGATAATACGAGATGTTGAAAGATCTCGTGGTATCCGGATAAGAAATTGCAATAAAACTATCGGCGAAACTCAGTACAGTGATAACTCTGAGAAAGAAGATGGAAGTATTGATGGTTACATTTGCTAA

Protein sequence:

>DPOGS207197-PA
MADWPMLECDYHNRYDSYQYNYYQEKDEQPLPSQCLQTQAQPYRMPSPTLTLLLSPPPPEQTSSCTYKPASIGESPREVRNKAEKQRRDKMNQSISRLATIVPTVVRPGRKLDKTSILRLTAHYLRSHQHVFGNSIDRSPEFSMQFIQGLLKNLKGFLITITYKGLVVVVSPNVQEYLGYSEVELLGQNILNIIHEDDHQLLREQIYPRSCTLGSNGELLLPRESEAEKKVMKSLINEKRNFILRFKKMTQQRSNPPEYITCHVEGSLRKSDRAGVYFDSIVHIGRRVRARGENPFASGNDVVFIGMVRPTTETFITESGLESFKMEYRTRHSIDGEIIQCEQRIALVTGYMTHEVNGVNAMNFMHRDDVRWVIIALREMYDKHRLVGESCYRLMTKNGQFIYMRTLGHLDVDQNSKEVTSFVCTNTVVAEHEGKKLIKLMKKKFTLMINNNEAVKDLEDSEVNDKKNLPVEDPRQLEKVILHLVTNLPSFKSGDIFQQSTYNGEISPPELAIIPPRKEKIQKAIERSYSLIKNLRDSESSKKQSPMFNHHQSNEPLEPKEPMKLMDTLEHLESLELKEPLREETHTSAFVPVSTKCNSLVLYRPNPSNAFASRNHPSVQENNNLQFPNVSEIEKTLPMEYIAYPGAPHGYKSHNIPTRVVSAPIPTPESSFMPIPETVIGMTTKTDPEPLRMTPELSPVISPDFDLDYEATQKILEDFFELEKKEGSPKPIQFVKSPFENLFPPVGPNGDNQAFNCFGEPTPSTSGIKRHFEDFDDDTNLDDSSLSETFKTPRQPKKYQTKKLKRSQTSAEYEIEKLISRLNKIHVPKKKNKKLFSIIRDVERSRGIRIRNCNKTIGETQYSDNSEKEDGSIDGYIC-