Monarch geneset OGS2.0

DPOGS209925
TranscriptDPOGS209925-TA1272 bp
ProteinDPOGS209925-PA423 aa
Genomic positionDPSCF300180 - 115530-130653
RNAseq coverage544x (Rank: top 23%)
Annotation
HeliconiusHMEL0167502e-11268.17% 
BombyxBGIBMGA010929-TA2e-11260.83% 
Drosophilacwo-PA5e-3448.59% 
EBI UniRef50UniRef50_D6X1U32e-3836.04%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X1U3_TRICA
NCBI RefSeqXP_001812240.11e-4736.16%PREDICTED: similar to class b basic helix-loop-helix protein (bhlhb) (differentially expressed in chondrocytes) (mdec) (sharp) [Tribolium castaneum]
NCBI nr blastpgi|1892408373e-4636.16%PREDICTED: similar to class b basic helix-loop-helix protein (bhlhb) (differentially expressed in chondrocytes) (mdec) (sharp) [Tribolium castaneum]
NCBI nr blastxgi|1892408372e-4636.38%PREDICTED: similar to class b basic helix-loop-helix protein (bhlhb) (differentially expressed in chondrocytes) (mdec) (sharp) [Tribolium castaneum]
Group
Gene OntologyGO:00056341.6e-15nucleus
GO:00063551.6e-15regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[59-149] IPR0115981.6e-15Helix-loop-helix DNA-binding
[64-117] IPR0010923.2e-14Helix-loop-helix DNA-binding domain
Orthology groupMCL18267 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209925-TA
ATGGAAACACGTCATTATTGGGAAGAGAATGGACATGCCGTCAAATATGACAACTACTCCAATGAGGAGTTCGCGCGCGAGCCGCTGAGCTTCGCGCCGCCGTCCGAGGACGAGGCGGAGTACCCTCCAGGGTACAAGAAGGGGAAGGTCTCGAGGGCAAGTCACACTGAATATGCCCGTCAAGATCCAATGTCGCACCGCATCATCGAGAAGCGGCGGCGGGACCGGATGAACAACTGCCTGGCGGACCTGTCCCGCCTCATACCACCCGAGTACCTCAAGAAGGGCCGCGGCCGCGTGGAGAAGACGGAGATCATAGAGATGGCCATACGACACCTCAAGTATTTACAGGACAGAGTCCACGTTCTGGAGCGGGGGTCGGAGTTCCTCGCGGGGTACCAGAGGGCGGGGGCGGAGGCGGTGCGGTTCGTGGAGCTCCAGGGCTCCCGTGACGGCCTGGCGGATCAGCTCGCTGCACACCTACACTCACACGCTGACATGATGGCCAAAGAAGCTGTACACGAAAAGCGTGTGTATCCGAACTCCTCGTCGGAGACGACCAGCTCGTCGAGCAGTTCCCAGGGCTTCGCCGTGAAGGTGATCCAGCGGCCGGAGCCCCCGCCCGCGTTCCCGGAGCCCTACGAGCCTGACAGACAGGAACATTTCGCGGACTGCGAGAGATTGTCGGTGCACCAACCAGCTGTAATGGAGCCGTTGGAAGGCGAGCCTCTCCCGCTGGACGGACGAGTGAAGAAGGAAGTGACGCTGAGGAAGATTAGGAAGCCAGAACACGAGGACTACTTGCACTCGTACAAGTTCAAGAACTCCATAGAGAGGAGGTTCTCCAGGTCGCAGGACTCCGAGGGTGACGCTTGGAGCGCCGGGCCGACAGCGAAGGCCTACAATCATAAACGTCGGAGGCCTACCAAGCCCGCGCCGCCCTCCACGTCCACTTCCGCCTCGGGCTCCACCGAGGAAGCGCGCGACACAAGCCCACAAGACACGTGCAGCGACTCCCCCCACCACCACTCGTTCGACAAGCCGCCGCCGCCCGCGCAGTACGTGCCCGTGTTCGCCTTGAACGCGCTCGGCAAGTACTACGTGCCGCTGAGCGTGGAGTACGGCTGCGTGTCTCGTCAGCTGGGCGCGGGCGTGACGTCACTGGAGGCGGCCGAGGCGCGCGCGCTTCACCCCGTCACCATACACGTGAACTTCCAACCCTGCATCGACTACCTCAAGCGGGAGCCCGACCCTCACTGGCGCCCGCTCTAA

Protein sequence:

>DPOGS209925-PA
METRHYWEENGHAVKYDNYSNEEFAREPLSFAPPSEDEAEYPPGYKKGKVSRASHTEYARQDPMSHRIIEKRRRDRMNNCLADLSRLIPPEYLKKGRGRVEKTEIIEMAIRHLKYLQDRVHVLERGSEFLAGYQRAGAEAVRFVELQGSRDGLADQLAAHLHSHADMMAKEAVHEKRVYPNSSSETTSSSSSSQGFAVKVIQRPEPPPAFPEPYEPDRQEHFADCERLSVHQPAVMEPLEGEPLPLDGRVKKEVTLRKIRKPEHEDYLHSYKFKNSIERRFSRSQDSEGDAWSAGPTAKAYNHKRRRPTKPAPPSTSTSASGSTEEARDTSPQDTCSDSPHHHSFDKPPPPAQYVPVFALNALGKYYVPLSVEYGCVSRQLGAGVTSLEAAEARALHPVTIHVNFQPCIDYLKREPDPHWRPL-