Monarch geneset OGS2.0

DPOGS207699
TranscriptDPOGS207699-TA1467 bp
ProteinDPOGS207699-PA488 aa
Genomic positionDPSCF300042 - 1710921-1713613
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0084264e-17075.71% 
BombyxBGIBMGA009932-TA6e-14671.81% 
Drosophiladpn-PA1e-4760.93% 
EBI UniRef50UniRef50_E0VAV51e-4666.17%Transcription factor hes-1, putative n=7 Tax=Eumetazoa RepID=E0VAV5_PEDHC
NCBI RefSeqXP_002423249.12e-4766.17%transcription factor hes-1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420047665e-4666.17%transcription factor hes-1, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3838553345e-4432.61%PREDICTED: uncharacterized protein LOC100876689 [Megachile rotundata]
Group
Gene OntologyGO:00063558.5e-15regulation of transcription, DNA-dependent
GO:00056341.2e-13nucleus
GO:00036772.7e-09DNA binding
KEGG pathway 
InterPro domain[116-173] IPR0010928.5e-15Helix-loop-helix DNA-binding domain
[115-175] IPR0115981.2e-13Helix-loop-helix DNA-binding
[190-230] IPR0036502.7e-09Orange
Orthology groupMCL20515 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207699-TA
ATGAATGGTACTGATGGGACAAATTTAAATAATGCCCTATTTCGTCGGACGTCAGTTGATTGCGAACCGTCGTCCGGTGAACATATAGGAAAAATGCAGTGCAGTGAGGACGATGACTCCCAAAGCGGATTCAATTCAGAACCGACCCAAGTTCTATCTAAGGCTGAATTAAGAAAGACGAACAAACCTATAATGGAAAAGAAGAGGCGAGCGCGCATTAATAACTGTCTCAACGAATTAAAAGATCTCTTGATGGATGCCATGGATAAAGACTGCAGTGAGGACGATGACTCCCAAAGCGGGTTCAATTCAGAACCGACGCAAGTTCTATCTAAGGCTGAATTAAGAAAGACGAACAAACCTATAATGGAAAAGAAGAGGCGAGCGCGGATTAATAACTGCCTCAACGAATTAAAAGATCTCTTGATGGATGCCATGGATAAAGACCCAGCTCGGCACTCGAAACTGGAAAAGGCCGACATCCTGGAGCTGACCGTTAAACACCTCCAGACCCTCCAGCGTCAGCAGCTAGCTGCAGCGATCGCTGCGGACCCCGCCGTCCTACACCGATTCAAGGCCGGCTTCGGAGACTGTGCCGGGGAAGTCCGAAGATACCTCTCCAGACTCGCCAGTGTACCCACCGGACTTCGCTACAGACTTGGAAACCATCTCAACACCTGTCTATCAGGGATAGAACGATTACACTCCACTGACTACCCACCACTGATACCGGACCCTTTAAGACTAGACGACGAAAGGCCGAGTGCGTTCCACTACGTAAGATCGACTAAGCCGACCAGCCCGCCCTTGAGTCCCTTGTCCTGTGATTCCGCGTGTGATTCTTCTACGGAACTCGAGACTCCGCCGCGACCCGTACAGAAATACCCCTTCCCGACACCTCCCAGCCATTCCGCATCGGATCAAGATTCTAGTCCAGAGCAAAAACCTACCGTATCCTCCACAACGATATCACCAGACACATTGAAACAAGATGTCCTGAGGAACAAAACAGTGATGGAACCGCTATCAATTGTTATCGATGTCGAGAATTACAGAATAGGAATCGACGCGTCACCAAAGAGAGCGGTCGATTATTCGATACGTCACAAGCTGAAACGACATTCAGATGCCAAAGGAGCAATGCCAAAGCTGATCAAACTTGACGGGGAGAGGAGGGAAAAGCAATTCCTAGATAGATTGCAAGCGCCTGAAGTCAAGACTGAACGTTCCGCTTTTGTGCGATTACCAGACAAGGTGATCCCTGAAAGAAAGTTGACTATACCCGAAGGTATACCAGCTGCGAGGGAAAAGTCCTCAGAAATGAAAGCCAACGTGCCCAGAACAGTCATAAGCCACACAGCGGCGTCACTGGCACAGTCGAGTCTTCAAAATATTAAGACTGAAGATAAAGACAGTCCTAAATCACCAAAACAAGGAACCAGTTCAGAAATGTGGAGACCATGGTGA

Protein sequence:

>DPOGS207699-PA
MNGTDGTNLNNALFRRTSVDCEPSSGEHIGKMQCSEDDDSQSGFNSEPTQVLSKAELRKTNKPIMEKKRRARINNCLNELKDLLMDAMDKDCSEDDDSQSGFNSEPTQVLSKAELRKTNKPIMEKKRRARINNCLNELKDLLMDAMDKDPARHSKLEKADILELTVKHLQTLQRQQLAAAIAADPAVLHRFKAGFGDCAGEVRRYLSRLASVPTGLRYRLGNHLNTCLSGIERLHSTDYPPLIPDPLRLDDERPSAFHYVRSTKPTSPPLSPLSCDSACDSSTELETPPRPVQKYPFPTPPSHSASDQDSSPEQKPTVSSTTISPDTLKQDVLRNKTVMEPLSIVIDVENYRIGIDASPKRAVDYSIRHKLKRHSDAKGAMPKLIKLDGERREKQFLDRLQAPEVKTERSAFVRLPDKVIPERKLTIPEGIPAAREKSSEMKANVPRTVISHTAASLAQSSLQNIKTEDKDSPKSPKQGTSSEMWRPW-