Monarch geneset OGS2.0

DPOGS200685
TranscriptDPOGS200685-TA1155 bp
ProteinDPOGS200685-PA384 aa
Genomic positionDPSCF300353 - 29861-31573
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0083018e-15375.38% 
BombyxBGIBMGA008916-TA2e-14266.26% 
DrosophilaHLHmgamma-PA4e-2436.94% 
EBI UniRef50UniRef50_C5NS703e-8090.18%Enhancer of split mbeta-2 (Fragment) n=1 Tax=Bombyx mori RepID=C5NS70_BOMMO
NCBI RefSeqXP_001949270.11e-3459.84%PREDICTED: similar to AGAP012342-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|2517527781e-7990.18%enhancer of split mbeta-2 [Bombyx mori]
NCBI nr blastxgi|2517527781e-7690.18%enhancer of split mbeta-2 [Bombyx mori]
Group
Gene OntologyGO:00056344.5e-13nucleus
GO:00063554.5e-13regulation of transcription, DNA-dependent
GO:00036774.9e-08DNA binding
KEGG pathway 
InterPro domain[13-71] IPR0115984.5e-13Helix-loop-helix DNA-binding
[14-65] IPR0010922.6e-12Helix-loop-helix DNA-binding domain
[77-115] IPR0036504.9e-08Orange
Orthology groupMCL25552 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200685-TA
ATGTCGGACCCGGCGCCGCTATCCAAAACAGCCAAATACAAGAAGATAACTAAACCGTTACTGGAGAGGAAACGAAGGGCGCGCATCAATAGATGTCTGGACGAATTAAAGGACCTGATGATCGATGACGACAACCTGAGCAAGCTGGAGAAGGCTGATATCCTTGAGCTAACCGTGAATCACCTCACAAAGTTGCACAGACCCAAGGATCCCGTTATGGAAGCGAAGAAATTTCAAGCCGGATTCGGACAATGCGCGGCTGAGGCTTGTAGATTTATTATGTCCGTACCAGATTTAGACTCCAAAGTTAGTCAAAATCTCGTTGGACATCTGTCGAGACTGATCACATCCCAGCCGCTGACGATACAAGTACCGGAGAGGTCTTCATTCTCGCCGCCGACATCTCCGTCGTCCGTTGTCTCCGATAGACATCATTACTACAGCGATCACGAGAGATCATCCTCAGACGCTGAGGACTCTGTATACTCGGGAGACAGCGCAACAAAACAATGGACATACAAACCCAGTAATAAACAAAGTTTACCAGTTACCGGATTACTTACGACAGTTGACAAGCTATCGCCCCATAACCCAGAACACACCTTCAACGGCCATCGGAACGGGACTTACTTTAACAAAGTTCCAGCTGAAGCGAAAGACGTTATATTGCAGAAGATAAGACAACACATCATGGACAAACGCGGCAACGAGTACGTCGGACACATGGATGTCAACGCAAACGCGGACATCCCCAACGAGAACAGGTACCTTCGTGAGGAGGTCTACAGGAGTGAAGCGCATCACTATCCAGTGCATTACACGACTAACGACACGTTGGATCTCAGAAAAGTAAGATCGCCAGCAAGACAAGCACAAGAGAAATTGAATTCCCCGCCGGAACCGATCCATCCGAACTCAGAATCGGAACATTTAGACAAAAAGCTCCAAATATCCGTCAGCGTTCCGGAACATTGCGAGTCACCGATGGACTACAGCAACTTGCCGCCCAAGAAGAAGAGGAAGTTGATCGAGTACCAGGAGTACAAAAAACAGGAGGAGGCGAGGAGGCAGAACGCTTTCTACGAGGAAAAGGAACGCCGATACGGAGCGCCTTCCGACAGCGAGAATGACGCTAACAAGTGGCGACCTTGGTGA

Protein sequence:

>DPOGS200685-PA
MSDPAPLSKTAKYKKITKPLLERKRRARINRCLDELKDLMIDDDNLSKLEKADILELTVNHLTKLHRPKDPVMEAKKFQAGFGQCAAEACRFIMSVPDLDSKVSQNLVGHLSRLITSQPLTIQVPERSSFSPPTSPSSVVSDRHHYYSDHERSSSDAEDSVYSGDSATKQWTYKPSNKQSLPVTGLLTTVDKLSPHNPEHTFNGHRNGTYFNKVPAEAKDVILQKIRQHIMDKRGNEYVGHMDVNANADIPNENRYLREEVYRSEAHHYPVHYTTNDTLDLRKVRSPARQAQEKLNSPPEPIHPNSESEHLDKKLQISVSVPEHCESPMDYSNLPPKKKRKLIEYQEYKKQEEARRQNAFYEEKERRYGAPSDSENDANKWRPW-