Monarch geneset OGS2.0

DPOGS213896
TranscriptDPOGS213896-TA1392 bp
ProteinDPOGS213896-PA463 aa
Genomic positionDPSCF300218 - 367064-374563
RNAseq coverage824x (Rank: top 16%)
Annotation
HeliconiusHMEL0060566e-4788.46% 
BombyxBGIBMGA004632-TA3e-2086.76% 
DrosophilaMitf-PC6e-5062.50% 
EBI UniRef50UniRef50_D6W7562e-7042.22%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6W756_TRICA
NCBI RefSeqXP_394278.27e-7648.17%PREDICTED: similar to CG17469-PA.3 [Apis mellifera]
NCBI nr blastpgi|3800264458e-7648.60%PREDICTED: microphthalmia-associated transcription factor-like [Apis florea]
NCBI nr blastxgi|3800264453e-7448.60%PREDICTED: microphthalmia-associated transcription factor-like [Apis florea]
Group
Gene OntologyGO:00063555.3e-78regulation of transcription, DNA-dependent
GO:00056346.4e-20nucleus
KEGG pathway 
InterPro domain[11-264] IPR0240975.3e-78Basic helix-loop-helix leucine zipper transcription factor
[160-237] IPR0115986.4e-20Helix-loop-helix DNA-binding
[164-222] IPR0010922.8e-15Helix-loop-helix DNA-binding domain
Orthology groupMCL17704 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213896-TA
ATGATACGGTTATACCAAACATGGGACAGTCCGCCCACATTCAAAACCTTAACGCCCACATCCCGCACGCAGCTTAAACAACAGTTGATGAGAGAGCATGCCCAGGAGCAACTACGGAGGGAATCGTTACAGGTGCGAACAGTTTTGGAGAATCCCACCAGGTATCACGTGATCCAGAAGCAGAAGAGCCAGGTGCGCCAGTACCTCAGCGAGTCATTCACACCACAAACGCAGGTGTCAGCTGTCCGTGGTCCGGTGCAGAGCGCCCCGGAGCTAAGGTCGTCATCACCAGAACGTGGAACTGTCCTCAGTCCAGGACTATGCTCGGCAGGAAACTCAGAAACGGATGAATTTCTGGATGACATCCTATCCCTGGATAGCGGGGCTGGTCCCCTGTCGTCTTCGGAGCCCCCCTCTACAGCCAGCTCCGTGGCCGGGGACTGCGCCCTCTCAGACGCAGACATGCACGCGCTCGCTAAGGATAGACAGAAGAAAGACAACCATAATATGATCGAACGCCGCCGTCGTTTCAATATAAACGATAGAATTAAAGAGTTGGGTACCTTACTGCCCAAAACGAACGATCCCTTCTACGAGGTGATACGGGACGTGCGACCTAACAAGGGGACCATCCTCAAGAGCAGCGTCGACTACATCAAGTGTCTGCGGGACGAAGTCAACAGGCTCAAGCAGAGCGAACAGAGGCGGAAACAGATTGAGCTGCACAACCGGAAACTCATGCTGAGGATACAGGAGTTGGAACGTCTGGCGAGAGTTCATGGACTTCCGGTCAATGAAAGCTGGTCGGCATCACAGGAGGACTCGGGGGTCGAAGCCTCCCCGGAATGTTACACTGACAAGAACCCAGTACACCAAGAGCCTCCAGCTGTGCAGCCCAAGAGTGAACCAGCGCCGATGGAACTGTCCGATGGAAGGGACGCCCTTGCAGCACTCACAGCGCTTGACGGTTTGAAGCTGGGCTCATGTTCTCCCCTGGACCGCGGAGCATCTCTGTCCTTGGACTGCCTGGAACCAGACCTCTGTCTCGACACACCTGGAGACCTCTTCCACAAAGATATCAAGCAGATGCGTTTGTCACCCACGGCTGGTCTCCTTGATGATGAAGCGGTGATGAACCTGGCTCAGATAGAAGACCTCATGGATGACGACTCACACAATCCCGTCACACAGGGTGACCCGATGTTGTGTTCGTCGCCGAGCGCGATGGGGCCGGCGGGAGATTCGTCCTGCGCCATGCTGCACATAGACCTCGCGCTGCACAACACAGACTACGGCTCACGATCTCTCCTGTCCGAGCTGAGTGACGGCCTGCCTCTGTTGATGGGTGCTCCGCCCCCCCGGGCCTGCTTCGACATGGATCTAGGGGCGTAG

Protein sequence:

>DPOGS213896-PA
MIRLYQTWDSPPTFKTLTPTSRTQLKQQLMREHAQEQLRRESLQVRTVLENPTRYHVIQKQKSQVRQYLSESFTPQTQVSAVRGPVQSAPELRSSSPERGTVLSPGLCSAGNSETDEFLDDILSLDSGAGPLSSSEPPSTASSVAGDCALSDADMHALAKDRQKKDNHNMIERRRRFNINDRIKELGTLLPKTNDPFYEVIRDVRPNKGTILKSSVDYIKCLRDEVNRLKQSEQRRKQIELHNRKLMLRIQELERLARVHGLPVNESWSASQEDSGVEASPECYTDKNPVHQEPPAVQPKSEPAPMELSDGRDALAALTALDGLKLGSCSPLDRGASLSLDCLEPDLCLDTPGDLFHKDIKQMRLSPTAGLLDDEAVMNLAQIEDLMDDDSHNPVTQGDPMLCSSPSAMGPAGDSSCAMLHIDLALHNTDYGSRSLLSELSDGLPLLMGAPPPRACFDMDLGA-