Monarch geneset OGS2.0

DPOGS213068
TranscriptDPOGS213068-TA2028 bp
ProteinDPOGS213068-PA320 aa
Genomic positionDPSCF300016 - 732650-737352
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0135828e-3579.31% 
BombyxBGIBMGA007685-TA2e-1073.68% 
DrosophilaAtf6-PC5e-1337.40% 
EBI UniRef50UniRef50_UPI00015B4D354e-2240.96%UPI00015B4D35 related cluster n=2 Tax=unknown RepID=UPI00015B4D35
NCBI RefSeqXP_001605766.18e-2340.96%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565431462e-2140.96%PREDICTED: hypothetical protein LOC100122162 [Nasonia vitripennis]
NCBI nr blastxgi|1565431461e-4530.45%PREDICTED: hypothetical protein LOC100122162 [Nasonia vitripennis]
Group
Gene OntologyGO:00063558.3e-08regulation of transcription, DNA-dependent
GO:00435658.3e-08sequence-specific DNA binding
GO:00037008.3e-08sequence-specific DNA binding transcription factor activity
GO:00469838.3e-08protein dimerization activity
KEGG pathwaydre:7776123e-13 
 K09049 (CREBL1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[197-250] IPR0116168.3e-08bZIP transcription factor, bZIP-1
[196-259] IPR0048275.8e-06Basic-leucine zipper (bZIP) transcription factor
Orthology groupMCL13751 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213068-TA
ATGGATTCCGATATACGCCTCGATAGTAAAGACTGTTTATACGGTGAAGATTTTTTGGAACATCTTACCAATGATTATCATTGGCCCTCTGTATTAGATGTGACTGATGCTACTCCCAGCGAAATATTAGCTGATGCAGCACCAATTCTAGTCCCTGCTATATCACCAGATCTAAGTATCAAATCTTCTCCCAATGAGAGCAGTAATTCCGATTCTAGCAGTGATGATGACTCAAAAAATGGGTATTACAAATTTACTACAAGACCCATGCCCCAATCTCCATCAGATTATTCTATTAATGATGAACCTACAAATCAGCTAAAGCTTGAAGATGTAGAGTTATTTCTTCAAAAATCAGTACCAACAAGTCCACCAGTTCTTGATATATCACCTACAACAAGGCCACCAGAACAAGCCAGTCCTGTTATGTCTATTGAAAATGGTGTGATAAAGGCGCATTCACCAAAAAGTATAGTCATCAATCCCAATTTAGATGTCAATAAAAAAAGTAACAATACAAATAATGTCATTGGAGATGAGACTGACCTTGATTTCATAGATTTCTCAAAGCTTACCGATATTGAGATCAGGGCTATCAAAAAACAACAAAGAATGATCAAAAATAGGGAATCAGCATGTCAGTCACGGCAAAAGAAGAAAGAATATGTGACAGCTTTAGAAAACCAACTGTTAGAAGCACATCAAGAAATAAGAAGATTACAAATAGAGAATAAACAACTACGGGAACAGCTCATACTAAATGGGAGAAGCAGAAAGATACCAAAGCTCGATTCAACAATCTCAATACCTAAAAGGAACATTGCTGTTATTTTTGCAATGGTATTCATGGTGTCATTAAATTTCAATATTTTAGGCCAGAATGTTAAATTTTTGATAACTATAGATCAAGTTCGTAATTTTTTTTTTCAATACACATCTAAACTTTTTGTTTTAGTGTTTTAAATTTTGATATATGACGTCACACGATGTCACACTTCTGTGATAGCCTCCCCCCCCCCCCATCAACGTGTGAAGTACTTTATGGATGGCTCCTTAATACATTTATTATTTTTAGACTATGATCAGTATCTCAACCGTAGTAATGAAGGTATGGACTGTAGGAACACAACATTAAGTGATTTCCTTAAGATTAATCAGACAGAGAGTATAAGGATAGCTGGTGAATTGAAAAGGTGGATAGGAGGAGGGAAAACCTTGAACTTAACAAAGGCACTGAAAAAAAATAAGGTCTATTTAAACGAACAGCATATTTCAGGAGGATTTCTGGACAGCTATGAATTAATCAGCAAATTAAATCTGCATAATTTAATCGACATTCCCATTACACCCAAACAAGCGAGGAATGCCAGAGAAAAGTCACGTTTAAGGAAATTAAGGAGGCATACGTCGAAAGATATAGATTTCGCTGACAGTTCGTTATACTATGAAAAATTATATAACAAACCTATTAGGAAATCGGTAGATTTTAATTTGGACGATTTTGGGGAATGGAACGCGTTGCTTCAAGCTCTTCATAGACGGGATGACACATTTTACGTAGTAGGCGTGGGGAAGGGAGAGCATTTGTTGTTACCGGCTGTCACTCACAACGTCACTCGTCCACCCAAGATGGCGTTAATATTGCCCGCACGTTCCGGAAATGATTCATTGATGAATGGGCATGTGACACTCATGCAAATTGATTGCTCAGTCGTCAACACAACCCTAGTCAAACTGAAGTCGGAAGCGTTGCCGGAGAGTTTAAGGAAAGTCAATTTCGATGGTAATCCCTCAATTGATACGAAACGAGAAAATGCAAAAATACAGAGATTCAAAGTTGATACTAGTCAAGAAGATGTACATAATAATAGCTTAGAGTATAGAAAACCCTATTTAAATGTCGATAAAAATGATTTGTTTACGCAGTATTTGCTGTCGAAGTCGAATATAAAAAATACCTCTAATGAAATAAAATATTCGGAAAAAAATGATTTTAGAGATAGTAAGAAATCTGATAATGAGAGATAA

Protein sequence:

>DPOGS213068-PA
MDSDIRLDSKDCLYGEDFLEHLTNDYHWPSVLDVTDATPSEILADAAPILVPAISPDLSIKSSPNESSNSDSSSDDDSKNGYYKFTTRPMPQSPSDYSINDEPTNQLKLEDVELFLQKSVPTSPPVLDISPTTRPPEQASPVMSIENGVIKAHSPKSIVINPNLDVNKKSNNTNNVIGDETDLDFIDFSKLTDIEIRAIKKQQRMIKNRESACQSRQKKKEYVTALENQLLEAHQEIRRLQIENKQLREQLILNGRSRKIPKLDSTISIPKRNIAVIFAMVFMVSLNFNILGQNVKFLITIDQVRNFFFQYTSKLFVLVF-