Monarch geneset OGS2.0

DPOGS215281
TranscriptDPOGS215281-TA1896 bp
ProteinDPOGS215281-PA631 aa
Genomic positionDPSCF300047 + 804716-817028
RNAseq coverage1811x (Rank: top 7%)
Annotation
HeliconiusHMEL0213247e-7578.46% 
BombyxBGIBMGA001460-TA1e-13263.06% 
DrosophilaCrebA-PB7e-1550.51% 
EBI UniRef50UniRef50_E2BLF61e-6237.57%cAMP-responsive element-binding protein 3-like protein 4 n=7 Tax=Formicidae RepID=E2BLF6_HARSA
NCBI RefSeqXP_001657112.13e-5649.41%hypothetical protein AaeL_AAEL003646 [Aedes aegypti]
NCBI nr blastpgi|3072049695e-6237.57%cAMP-responsive element-binding protein 3-like protein 4 [Harpegnathos saltator]
NCBI nr blastxgi|3287829299e-7636.93%PREDICTED: hypothetical protein LOC726184 [Apis mellifera]
Group
Gene OntologyGO:00063552.4e-18regulation of transcription, DNA-dependent
GO:00435652.4e-18sequence-specific DNA binding
GO:00037002.4e-18sequence-specific DNA binding transcription factor activity
GO:00469833.4e-15protein dimerization activity
GO:00036772.4e-09DNA binding
KEGG pathwayaga:AgaP_AGAP0014645e-52 
 K09048 (CREB3)maps-> Huntington's disease
    Prostate cancer
    Melanogenesis
    Vasopressin-regulated water reabsorption
InterPro domain[338-402] IPR0048272.4e-18Basic-leucine zipper (bZIP) transcription factor
[338-400] IPR0116163.4e-15bZIP transcription factor, bZIP-1
[311-369] IPR0089172.4e-09Eukaryotic transcription factor, Skn-1-like, DNA-binding
Orthology groupMCL17989 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215281-TA
ATGGATAGTTTTCCTCCCGAGCACATGATGTGCGATCCCTGGCAGGGGACGGAAGATCTGGAGAATATTTTCTCTCTGGATCAGAGTTCTTTGGACTTCCTGGAGAATGCTTTGCCGGATTTCAATTTAACAACGGACAATGCACAGCCAGAGGGACTTAGTAGTTCATGTTCTGATAGCGGGTTATCAAGCGACCATGCTGAGCTCGACTTCGAGCAGCAGTTGTCGCCGAACCTCATCCAGAGTACTGATTATGAGGATTTACCAACAACTATTCTAGAGCCGCTGAGTCCTTGTAATACGAGTGACGTCATCATCCAGGACAACCAGACATTAGACATGCTGGACTTCGAACAGAACGTTGTCCCTGGATTCATTAACACAACATTCCAGAGCCCCGGTAAAGGCGGCAGAAAACGTCGTTTCTCATCAACTCAAACAGTCGTTCAACCCAAGGTTCAGAAACAGACGATAAAGTTGCCAGCGCCAGCGGGCAACAACAAACCGCAGCTGGTTGTGAAGGCGCAACCCCAGAAACCATTAAAAGTAACCAACATCCAAGTCATAAACCCTCAGACTAAGGTCTACTCTAAACCAGTGGAAAGTGTAGCGCCCCAACGCAGAGTGATCCGCGTGGCTCCGATGGCCGGAAACCCTAGATCTATATTACTACCCGTAACATTCAAAGATATGAAAGATTTGAAATCGATCAAAATCATAAATGCATCAGATTTGAAGAACTCGCCCAATATAAAGTTAGCTGCGGCTAATCTACTGTCGCAGAGCAAACTGCAAGATCTCAAGATTGAAACACGCGGTGATGATTACGAACACAACGCTAAATACGACGACTCGGCCAGCGATCACAGTGACGACGACGACGAGAAAGAGACGCAAATAAATGACGGTAGGAACGGATATCCCCGTCTCGTTCTAACGGCCGAAGAGCGCCGACTGCTGGCGAAGGAGGGCATCCAGCTGCCGAACAGTTACCCACTAACGAAGCACGAGGAAAGGGAGCTGAAGAGGATCAGACGCAAAATACGCAACAAGATATCAGCACAGGATTCAAGGAAAAGGAAGAAGGAATACGTCGATGGGCTCGAAGACAGGGTAAAACAATGCACAGCCGAAAATCAGACATTGTTGAGACGGATAAAGATGCTGCAGTCGCAGAATCAATCTCTCAGTCAGCAACTGAAGAGGTTACAGAGCGTGTTGACCGGAGCGTCGTCGTCGGGTCGAGCTCAGCCGGCCACGTGTCTGCTGGTGTTGTTACTGTCCGTGGCGTTGGTAGCGCTGCCGTCCGTCAGGGACGAGGTCCGCCGTCGACCAGCCACCACCACCAGCAGCGCCACCACCACACCACCCTCGCCGGCTATCACACGAGCCCTGCTGTCCGCTACACACAAAATGGTGTTCGATGAGACGGTCATAGATGACGGGGAGTTCAATATGGACGAGCTGATAACGTTCAACAAGGCGCACTCCGACCACGACTACCAGGTGGTGAAGAACAGCGACAGACGGACACACAACGGATACATCGACCTCCCGATAGACGAGGACTGGCCGCCCAAGAAGAAGCGGATGAAAAAGATTGAGTTCGACTACGGCGACGGCAAGGATTACATACCGATAGTCAAAGACGAGAACTACGAGAACATACAGCAGACCGGCAGCGCTGTGGGCCACGACGTCCAGATAGGTGACAACTACCTGACGAACACGCTGCTGTCCACTGGCCGGAAACTCGGCGAGCTCTTGGACATATTCCCTCCCATACCCGTCAAGAACGAAGACATACTGGTCGAGGAAGTAGCGGACTTCGACGAGAGGCACAACGTCACCGAAGTCAAAAGTTTCGTAGTAAATGGGACCTTAAATGAATTCTAG

Protein sequence:

>DPOGS215281-PA
MDSFPPEHMMCDPWQGTEDLENIFSLDQSSLDFLENALPDFNLTTDNAQPEGLSSSCSDSGLSSDHAELDFEQQLSPNLIQSTDYEDLPTTILEPLSPCNTSDVIIQDNQTLDMLDFEQNVVPGFINTTFQSPGKGGRKRRFSSTQTVVQPKVQKQTIKLPAPAGNNKPQLVVKAQPQKPLKVTNIQVINPQTKVYSKPVESVAPQRRVIRVAPMAGNPRSILLPVTFKDMKDLKSIKIINASDLKNSPNIKLAAANLLSQSKLQDLKIETRGDDYEHNAKYDDSASDHSDDDDEKETQINDGRNGYPRLVLTAEERRLLAKEGIQLPNSYPLTKHEERELKRIRRKIRNKISAQDSRKRKKEYVDGLEDRVKQCTAENQTLLRRIKMLQSQNQSLSQQLKRLQSVLTGASSSGRAQPATCLLVLLLSVALVALPSVRDEVRRRPATTTSSATTTPPSPAITRALLSATHKMVFDETVIDDGEFNMDELITFNKAHSDHDYQVVKNSDRRTHNGYIDLPIDEDWPPKKKRMKKIEFDYGDGKDYIPIVKDENYENIQQTGSAVGHDVQIGDNYLTNTLLSTGRKLGELLDIFPPIPVKNEDILVEEVADFDERHNVTEVKSFVVNGTLNEF-