Monarch geneset OGS2.0

DPOGS206083
TranscriptDPOGS206083-TA864 bp
ProteinDPOGS206083-PA287 aa
Genomic positionDPSCF300028 - 133316-136074
RNAseq coverage1203x (Rank: top 10%)
Annotation
HeliconiusHMEL0121082e-13185.37% 
BombyxBGIBMGA006865-TA6e-13382.89% 
DrosophilaCrebB-17A-PE8e-3845.88% 
EBI UniRef50UniRef50_E1ZYA72e-7656.31%Annexin-B9 n=21 Tax=Coelomata RepID=E1ZYA7_CAMFO
NCBI RefSeqNP_001040181.12e-11979.58%cAMP responsive element binding protein [Bombyx mori]
NCBI nr blastpgi|3044214304e-13082.89%creb [Bombyx mori]
NCBI nr blastxgi|3044214301e-13182.89%creb [Bombyx mori]
Group
Gene OntologyGO:00056344.2e-37nucleus
GO:00036774.2e-37DNA binding
GO:00063554.2e-37regulation of transcription, DNA-dependent
GO:00037004.2e-37sequence-specific DNA binding transcription factor activity
GO:00435654.8e-19sequence-specific DNA binding
GO:00469834.8e-19protein dimerization activity
GO:00055152.7e-16protein binding
KEGG pathway 
InterPro domain[126-148] IPR0016304.2e-37cAMP response element binding (CREB) protein
[227-286] IPR0116164.8e-19bZIP transcription factor, bZIP-1
[120-155] IPR0031022.7e-16Coactivator CBP, pKID
[224-287] IPR0048274e-07Basic-leucine zipper (bZIP) transcription factor
Orthology groupMCL13684 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206083-TA
ATGGACGGAATGGTGGAGGAGAACGGAACTTCTAGCGCAGCAGGAGTAGCGGATCCCCTGGCGAGCGGAGGATCCGCAGCGAACGCCACCCCTCATGTGGTGGTCACGAGTATAGTGCAGCTCACATTACCAAGTCAAGCACCATCCGCACAGGTCCAATCAGTTATACAACCAAATCAACAATCTGTCATTCAAACGGCATCCAATATACAATCAGTACAACTACAAAAAGGCAATGTGATATTGGTCAGCAAACCCAGTTCTGTCATACATACTACTCAAGGAACTCTCCAAACATTACAGATTAAACCGGAACCTAACACAATATTAAGTACCCAAGGACAATCTTGCAGTGATGACAGTTGTAGTGATGAAGAGAGTCCCAAGAGGAAATACAGAGAAATGTTAACGAGACGTCCATCATATAGGAAAATACTTAATGACCTTGGAGGAACCGAAATTGCTGAAAATCGCATGGGAACTAAAGCAGTATTGGAAAGTGAAGTGTCGCTGTCACCTTCTTTATCGTTTGCGCCAGTTATTCCAGCGAGTTCACTTCAGACTGAAAGTGGATTACACACGTTAGCAGTGTCAGGCACCACAGGCGGTGGGACCTTAGTCCAATATGCAACTAATCAAGATGGTCAATTCTATGTACCGGGGCCAATATTAGAGGACCAAACCAGAAAACGTGAATTAAGGCTGTTAAAAAATCGTGAAGCCGCCAGGGAATGCCGGCGAAAGAAGAAGGAGTATATTAAATGCCTCGAGAACAGGGTTGCTGTACTAGAAAATCAAAACAAAGCTCTAATAGAGGAGCTCAAATCTCTGAAGGAACTATATTGCCAGCAGAAAACTGAATGA

Protein sequence:

>DPOGS206083-PA
MDGMVEENGTSSAAGVADPLASGGSAANATPHVVVTSIVQLTLPSQAPSAQVQSVIQPNQQSVIQTASNIQSVQLQKGNVILVSKPSSVIHTTQGTLQTLQIKPEPNTILSTQGQSCSDDSCSDEESPKRKYREMLTRRPSYRKILNDLGGTEIAENRMGTKAVLESEVSLSPSLSFAPVIPASSLQTESGLHTLAVSGTTGGGTLVQYATNQDGQFYVPGPILEDQTRKRELRLLKNREAARECRRKKKEYIKCLENRVAVLENQNKALIEELKSLKELYCQQKTE-