Monarch geneset OGS2.0

DPOGS206613
TranscriptDPOGS206613-TA1179 bp
ProteinDPOGS206613-PA392 aa
Genomic positionDPSCF300048 - 1134818-1138987
RNAseq coverage1383x (Rank: top 9%)
Annotation
HeliconiusHMEL0065704e-16481.66% 
BombyxBGIBMGA008330-TA3e-4660.96% 
DrosophilaAtf3-PA2e-2637.60% 
EBI UniRef50UniRef50_D2A0E21e-4052.09%Putative uncharacterized protein GLEAN_08201 n=1 Tax=Tribolium castaneum RepID=D2A0E2_TRICA
NCBI RefSeqXP_975619.13e-4152.09%PREDICTED: similar to AGAP001536-PA [Tribolium castaneum]
NCBI nr blastpgi|3320259332e-4048.88%Cyclic AMP-dependent transcription factor ATF-3 [Acromyrmex echinatior]
NCBI nr blastxgi|910812133e-5338.42%PREDICTED: similar to AGAP001536-PA [Tribolium castaneum]
Group
Gene OntologyGO:00063555.6e-14regulation of transcription, DNA-dependent
GO:00435655.6e-14sequence-specific DNA binding
GO:00037005.6e-14sequence-specific DNA binding transcription factor activity
GO:00056346.3e-12nucleus
GO:00036776.3e-12DNA binding
GO:00469837.2e-10protein dimerization activity
KEGG pathway 
InterPro domain[171-235] IPR0048275.6e-14Basic-leucine zipper (bZIP) transcription factor
[166-182] IPR0008376.3e-12Fos transforming protein
[172-229] IPR0116167.2e-10bZIP transcription factor, bZIP-1
Orthology groupMCL15725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206613-TA
ATGAACTCGACGAGCATTAGCTCAGCAGTGCCGACAATTAAATGTGAGGATACGTCTCCGTCGCCGACAGCCGCCGTTGGTACTGACGGAGAATCAGGGGCATATCTTAGTGTTAACGTCAACCTGAGCACTGCCATGATGAACCTCCTCGCTGCGGAAGGCGCCAACACCACACTGCGCACCCCAGAAATCGTCAACGACCTGATCACCATGTCAAACCCCATGGACCAATATAACTACGACAAAAATTCTAGCTTCAAGAATAGCAATGACTCGAACTCTTCAATGTCAAACAGCTCGTCGGCTACTTCACCAGCCTCCGGGACCCCGCCCAGCATACAAAAGACTTGCTCGGAACTGATTAAGGCCGGTTTGAAGTTGTCCATAGAGTCGAAAAGGAAAATGTCGGGAAGCGACACGGATGTCGGCATCAAGAGAATGAAGAAGGAGGAGAGCGATGATGATTACGACAGCAGTCATACCCAGGTGTCTAGAAACGAGCTGACACCAGAGGACGAGGAGAGGAGGCGTCGGAGAAGAGAAAGAAACAAAATAGCAGCTACCAAGTGCAGGATGAAGAAGAGGGAACGAACAGTGAACCTCGTTAATGAAAGTGAAGTGCTCGAAAACCAGAATATTGACCTCAAGGCGCAACTTAAGGATCTCGAAGTTCAGAAGCGACAGTTGCTAGACATGTTATCGCAACACGCGTCCTCCTGCGTACGGAACAACACTAACGCGAGAACGTCGCCAAACTTCAACATGATGAGGACGTTCGAATCACCGACAACATTCCCCGTCAACTACGACGCCCACTCGCCCTACATACGACCAGAATCAGCCAACATACTAGCCTCGAGCTACACGTGTGCCACCCCACTCAACGAGACCATAGACACTATGTCTTTAGACGCGGCCTACATGACGCCGCAGAACATAGACGTCGAATACAACAGACCCGACAGCGTCATCAGTCTGCCCCCTAATTCCGACAGTTACATCACCACGGATGGATACCTGCCAAAAGCCACAGCGATCCTAGGTCCGATCGAACCGGAAACGGAATACTATGACAATGAGATCAATTACGTCACCCAACAATGTCACAGTTACCCCAACAACATACAGGACTCACAACAGAAGTTAAACAACAGTCTGAACGACGGCTGTCTGGTCTAA

Protein sequence:

>DPOGS206613-PA
MNSTSISSAVPTIKCEDTSPSPTAAVGTDGESGAYLSVNVNLSTAMMNLLAAEGANTTLRTPEIVNDLITMSNPMDQYNYDKNSSFKNSNDSNSSMSNSSSATSPASGTPPSIQKTCSELIKAGLKLSIESKRKMSGSDTDVGIKRMKKEESDDDYDSSHTQVSRNELTPEDEERRRRRRERNKIAATKCRMKKRERTVNLVNESEVLENQNIDLKAQLKDLEVQKRQLLDMLSQHASSCVRNNTNARTSPNFNMMRTFESPTTFPVNYDAHSPYIRPESANILASSYTCATPLNETIDTMSLDAAYMTPQNIDVEYNRPDSVISLPPNSDSYITTDGYLPKATAILGPIEPETEYYDNEINYVTQQCHSYPNNIQDSQQKLNNSLNDGCLV-