Monarch geneset OGS2.0

DPOGS204843
TranscriptDPOGS204843-TA957 bp
ProteinDPOGS204843-PA318 aa
Genomic positionDPSCF300227 - 158539-164789
RNAseq coverage12196x (Rank: top 1%)
Annotation
HeliconiusHMEL0138937e-10463.25% 
Bombyx% 
Drosophilacrc-PA4e-1631.44% 
EBI UniRef50UniRef50_Q9GPH36e-5968.27%Activating transcription factor of chaperone n=1 Tax=Bombyx mori RepID=ATFC_BOMMO
NCBI RefSeqNP_001037041.11e-5968.27%activating transcription factor of chaperone [Bombyx mori]
NCBI nr blastpgi|1129831402e-5868.27%activating transcription factor of chaperone [Bombyx mori]
NCBI nr blastxgi|1129831403e-6669.38%activating transcription factor of chaperone [Bombyx mori]
Group
Gene OntologyGO:00063551.7e-11regulation of transcription, DNA-dependent
GO:00435651.7e-11sequence-specific DNA binding
GO:00037001.7e-11sequence-specific DNA binding transcription factor activity
GO:00469839.6e-08protein dimerization activity
KEGG pathwayame:4102264e-33 
 K04374 (ATF4, CREB2)maps-> GnRH signaling pathway
    Prostate cancer
    MAPK signaling pathway
    Neurotrophin signaling pathway
    Long-term potentiation
    Protein processing in endoplasmic reticulum
InterPro domain[245-309] IPR0048271.7e-11Basic-leucine zipper (bZIP) transcription factor
[247-304] IPR0116169.6e-08bZIP transcription factor, bZIP-1
Orthology groupMCL17726 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204843-TA
ATGTGCTGTTACAAGAATAAAGGAATGTGCGATCTGTGTAGAGATTTTCAGCTATCGTTGTTTCAAAATATTCTTAAGAGCTTTCCGGGCGCTGCAACAAAGATCGAAACGGCTAGCTTCCAAGATGACCTTACCAACACCATGTACCCACCATCTCCTGTGGACATCAAGCCCAGTCAAGCGGAGCGAGCTGAAGATCTGCTGCAGCAGCTGGAAAGTCAATGCAAACAAGAAAACATATACTCTAACTGGTTCGAAGAGAAAGTTGAGAACAGCATCTTCGATAATATCAGTCAGGGACCAGAGCCGGAGTTCCGGCCGGTGGCTGTCGATTACACCGCGCAGACCGTGCCGCGCTCCACCGAGGTTCTTTTGAGGGAGTTCGAGTCGGTGTACAGTGGCGTCCAACTGACTCACCTCACCCCGCCTCAGAGCCCGCCCGGTCCGGCTACCCAACTCCTGCTAAGCTACGCCCAAGCTCAGGCTGCTCCGCCTTTACAACCACTAACTGTCGAGCAATGGCCATTGATCCCGCCCCAAAGCTCAATACCGGAGTACGACTGCGATCCTCAGGCCCTCGAGGAGTTGGTCCGCCATCGTGCCGCTCAATTGGAATCGCCGCAGCCCGCGCACAGCCCTTCACCATCACCGCAATCATCACCGTCCTCATCGCCGCGGTCATCTTCCACTGATGAGGATTGGACATCATCCCGCCCCAAGCCGTACTCCCGGAACGGTGATGATCGCAGGTCTCGTAAGAAGGAGCAGAACAAGAATGCGGCTACCCGTTACCGCCAGAAGAAGAAAGCCGAGATCGAGGTGCTCCTCAACGAGGAACAGGAGCTGCGCAAGCGACACGGTGAGCTCGGGGACAAGTGTTCCGACCTCCAACGCGAGATCCGCTACATCAAGGGCATCCTGCGCGACCTCTTCAAGGCAAAAGGCCTCATCAAATAG

Protein sequence:

>DPOGS204843-PA
MCCYKNKGMCDLCRDFQLSLFQNILKSFPGAATKIETASFQDDLTNTMYPPSPVDIKPSQAERAEDLLQQLESQCKQENIYSNWFEEKVENSIFDNISQGPEPEFRPVAVDYTAQTVPRSTEVLLREFESVYSGVQLTHLTPPQSPPGPATQLLLSYAQAQAAPPLQPLTVEQWPLIPPQSSIPEYDCDPQALEELVRHRAAQLESPQPAHSPSPSPQSSPSSSPRSSSTDEDWTSSRPKPYSRNGDDRRSRKKEQNKNAATRYRQKKKAEIEVLLNEEQELRKRHGELGDKCSDLQREIRYIKGILRDLFKAKGLIK-