Monarch geneset OGS2.0

DPOGS203627
TranscriptDPOGS203627-TA1341 bp
ProteinDPOGS203627-PA446 aa
Genomic positionDPSCF300063 + 1020240-1023195
RNAseq coverage248x (Rank: top 42%)
Annotation
HeliconiusHMEL0158766e-14159.75% 
BombyxBGIBMGA001375-TA5e-11058.00% 
DrosophilaAtf-2-PA1e-1547.31% 
EBI UniRef50UniRef50_E9JEG91e-10758.00%Atf2 n=1 Tax=Bombyx mori RepID=E9JEG9_BOMMO
NCBI RefSeqXP_393896.38e-3552.26%PREDICTED: similar to activating transcription factor 2 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3796988754e-10758.00%atf2 [Bombyx mori]
NCBI nr blastxgi|3796988751e-10457.93%atf2 [Bombyx mori]
Group
Gene OntologyGO:00036768.9e-06nucleic acid binding
KEGG pathwayrno:816474e-22 
 K04450 (ATF2, CREBP1)maps-> MAPK signaling pathway
InterPro domain[5-31] IPR0130878.9e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26546 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203627-TA
ATGGAGCCAGAAAAACCATTTGCCTGTACCCTACCGGACTGTGGCATGACTTTTACAAACGAGGATCACCTCCATGTACACACCAAGAAGCACGACATGGTGCTGCAGCTGGGCATGGAGCAAAAGGTGGCATTTGTTGCTGATCAAACCCCGACACCCACGAGATTCATCAGGAACTGTGAAGAGGTGGGCCTGTTCCAGGATCTACAGAATGTCAATCCATTCGATGAGGGCTTCAAGAGAGCTATGGAGACTAAACATTCCTTGTCCTTGGAGACGAGTGCCATGGACGATGTTCTTCACACGCCACACCTGGTGCTGCCGCTGGAGGCCGGGGACTGCGCCCTCTACACCAACAACCAGCGAAACATCACTATCAGTCGATCCTCGAGTGACGAGTCCGGAGCTGTCAAAGAATACGAAACAACAACAATATCAAAACTAACGAATGAAGTGACGACTATAAGTAGAGTCGTAGGAAAAGACGTTCTGGACAGAGTCACGACCACCGATGACGTCATAGTGAGACACGAGGAGACCGCGAGGAAAGACGGCTTCAACGAAACATCCGTCTCTTACACGAACAACGTGATAAAAATACACAGCAACGTGGAGATAAGAAAGGACGGGCACATAGACAAGACGCACGAAGCGAAGGACGGGAAGATGGAGAAAATACCGCCCATCATGAGCCAGAAGTCCTTAGACTTTGTTGTGGACAGTTTGAATACAGAAGAGAAGAACATGAAACGGGCCAAGAAGGACAAAGACACAGAGGATTACGAAGTCATCATAAAACTACCCAACGGGAAACATGTTAGAATGAAAACGGTAGAGGAATGCGAACAGAACGCCAAGGAGAAACTGAACAGTGTCATAAGGAACAGAGCCAAGACACCGAACGTCGTGCCTCTCACGGCTGGGACGCTGATACCGGTCACCATAGTGAGTCCGCCCCTCGCACACAACCATATACCGAAAATACCGATCGTACCCATCACTAACGTCAAGACCAATTACAAGCGAGTCAAGAGAAAAGTCGGCGACAAAGACGAGGCGAGGAGCGACCGGGGACCGGACGACAAAATAAAACAAGGACTCGAGAGTCGGAGCGCCGCCTCCAAGAGATACAGAGAGAGATTAAAACAGAGTATGCATCAGCAGAGCGTAGAGATGCAGCAGCTGAGGGAGAGCAACTCCCGCCTGGCAGCAGAACGAGCTGTGCTGCAACACGCCCTGCTGCAACACCTCAGGACCTGCCCCGTGGGACAAGACTTGAGGAACGTGCAGGAGAAGCTACAGAACGCCAGCCAGTTTGTAAAGGATGTCAATAACGGATAG

Protein sequence:

>DPOGS203627-PA
MEPEKPFACTLPDCGMTFTNEDHLHVHTKKHDMVLQLGMEQKVAFVADQTPTPTRFIRNCEEVGLFQDLQNVNPFDEGFKRAMETKHSLSLETSAMDDVLHTPHLVLPLEAGDCALYTNNQRNITISRSSSDESGAVKEYETTTISKLTNEVTTISRVVGKDVLDRVTTTDDVIVRHEETARKDGFNETSVSYTNNVIKIHSNVEIRKDGHIDKTHEAKDGKMEKIPPIMSQKSLDFVVDSLNTEEKNMKRAKKDKDTEDYEVIIKLPNGKHVRMKTVEECEQNAKEKLNSVIRNRAKTPNVVPLTAGTLIPVTIVSPPLAHNHIPKIPIVPITNVKTNYKRVKRKVGDKDEARSDRGPDDKIKQGLESRSAASKRYRERLKQSMHQQSVEMQQLRESNSRLAAERAVLQHALLQHLRTCPVGQDLRNVQEKLQNASQFVKDVNNG-