Monarch geneset OGS2.0

DPOGS208741
TranscriptDPOGS208741-TA1449 bp
ProteinDPOGS208741-PA482 aa
Genomic positionDPSCF300043 + 335638-345109
RNAseq coverage880x (Rank: top 14%)
Annotation
HeliconiusHMEL0152371e-10394.42% 
BombyxBGIBMGA003403-TA0.080.27% 
DrosophilaE2f-PA6e-4147.96% 
EBI UniRef50UniRef50_B6ZL890.080.27%E2F1 n=3 Tax=Obtectomera RepID=B6ZL89_BOMMO
NCBI RefSeqXP_001607080.15e-6940.31%PREDICTED: similar to transcriptional activator [Nasonia vitripennis]
NCBI nr blastpgi|3505384650.080.27%E2F transcription factor 1 [Bombyx mori]
NCBI nr blastxgi|3505384650.080.85%E2F transcription factor 1 [Bombyx mori]
Group
Gene OntologyGO:00063553.5e-21regulation of transcription, DNA-dependent
GO:00056673.5e-21transcription factor complex
GO:00037003.5e-21sequence-specific DNA binding transcription factor activity
KEGG pathwaymdo:1000179775e-47 
 K06620 (E2F1_3)maps-> Prostate cancer
    Glioma
    Melanoma
    Small cell lung cancer
    Pathways in cancer
    Pancreatic cancer
    Bladder cancer
    Non-small cell lung cancer
    Cell cycle
    Chronic myeloid leukemia
InterPro domain[50-479] IPR0156333.9e-78E2F Family
[120-187] IPR0119911.8e-26Winged helix-turn-helix transcription repressor DNA-binding
[123-187] IPR0033163.5e-21Transcription factor E2F/dimerisation partner (TDP)
Orthology groupMCL15618 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208741-TA
ATGCCGAGGGGGGTGAAGCGAGGAGCGGCGGAAGGCGAAGCTGAGGTGGTGGTACGCGTCGGGGCTTCTCCGTCGCACACCACGCTCCTGGACGACAGCCCTAGCCAGCCCATAAGCTACCACCTGCTCGACCATGGCTATGGTGCCACACCTCAACACCAAATACGCCGGGAAGCGCCCACCGCACCGCCGAAGACATCTGAAGCGGTGAAACGTAGACTGAACCTGAGCGAGAGCAGCTCAGGCAGCCAGGGTCACGTGGTGCCGATGAAGGCGGACTTCAAGACGCCCAAGCAGAAACGCGTCAAAGTCCTAACCCCGTACAGCCGACCGTCCAGTTCTATGAAAAAATACACAGAACGCTCCAGGTTTGACACGTCATTGGGTCTACTGACGAAAAAGTTTGTAGCTCTCCTCAAGTCGTCGCCGAACGGTGTTTTAGATCTTAATATAGCAGCTGAGCATCTCTCTGTACAGAAACGCCGAATATACGACATCACAAACGTATTAGAGGGTATAGGAATATTAGAGAAGAGATCCAAAAATAATATACAATGGAAATGTGGTGTAGGAGGTGGAGGAGTGAACGAAGAGAACCGTGTGCGTCGTCTGCGGCGCGAGGTGCGGTCGCTGGGCGGGCGGGAGGCGCGGGTCAGTCGAGCGGTGGCCGCGGCCGAGCAGGCGCTGTCTCGACTGTCGGCGGAGCACGGGGCGAGGGCCTACATAACGTACGCGGACCTCAGGTCCATTAAGGACTTTAGAAATCAAACTGTTATACCCATCAAGGCCCCGCCGGACACCAGGCTCAGTGTACCACATCCAGATGAGAAAGGGTATATGATACATCTCAAATCAATTTCTGGAGAAATAGAAGTGTACCTCTGTCCTAAAGAACGTCCGCCCACGCCGCCGCCCTCATCTGGTGTGTTGCCATCGGATCCCTTGTTGGAGGATAACAAAGCTCTCCTGGCTCCGCTCATCGCCCAGCTTCAAACACTACCCTCCAGTTCCATCTCAGCCGCCTTCACAACACCAATAAAGCGTGAGCCGGATGAAGGAGCGTGGTCCCGTAGCCTCGTGGTTCGTACTCCGTGCGTCACGGATCCCACCCTGCCGCTGACGCCGGCGTTATCGACCCCCACAGCCCCCACCACACCAGTCGGACCAGCTGCGCCCACCACGCCCACCACGCCGGCACACGCCACTATGACCACACCTGATACGGGAGGCGCCCGTGGTCGTCTTCGGAACGCGTTGATAGCGGACAGCGACGACTTCGCGCCCATCATGGGCGGTGGGCGGTTCCAGCTGCAAACTGAAGACCAGGAGTCAGAGCAAATGGAGTTGGAGCCGTTCCTGCCTCTCGAGCCGCCGATGTCCGCCAACGACTACGGCTTCTGTCTCGACCACGACGAGGGGCTCTCGGAACTGTTTGACTTTGAATTTTAG

Protein sequence:

>DPOGS208741-PA
MPRGVKRGAAEGEAEVVVRVGASPSHTTLLDDSPSQPISYHLLDHGYGATPQHQIRREAPTAPPKTSEAVKRRLNLSESSSGSQGHVVPMKADFKTPKQKRVKVLTPYSRPSSSMKKYTERSRFDTSLGLLTKKFVALLKSSPNGVLDLNIAAEHLSVQKRRIYDITNVLEGIGILEKRSKNNIQWKCGVGGGGVNEENRVRRLRREVRSLGGREARVSRAVAAAEQALSRLSAEHGARAYITYADLRSIKDFRNQTVIPIKAPPDTRLSVPHPDEKGYMIHLKSISGEIEVYLCPKERPPTPPPSSGVLPSDPLLEDNKALLAPLIAQLQTLPSSSISAAFTTPIKREPDEGAWSRSLVVRTPCVTDPTLPLTPALSTPTAPTTPVGPAAPTTPTTPAHATMTTPDTGGARGRLRNALIADSDDFAPIMGGGRFQLQTEDQESEQMELEPFLPLEPPMSANDYGFCLDHDEGLSELFDFEF-