Monarch geneset OGS2.0

DPOGS202108
TranscriptDPOGS202108-TA759 bp
ProteinDPOGS202108-PA252 aa
Genomic positionDPSCF300150 - 322824-324149
RNAseq coverage263x (Rank: top 41%)
Annotation
HeliconiusHMEL0023878e-10172.33% 
BombyxBGIBMGA006899-TA6e-7959.13% 
DrosophilaE2f-PA1e-2341.45% 
EBI UniRef50UniRef50_Q2F6B11e-7659.13%E2F transcription factor 4-like protein n=2 Tax=Obtectomera RepID=Q2F6B1_BOMMO
NCBI RefSeqNP_001040298.13e-7759.13%E2F transcription factor 4-like protein [Bombyx mori]
NCBI nr blastpgi|1140514515e-7659.13%E2F transcription factor 4-like protein [Bombyx mori]
NCBI nr blastxgi|1140514513e-7659.45%E2F transcription factor 4-like protein [Bombyx mori]
Group
Gene OntologyGO:00063555.6e-22regulation of transcription, DNA-dependent
GO:00056675.6e-22transcription factor complex
GO:00037005.6e-22sequence-specific DNA binding transcription factor activity
KEGG pathwaycin:7785857e-55 
 K04682 (E2F4_5)maps-> TGF-beta signaling pathway
    Cell cycle
InterPro domain[9-185] IPR0156334.4e-72E2F Family
[7-76] IPR0119913.5e-27Winged helix-turn-helix transcription repressor DNA-binding
[9-75] IPR0033165.6e-22Transcription factor E2F/dimerisation partner (TDP)
Orthology groupMCL16480 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202108-TA
ATGGCAGAGCTGTACTCTTATAAAAGGTATGAAAAATCACTTGGATTGCTTACAACGAGGTTCGTGTCTTTACTTAAAAAAGCAAAGGATGGAGTTTTAGATTTAAAAATCGCAACTGATTTATTGGCAGTAAGACAAAAACGGCGAATTTATGATATAACTAACGTTCTAGAGGGAATTGGTTTGATAGAAAAGCGAAGTAAGAATAGTATACAATGGAAGGGTGCCAGTCCAGATGGAAATACATCAGAAATTGGTAAAAAAGTGACACTTCTAAGGAAACAAATAGGTCTTTTGGAGGAACATGAAGAGTTATTAGACAAGCAAATGCACTGGATTGAGCAAAGTATAAAGAATGTCATAGATGATGCTGATAATGATGCTTTGTCTTATGTGACTCAAAATGATGTAAAAAACTGTTTCCATGATAGTCAAGTGCTTGTATTGGAAGCACCACTTGGGGCCAATTTATCAGTTGGGCAATTAGATGAGGGTGCAGGCGAAGATCAGTATTTTCTACATTTAAAATCAAATGAACCTGTTGGTGTTATACTTTTGTGTGATGTTGAAAAGGATAAGATTGTAGATGACGATACTATGGATGAGGAAGTAGAATGGTATGGGGATAGTGGGACTACAAATTCACAAGAATACCTTTTAAGGCTAAGCCCGCCAGTAACAAAACAGGATTTCTCATTTTCTCTATATGGTACTGAAGGACTATGTGACTTATTTGATATTCCATATCCTAATCAATAG

Protein sequence:

>DPOGS202108-PA
MAELYSYKRYEKSLGLLTTRFVSLLKKAKDGVLDLKIATDLLAVRQKRRIYDITNVLEGIGLIEKRSKNSIQWKGASPDGNTSEIGKKVTLLRKQIGLLEEHEELLDKQMHWIEQSIKNVIDDADNDALSYVTQNDVKNCFHDSQVLVLEAPLGANLSVGQLDEGAGEDQYFLHLKSNEPVGVILLCDVEKDKIVDDDTMDEEVEWYGDSGTTNSQEYLLRLSPPVTKQDFSFSLYGTEGLCDLFDIPYPNQ-