Monarch geneset OGS2.0

DPOGS204289
TranscriptDPOGS204289-TA1005 bp
ProteinDPOGS204289-PA334 aa
Genomic positionDPSCF300046 + 245488-246643
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0151967e-17888.02% 
BombyxBGIBMGA007534-TA1e-16583.28% 
DrosophilaE2f-PA7e-0834.44% 
EBI UniRef50UniRef50_UPI0001A595B71e-4540.06%UPI0001A595B7 related cluster n=1 Tax=unknown RepID=UPI0001A595B7
NCBI RefSeqNP_001155055.12e-4640.06%E2F family member 8 [Nasonia vitripennis]
NCBI nr blastpgi|2390493114e-4540.06%E2F family member 8 [Nasonia vitripennis]
NCBI nr blastxgi|2390493116e-4336.45%E2F family member 8 [Nasonia vitripennis]
Group
Gene OntologyGO:00063551.9e-13regulation of transcription, DNA-dependent
GO:00056671.9e-13transcription factor complex
GO:00037001.9e-13sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[52-246] IPR0156332.1e-28E2F Family
[50-119] IPR0119919.4e-17Winged helix-turn-helix transcription repressor DNA-binding
[52-119] IPR0033161.9e-13Transcription factor E2F/dimerisation partner (TDP)
Orthology groupMCL19547 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204289-TA
ATGTATGAAGATGATAACATTACCTCTACTCCTAAGCGTTACGCACTAACAGAAGTTACGAATTCAACCAGTTACGTCTCACCGACGGCCAATCTAAAGTTGTTGACGAATGTTGCGTTGCAGTATCCCACGCCGCCGCCAAGCACGGTGCATCGAAAAGAAAAGTCTCTTCAAATATTATGTGACAAGTTCCTCAATTTGTATCCATTACACGGCAATGGTACGGTAGAAATACAACTGGACAGTACCGCAGCCCGATTAGGGGTTGAAAAGAGAAGGATGTATGATATAATTAATATATTGGAAGCCATGCAATGTGCTGTACACAAAAGAAAAAACACTTATTTGTGGCATGGCGGTGCAAGGCTGAATTCTTTTCTTAAAATGTTGAAAAGGCAAGGAGAAAATTTAAAGTTATCAGAAGCTCTTAGAGGAAGAGCACCAAAACCACCTGCACCAAAGCATAAAACTTTGGGTGTTTTAGCCCAAAGATTTTTGATGCTTTTTTTAGTGGAACCTCCTAATACACTAATAAATTTAGAAATGGCTGTGAGTGTTCTAATAGACACAACAAACAAAAATAAATCGGTGCTATCTCCTGAGCAGCAGGACCGGCAACACAAGTCCAAAGTCAGGAGGCTTTATGATATTGCTAATGTCTTTATTTCAATTGGTCTCATAGAAAAGGTTTCAGGTAATTTAATATTAAAGAAGCCAGTATTTAAGTATGTTGGTCCTTTTAAGATGAAGGAAACTGAGAAGGTAGTTTGCACTCCATCACCCGTCACACCCCTATCAATATTAGATACACACCAACAGATGACACCATGTCATGTATTTGCTGGTAAATCTAAGCGCAAACTAGAGTTTGCAACTCCTACTACGAGTACAGAGAAGCTCGGACTGACGACACCTCCACACACACCGTCCCACAAGTGGGATGAAATCCTTCTTGTAGCTGACATGGAACTTAATAGAATTAACAGTGGAGTTGTGATTATATAG

Protein sequence:

>DPOGS204289-PA
MYEDDNITSTPKRYALTEVTNSTSYVSPTANLKLLTNVALQYPTPPPSTVHRKEKSLQILCDKFLNLYPLHGNGTVEIQLDSTAARLGVEKRRMYDIINILEAMQCAVHKRKNTYLWHGGARLNSFLKMLKRQGENLKLSEALRGRAPKPPAPKHKTLGVLAQRFLMLFLVEPPNTLINLEMAVSVLIDTTNKNKSVLSPEQQDRQHKSKVRRLYDIANVFISIGLIEKVSGNLILKKPVFKYVGPFKMKETEKVVCTPSPVTPLSILDTHQQMTPCHVFAGKSKRKLEFATPTTSTEKLGLTTPPHTPSHKWDEILLVADMELNRINSGVVII-