Monarch geneset OGS2.0

DPOGS201136
TranscriptDPOGS201136-TA1134 bp
ProteinDPOGS201136-PA377 aa
Genomic positionDPSCF300065 - 563551-575262
RNAseq coverage1513x (Rank: top 8%)
Annotation
HeliconiusHMEL0137365e-11169.38% 
BombyxBGIBMGA003938-TA4e-10962.78% 
DrosophilaEip74EF-PD4e-1640.66% 
EBI UniRef50UniRef50_Q7YT295e-14768.07%Transcription factor BmEts n=4 Tax=Obtectomera RepID=Q7YT29_BOMMO
NCBI RefSeqNP_001036902.11e-14768.07%transcription factor Ets [Bombyx mori]
NCBI nr blastpgi|1129827532e-14668.07%transcription factor Ets [Bombyx mori]
NCBI nr blastxgi|1129827535e-14867.79%transcription factor Ets [Bombyx mori]
Group
Gene OntologyGO:00063551.2e-28regulation of transcription, DNA-dependent
GO:00435651.2e-28sequence-specific DNA binding
GO:00037001.2e-28sequence-specific DNA binding transcription factor activity
GO:00055157.4e-09protein binding
KEGG pathway 
InterPro domain[255-360] IPR0119915.8e-31Winged helix-turn-helix transcription repressor DNA-binding
[266-353] IPR0004181.2e-28Ets
[71-151] IPR0109937.4e-09Sterile alpha motif homology
Orthology groupMCL24958 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201136-TA
ATGATCGACTACGCGCGCCCCTACGAGATGAACATGTGCGACCTGTACCCCCCCACGCCGCCGACCTCCGCACGCCTGGACGACTTCCAGGTCTTCAAGAAGGGCAACGCCTCCTACATCTACGGGAACTACTACAACAACTACGATGTATATCCCGAACCCGAAATGGACACCACGGCGAACGTCTACCACAACCTGGTCAGCTCCCGCGAGTACCGGCACGAGGACTGGAAATCTAAAGTCATCGTGGACTGGACAGATGATGATACAGTATCCTGGCTCATAGAGTGTGCCGTGTCACAGGGCTTCTGTGAGTATGACGTCCCGTTCTATAATTTCAAAGTTTGCGGAGTGGAACTCTCTAATATGAGACGTGAAGAAGTTATCCAAAGAATGGCACATCCAAACGTCGATCTAAATCTGTCGAAGAAAATAGGAGAAATTGTTTACGATAAACTGCAGGCCCGTCTCAGTGAAGAGATCCACAGACAGACTTCGGTATTCAGATACGCGGAGAGTGACCCGTACACACAGCAGGAAACAGTGCAGTCAGTGTTAGATCTTGATAAACATAGGTTATATACGTCGGACTATAAGCCGAACGACAGCAGTCTGCTACCGGCATCTGATTCCAGTGATGACGCGGAGGATGTATTCCGTACCTCGGCGCCGGCATCTCCCTTCTCCTACGGAAGTGATAGTAAGTCAGGAGACGAAGAAGAGAAAAGGAAAATTCCCAAAAGACTCCCAGGGAGGCCTAAGGGAAGCGGCAAGAACAGATTCAAGAGACCCAGAAGTGTGTCCGTCCCAGAGTTTCTGAGGAACCTGTTGTTCGACGAGAGATACTGTCCATCTATTATCAAATGGGAAGACTATTCGCAGGGCAAGTTCAGGTTCGTGAAACCGGACGAGGTGGCCAAGCTGTGGGGCCAGATGAAGCAGAATGACAACATGACCTTCGAGAAGTTCAGTCGGGCGATGCGATACCACTATCGGCAGAACGTGCTGGTCTCCGTGCCGACGGCGCGGCTCGTGTACCAATTCGGACACAAGGGACCGGACTTCAAGACCCAGAACCCGAATTTCGTCAAGGTCAAGTCCGAAATGGACCTCCAAGATATGCCGTACTCGTAG

Protein sequence:

>DPOGS201136-PA
MIDYARPYEMNMCDLYPPTPPTSARLDDFQVFKKGNASYIYGNYYNNYDVYPEPEMDTTANVYHNLVSSREYRHEDWKSKVIVDWTDDDTVSWLIECAVSQGFCEYDVPFYNFKVCGVELSNMRREEVIQRMAHPNVDLNLSKKIGEIVYDKLQARLSEEIHRQTSVFRYAESDPYTQQETVQSVLDLDKHRLYTSDYKPNDSSLLPASDSSDDAEDVFRTSAPASPFSYGSDSKSGDEEEKRKIPKRLPGRPKGSGKNRFKRPRSVSVPEFLRNLLFDERYCPSIIKWEDYSQGKFRFVKPDEVAKLWGQMKQNDNMTFEKFSRAMRYHYRQNVLVSVPTARLVYQFGHKGPDFKTQNPNFVKVKSEMDLQDMPYS-