Monarch geneset OGS2.0

DPOGS211900
TranscriptDPOGS211900-TA1392 bp
ProteinDPOGS211900-PA463 aa
Genomic positionDPSCF300011 - 172874-174635
RNAseq coverage1048x (Rank: top 12%)
Annotation
HeliconiusHMEL0166940.089.17% 
BombyxBGIBMGA001167-TA0.086.55% 
Drosophilaaop-PA5e-5886.78% 
EBI UniRef50UniRef50_E0VDJ04e-10951.80%Transcription factor ETV6, putative n=1 Tax=Pediculus humanus corporis RepID=E0VDJ0_PEDHC
NCBI RefSeqXP_975017.22e-13561.30%PREDICTED: similar to ets [Tribolium castaneum]
NCBI nr blastpgi|2700057278e-13561.30%hypothetical protein TcasGA2_TC007831 [Tribolium castaneum]
NCBI nr blastxgi|2700057272e-13761.30%hypothetical protein TcasGA2_TC007831 [Tribolium castaneum]
Group
Gene OntologyGO:00063558.1e-50regulation of transcription, DNA-dependent
GO:00435658.1e-50sequence-specific DNA binding
GO:00037008.1e-50sequence-specific DNA binding transcription factor activity
GO:00056341.9e-27nucleus
GO:00055151.4e-23protein binding
KEGG pathwaytca:6638955e-135 
 K03211 (ETV6_7, yan)maps-> MAPK signaling pathway - fly
    Dorso-ventral axis formation
InterPro domain[275-363] IPR0004188.1e-50Ets
[250-374] IPR0119911e-40Winged helix-turn-helix transcription repressor DNA-binding
[41-123] IPR0031181.9e-27Sterile alpha motif/pointed
[17-125] IPR0109931.4e-23Sterile alpha motif homology
[44-122] IPR0137611.7e-23Sterile alpha motif-type
Orthology groupMCL14848 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211900-TA
ATGAAAGTAGTTAGTCTTCAATTGCCGTCCGGGCCGAGTATGGAGCGGCTTCCGCTACCGTTCAGTCCGACCGAACTATTGTGGCGCTATCCCTTGCCGTGGGCGCCGCCGCCGCCCTCACCCCTCGGCGATACGAAGGCCCAGCTGCCGGCGGGTCTTCCTCCAGAACCTAGACTCTGGACTCGTGAAGATGTCTCCGTGTTCTTGAAATGGTGCGAAAGAGAATTCGATCTGCCTAACTTCGACATGGATCTCTTCCAAATGAATGGTAAAGCTTTGTGTCTTCTGACCAAAACCGACTTGGGTGAGAGATGTCCGGGGGCTGGAGACGTGCTTCACAACGTTCTCCAGATGTTGGTCCGAGACGCCGCTCTTCTTGGGCGAGTGCCTTCTTCTCCAGTGACGCCCACAGCTCGGGGCGCGCCGTACCCACCGTCGCCGCACTCTCACCCGCCCACTCCGACGTGGACGGTGGACGGATTCCATCACTTCCACACCGCGGCGGCCGCTGCCCAACCGAACTCCGTAACGTTAAGCCCAGCCCCGTCAGTGGACAGTTCAGGGAGTCCGCAGAGAGGGGAAACCCTGAGCTACGCCCCCGCCTACGCGCAACCTGTCCCCGTCAGTACTCAGGCCGTGAGTTCCGGCAGTAATCAGTCGGACTCCGATGAGGAAGCTCAGTTTGCGCCACAGTCGAGATCACCCAAGGACGCTCCTCTAAACAGTGTGACCCCGCAGACACACGCAGCTCCACAACACTCTCATTACCGCGCACAACATAGAGAATTCTTCCCAAATGATATGCCCGAATCAAATACAAATGGAAGACTCTTGTGGGATTTCCTTCAGCAACTTCTAAATGACCCAACTCAACGATACACCAACTACATCGCCTGGAAGAACAGAGAAACTGGTGTATTCAAAATCGTGGACCCAGCTGGGCTGGCCAAGCTATGGGGGATACAGAAGAACCACTTATCGATGAACTACGATAAGATGTCGCGCGCCCTCCGGTACTACTACCGCGTCAATATTCTGCGAAAAGTCCAGGGCGAGCGACACTGCTACCAATTCTTGAGGAACCCAACCGAGCTGAAGAACATCAAGAACATCTCGCTGTTACGGCAGCAAATGAGTCCGACGCGGGTCGCCCAGCAGCCGGCCGTGAAGGCGGAGCTCAAGGACGAGAGGTGCGAAGAGGAGAACGACGACGACATGCCCACCGACCTCAGCATGAACGGCGCCGAGCCCTGGCGCAAGCGGCCGCGCACCGAGCCCGCGCGGGACAAGCACCGCATCAGCGCGCTCATCGGAGACGCCATCATGAAGCGGGAACCCGACTACCCCGAGCACTACGCGCTCAACTTGAAAAGTGAAAAATGCGAACAATGA

Protein sequence:

>DPOGS211900-PA
MKVVSLQLPSGPSMERLPLPFSPTELLWRYPLPWAPPPPSPLGDTKAQLPAGLPPEPRLWTREDVSVFLKWCEREFDLPNFDMDLFQMNGKALCLLTKTDLGERCPGAGDVLHNVLQMLVRDAALLGRVPSSPVTPTARGAPYPPSPHSHPPTPTWTVDGFHHFHTAAAAAQPNSVTLSPAPSVDSSGSPQRGETLSYAPAYAQPVPVSTQAVSSGSNQSDSDEEAQFAPQSRSPKDAPLNSVTPQTHAAPQHSHYRAQHREFFPNDMPESNTNGRLLWDFLQQLLNDPTQRYTNYIAWKNRETGVFKIVDPAGLAKLWGIQKNHLSMNYDKMSRALRYYYRVNILRKVQGERHCYQFLRNPTELKNIKNISLLRQQMSPTRVAQQPAVKAELKDERCEEENDDDMPTDLSMNGAEPWRKRPRTEPARDKHRISALIGDAIMKREPDYPEHYALNLKSEKCEQ-