Monarch geneset OGS2.0

DPOGS213352
TranscriptDPOGS213352-TA1449 bp
ProteinDPOGS213352-PA482 aa
Genomic positionDPSCF300109 - 215162-223826
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0163802e-9379.91% 
BombyxBGIBMGA009158-TA6e-10769.49% 
Drosophilanvy-PA1e-6739.53% 
EBI UniRef50UniRef50_Q16H021e-7539.58%Nervy n=4 Tax=Culicidae RepID=Q16H02_AEDAE
NCBI RefSeqXP_975677.11e-9145.19%PREDICTED: similar to nervy [Tribolium castaneum]
NCBI nr blastpgi|910874252e-9045.19%PREDICTED: similar to nervy [Tribolium castaneum]
NCBI nr blastxgi|910874252e-8843.85%PREDICTED: similar to nervy [Tribolium castaneum]
Group
Gene OntologyGO:00063552e-18regulation of transcription, DNA-dependent
GO:00037002e-18sequence-specific DNA binding transcription factor activity
GO:00082707.6e-08zinc ion binding
KEGG pathwaybfo:BRAFLDRAFT_2011734e-65 
 K10053 (RUNX1T1, CBFA2T1)maps-> Pathways in cancer
    Acute myeloid leukemia
InterPro domain[108-136] IPR0132896.3e-30Eight-Twenty-One
[72-163] IPR0038942e-18TAFH/NHR1
[268-299] IPR0148966.8e-11NHR2-like
[386-422] IPR0028937.6e-08Zinc finger, MYND-type
Orthology groupMCL11777 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213352-TA
ATGAAAGGCCTCAGGCGAGGTGTCGCCGCCACGCAGCAGTCGATAACGACTAGGGATGGGGGATCTGTGACGGGGAAGGAGAGGAAGTCCCCGGAGCAGGAGGAGACCAGGTCCTCGCCAACACCTGGACCACACAGCCCACAGAACAACGGAACACCAGCGCTGTCGTCGTCCGAGTCCTCGCCCCCGCCCGACGCCCGCGCCGCTCCAGCGCTGCAGCGCTTGCGTCGCTTCCTTTCAGCGCTGCACACGTTCGCCGCGGATGTCAGCGCTGATGTCGGCGAGAGGGTCCGACAGTTGATATTCAACCTCGTGGCGGGAGTCATCAGTATCGAGGAGTTCCAGAGCGGGGTGCAGGAAGCCACCAACTATCCTCTGAGAGCGTCCGTGCCGGCCTTCCTCAGGGCCTTGTTACCCCTGGCCCTTAGAGACCTGCATGCTAGAGCTCGGAGAACGAAACAGACACCGCTGCAGTACATCAGGTCGCACGAGCACACCGTGCTGGAGGCGGGCGGAGACGCCGGGGACATCTTCGCCCATGAGGCGGGCAAGAGACGTGCCAGCGACCCTTTCTACGAGTGTCACAGTAACGGAACTCACGAGGACTACCCTCCAGCCAAGAGGCCTGTGTTCAATCCGTCCCCGTTCTACCCTCTGCCGTCTAACGCCAGCCTCTTCGACTACCAGCCTTACCACTGCCCCACGCAGGAAGCTGGCTTCGAGAGGAGAGACGGTGGTATCACCGTCCGCGACGTGTCCACCATGAACGCGACCCTGGAGCCGCGGCCCGGCCTGGCCAAGACCGACGACGAGTGGAAGAACATCAACACCATGCTGAACTGCATCCTCAGTATGGTGGAGAAGACGAAACGAGCCCTCGCTATACTGCAGCAGAGAGGAGTGGAACCTCCCGAGAGTTCTGATATAAAGCGCGCGGCGAGTGAGATCATGTGCGCCGCGGTGAGGCAGACGGAGGAGAGGGTCGCCGAAGTCAGGAGACGGGCCGAGGATGCCGTCAACCAGGTGAAGCGCCAGGCGCTGGTGGAGCTCCAGCGCGCGGTGGGCGCTGCCGAGAGCAAGGCCCTGGAGCTGGAGCGGGCACGACATTCGCCGCCCCCCGGCCGAGACCTCAGCCCCGGGGCCGCGCACAGTTGTTGTTGGAACTGTGGTCGCGCGGCGCAGGAGACGTGCTCGGGCTGCGGCGCCGCCAGGTACTGCGGGGCCTTCTGCCAACACAGGGACTGGGAGAACCACCACCAGGTCTGCAGCGGTCGGGACACGAAGCCCTCGTCGATGGTCCGCACCTCCCCTCCCTCCACACAACCCATCCTCCCCAAACCCCTGACTCGCTCCTCGACCCCCATCGTCACACCTATAGTGACCCCCATAGCGGCACCCGCAGCGACCCCCAACCCCGCAGCCGACAGACCCGCGGCGGCGAAGAAATGA

Protein sequence:

>DPOGS213352-PA
MKGLRRGVAATQQSITTRDGGSVTGKERKSPEQEETRSSPTPGPHSPQNNGTPALSSSESSPPPDARAAPALQRLRRFLSALHTFAADVSADVGERVRQLIFNLVAGVISIEEFQSGVQEATNYPLRASVPAFLRALLPLALRDLHARARRTKQTPLQYIRSHEHTVLEAGGDAGDIFAHEAGKRRASDPFYECHSNGTHEDYPPAKRPVFNPSPFYPLPSNASLFDYQPYHCPTQEAGFERRDGGITVRDVSTMNATLEPRPGLAKTDDEWKNINTMLNCILSMVEKTKRALAILQQRGVEPPESSDIKRAASEIMCAAVRQTEERVAEVRRRAEDAVNQVKRQALVELQRAVGAAESKALELERARHSPPPGRDLSPGAAHSCCWNCGRAAQETCSGCGAARYCGAFCQHRDWENHHQVCSGRDTKPSSMVRTSPPSTQPILPKPLTRSSTPIVTPIVTPIAAPAATPNPAADRPAAAKK-