Monarch geneset OGS2.0

DPOGS203933
TranscriptDPOGS203933-TA1392 bp
ProteinDPOGS203933-PA463 aa
Genomic positionDPSCF300005 - 411497-415608
RNAseq coverage533x (Rank: top 24%)
Annotation
HeliconiusHMEL0135170.076.47% 
BombyxBGIBMGA002019-TA0.075.05% 
DrosophilaDeaf1-PA3e-8237.40% 
EBI UniRef50UniRef50_D6WIQ41e-12352.88%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=D6WIQ4_TRICA
NCBI RefSeqXP_966671.11e-12452.88%PREDICTED: similar to Deformed epidermal autoregulatory factor-1 CG8567-PB [Tribolium castaneum]
NCBI nr blastpgi|3323726321e-12353.81%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323726322e-12053.62%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00056341e-35nucleus
GO:00036771e-35DNA binding
GO:00054888.5e-35binding
GO:00082701.3e-08zinc ion binding
KEGG pathwaycfa:4870444e-06 
 K10053 (RUNX1T1, CBFA2T1)maps-> Pathways in cancer
    Acute myeloid leukemia
InterPro domain[56-461] IPR0241193.7e-87Transcription factor DEAF-1
[152-224] IPR0007701e-35SAND domain
[150-239] IPR0109198.5e-35SAND domain-like
[401-437] IPR0028931.3e-08Zinc finger, MYND-type
Orthology groupMCL16043 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203933-TA
ATGGCGGAGAATAGAAGTTCGGAAAACGTCGTTATGCCGGACATTACCGAAGTAAGCGAACCCGATCCTCTGTCGACAGCTGCCGAGCACGATACCGATTGTTCTGATGGCGATACCGTATTGTCGGCAAGTATCAAAGATACAAGACGTTCAAGCGATTCAAACGGTGTGGTTACGGTCCCCGTATCACTGCCAGTTGGCACGCTCATTACTGGCACTACATTTAATGTCATAACTTCTGACCAATTGCAACACTTTAAACCTATGATATGCGTTGATAATGGTTTTATATCAGGTGGTCCTGTACAGGAAGATATAAAGCCTACTCACATAGTAATCCAAAACACTTCATCAACAACTGCTGCTCGTGAACAAAAGAGCCATGAAACTGGTTCTAATTCCACTAGTAATAGATCAATGAGCAATAGATCATGGGTCGAAACTGCCAACATGCCTATTCTCCCTATTAGATGTAAGAACACTTCAGCAGAACTACATAAGCAAAGGTTAGGATCTGGAGGGCGTGGAAGATGCATTAAATATGGAAGTGAATGGTACACTCCCAGTGAATTTGAAGCTCTTTGTGGGCGAGCATCAAGCAAGGACTGGAAGAGATCTATAAGATTTGGTGGGAGAAGTATTCAGGCTTTAATTGATGAAGGAATATTGACACCCCATGCTACTAGCTGCACTTGTGGTGCCTGCTGTGATGATCAAACTGCAATGGGACCTGTAAGACTATTTACCCCATATAAGAGGAAGAGAAAAAACCAAGATGGAACAGATGAGAAGAATGTGAAAGTAAAGCGTGATACTTCATTGAGTGATGCTGAAGTAGACAGCATTCATCAGACAAGTAGCAATAGTCATTCAAAAGAAGCGTGGCAGACAATAGCTGAGGGATTGGACACAAACTCAGATTATCATTTGTTGGCAAGTCCTGAACCGCCTCCAGATATAACTGCAGCTATTCCGGACATGACAAAAGTATTGAAACGGCTCGAAGATATCGGTCAGAACCTTACACGGCTGTCTGGAGAGTTGAAGCAATGTGTTGAAGACGTTAAGATAATGAGCACAAGACAGATGGAGAGATTGGAGCAGGAGCGTGCCTCCGCTTTACTAGCAGCGAGTATGGATGCTCATGTAGAGGCCGAACAGGTCTCATTACATAATGTTGACGAATCCGAAGCTAAGAAGTGCGCAAACTGCAACCGCGAAGCTTCCGCCGAATGTTCACTGTGCCGTCGGACGCCATATTGCTCGACATATTGCCAAAAAAAAGACTGGGCTGCCCATCAGATTGAATGTCTCCGATCGGTGCCTACAATACACACAGACGCACAACAGCACCAGTCCATTATGTTAATTGTAGAAAGCCCGCATCAATAG

Protein sequence:

>DPOGS203933-PA
MAENRSSENVVMPDITEVSEPDPLSTAAEHDTDCSDGDTVLSASIKDTRRSSDSNGVVTVPVSLPVGTLITGTTFNVITSDQLQHFKPMICVDNGFISGGPVQEDIKPTHIVIQNTSSTTAAREQKSHETGSNSTSNRSMSNRSWVETANMPILPIRCKNTSAELHKQRLGSGGRGRCIKYGSEWYTPSEFEALCGRASSKDWKRSIRFGGRSIQALIDEGILTPHATSCTCGACCDDQTAMGPVRLFTPYKRKRKNQDGTDEKNVKVKRDTSLSDAEVDSIHQTSSNSHSKEAWQTIAEGLDTNSDYHLLASPEPPPDITAAIPDMTKVLKRLEDIGQNLTRLSGELKQCVEDVKIMSTRQMERLEQERASALLAASMDAHVEAEQVSLHNVDESEAKKCANCNREASAECSLCRRTPYCSTYCQKKDWAAHQIECLRSVPTIHTDAQQHQSIMLIVESPHQ-