Monarch geneset OGS2.0

DPOGS200621
TranscriptDPOGS200621-TA1383 bp
ProteinDPOGS200621-PA460 aa
Genomic positionDPSCF300076 + 167970-195615
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0076034e-7574.77% 
BombyxBGIBMGA008908-TA2e-6972.86% 
Drosophilalz-PA4e-5674.26% 
EBI UniRef50UniRef50_B6S2Q53e-6639.96%RUNX4 n=1 Tax=Aedes aegypti RepID=B6S2Q5_AEDAE
NCBI RefSeqXP_001603414.12e-6057.08%PREDICTED: similar to lozenge [Nasonia vitripennis]
NCBI nr blastpgi|3838546294e-6640.21%PREDICTED: uncharacterized protein LOC100875563 [Megachile rotundata]
NCBI nr blastxgi|1942452348e-7339.67%RUNX4 [Aedes aegypti]
Group
Gene OntologyGO:00056341.3e-96nucleus
GO:00036771.3e-96DNA binding
GO:00055241.3e-96ATP binding
GO:00063551.3e-96regulation of transcription, DNA-dependent
GO:00037004.9e-70sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[37-157] IPR0000401.3e-96Acute myeloid leukemia 1 protein (AML 1)/Runt
[27-155] IPR0123464.9e-70p53/RUNT-type transcription factor, DNA-binding domain
[28-156] IPR0135249.6e-67Acute myeloid leukemia 1 (AML 1)/Runt
[27-153] IPR0089678.9e-57p53-like transcription factor, DNA-binding
Orthology groupMCL22197 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200621-TA
ATGCACTTGCCGTACGAGTCTCGAGGCCGTCGGCGGCGCGAGATGGCCGAACTCGAACCCTGGTGGCTGCAGTCCATCATCGACGAGACCTTAGGAGAGCATCCCGACCTAGTGCGAACGGGCAGTCCTGATTACATGTGCTCAATGCTGCCACAACACTGGCGCTCGAACAAAACTCTACCTGGTGGCTTCAAGGTGGTAGCATTGGGGGACGTGGTAGATGGCACACAAGTCACCGTACGAGCAGGCAATGATGAGAACTGTTCTGCCGAACTGAGGAATAATACGGCTGTTATGAAGAATAGGATCGCTAAGTTTAACGATCTCAGATTCGTCGGTAGAAGTGGACGGGGTAAGAGCTTTTCATTGACAATAACGGTCTCGACGTCACCACCACAGGTAGCGACCTACCTGAAGGCTATCAAAGTCACTGTGGACGGACCAAGGGAACCACGATCCAAAACTAGTAAGAGCTTTTCATTGACAATAACGGTCTCGACGTCACCACCACAGGTAGCGACCTACCTGAAGGCTATCAAAGTCACTGTGGACGGACCAAGGGAACCACGATCCAAAACTAGTACAAACACTTCGCCTCATCCGATACGATCGTTGTCCTTCCAGCGTTCTTTCATTCCAAGCTCGAGCATGGCTATGAGAGACATGGAATTCAAGTCCTCAACCAGAAGTCTGCGGTCATTGTCACACGAAGAAAGAGAATATAAGACTAACGCCAATCTACCGACAGAAGAAAATACTGGCAGTATCCTAGGAGCTAGCGAGTGGAACACCGGATATCCGTCAACTGCCACCGTATATCCTAGCTACCCTCCTCTCCAGCCGCCATATTACAAACCCGACCCTACCTTACATATACCCGGCGTTCTCCCCGAAATCCCTATCGGGACTCCCGACTACGGTTGTTATCAGACCAGTTCTAGCGGCTACGATAAAGGAAGTCCGAGTGGAACGAACAGCAGTCTCACAGATCTCAATTCGCCATCGATGTCCACCCAAAGATACGAACCAAATTACTACAATTCATGGCCTAGCAACAGCTACAACTACCAATACAACAACATAAACAATAATCCCGCCTGTCTCCAATCACACACGCCATATATAAACCCAAACCCCCAAATGATCCTTCCAAATCTTTATTCCACGGTAAACCAGAACCAAATACACGTCCACCTTCACAGCTCATCCGACAAATACAATCTAGAACAATACAAAATAAGCGACATAAATGGCGGGATTTCAATAACCACCGAATTACAAGGTGCGGGGGAAGCGAGCGGTCTAGTGCAAACATGTGAAATAACTGATGACGTGAAACACGGGCTGTACGGAGCGAGCCAAGAAGTTTGGCGGCCATATTGA

Protein sequence:

>DPOGS200621-PA
MHLPYESRGRRRREMAELEPWWLQSIIDETLGEHPDLVRTGSPDYMCSMLPQHWRSNKTLPGGFKVVALGDVVDGTQVTVRAGNDENCSAELRNNTAVMKNRIAKFNDLRFVGRSGRGKSFSLTITVSTSPPQVATYLKAIKVTVDGPREPRSKTSKSFSLTITVSTSPPQVATYLKAIKVTVDGPREPRSKTSTNTSPHPIRSLSFQRSFIPSSSMAMRDMEFKSSTRSLRSLSHEEREYKTNANLPTEENTGSILGASEWNTGYPSTATVYPSYPPLQPPYYKPDPTLHIPGVLPEIPIGTPDYGCYQTSSSGYDKGSPSGTNSSLTDLNSPSMSTQRYEPNYYNSWPSNSYNYQYNNINNNPACLQSHTPYINPNPQMILPNLYSTVNQNQIHVHLHSSSDKYNLEQYKISDINGGISITTELQGAGEASGLVQTCEITDDVKHGLYGASQEVWRPY-