Monarch geneset OGS2.0

DPOGS200612
TranscriptDPOGS200612-TA1299 bp
ProteinDPOGS200612-PA432 aa
Genomic positionDPSCF300076 - 75883-94966
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0098427e-13582.46% 
BombyxBGIBMGA008971-TA1e-14489.97% 
DrosophilaRunxA-PA3e-11360.87% 
EBI UniRef50UniRef50_G6CUI90.0100.00%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=G6CUI9_DANPL
NCBI RefSeqXP_968985.23e-15767.24%PREDICTED: similar to CG34145 CG34145-PA [Tribolium castaneum]
NCBI nr blastpgi|1892403276e-15667.24%PREDICTED: similar to CG34145 CG34145-PA [Tribolium castaneum]
NCBI nr blastxgi|1892403272e-16168.90%PREDICTED: similar to CG34145 CG34145-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056341.5e-119nucleus
GO:00036771.5e-119DNA binding
GO:00055241.5e-119ATP binding
GO:00063551.5e-119regulation of transcription, DNA-dependent
GO:00037004.2e-79sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[8-169] IPR0000401.5e-119Acute myeloid leukemia 1 protein (AML 1)/Runt
[8-144] IPR0123464.2e-79p53/RUNT-type transcription factor, DNA-binding domain
[8-141] IPR0135241.3e-74Acute myeloid leukemia 1 (AML 1)/Runt
[12-138] IPR0089671.3e-62p53-like transcription factor, DNA-binding
Orthology groupMCL10836 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200612-TA
ATGACATCAGATATACTCGCGGAGAGGACGCTGGGTGATTTTCTATCGGAGCACCCGGGGGAATTGGTGAGGACTGGTAGTCCACACTTCGTATGCACAGTGCTACCTCCTCACTGGCGATCCAATAAAACGCTGCCGGTGGCGTTTAAGGTGGTAGCGCTCGGTGACGTTGGAGACGGAACACTGGTGACTGTCAGGGCTGGTAACGATGAGAACTGCTGCGCTGAACTCCGTAACAGCTCGGCGGTCATGAAGAACCAAGTGGCAAAGTTCAACGATTTGAGATTCGTCGGCCGTAGTGGTCGCGGGAAATCATTCACATTGACAATAACGATATCGACGACGCCTCCGCAAGTCACAACCTACAATAAGGCTATCAAGGTCACCGTGGACGGACCCAGGGAACCGCGGTCGAAAACCAGGCAGCAGCAGCAATTTCATTTCGCATTCGGTCAACGGCCGTTTCCTTTCCCACCAGATCCTCTGGGAGGATTCCGGATGCCGCCGATTACTACATGTCAGAATATGAGTCAATTTGGTTTGAGTTCGAGTAACTCTCATTGGGGCTATGGTGGTGCCGGCGCTTACCCAGCATACCTTCCATCCTGTGCGGCTCCAGCGACACAGTTCAATACACCGACATTAGGCTTCGCTGGTTCCGTCCCTGAACAAACCCCCACTCAGGATTTCACTAATAACACCGTTCTACCGGATACGACGGGAGTGGATCTGGACCAACAGCTGTCCGGTCTAGTGGGATCGTCTCCATCACACCACGGCAGCTTGCTACCTAGATACAACAACAACACAGACTACACGCTATCCACCGGCCCACGCTCCCTCAGCGACAATAGCTCGCAACCGGAATCCCCGGTCCAAGACGACCTTTTAACTTCAAACACAACAACCAACATCGGTCACAACCATTCGAACTCCAACTTCTCGCTAATGAGTACTCAGAATGCATCATACGGAAGCAGCAACTGCAACAATTCCCTCTACCCCGTTCTACCGGCCAGCCTGCTATACAGTCAATTATACACAGCAGCTAATCAAAGTCACAATTTCCATCCGCTCCATTCGAACTCCATCCATTCAACGCAGAATCATCACAACGAACTACAAACTATGATGGACCAGATATCATCAACCACGAACCATAGACAGGGTCACGGGCAAGACTTGTTGGGTGGGAACTCGTGTGCTGCTGCGGCCGCGAGGGGGGAAGATGGAAGGGTTAGTTTGGGACAGCGGGGAAATCCCCAACCAGACAGCAACACCGTTTGGCGGCCCTATTGA

Protein sequence:

>DPOGS200612-PA
MTSDILAERTLGDFLSEHPGELVRTGSPHFVCTVLPPHWRSNKTLPVAFKVVALGDVGDGTLVTVRAGNDENCCAELRNSSAVMKNQVAKFNDLRFVGRSGRGKSFTLTITISTTPPQVTTYNKAIKVTVDGPREPRSKTRQQQQFHFAFGQRPFPFPPDPLGGFRMPPITTCQNMSQFGLSSSNSHWGYGGAGAYPAYLPSCAAPATQFNTPTLGFAGSVPEQTPTQDFTNNTVLPDTTGVDLDQQLSGLVGSSPSHHGSLLPRYNNNTDYTLSTGPRSLSDNSSQPESPVQDDLLTSNTTTNIGHNHSNSNFSLMSTQNASYGSSNCNNSLYPVLPASLLYSQLYTAANQSHNFHPLHSNSIHSTQNHHNELQTMMDQISSTTNHRQGHGQDLLGGNSCAAAAARGEDGRVSLGQRGNPQPDSNTVWRPY-