Monarch geneset OGS2.0

DPOGS211153
TranscriptDPOGS211153-TA651 bp
ProteinDPOGS211153-PA216 aa
Genomic positionDPSCF300007 + 28623-29273
RNAseq coverage111x (Rank: top 59%)
Annotation
HeliconiusHMEL0171921e-9168.25% 
BombyxBGIBMGA003139-TA1e-7859.62% 
Drosophilamsta-PA2e-3433.50% 
EBI UniRef50UniRef50_D6WLG58e-4036.94%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WLG5_TRICA
NCBI RefSeqXP_001122203.14e-3837.44%PREDICTED: similar to Protein msta, isoform A [Apis mellifera]
NCBI nr blastpgi|3504242414e-4040.69%PREDICTED: protein msta, isoform B-like [Bombus impatiens]
NCBI nr blastxgi|3504242414e-4040.69%PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Group
Gene OntologyGO:00055156.2e-11protein binding
KEGG pathway 
InterPro domain[9-72] IPR0012146.2e-11SET domain
Orthology groupMCL16625 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211153-TA
ATGATCTTAGTAATATGTATTCTTAACACAAATGCCTTTCAGATGGCCACACCGTATGGTAAAAAGGAAATGAGTTTGCGTGGATTATACCCTGTCGCATCCATTTTGAATCACAACTGTGTACCAAACACAAGAAACTGTTTTAATGGTGACTTACAAATGACCGTAAAGGCCACGAAAACCATCAACGCCGGTAGCGAAATATTCACATGTTACTCGGGTATGCTCTGGGGAACACCAGCCCGACGTCTATATCTATATAAAAGCAAGCATTTCTTGTGTGATTGTGAGCGTTGCGCGGATCCAACCGAGAGAGGTACGTTGCTGGCTGCCCTAAAATGTTTCTCAACAGAGTGCCAGGGCTCACTACTACCGATACAACCATTAAAAACAACCACAGCTTGGCGCTGTCTAGAATGTGGGATGAGAGTGCCAAATGATAACATTTGTGTTATACAGAGCGCGTTAGGTAGTCTTATGGGATCATTGGATCTGAAAAACGTCGATGAATTAGAAAATTTCTATTTAAACAGAATCACTAGATACGTTCCAAGAACAAATCAAATAGTATTGGATCTACAATGTCGTCTCGTTTGGGAGCTTGGCGAAACGGATGGACTTCGATGGAATGGTAAGTTTACCGAAACATAA

Protein sequence:

>DPOGS211153-PA
MILVICILNTNAFQMATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQMTVKATKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHFLCDCERCADPTERGTLLAALKCFSTECQGSLLPIQPLKTTTAWRCLECGMRVPNDNICVIQSALGSLMGSLDLKNVDELENFYLNRITRYVPRTNQIVLDLQCRLVWELGETDGLRWNGKFTET-