Monarch geneset OGS2.0

DPOGS205412
TranscriptDPOGS205412-TA1122 bp
ProteinDPOGS205412-PA373 aa
Genomic positionDPSCF300407 + 237492-240875
RNAseq coverage275x (Rank: top 39%)
Annotation
HeliconiusHMEL0215364e-10868.11% 
BombyxBGIBMGA001581-TA2e-16986.49% 
DrosophilaMRG15-PA3e-8142.68% 
EBI UniRef50UniRef50_C0RWX30.092.01%Mrg15-like protein n=14 Tax=Coelomata RepID=C0RWX3_BOMMO
NCBI RefSeqNP_001139536.10.092.01%mortality factor 4-like [Bombyx mori]
NCBI nr blastpgi|2257030880.092.01%mortality factor 4-like [Bombyx mori]
NCBI nr blastxgi|2257030881e-17692.01%mortality factor 4-like [Bombyx mori]
Group
Gene OntologyGO:00056342.9e-140nucleus
KEGG pathway 
InterPro domain[1-334] IPR0086762.9e-140MRG
[1-187] IPR0161971.5e-35Chromo domain-like
[6-75] IPR0009533.5e-09Chromo domain/shadow
Orthology groupMCL13039 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205412-TA
ATGGCACCGAAACTGAAATTTGCTGAAGGGGAAAAAGTGTTGTGTTTTCATGGTCCCCTCATATATGAAGCCAAATGTCTTAAAAGTTCAGTTACAAAGGACAAGCATGTCCGATATTTGATACATTACGCCGGTTGGAACAAAAATTGGGACGAGTGGGTTCCTGAGAGCAGAGTATTGAAATACAATGAAGCAAATGTCCAACGACAAAAGGAAGTACAGAGAGCACATTCTGCACAACCAGCCAAGACGAAGAAAACTCCGGCGAAGGGTCGAAGATCAGAGGCAGCTGCAAATTCTACACCAGCCAGAGAGGAATCCAGAGCTTCAACACCTGCCGGAAAAGATGTTGAATCAACTCCCGCCCCGACTAAAGCCTCAAAAACTCAAAGTAAGGACATTCAAGCTGACTCCGGCTCGGATCAGCCCAAGAAAAAACGAGGACGTCTCGATCTATCAATAGAATCAGAAGAGCAATACCTAGCGAAGGTTGAGGTCAAGATTAAAATACCAGAGGAATTGAAAGTGTGGCTCGTGGACGACTGGGACGTGATAACGAGACAGCAGAAGCTGGCTATATTACCAGCAAAACTGACTGTATCCCAAATAGTAGACAACTATCTAGCGTTTAAGAAGTCAAGTAAATTGCACAATCAGGCAAAAGAATCAGTATTAGTTGATATAACGGAGGGTATCAAGGAATATTTCAATGCAACGATTGGTTCACAATTATTATACAAATTCGAAAGACCTCAGTATAGCGAGATACTACAGGAATATCCGGACACACCGCTGTCCCAAATATACGGATCAATACATTTGTTGAGATTGTTCGCCAAAATGGGACCGATGTTGGCTTACACAGCGCTCGATGAGAAATCCTTGCAACACGTGCTGTCCCATATCCAAGACTTCCTAAAGTACATGGTCACAAACAGATCTACGCTATTCAACTTGCAAGATTACGGCAACGCTACACCCGAGTATCATAGGAAAGTGCACCTGCTTACCTTTAAAGGCAGGGTTATACAAGGTCAGTGGTCGCTGGTCGGTGGTACGCGACAGGGCGGCGGCATTACGACACTACCACTTATGGATGAACTACGAACATATGGCCTCTGA

Protein sequence:

>DPOGS205412-PA
MAPKLKFAEGEKVLCFHGPLIYEAKCLKSSVTKDKHVRYLIHYAGWNKNWDEWVPESRVLKYNEANVQRQKEVQRAHSAQPAKTKKTPAKGRRSEAAANSTPAREESRASTPAGKDVESTPAPTKASKTQSKDIQADSGSDQPKKKRGRLDLSIESEEQYLAKVEVKIKIPEELKVWLVDDWDVITRQQKLAILPAKLTVSQIVDNYLAFKKSSKLHNQAKESVLVDITEGIKEYFNATIGSQLLYKFERPQYSEILQEYPDTPLSQIYGSIHLLRLFAKMGPMLAYTALDEKSLQHVLSHIQDFLKYMVTNRSTLFNLQDYGNATPEYHRKVHLLTFKGRVIQGQWSLVGGTRQGGGITTLPLMDELRTYGL-