Monarch geneset OGS2.0

DPOGS211136
TranscriptDPOGS211136-TA1188 bp
ProteinDPOGS211136-PA395 aa
Genomic positionDPSCF300007 - 266930-268575
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0172162e-14865.22% 
BombyxBGIBMGA003336-TA5e-15165.66% 
Drosophilal(2)35Bd-PA4e-9547.43% 
EBI UniRef50UniRef50_Q9VJQ46e-9347.43%mRNA cap guanine-N7 methyltransferase n=11 Tax=Drosophila RepID=MCES_DROME
NCBI RefSeqXP_001602514.13e-10252.99%PREDICTED: similar to mrna (guanine-7-)methyltransferase [Nasonia vitripennis]
NCBI nr blastpgi|3072059725e-10153.12%mRNA cap guanine-N7 methyltransferase [Harpegnathos saltator]
NCBI nr blastxgi|3072059728e-10153.12%mRNA cap guanine-N7 methyltransferase [Harpegnathos saltator]
Group
Gene OntologyGO:00044823.4e-126mRNA (guanine-N7-)-methyltransferase activity
GO:00063703.7e-88mRNA capping
KEGG pathway 
InterPro domain[1-373] IPR0168993.4e-126mRNA (guanine-N(7))-methyltransferase
[33-371] IPR0049713.7e-88mRNA capping enzyme, large subunit
Orthology groupMCL14475 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211136-TA
ATGAATGACTCTTCGACTAGTTGTGAACAAAAAACAAAAAAGAATGAAGATAATAAGTTAAAAAGGACTTTAAATGAGGGGGATGACTCTGAGATATCTGCCAAGAAGGCTTGTAACCACGCTAATATCGTCGCTAATCATTACAATCATCTTGAACAGAAGGGATTAAGAGAAAGGTTTAATTCTCCAATACTTTATTTAAGAAACTTCAACAATTGGGTCAAAAGTGTTCTTATACAAGAATACTCTGATAAAGTGCGTGAAAAAAATTATTTACGATCTCTAAGAGTTCTTGATATCTGCTGTGGAAAAGGTGGTGATCTTAGTAAGTGGCAGAAAGCTCGAGCTGACCATGTTGTATTTGCAGATATTGCAGAGGTCTCTGTACAGCAATGTAAAGAAAGATATGACGAGATCCATCGTCGTTTTGGAAGGTTGTTTACAGCTGAGTTTATAGCTGCAGATTGTTCTAGAGAAACTTTAAGAGATAAATATAAGGATCCTTCAATAAAGTATGACGTAGTTAGTTGCCAGTTTGGGTTGCATTATAGCTTTGAAAGTTTAAAACAAGCACGTAGAATGCTTTTGAACATATCCGAGTGCCTTAAACCTGGTGGTTATTTCTTTGGGACTATACCTGACGCATATGAAATTGTCTCTAGGTGTAAGAAATCTCCAGATAATTCATTTGAAAATAGAATCTGTAATATCAAATTAATGTTTGACTCTGAAAAAGACGGATTTCCCCTTTTTGGGGCAAAATATGATTTCCATTTGGAAGGTGTTGTTGATTGTCCAGAATTCTTAGTTTATTTTGATATGTTTACCAAGTTAGCTTTAGAGTGTGGCTTGGAATTGGTGTATAAGGCTGGATTTTCAGATTTTTATAAAGAGCATTTAGAAAAATATAAGGATTTATTGCCAAAAATAAAGTGTTTTGAAAATTACCCCATGCCTCAAGGAAAAGAATTAATAGGTGATGAGTTAGAATATGAACATGCAAAAAAATACATTGAAAACATTTCAGATGGCACCAAAAAAGATTACCTAACAATGAGTCGTAGTGAATGGGAGGTAGCAACTGTTTACCTGGCATTTGCGTTTAAAAAATGCAAAAACACTTGGAATGCAGATGGGAAGCCAGTGTACCAAACTTCTTCAAGTAACTGTTCAACGGCAACAGACTAG

Protein sequence:

>DPOGS211136-PA
MNDSSTSCEQKTKKNEDNKLKRTLNEGDDSEISAKKACNHANIVANHYNHLEQKGLRERFNSPILYLRNFNNWVKSVLIQEYSDKVREKNYLRSLRVLDICCGKGGDLSKWQKARADHVVFADIAEVSVQQCKERYDEIHRRFGRLFTAEFIAADCSRETLRDKYKDPSIKYDVVSCQFGLHYSFESLKQARRMLLNISECLKPGGYFFGTIPDAYEIVSRCKKSPDNSFENRICNIKLMFDSEKDGFPLFGAKYDFHLEGVVDCPEFLVYFDMFTKLALECGLELVYKAGFSDFYKEHLEKYKDLLPKIKCFENYPMPQGKELIGDELEYEHAKKYIENISDGTKKDYLTMSRSEWEVATVYLAFAFKKCKNTWNADGKPVYQTSSSNCSTATD-