Monarch geneset OGS2.0

DPOGS207627
TranscriptDPOGS207627-TA1605 bp
ProteinDPOGS207627-PA534 aa
Genomic positionDPSCF300199 - 97313-102447
RNAseq coverage747x (Rank: top 17%)
Annotation
HeliconiusHMEL0061605e-9753.76% 
BombyxBGIBMGA006011-TA2e-12551.03% 
DrosophilamRNA-cap-PA4e-14447.40% 
EBI UniRef50UniRef50_D3TNV74e-14547.42%mRNA capping enzyme guanylyltransferase subunit alpha n=2 Tax=Glossina morsitans morsitans RepID=D3TNV7_GLOMM
NCBI RefSeqXP_972171.17e-15052.51%PREDICTED: similar to mRNA capping enzyme [Tribolium castaneum]
NCBI nr blastpgi|910831711e-14852.51%PREDICTED: similar to mRNA capping enzyme [Tribolium castaneum]
NCBI nr blastxgi|910831718e-15352.53%PREDICTED: similar to mRNA capping enzyme [Tribolium castaneum]
Group
Gene OntologyGO:00063976.9e-63mRNA processing
GO:00063706.9e-63mRNA capping
GO:00044846.9e-63mRNA guanylyltransferase activity
GO:00081384.1e-10protein tyrosine/serine/threonine phosphatase activity
GO:00064704.1e-10protein dephosphorylation
KEGG pathway 
InterPro domain[224-413] IPR0013396.9e-63mRNA capping enzyme
[417-513] IPR0138461.4e-25mRNA capping enzyme, C-terminal
[416-531] IPR0160274.2e-25Nucleic acid-binding, OB-fold-like
[413-506] IPR0123402.5e-24Nucleic acid-binding, OB-fold
[45-133] IPR0003404.1e-10Dual specificity phosphatase, catalytic domain
Orthology groupMCL16128 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207627-TA
ATGTATCGATTTACACCATCAATGGTTTTTGACTACGTTAAGAAGTATAAAAAAAGATTAGGTTTATGGATAGATTTAACGAATACAACAAGATTTTACGACAGAACAGAAGTGGAGAACAGAGGGTGTATATATAAGAAATTATCATGTCGTGGTCATGGGCAAACACCTTCAGAACAACAAACAAAACAGTTTATAGATATTGTGAGTGATTATATTGCACAAAATCCAAACAATTTAATTGGTGTCCATTGTACTCATGGATTTAACAGGACTGGTTTTCTTCTCTGCGCCTATATGATAATACAGGAGGATTGTAGTGTAGATTTTGCAATTTTTAATTTTGCTCAAGAGAGACCGCCAGGTATTTACAAGCAAGACTATATTGATGAACTAATAAAAAGATTCAAAGGTGATTGCGCGTTGGAGGCTCCCACGCTACCCGATTGGTGTGACGAAGAACAAATTGATTATGATGACAATGATAGAGATGGCTCTAGTCACTCACAGAGTAATTCTTCAAGAAAGAGGGAGGGGAAATACATAAACAAAAAGTTCATGATAGAACACGAAAAAGTAACATTATTGACTGACACGAAGAAAATCGATGCAATACGTGAAACGGCGGCCTCATATTTGAAGTGGAAAGTGAATGATTTCCCTGGAGCACAGCCGGTTTCAATGACTAGGAAAAATATAGAAAATTTGCAAAAGTACCCCTATCAAGTGTCTTGGAAAGCTGATGGTGTTAGATACATGATGCTTATTGTAGACGATGACGAAGTTTACATGATAGATAGAGATAATTGTATATTTAAAGTGGACAATTTAAAATTTCCTCATAACACAAAACCGAGGCATCTGCGGAAAACTTTACTAGACGGAGAAATGGTTATAGACAAAGTTGATGGTAGAGAAAAACCGAGATATTTAATTTATGATATAATAAGGTTTGAAGATACGAATGTAGGCAGAGAACACTTTTATCCGGTTAGGCTTCATTGTATAGAAGTGGAAATCGTTAATCCCAGAAATCGAGCTATAGTGAGCGGTCATATAAGAAAAGAATTGGAACCATTCAGTGTTATCATAAAACGTTTCTGGGATGTAAGGATGGCACACAGTTTACTGGAGGATAAGTTTATAAGGACACTGCATCATGAACCTGACGGACTCATTTTTCAACCATCAGAGATGCCCTACTCAGGTGGTCCCTGCGAGTTCATATTAAAATGGAAACCAAGTGATCAAAACAGCATTGATTTTAAACTTGTTCTGGAAAAGGAGACTGGACTAGGACTTGTTTCCGAAACGAAAGGCAACTTATATGTCGGCGGATCGAACGTCCCCTTTGGATGGACAGCATATAATAAGAAAATCAAGCATTTAAACAACAAGATAATTGAATGCAAGCTAGTCAACCGCTGCTGGGTCTTCATGAGGGAACGAACGGATAAGTCGTTTCCAAACTCCTACACAACAGCTAAAGCTGTAATGGAGAGCATCGTTAATCCGGTCACAAAGGAATATCTGTTGGACTTCATTAAATACAACTCTTACAGAAAACCAGACATAAATCAATCAAAACGACCACGGCTCGAATAA

Protein sequence:

>DPOGS207627-PA
MYRFTPSMVFDYVKKYKKRLGLWIDLTNTTRFYDRTEVENRGCIYKKLSCRGHGQTPSEQQTKQFIDIVSDYIAQNPNNLIGVHCTHGFNRTGFLLCAYMIIQEDCSVDFAIFNFAQERPPGIYKQDYIDELIKRFKGDCALEAPTLPDWCDEEQIDYDDNDRDGSSHSQSNSSRKREGKYINKKFMIEHEKVTLLTDTKKIDAIRETAASYLKWKVNDFPGAQPVSMTRKNIENLQKYPYQVSWKADGVRYMMLIVDDDEVYMIDRDNCIFKVDNLKFPHNTKPRHLRKTLLDGEMVIDKVDGREKPRYLIYDIIRFEDTNVGREHFYPVRLHCIEVEIVNPRNRAIVSGHIRKELEPFSVIIKRFWDVRMAHSLLEDKFIRTLHHEPDGLIFQPSEMPYSGGPCEFILKWKPSDQNSIDFKLVLEKETGLGLVSETKGNLYVGGSNVPFGWTAYNKKIKHLNNKIIECKLVNRCWVFMRERTDKSFPNSYTTAKAVMESIVNPVTKEYLLDFIKYNSYRKPDINQSKRPRLE-