Monarch geneset OGS2.0

DPOGS206684
TranscriptDPOGS206684-TA2046 bp
ProteinDPOGS206684-PA681 aa
Genomic positionDPSCF300048 + 1183070-1200879
RNAseq coverage68x (Rank: top 67%)
Annotation
HeliconiusHMEL0065714e-7163.01% 
BombyxBGIBMGA008517-TA3e-7162.32% 
DrosophilaCG32732-PA1e-6431.85% 
EBI UniRef50UniRef50_UPI00021A7B436e-8941.61%UPI00021A7B43 related cluster n=2 Tax=unknown RepID=UPI00021A7B43
NCBI RefSeqXP_393639.24e-9241.07%PREDICTED: similar to SET domain containing 3 [Apis mellifera]
NCBI nr blastpgi|3800152485e-9138.64%PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis florea]
NCBI nr blastxgi|3800152482e-8938.64%PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis florea]
Group
Gene OntologyGO:00055151.1e-05protein binding
KEGG pathway 
InterPro domain[334-404] IPR0153537.7e-11Rubisco LS methyltransferase, substrate-binding domain
Orthology groupMCL10527 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206684-TA
ATGGGACGAAAACTTCAGTCAAAGCTTACATGCAAGAAGAAAAATGTTAAAGAAGGGAATAGATTTTTGCAGCAGAGACGTAAAGAATTAGCAGTTTTAGTAGATACATTACTTAAATTAACCAGCACGTTTCAAAGTACGGGTAAAAGCTTTGAGCACCATTTACAAATCGAAAAAATTATCAAAGAAATTATAAATATTGAGTCCATTTCAAATAAAAGTACAAACAGACAGAGGAAATTATATATAGAAAACTATGTCAGTTGGTTACATGAACATGGAGCTGAATTTGAAGGAGTGGAAATAAGTGAATTTGATGGTTACGGGTTCGGTTTAAAGGCGACCAAAGATTTTTCAGAAGGATCACTTATATTAACTGTACCTGGCAAAGTTATGATGAGTGAGAAAGATCCAAAAGCATCCGACTTATCAGAATTTATCAACATAGATCCACTTTTACAGAATATGCCAAATGTTACCCTAGCGTTGTTTTTGCTTTTAGAAAAGAATAATCCCAACTCTTTCTGGAAGCCATACATTGATGTACTGCCTGAGAAGTATTCCACAGTACTATACTTTAACTCAGAAGAACTAGCCGAGCTGAGGCCTTCACCTGTTTTTGAGTCGTCATTAAAATTGTACAGAAGTATTGTAAGACAATACGCCTACTTCTACAACAAAATTCACACAATAGACTTGCCAGTTCTCAAAAATCTACAAGATATATTCACATTTGATAACTACAGATGGGCGGTGTCCACTGTGATGACCCGCCAGAACAACATAGTTCAGGGGACTGCCTTCACGTTGACGAACGCTTTCATACCGCTCTGGGACATGTGCAATCATAAACACGGCAAGATAACGACCGATTTCAATTTGGAGCTGAACCGCGGCGAGTGTTACGCGTTACAAGACTACAGACGAGACGAACAGATATTCATATTTTACGGAGCGAGACCGAACTCGGATCTCTTCCTGCATAATGGTTTTGTGTATCCGGATAATGATTACGATAGTTTGTCTATCGCGTTGGGTATAAGTCCCAACGACGCTTTGAGGAACGGAAAAGTCAATCTATTGAATAAGCTCGGCCTGTCTGGTGTCACAAACTTCTCGCTATACAAAGGCGCGAGTCCCATCAGCGTGGAACTGCTCGCCTTTATAAGGATTTTCAATATGAACCAAGAGGAATTAGAGAAGTGGTCGGCCGAGAGCATCCCTAGTGATTTGCTGTCTTTTGAGACAGGAACCGAATACAATATGGCGTCGATTGATAAAAGAGGATTTACATACCTTCTGACCAGGTGCGGCCTCATCAGGGGTACTTACAAAGACAGTGGGGGTGATGTCCAGTCTGAGCACAGGAAAAACATAAAACTATTGAAGCAATGCGAAGTACAAATATTAGAAAATGCCATAAAGTACTTAAGGGACATAACGACCGATTTCAATTTGGAGCTGAACCGCGGCGAGTGTTACGCGTTACAAGACTACAGACGAGACGAACAGATATTCATATTTTACGGAGCGAGACCGAACTCGGACCTCTTCCTGCATAATGGTTTTGTGTATCCGGATAATGATTACGATAGTTTGTCTATCGCGTTGGGTATAAGTCCCAACGACGCTTTGAGGAACGGAAAAGTCAATCTATTGAATAAGCTCGGCCTGTCTGGTGTCACAAACTTCTCGCTATACAAAGGCGCGAGTCCCATCAGCGTGGAACTGCTCGCCTTTATAAGGATTTTCAATATGAACCAAGAGGAATTAGAGAAGTGGTCGGCCGAGAGCATCCCTAGTGATTTGCTGTCTTTTGAGACAGGAACCGAATACAATATGGCGTCGATTGATAAAAGAGGATTTACATACCTTCTGACCAGGTGCGGCCTCATCAGGGGTACTTACAAAGACAGTGGGGGTGATGTCCAGTCTGAGCACAGGAAAAACATAAAACTATTGAAGCAATGCGAAGTACAAATATTAGAAAATGCCATAAAGTACTTAAGGGACGTCATAGACAAGATTTCCGGAGACGAGAAATAA

Protein sequence:

>DPOGS206684-PA
MGRKLQSKLTCKKKNVKEGNRFLQQRRKELAVLVDTLLKLTSTFQSTGKSFEHHLQIEKIIKEIINIESISNKSTNRQRKLYIENYVSWLHEHGAEFEGVEISEFDGYGFGLKATKDFSEGSLILTVPGKVMMSEKDPKASDLSEFINIDPLLQNMPNVTLALFLLLEKNNPNSFWKPYIDVLPEKYSTVLYFNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTNAFIPLWDMCNHKHGKITTDFNLELNRGECYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGASPISVELLAFIRIFNMNQEELEKWSAESIPSDLLSFETGTEYNMASIDKRGFTYLLTRCGLIRGTYKDSGGDVQSEHRKNIKLLKQCEVQILENAIKYLRDITTDFNLELNRGECYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYKGASPISVELLAFIRIFNMNQEELEKWSAESIPSDLLSFETGTEYNMASIDKRGFTYLLTRCGLIRGTYKDSGGDVQSEHRKNIKLLKQCEVQILENAIKYLRDVIDKISGDEK-