Monarch geneset OGS2.0

DPOGS206538
TranscriptDPOGS206538-TA1026 bp
ProteinDPOGS206538-PA341 aa
Genomic positionDPSCF300190 - 137459-143417
RNAseq coverage1008x (Rank: top 13%)
Annotation
HeliconiusHMEL0116311e-5670.06% 
BombyxBGIBMGA004208-TA4e-12563.21% 
DrosophilaCG10621-PA7e-9250.16% 
EBI UniRef50UniRef50_Q2F5Q83e-11863.43%Homocysteine S-methyltransferase n=11 Tax=Endopterygota RepID=Q2F5Q8_BOMMO
NCBI RefSeqXP_966501.15e-11963.53%PREDICTED: similar to homocysteine S-methyltransferase isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910832139e-11863.53%PREDICTED: similar to homocysteine S-methyltransferase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|1140525143e-11564.14%homocysteine S-methyltransferase [Bombyx mori]
Group
Gene OntologyGO:00088982e-81homocysteine S-methyltransferase activity
KEGG pathwaytca:6621761e-118 
 K00547 (E2.1.1.10, mmuM)maps-> Cysteine and methionine metabolism
InterPro domain[13-322] IPR0037262e-81Homocysteine S-methyltransferase
Orthology groupMCL12369 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206538-TA
ATGACACCAACAAATGAGGACGGTGTGGAACCGCCGCACATAGTTGTTTTAGACGGAGGATTCTCTACGCAACTTTCCTGTCACGTAGGTCATGTCATTGACGGTGACCCTCTCTGGAGCGCCCGCTTCTTGCACACACACCCCAACGAGGTTGTGAATACTCATCTTGACTTCCTTAGAGCTGGGGCCAATTTTATAATTACGAATACATATCAAGCATCTGTCGAGGGTTTTGTGGAACACCTGGATCTGACACCGGAGCAAGGATATGAGCTCATCACCAGAGCTGTCGAGCTCGCGAAGCAGGCTCGTACATTGTATCTTGAGGAGTATGAGAATTACATACAACACGATCACGTCCCACTAGTTGTAGGATCTGTAGGACCATATGGGGCTCATTTGCACGATGGCTCGGAATACGACGGCAGTTACGCGGACACAACATCTGCTCAGACAATGCGTGAATGGCATAGACCTCGAATTCAAGCGTTAATAGAAGCTGGAGTGGATCTGCTAGCTTTAGAGACGATACCTTGTCAAGAAGAGGCTGAGATGTTGTGTGACTTGTTGCGCGAATTTCCCAATATGAAAGCTTGGCTGTCCTTTAGCTGCAAAGATAATCAAAGCATAGCTCACGGTGAAAGTTTTCAAAAAGTGGCTAAGAAATGTTGGGAGTCGAATTCAGATCAGCTGGTGGCTGTGGGGGTGAACTGCTGCGCCCCTTCGTTTGTGACCAGTCTATTAAAGGGGATCAACGACGATAGGCCGCACGACCCCATACCCCTCATCGTTTACCCCAACTCCGGCGAAAAGTACAACCCGCAAATTGGATGGATAGATCGCGATAAGTGCGAACCCGTGGAAGTATTCATCCAGGAATGGTTGGACTTAGGAGTGCGATACGTGGGCGGGTGCTGTCGTACATACGCAGCAGATGTATCAAGAATACGTAACCAGGTCCACTGCTGGAGAGATCGTTGGCGCTTCCAGCACAAGTTTACATCTAACACTCAGAATAATAATTGA

Protein sequence:

>DPOGS206538-PA
MTPTNEDGVEPPHIVVLDGGFSTQLSCHVGHVIDGDPLWSARFLHTHPNEVVNTHLDFLRAGANFIITNTYQASVEGFVEHLDLTPEQGYELITRAVELAKQARTLYLEEYENYIQHDHVPLVVGSVGPYGAHLHDGSEYDGSYADTTSAQTMREWHRPRIQALIEAGVDLLALETIPCQEEAEMLCDLLREFPNMKAWLSFSCKDNQSIAHGESFQKVAKKCWESNSDQLVAVGVNCCAPSFVTSLLKGINDDRPHDPIPLIVYPNSGEKYNPQIGWIDRDKCEPVEVFIQEWLDLGVRYVGGCCRTYAADVSRIRNQVHCWRDRWRFQHKFTSNTQNNN-