Monarch geneset OGS2.0

DPOGS211994
TranscriptDPOGS211994-TA1338 bp
ProteinDPOGS211994-PA445 aa
Genomic positionDPSCF300514 + 9472-17343
RNAseq coverage468x (Rank: top 27%)
Annotation
HeliconiusHMEL0093584e-13062.72% 
BombyxBGIBMGA009808-TA0.080.04% 
DrosophilaCG9977-PA0.069.82% 
EBI UniRef50UniRef50_O438653e-17467.48%Putative adenosylhomocysteinase 2 n=243 Tax=Metazoa RepID=SAHH2_HUMAN
NCBI RefSeqXP_624152.20.073.88%PREDICTED: similar to CG9977-PA [Apis mellifera]
NCBI nr blastpgi|3071827080.074.00%Putative adenosylhomocysteinase 3 [Camponotus floridanus]
NCBI nr blastxgi|3454866110.072.25%PREDICTED: putative adenosylhomocysteinase 3-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00040131.8e-302adenosylhomocysteinase activity
GO:00067301.8e-302one-carbon metabolic process
GO:00054884.5e-56binding
KEGG pathwayame:5517620.0 
 K01251 (E3.3.1.1, ahcY)maps-> Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[56-445] IPR0000431.8e-302Adenosylhomocysteinase
[208-364] IPR0158784.1e-73S-adenosyl-L-homocysteine hydrolase, NAD binding
[209-362] IPR0160404.5e-56NAD(P)-binding domain
Orthology groupMCL10645 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211994-TA
ATGCCGTCCTCTGGTATCCTGAAGAACAGCATGAATGAGGAGTGCGGAGCCTCGCTGAAGAAGCGCCACTCGATAGTGTCCGGGAGAAGTGACTTCTGGTCCTCGTCAGATGAAGACGAGGGAGTCCAACCTCACGAGAAGGCTATCGCTGCCGGCCAGCCGAATCCCCCGTACTGCGTGCGTAACATAGAACAGCATGATTTTGGTAGGAGGGAGATTGAGATCGCTGAACAGGAGATGCCGGGCATAATGGCCTTGAGGGCGAGGGCAAAGGACGATAAGCCGCTAAAAGACGCGAAGATAGTAGGCTGCACACACATTAACGCTCAAACGGCCGTGTTGATCGAGACCCTGGCCGCTTTGGGCGCGACTGTCCGTTGGGCGGCATGCAACATATACAGCACACAGAATGAAGTGGCCGCTGCCTTAGCGGATGCGGGTTACGCTATATTCGCGTGGCGTGGCGAGAGCGAGGAGGCGTTCTGGTGGTGTATAGACCAGTGCTGTACCCCCACCAGCGGCTGGCAACCCAACATGATACTAGACGACGGTGGAGACGCCACGCACCTGATGCTGAAGAAGCACCCGGCCGCCTTCAAACAGATCAAAGGTACGTCAAAGATAAGGGAGAGCATCATAGACGCCCTTAAACGCAGCACGGACCTGATGTTCGGCGGCAAGCAGGCGGCCGTGTGCGGGTACGGGGAGGTGGGCAAGGGCTGCTGCCAGGCGCTCAAGGCGCTCGGCTGTGTGGTGTACGTCACTGAGATCGACCCTATCTGCGCGCTGCAGGCCGCCATGGACGGCTTCAGGGTGGTCAAGCTGAACGAGGTGATAAGACAAGTGGATATAGTCATAACGGCGACGGGGAATAAGGGCGTGGTCACACGGGACCACATGGAGAGAATGAAGAATGGATGTGTGGTGTGCAACATGGGCCACAGTAACACTGAGGTGGATGTACACGCGCTCAGAACACCTGATCTGATGTGGGAGAGGGTCAGGAGTCAGGTGGATCATATAATCTGGGGTAACGGCAAGCGCATCGTGTTGCTGGCGGAGGGCCGGCTCGCCAACCTGTGCTGCTCGTCGCTGCCGTCGTTCGTGGTGTCCGTGACGGCCGCCACGCAGGCGCTCGCACTCATAGAGCTCTACAACGCGCCCGCACACCGATACAAGGCCGACGTGTATCTTCTACCAAAGAAAATGGACGAGTACGTGGCCAGTTTACATCTACCCACGTTCGACGCACATCTCACGGAGCTCACAGACGAACAGGCCAAATATCTAGGTCTTAATAAAGTGGGGCCCTTCAAACCTAACTATTATAGGTACTAG

Protein sequence:

>DPOGS211994-PA
MPSSGILKNSMNEECGASLKKRHSIVSGRSDFWSSSDEDEGVQPHEKAIAAGQPNPPYCVRNIEQHDFGRREIEIAEQEMPGIMALRARAKDDKPLKDAKIVGCTHINAQTAVLIETLAALGATVRWAACNIYSTQNEVAAALADAGYAIFAWRGESEEAFWWCIDQCCTPTSGWQPNMILDDGGDATHLMLKKHPAAFKQIKGTSKIRESIIDALKRSTDLMFGGKQAAVCGYGEVGKGCCQALKALGCVVYVTEIDPICALQAAMDGFRVVKLNEVIRQVDIVITATGNKGVVTRDHMERMKNGCVVCNMGHSNTEVDVHALRTPDLMWERVRSQVDHIIWGNGKRIVLLAEGRLANLCCSSLPSFVVSVTAATQALALIELYNAPAHRYKADVYLLPKKMDEYVASLHLPTFDAHLTELTDEQAKYLGLNKVGPFKPNYYRY-