Monarch geneset OGS2.0

DPOGS211094
TranscriptDPOGS211094-TA1293 bp
ProteinDPOGS211094-PA430 aa
Genomic positionDPSCF300007 - 1111303-1113545
RNAseq coverage3280x (Rank: top 4%)
Annotation
HeliconiusHMEL0124780.092.09% 
BombyxBGIBMGA011168-TA0.093.59% 
DrosophilaAhcy13-PA0.078.37% 
EBI UniRef50UniRef50_Q275800.078.37%Adenosylhomocysteinase n=323 Tax=root RepID=SAHH_DROME
NCBI RefSeqNP_001093271.10.094.19%S-adenosyl-L-homocysteine hydrolase [Bombyx mori]
NCBI nr blastpgi|1537918170.094.19%S-adenosyl-L-homocysteine hydrolase [Bombyx mori]
NCBI nr blastxgi|1537918170.094.19%S-adenosyl-L-homocysteine hydrolase [Bombyx mori]
Group
Gene OntologyGO:00040137.4e-200adenosylhomocysteinase activity
GO:00067307.4e-200one-carbon metabolic process
KEGG pathwayaag:AaeL_AAEL0083410.0 
 K01251 (E3.3.1.1, ahcY)maps-> Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[2-430] IPR0000430Adenosylhomocysteinase
[190-350] IPR0158782.7e-81S-adenosyl-L-homocysteine hydrolase, NAD binding
Orthology groupMCL12809 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211094-TA
ATGAAGCCACCATACAAAATTGCTGATGAGAAATTGGCGGAACTCGGCCGTAAAGAAATTATGTTAGCTGAGAAAGAAATGCCGGGACTAATGGCATGCCGTCGCAAATATGCTCCTAGCAAAATTCTCAAGGGAGCCAGAATAGCTGGAAGTCTTCACATGACAGTACAAACAGCTGTACTTATTGAAACTCTAATTGAGTTAGGTGCAGAGGTACAATGGTCTAGCAGTAACATATTCAGTACACAAGATGAGGCAGCCGCTGCCCTAGTGGCCGTCGGAATACCTATTTATGCCTGGAAGGGTGAAACTGATGAAGAATACATTTGGTGTATTGAACAAACTTTGTTCTTCAATGACGGAAAGCCCTTGAACATGATTTTGGATGATGGTGGAGATCTTACAAACCTGGTTCACAAGAAATATCCACAACTACTTGAGAATATCAAGGGTATCTCAGAGGAAACAACCACTGGTGTTCACAACTTATACAAAATGTTCCGAGAAGGTCTTCTTAAAGTGCCTGCTATTAATGTGAACGACTCTGTAACAAAAAGCAAATTTGATAACTTATATGGCTGCCGGGAGTCACTTCTTGATGGAATTAAAAGAGCAACTGACATTATGGTTGCTGGTAAGGTCTGCGTTGTAGGGGGTTATGGAGATGTCGGAAAGGGCTGTGCCCAAGCATTCAAGGGCTTTGGAGGCAGGGTAATTGTCACTGAAATTGACCCCATTAATGCCCTACAAGCAGCTATGGAAGGTTTCCAAGTAACAACAATGGATGAGGCGGCTGAGATTGGACAAATCTTTGTCACCACCACTGGGAATATTGACATAATATGCAAGGAGCACTTCCTTAAGATGAAAGATGATGCTATCGTTTGTAACATTGGACACTTTGACTGTGAGATTGATGTAGCTTGGCTGGAAAAGAATGCCAAGAAAGTCAATATTAAACAGCATGTGGACCGCTATGAACTTGAAAACGGAAATCATATAATTGTCCTAGCCGCTGGAAGGCTAGTTAACTTGGGATGTGCCACCGGTCACTCTTCATTTGTTATGTCCAACTCTTTCACAAATCAAGTCTTAGCGCAAATTGAACTCTGGACAAAACACAACACATACCCTATTGGAGTCCACACCTTGCCTAAGAAGCTTGATGAAGAGGTAGCAGCATTACACTTAGACCATTTAGGAGTGAAACTTACTAAGCTAACTCCTAAACAAGCTCAGTACATTGGTGTGCCAGTCGAAGGTCCCTACAAGCCTGACCACTATAGATACTGA

Protein sequence:

>DPOGS211094-PA
MKPPYKIADEKLAELGRKEIMLAEKEMPGLMACRRKYAPSKILKGARIAGSLHMTVQTAVLIETLIELGAEVQWSSSNIFSTQDEAAAALVAVGIPIYAWKGETDEEYIWCIEQTLFFNDGKPLNMILDDGGDLTNLVHKKYPQLLENIKGISEETTTGVHNLYKMFREGLLKVPAINVNDSVTKSKFDNLYGCRESLLDGIKRATDIMVAGKVCVVGGYGDVGKGCAQAFKGFGGRVIVTEIDPINALQAAMEGFQVTTMDEAAEIGQIFVTTTGNIDIICKEHFLKMKDDAIVCNIGHFDCEIDVAWLEKNAKKVNIKQHVDRYELENGNHIIVLAAGRLVNLGCATGHSSFVMSNSFTNQVLAQIELWTKHNTYPIGVHTLPKKLDEEVAALHLDHLGVKLTKLTPKQAQYIGVPVEGPYKPDHYRY-