Monarch geneset OGS2.0

DPOGS202279
TranscriptDPOGS202279-TA1410 bp
ProteinDPOGS202279-PA469 aa
Genomic positionDPSCF300032 - 316868-329036
RNAseq coverage237x (Rank: top 43%)
Annotation
HeliconiusHMEL0047223e-7278.98% 
BombyxBGIBMGA004935-TA2e-10662.79% 
DrosophilaCG1753-PB5e-8950.14% 
EBI UniRef50UniRef50_P355202e-8950.00%Cystathionine beta-synthase n=149 Tax=root RepID=CBS_HUMAN
NCBI RefSeqXP_002054857.12e-9440.53%GJ24676 [Drosophila virilis]
NCBI nr blastpgi|3264314622e-9551.26%cystathionine-beta-synthase [Salpingoeca sp. ATCC 50818]
NCBI nr blastxgi|3264314624e-9643.79%cystathionine-beta-synthase [Salpingoeca sp. ATCC 50818]
Group
Gene OntologyGO:00081522.8e-55metabolic process
GO:00038242.8e-55catalytic activity
GO:00301702.8e-55pyridoxal phosphate binding
KEGG pathwaydvi:Dvir_GJ246765e-94 
 K01697 (E4.2.1.22, CBS)maps-> Glycine, serine and threonine metabolism
    Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[148-374] IPR0019262.8e-55Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL25084 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202279-TA
ATGGCTAACGGCATCTGTATCGGATCTCAGGAATTCTTCAATCTCAAAGAAATGAGACACATCGTGAAAAATCTGGACAAAAATTCAAAAACCTATGATCATATTTTAGATGCTATTGGGAAAACACCAATGGTGAAACTATCCAAGATACCAAAGGACGAAGGCTTGAAGTGTGACATGTGCGATAAGGTCTATTCCGGGGCTCAGTCGAAAGATCTTCAATTGTCTAAGTTTCATTTACTTCGTAACTGTTCAATCAGTGGAATATTTAGCGGAGACAACGCTAGACGGAGACTGGCTCATTTGTCAGGATCCACAGAAGAGCCTCTTGATAAATTTACAGTAAAGAAGTTTTACAAAGTGGACCTCTCTGACAATCCAACGCTGGGCTGTGTATCGAGAATACTGGATATAGCACCTTATGTTGTGGTTGTCAAAACAGGATACAGATGTATTATTGTGCTACCGGAAAAGATGTCAGATGAAAAGGTGAACACGCTCCGTGCATTGGGTGCAGAAATCATCAGGACTCCGACCGAGGCAGCTTCTGAGTCCCCCGATAGTAACATCATGGTTGCGAGACGTTTGTCAAACGAAATACCTGATGCTGTACTCTTAGATCAGTACAATAATGTGTGCAACCCGTTAGCTCATTATGATGGCACCGCTGAAGAAATCCTATGGTCGCTTGAGAATGACGTGGATATGGTGGTGATAGGAGCTGGAACTTGTGGGACAATCTCCGGGGTCGCACACAAAATCAAGGAAAAGTGTCCCAAATGTGTAGTCGTTGGAGTGGACCCCTATGGATCAATTCTGGCTCAGCCTGAAGAGTTAAATGAGAGTGATGTAATGGTGGTGGAAGGTATTGGATATGATTTCTTGCCGAAGGCTCTAGACAGGACGGTCATTGACAAATGGGTGAAGACGGAAGACAAGACCTCACTGAACATGGCCAGGAGGTTGATCAAAGAGGAAGGACTGCTATGCGGAGGTAGCAGCGGGTCCGCGATGTGGGGAGCCATCCAAGCGGCGAAATCTCTTAAAGCAGGTCAAAGATGTGTGGTTCTATTACCAGACAATATACGTAACTATATGACGAAATTCATCTCTGACCAGTGGATGGAGGCCCGCGGTTTCAAGCCATACGATAATAACGAAAAACTGTGTTCAATCAGTGGAATATTTAGCGGAGACAACGCTAGACGGACACTGGCTCATTTGTCAGGATCCACAGAAGAGCCTCTTGATAAATTTACAGTAAAGAAGTTTTACAAAGTGGACCTCTCTGACAATCCAACGCTGGGCAGTGTATCGAGAATACTGGATATAGCACCTTATGTTGTGGTTGTCAAAACAGGTAAATCATGTTTATTGTATAGAATTCTCGAAGTGGCCTTAAATTTTTGA

Protein sequence:

>DPOGS202279-PA
MANGICIGSQEFFNLKEMRHIVKNLDKNSKTYDHILDAIGKTPMVKLSKIPKDEGLKCDMCDKVYSGAQSKDLQLSKFHLLRNCSISGIFSGDNARRRLAHLSGSTEEPLDKFTVKKFYKVDLSDNPTLGCVSRILDIAPYVVVVKTGYRCIIVLPEKMSDEKVNTLRALGAEIIRTPTEAASESPDSNIMVARRLSNEIPDAVLLDQYNNVCNPLAHYDGTAEEILWSLENDVDMVVIGAGTCGTISGVAHKIKEKCPKCVVVGVDPYGSILAQPEELNESDVMVVEGIGYDFLPKALDRTVIDKWVKTEDKTSLNMARRLIKEEGLLCGGSSGSAMWGAIQAAKSLKAGQRCVVLLPDNIRNYMTKFISDQWMEARGFKPYDNNEKLCSISGIFSGDNARRTLAHLSGSTEEPLDKFTVKKFYKVDLSDNPTLGSVSRILDIAPYVVVVKTGKSCLLYRILEVALNF-