Monarch geneset OGS2.0

DPOGS202278
TranscriptDPOGS202278-TA3090 bp
ProteinDPOGS202278-PA1029 aa
Genomic positionDPSCF300032 - 332179-342885
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0149642e-16055.63% 
BombyxBGIBMGA000065-TA3e-11145.38% 
DrosophilaCG1753-PB7e-11361.38% 
EBI UniRef50UniRef50_H0VAY28e-11453.62%Uncharacterized protein n=12 Tax=Amniota RepID=H0VAY2_CAVPO
NCBI RefSeqXP_002054857.14e-12160.29%GJ24676 [Drosophila virilis]
NCBI nr blastpgi|3264314625e-12062.35%cystathionine-beta-synthase [Salpingoeca sp. ATCC 50818]
NCBI nr blastxgi|2897403371e-11760.76%cystathionine beta-synthase s [Glossina morsitans morsitans]
Group
Gene OntologyGO:00081529.2e-84metabolic process
GO:00038249.2e-84catalytic activity
GO:00301709.2e-84pyridoxal phosphate binding
KEGG pathwaydvi:Dvir_GJ246761e-120 
 K01697 (E4.2.1.22, CBS)maps-> Glycine, serine and threonine metabolism
    Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[29-346] IPR0019269.2e-84Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL11242 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202278-TA
ATGTTCAACTTATTTCAGGCTAAGCATAAATACAGTTTAGGTCACAAAGTTGCACACTTAAATCATCGGAGGTCGCCATATGTCAACAAGGTATACAATAGCATATTGGACTTTGTTGGGAACACGCCGCTTATAAAACTGAACAAAATACCAAAGGATTATGGAATCCATTGTGATATTTACGCAAAATGTGAGTTCATGAACCCGAGTGGCTCGGCAAAAGATAGAATAGCGTTGGCGATGCTTCAGGATGCTGAAAAAGGCGGCTTCATTAAGGAAGATTCAAAATTTGTTGAGCCAACTTCCGGAAACACCGGCATTGGGATTGCCTTCAACAGTGCTCTAATGGATAAGAAATGCTACATCGTCATGGGCGAAAAGAACTCAAGAGAAAAACTAACTACTATGCAGGTTCTTGGAGCTGAAACAATCAAAACCAATGGCACATCAATACAAGTTGCTCGAGATATTAAAGATTCTGATCCTGAGAATTATGTTATGTTAGATCAATTCGAAAACAATGTTAATCCGAGAGTTCATTACGAGAACACGGGAGTTGAAATTTTGGAGGCGTTGGGCGATGTAGACATGTTTGTCGTTGGTTGTGGCACCGGTGGCACTTTATCTGGTGCTGGCCAAAGGATCAAAGAGAAGTGTCCGAAGTGCACCATAATTGCTGCGGAGCCAGCTGGATCAACTATGTTTAATGTATCCGGAATACCACATCCGTATTTGGTAGAAGGAATAGGAGGACAGGAAGTTCCTATTGTCCTTGACAAATCTTTAGTAGACGATTTCGAAATTGTGACGGATAAAGAAGCGTTTCTCATGTCGCGGGAAATAATTAGAAAGGAAGGATTACTTTGTGGAGGCAGCAGCGGTGCGGCAATGGTTGCAGCAATAAAAGCAATCCAGAAGCGAAAATTCACTGCCGGTCAGCATGTGGTAGTCGTGCTACCAGACAGCATTCGTAACTACATGACCAAATTCGTCACTGATCAATGGATGGAAGCTCACCTCTTTATTGATCCACCACAACACGCTTCACCGTGGTGGAATGATCCCGTCACTCACTTAACCCTGGGACACACTTACCCTATTCTAAGCAGTGACCAAACGTGCTCTGAAGCCATACTCGAAATGATGAAAGAGAATATTGCTATTGTGGTAGAGACGAACGGAAATTTCTTAGGAGCAGTGACAAAGGATGGCCTGCGTTCAGAAGCAACAAATCCAATGAGACTTCCACACAAAAGTATTCAAGAGCTAAACTTCCAAGACTTTGTATCTGATCATTTGGTCAAAGATTGTTTCACACTGGCCAAGAACAGTGAACGTGGAATGCCAACTATAGGTCTATTATCCCGCATGTTGGACGTAGCACAATTTGTTGTCATCGGAAGAAATGTACATGAACTTGGACAAACTCACTTTGTGGCTGAAAGCGTGGCGACAGCAGACGATGTCCTAAATTATATTTTTGTTAATCGTACACAAAAAAAAAATGCAAAATGTGAATTCGTGAATCCTGGCGGTTCAGTGAAAGACCGCATAGCTTATAGAATGGTTTTGGATGCTGAGAAGAAGGGAATATTGAAACCTGGAAAATCGGTGATTGTTGAACCAACTTCTGGAAACACTGGAATTGGACTAGCTTTAGCAGCGGCTGTGAGAGGCTACAGATGTATAATTGTTTTACCGGAAAAGATGTCAAATGAGAAAGTGCTCACCTTGCATGCTTTGGGAGCCGAGATCATCAGAACACCCACTGAAGTGGCGTGGGACTCCCCTGAGAGTAACATCATGGTCGCTAAACGCTTGTCCACAGAAATCCCGAACGCTGTATTGTTGGATCAGTATAACAACGCTAGCAATCCGTTAGCCCATTATGACGGAACCGCTGAAGAGATACTCTGGTCTCTGGATAATGATGTCGACATGGTGGTATTGGGAGCGGGCACCTGTGGTACCATATCTGGAATTGCTCACAAGATTAAGGAAAAGTGCCCAAAATGCGTGGTAGTTGGAGTAGACCCACACGGCTCCGTGCTGGCCCCACCAGATAAACTTAATGAAAATGACGTAGAAATATATGAGGTAGAAGGAATCGGCTACGATTTCTTACCACAAGCGTTAGATCTTAAAATAATAGACAAATGGATAAAAACAGAGGACAAGTGGTCCTTCCAGATGGCGAGAAGACTTATCAAGGAGGAAGGACTGCTTTGCGGAGGTAGCAGTGGCGCCGCTATGTGGGGTGCCATACAAGCAGCGAAGTCTTTAGTGGCGGGTCAGAAATGCGTGGTCCTATTACCGGATAACATACGCAACTATATGACAAAATTCCTCACTGACCAATGGATGGAGGTCCGCGGATACAAATCCATCGAAAGTAATGATAATTTATGGTGGTGGAACAAACCTTTGACGGAAGGTTTAGTGCGTATCACAAAGAATATATGTCAGAACAGCACAACCTCGGAGGCGATAAAAGCTTTGAGGGAAAGAGGCAGCAGCGGTGCGGCAATGGTTGCAGCAATAAAAGCAATCCAGAAGCGAAAATTCACTGCCGGTCAGCATGTGGTAGTCGTGCTACCAGACAGCATTCGTAACTACATGACCAAATTCGTCACTCTAGTTACCTACCTTCGAGTGGAGTTTATTGATCCGCCACAACACGCTTCACCGTGGTGGAATGATCCCGTCACTCACTTAACCCTGGGACACACTTACCCTATTCTAAGCAGTGACCAAACGTGCTCTGAAGCCATACTCGAAATGATGAAAGAGAATATTGCTATTGTGGTAGAGACGAACGGAAATTTCTTAGGAGCAGTGACAAAGGATGGCCTGCGTTCAGAAGCAACAAATCCAATGAGACTTCCACACAAAAGTATTCAAGAGCTAAACTTCCAAGACTTTGTATCTGATCATTTGGTCAAAGATTGTTTCACACTGGCCAAAAACAGTGAACGTGGAATGCCAACTATAGGTCTATTATCCCGCATGTTGGACGTAGCACAATTTGTTGTCATCGGAAGAAATGTACATGAACTTGGACAAAGTAAATCCAATATGACACGTAAATAA

Protein sequence:

>DPOGS202278-PA
MFNLFQAKHKYSLGHKVAHLNHRRSPYVNKVYNSILDFVGNTPLIKLNKIPKDYGIHCDIYAKCEFMNPSGSAKDRIALAMLQDAEKGGFIKEDSKFVEPTSGNTGIGIAFNSALMDKKCYIVMGEKNSREKLTTMQVLGAETIKTNGTSIQVARDIKDSDPENYVMLDQFENNVNPRVHYENTGVEILEALGDVDMFVVGCGTGGTLSGAGQRIKEKCPKCTIIAAEPAGSTMFNVSGIPHPYLVEGIGGQEVPIVLDKSLVDDFEIVTDKEAFLMSREIIRKEGLLCGGSSGAAMVAAIKAIQKRKFTAGQHVVVVLPDSIRNYMTKFVTDQWMEAHLFIDPPQHASPWWNDPVTHLTLGHTYPILSSDQTCSEAILEMMKENIAIVVETNGNFLGAVTKDGLRSEATNPMRLPHKSIQELNFQDFVSDHLVKDCFTLAKNSERGMPTIGLLSRMLDVAQFVVIGRNVHELGQTHFVAESVATADDVLNYIFVNRTQKKNAKCEFVNPGGSVKDRIAYRMVLDAEKKGILKPGKSVIVEPTSGNTGIGLALAAAVRGYRCIIVLPEKMSNEKVLTLHALGAEIIRTPTEVAWDSPESNIMVAKRLSTEIPNAVLLDQYNNASNPLAHYDGTAEEILWSLDNDVDMVVLGAGTCGTISGIAHKIKEKCPKCVVVGVDPHGSVLAPPDKLNENDVEIYEVEGIGYDFLPQALDLKIIDKWIKTEDKWSFQMARRLIKEEGLLCGGSSGAAMWGAIQAAKSLVAGQKCVVLLPDNIRNYMTKFLTDQWMEVRGYKSIESNDNLWWWNKPLTEGLVRITKNICQNSTTSEAIKALRERGSSGAAMVAAIKAIQKRKFTAGQHVVVVLPDSIRNYMTKFVTLVTYLRVEFIDPPQHASPWWNDPVTHLTLGHTYPILSSDQTCSEAILEMMKENIAIVVETNGNFLGAVTKDGLRSEATNPMRLPHKSIQELNFQDFVSDHLVKDCFTLAKNSERGMPTIGLLSRMLDVAQFVVIGRNVHELGQSKSNMTRK-