Monarch geneset OGS2.0

DPOGS200777
TranscriptDPOGS200777-TA822 bp
ProteinDPOGS200777-PA273 aa
Genomic positionDPSCF300370 - 97930-98751
RNAseq coverage73x (Rank: top 66%)
Annotation
HeliconiusHMEL0024002e-12677.29% 
BombyxBGIBMGA012123-TA8e-12077.29% 
DrosophilaCG1753-PB5e-3034.42% 
EBI UniRef50UniRef50_B1M0783e-7254.95%Cysteine synthase n=4 Tax=Proteobacteria RepID=B1M078_METRJ
NCBI RefSeqXP_002162394.15e-4136.67%PREDICTED: similar to C17G1.7 [Hydra magnipapillata]
NCBI nr blastpgi|1707504121e-7154.95%pyridoxal-5'-phosphate-dependent protein subunit beta [Methylobacterium radiotolerans JCM 2831]
NCBI nr blastxgi|1707504122e-7854.95%pyridoxal-5'-phosphate-dependent protein subunit beta [Methylobacterium radiotolerans JCM 2831]
Group
Gene OntologyGO:00081523.5e-72metabolic process
GO:00038243.5e-72catalytic activity
GO:00301703.5e-72pyridoxal phosphate binding
KEGG pathwaymrd:Mrad2831_40172e-72 
 K01738 (cysK)maps-> Selenoamino acid metabolism
    Cysteine and methionine metabolism
    Sulfur metabolism
InterPro domain[1-272] IPR0019263.5e-72Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL18033 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200777-TA
ATGAATCCTGGAGCGTCCATCAAATGCCGGTCCTCGCTTTACATGATCACTAAGGCCTTGGAGTCTGGAGCTTTGAAGCCGGGGGAACCGGTCCTGGAAATCACCTCCGGGAATCAGGGATGTGGTCTGGCGGTGGTGTGCGCGGTGCTGGGACATCCTTTGACCGTGACCATGTCTGCTGGGAACAGCGTGCAGAGAGCGATACACATGGAGGCTCTGGGAGCCAGGTGTGTGAGGGTTCCGCAGGTGGATGGCACATACGGCAAAGTCACATTGGCGGATGTGACCGCAGCTGAGGAGAAAGGTTTGCAATTGGTGGAAGAGACTGGCGCGTACTACGTCAACCAGTTCAACAATGATATGAACTCTCGAGCTCATTACGAAACAACGGGCCCAGAGATCTGGAGACAGACTGGCCAGCGAGTCGACGCGTTCGTCGCCGCAGTCGGCACAGCAGGAACCTTCGCTGGGACCTCCAGATATCTCAAGGAAAAGGATCCGAGTATTGTTTGTGTGGTCGTGGAACCAGAAGGCTCTGAGGCCATCAAAGGCTGCGAAGTAACGAAGCCAATGCATTTAATGCAGGGCTCGGGTTACGGATGTGTTCCAAATTTGTTCAACTATGAACACCTGGACGACACCATAAGCGTCAGTGACGAGGAAGTCCTGGAATACAAGAGGCTGATCGGAGAAAAAGAAGGTCTGTTCGTGGGCTACACGAGTGCAGCAAATGTCCTGGCTGCGGCGAAGCTGCTGAAGGCTGGAAAGTTGAAGGAGGACGCCTGGGTGGTGACCGTGCTCTGTGACACAGGCCTGAAGTAA

Protein sequence:

>DPOGS200777-PA
MNPGASIKCRSSLYMITKALESGALKPGEPVLEITSGNQGCGLAVVCAVLGHPLTVTMSAGNSVQRAIHMEALGARCVRVPQVDGTYGKVTLADVTAAEEKGLQLVEETGAYYVNQFNNDMNSRAHYETTGPEIWRQTGQRVDAFVAAVGTAGTFAGTSRYLKEKDPSIVCVVVEPEGSEAIKGCEVTKPMHLMQGSGYGCVPNLFNYEHLDDTISVSDEEVLEYKRLIGEKEGLFVGYTSAANVLAAAKLLKAGKLKEDAWVVTVLCDTGLK-