Monarch geneset OGS2.0

DPOGS200776
TranscriptDPOGS200776-TA855 bp
ProteinDPOGS200776-PA284 aa
Genomic positionDPSCF300370 - 106728-107582
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0024002e-13479.36% 
BombyxBGIBMGA012123-TA3e-12880.14% 
DrosophilaCG1753-PB5e-2933.81% 
EBI UniRef50UniRef50_B1M0783e-7254.01%Cysteine synthase n=4 Tax=Proteobacteria RepID=B1M078_METRJ
NCBI RefSeqXP_002162394.16e-4035.53%PREDICTED: similar to C17G1.7 [Hydra magnipapillata]
NCBI nr blastpgi|1707504121e-7154.01%pyridoxal-5'-phosphate-dependent protein subunit beta [Methylobacterium radiotolerans JCM 2831]
NCBI nr blastxgi|1707504123e-7854.01%pyridoxal-5'-phosphate-dependent protein subunit beta [Methylobacterium radiotolerans JCM 2831]
Group
Gene OntologyGO:00081526.4e-74metabolic process
GO:00038246.4e-74catalytic activity
GO:00301706.4e-74pyridoxal phosphate binding
KEGG pathwaymrd:Mrad2831_40172e-72 
 K01738 (cysK)maps-> Selenoamino acid metabolism
    Cysteine and methionine metabolism
    Sulfur metabolism
InterPro domain[1-283] IPR0019266.4e-74Pyridoxal phosphate-dependent enzyme, beta subunit
Orthology groupMCL18033 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200776-TA
ATGAATCCTGGAGCGTCCATCAAATGCCGGTCCTCGCTTTACATGATCACTAAGGCCTTGGAGTCTGGAGCTTTGAAGCCGGGGGAACCGGTCCTGGAAATCACCTCCGGGAATCAGGGATGTGGTCTGGCGGTGGTGTGCGCGGTGCTGGGACATCCTTTGACCGTGACCATGTCTGCTGGGAACAGCGTGCAGAGAGCGATACACATGGAGGCTCTCGGAGCCAGGTGTGTCAGGGTTCCGCAAGTAGAAGGCACATACGGTAATGTGACATTATCTGATGTCAAAGCAGCGGAGGAGAAAGGTTTGAAGCTGGTGGAAGAGACTGGCGCGTACTACGTCAACCAGTTCAACAATGATATGAACTCTGAAGCTCATTACGAAACAACGGGCCCAGAGATCTGGAGACAAACTGGCCAGCGAGTCGACGCGTTCGTCGCCACAGTTGGCACAGCAGGAACCTTCGCTGGGACCTCCAGATATCTCAAGGAAAAGGATCCGAGTATTGTTTGTGTGGTCGTGGAACCAGAAGGGTCTGAGCCCATCAAAGGCTGCGAAGTAACGAAGCCATTGCATTTACTCCAGGGTTCGGGTTATGGATGTGTTCCAAATTTGTTCAACTATGAACACCTGGACGACACCATAAGCGTCAGTGACGAGGAAGTCCTGGAATACAAGAGGCTGATCGGAGAAAAAGAAGGTCTGTTCGTGGGCTACACGAGTGCAGCAAATGTCCTGGCTGCGGCGAAGCTGCTGAAGGCTGGAAAGTTAAAGGAAGACGCCTGGGTGGTGACCGTGCTCTGTGACACAGGCCTGAAGTACACCCCGGTGCCATCAGAGATCACCAAAGCGTAA

Protein sequence:

>DPOGS200776-PA
MNPGASIKCRSSLYMITKALESGALKPGEPVLEITSGNQGCGLAVVCAVLGHPLTVTMSAGNSVQRAIHMEALGARCVRVPQVEGTYGNVTLSDVKAAEEKGLKLVEETGAYYVNQFNNDMNSEAHYETTGPEIWRQTGQRVDAFVATVGTAGTFAGTSRYLKEKDPSIVCVVVEPEGSEPIKGCEVTKPLHLLQGSGYGCVPNLFNYEHLDDTISVSDEEVLEYKRLIGEKEGLFVGYTSAANVLAAAKLLKAGKLKEDAWVVTVLCDTGLKYTPVPSEITKA-