Monarch geneset OGS2.0

DPOGS215780
TranscriptDPOGS215780-TA1188 bp
ProteinDPOGS215780-PA395 aa
Genomic positionDPSCF300041 + 1841872-1850260
RNAseq coverage524x (Rank: top 24%)
Annotation
Heliconius% 
BombyxBGIBMGA003656-TA1e-15868.53% 
DrosophilaEip55E-PA1e-14762.53% 
EBI UniRef50UniRef50_Q17DR11e-14357.85%Cystathionine beta-lyase n=7 Tax=cellular organisms RepID=Q17DR1_AEDAE
NCBI RefSeqNP_001040113.10.075.38%cystathionine gamma-lyase [Bombyx mori]
NCBI nr blastpgi|1140512390.075.38%cystathionine gamma-lyase [Bombyx mori]
NCBI nr blastxgi|1140512394e-17475.38%cystathionine gamma-lyase [Bombyx mori]
Group
Gene OntologyGO:00065201.1e-234cellular amino acid metabolic process
GO:00301701.1e-234pyridoxal phosphate binding
GO:00038243.3e-91catalytic activity
KEGG pathwayaga:AgaP_AGAP0111722e-153 
 K01758 (E4.4.1.1)maps-> Nitrogen metabolism
    Glycine, serine and threonine metabolism
    Selenoamino acid metabolism
    Cysteine and methionine metabolism
InterPro domain[4-396] IPR0002771.1e-234Cys/Met metabolism, pyridoxal phosphate-dependent enzyme
[13-391] IPR0154248.9e-121Pyridoxal phosphate-dependent transferase, major domain
[13-257] IPR0154213.3e-91Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[258-393] IPR0154222.5e-51Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL12805 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215780-TA
ATGGGTGATCAAGGATTCCTAAAACAGAAGCCAGGTTTTTCCACTCTAGCAATCCATGCGGGCCAGAGTCCTGATAAGTGGAGACATGCTAGCGTAGTGACTCCCATTGTAACATCAACCACATTCAAACAGCCAGCCCCAGCGGAACACACGGGCTATGAATATGGTAGATCTGGGAATCCTACCAGGAACACTTTGGAGGAATGCTTGGCGGCCTTGGACGGAGGCAAGCATGGCTTCACGTTTGCCTCAGGTCTCGGTGCTACAACTACAATATTTTCGCTTCTGAAACAGGGTGATGAAATCATTTGCTGCGACGACGTGTACGGAGGAACAAATAGACTGTTTAGGAGAGTAGCCGCGCCGTTTGGTATAGAAATACATTTTATAGATTTCTCAGATCTGGGTTTATTAGATAGAACAATAAATGGTAAAACAAAGTTAGTATGGATGGAAACACCAACAAATCCTATGCTGAAAGTGTTGGACATAAAGGCTGTATCCAAAATAGTGAAATCCCATAGCGATGATATAATATTGGTGGTTGATAATACTTTCCTGACGCCGTACCTCCAAAGACCCTTGGACTTCGGGGCGGATATCGTCATGTATTCCGTCACCAAATACATGAACGGTCACGCCGATGTTATCATGGGAGCAGCCGTAGTCAATAACGATGATCTAGCCAGCAAGTTACGATTCCTACAAAACTCAATGGGGATCGTACCTTCGCCAATGGATTGCTACCTGGTAATACGTAGTCTGAAAACACTGGCCCTGAGAATGGAACATCACAAGAAATCGTCTCTCAAGATAGCGGAGTGGCTTCTCAAACATCCGAAGGTCGTTGAAGTTATGCATCCAGGTCTCCCCTCCCACCCTCAGCACGAGATCGCCAGACGTCAGAGTACAGGTCACTCGGGCGTGTTCAGCTTCCGTCACTCCGGCGGGCTCCAGGAGTCCAGGAAGTTCTTCAGCGCCATCAAGGTGTTCATACTGGCGGAAAGTCTCGGCGGATATGAGAGCCTGGCGGAGCTGCCGTCTCTAATGACTCACGCTTCCGTCCCAGCTGAACAAAGGGAACAGCTGGGAATCTCCGACTCATTGATAAGGCTGTCCGTCGGTCTGGAGGAGACGGATGACCTGATACAAGACCTGGAACAAGCTCTGGATGCAGCATTCAAATAA

Protein sequence:

>DPOGS215780-PA
MGDQGFLKQKPGFSTLAIHAGQSPDKWRHASVVTPIVTSTTFKQPAPAEHTGYEYGRSGNPTRNTLEECLAALDGGKHGFTFASGLGATTTIFSLLKQGDEIICCDDVYGGTNRLFRRVAAPFGIEIHFIDFSDLGLLDRTINGKTKLVWMETPTNPMLKVLDIKAVSKIVKSHSDDIILVVDNTFLTPYLQRPLDFGADIVMYSVTKYMNGHADVIMGAAVVNNDDLASKLRFLQNSMGIVPSPMDCYLVIRSLKTLALRMEHHKKSSLKIAEWLLKHPKVVEVMHPGLPSHPQHEIARRQSTGHSGVFSFRHSGGLQESRKFFSAIKVFILAESLGGYESLAELPSLMTHASVPAEQREQLGISDSLIRLSVGLEETDDLIQDLEQALDAAFK-