Monarch geneset OGS2.0

DPOGS211078
TranscriptDPOGS211078-TA1365 bp
ProteinDPOGS211078-PA454 aa
Genomic positionDPSCF300007 - 1342991-1345299
RNAseq coverage543x (Rank: top 23%)
Annotation
HeliconiusHMEL0022140.087.89% 
BombyxBGIBMGA003206-TA4e-8886.98% 
DrosophilaCG12264-PA0.079.85% 
EBI UniRef50UniRef50_Q9Y6970.078.30%Cysteine desulfurase, mitochondrial n=632 Tax=root RepID=NFS1_HUMAN
NCBI RefSeqXP_319845.30.081.16%AGAP009094-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187916080.081.16%AGAP009094-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|910832970.076.59%PREDICTED: similar to cysteine desulfurylase [Tribolium castaneum]
Group
Gene OntologyGO:00065344.3e-195cysteine metabolic process
GO:00301704.3e-195pyridoxal phosphate binding
GO:00310714.3e-195cysteine desulfurase activity
GO:00038241.5e-95catalytic activity
GO:00081524.5e-90metabolic process
KEGG pathwayaga:AgaP_AGAP0090940.0 
 K04487 (iscS, NFS1)maps-> Thiamine metabolism
InterPro domain[54-454] IPR0102404.3e-195Cysteine desulfurase
[52-434] IPR0164549.9e-162Cysteine desulfurase, NifS
[54-431] IPR0154248.7e-108Pyridoxal phosphate-dependent transferase, major domain
[64-302] IPR0154211.5e-95Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[57-418] IPR0001924.5e-90Aminotransferase, class V/Cysteine desulfurase
[303-422] IPR0154222.9e-46Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL12918 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211078-TA
ATGTTTCGATTCTCCAAAATTATTGCGCCCAATCTTATAAAAGGTTTACAGCTTAAACCATTGCAGAATGAAAACGCATTTCTCGTGTTTCTACAAGGAAAGAAATTTTTATCAGATTCCGCAAACGAAAGATTTGCTTTAAAACATGAAGAAATCGGTCGACCACTTTATTTCGACGCACAAGCAACAACTCCTATGGATCCAAGGGTCCTTGATGCGATGTTACCATATCTAGTCAGCCTCCATGGCAATCCACATTCTAGAACACATGCCTATGGTTGGGAGAGTGAAGCAGGAGTTGAAAAAGCTAGAGAACAAGTTGCGAGTTTGATTGGCGCTGACCCCAAGGAAATCGTATTTACATCTGGAGCCACAGAATCAAATAACATTTCTGTCAAAGGGGTTGCTCATTTCTATGCGCCAAGAAAGAAGCATATTGTGACCACACAAATTGAACACAAGTGTGTACTGGATTCTTGTCGAGCTTTAGAAGGAGAAGGTTTCAAAATAACATATTTGCCTGTAGGCCCAAATGGAATTATCAATATGAAAGATCTGGAAAATGCTCTCACACCTGAAACAAGCTTAGTGTCTATTATGACTGTCAATAATGAGATAGGAGTAATGCAACCGATTAGTGAAATTGGAGCTTTATGCAAAAGCAAAAAGATATTTTTCCATACTGATGCAGCACAGGCAGTTGGCAAAGTACCATTAGATGTAAATAGCATGAATATTGACCTCATGTCAATTTCTGGTCATAAAGTCTATGGACCAAAAGGAGTTGGAGCTCTATTTATCAGGCGTCGTCCAAGAGTTCGTGTTGAGCCAATCCAAAGCGGTGGTGGCCAAGAAAGGGGTATGAGGAGTGGAACAGTACCTACACCTTTAGTTGTTGGATTAGGTGCAGCCTGTGAGTTAGCACAGCATGAAATGGCTTATGATCATGCTTGGATGGAGACTTTAGCCCAAAGATTCTTAGATAAAATTTACTCTAAACTATCACATGTCATTAGAAATGGAGATCCAAAGCAGACTTATCCGGGATGCATCAATTTATCATTTGCATATGTTGAAGGTGAATCATTGCTAATGGCATTGAAAGATGTGGCTCTATCAAGTGGTTCTGCTTGCACATCAGCGTCCCTCGAACCGTCATATGTTTTAAGAGCTATCGGTGCAGATGAAGATTTAGCACACAGTTCTATAAGATTTGGACTAGGAAGGTTCACGACGATTGAGGAAGTAGATTACACAGCTGAAAAAACAATAAGACATGTAGAAAGGCTTAGAGAAATGAGTCCTCTCTGGGAGATGGTACAAGAGGGAGTCGACTTAAAAAACATTCAATGGTCTCAGCACTAA

Protein sequence:

>DPOGS211078-PA
MFRFSKIIAPNLIKGLQLKPLQNENAFLVFLQGKKFLSDSANERFALKHEEIGRPLYFDAQATTPMDPRVLDAMLPYLVSLHGNPHSRTHAYGWESEAGVEKAREQVASLIGADPKEIVFTSGATESNNISVKGVAHFYAPRKKHIVTTQIEHKCVLDSCRALEGEGFKITYLPVGPNGIINMKDLENALTPETSLVSIMTVNNEIGVMQPISEIGALCKSKKIFFHTDAAQAVGKVPLDVNSMNIDLMSISGHKVYGPKGVGALFIRRRPRVRVEPIQSGGGQERGMRSGTVPTPLVVGLGAACELAQHEMAYDHAWMETLAQRFLDKIYSKLSHVIRNGDPKQTYPGCINLSFAYVEGESLLMALKDVALSSGSACTSASLEPSYVLRAIGADEDLAHSSIRFGLGRFTTIEEVDYTAEKTIRHVERLREMSPLWEMVQEGVDLKNIQWSQH-