Monarch geneset OGS2.0

DPOGS208630
TranscriptDPOGS208630-TA993 bp
ProteinDPOGS208630-PA330 aa
Genomic positionDPSCF300730 + 2053-4147
RNAseq coverage61x (Rank: top 68%)
Annotation
HeliconiusHMEL0200661e-8265.32% 
BombyxBGIBMGA005042-TA1e-7262.87% 
DrosophilaCG1665-PA8e-4936.05% 
EBI UniRef50UniRef50_Q2F5P56e-11861.49%Mo-molybdopterin cofactor sulfurase n=4 Tax=Obtectomera RepID=Q2F5P5_BOMMO
NCBI RefSeqNP_001040259.11e-11861.49%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
NCBI nr blastpgi|1140525772e-11761.49%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
NCBI nr blastxgi|1140525771e-11361.49%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
Group
Gene OntologyGO:00301512.9e-21molybdenum ion binding
GO:00038242.9e-21catalytic activity
GO:00301702.9e-21pyridoxal phosphate binding
KEGG pathwayame:5514372e-35 
 K01205 (NAGLU)maps-> Lysosome
    Glycosaminoglycan degradation
InterPro domain[43-162] IPR0053031e-23MOSC, N-terminal beta barrel
[195-309] IPR0053022.9e-21Molybdenum cofactor sulfurase, C-terminal
[37-330] IPR0110375e-09Pyruvate kinase-like, insert domain
Orthology groupMCL10358 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208630-TA
ATGTCACAGGGGTCGTATCTAAGTGCAGCGGTCGCTACAACAGGCGTCCTCGCAGGTGCATACTGTGCGTATCATCTTTATAATGAAGCTCAGAAACGTAAGCTACCAACAACATGGAAGGAAGTTGGCACTTTGAAGGATATTTATATATATCCTATAAAATCTTGCGGTCCGGTGCAAAAGGACAGAGCTGAATGCACTTTACTCGGTCTAAAAGACGGTTGGTTGCGAGACAGAGTATTGATGGTAATTGACGGTAAAAATAATTTTATCACTGCAAGAGGGTATCCACAATTACTTTCTATTCGTCCAACAGTAAGAAATTCTGTCTTAACTTTACAACATAATGATATGGAAATACTCAACGTTGATCTATCAGAGGTTCCTTTACAATCGGTAGAAACCGCAACTGTTTGGGGTGTTGAGGTACCTGTATACGATTGCGGCTTTGACGCAAGCGAATGGGTTTCAAGATTACTAGACAAGTCTGCAAACAACTTCAGATTAGTACTTTATGCTTCCAACAATTCTCGCAAGTTAAAAAGACCCGCCAATAATGTGTATAAGTTCCGCAAGACAGATACGGGAGCACTTCCAGATGAATTACCATTTCATTTAATGAATGAAACCTCTATAGACGATCTCAATACTAAATTGCAAGGAAACAAAGTTTGCTACAAAAACTTCAGACCGAACTTCTTAATTACCGGCGCTCGGCCTTACGAAGAAGATGATTGGAAATATGTTAAAATCGGAGAAAACATTTTCGAAGTAATCAAGCCATGCACGAGATGTATGTTGACAACGATCGATCCAGAGACAGGTACACGTGATTCTAAATCCGAACCTATTCAGACGTTGAAAAGTTACCGCCAGATAACTGATTCTAGTGCTAGACCTTGGTCCGGCAGTGCTCCACGAATGGGCATACACCTGGCGCTCCGGTCAAAGAATGGCCTTGTCTCGATCAATGATCGTATTTACGTAGATTAA

Protein sequence:

>DPOGS208630-PA
MSQGSYLSAAVATTGVLAGAYCAYHLYNEAQKRKLPTTWKEVGTLKDIYIYPIKSCGPVQKDRAECTLLGLKDGWLRDRVLMVIDGKNNFITARGYPQLLSIRPTVRNSVLTLQHNDMEILNVDLSEVPLQSVETATVWGVEVPVYDCGFDASEWVSRLLDKSANNFRLVLYASNNSRKLKRPANNVYKFRKTDTGALPDELPFHLMNETSIDDLNTKLQGNKVCYKNFRPNFLITGARPYEEDDWKYVKIGENIFEVIKPCTRCMLTTIDPETGTRDSKSEPIQTLKSYRQITDSSARPWSGSAPRMGIHLALRSKNGLVSINDRIYVD-