Monarch geneset OGS2.0

DPOGS204159
TranscriptDPOGS204159-TA1323 bp
ProteinDPOGS204159-PA440 aa
Genomic positionDPSCF300034 - 554805-563746
RNAseq coverage693x (Rank: top 18%)
Annotation
HeliconiusHMEL0216101e-8038.20% 
BombyxBGIBMGA005042-TA4e-6857.97% 
DrosophilaCG1665-PA6e-4032.18% 
EBI UniRef50UniRef50_Q2F5P54e-11258.38%Mo-molybdopterin cofactor sulfurase n=4 Tax=Obtectomera RepID=Q2F5P5_BOMMO
NCBI RefSeqNP_001040259.17e-11358.38%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
NCBI nr blastpgi|1140525771e-11158.38%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
NCBI nr blastxgi|1140525776e-10858.38%Mo-molybdopterin cofactor sulfurase [Bombyx mori]
Group
Gene OntologyGO:00301518.3e-21molybdenum ion binding
GO:00038248.3e-21catalytic activity
GO:00301708.3e-21pyridoxal phosphate binding
KEGG pathway 
InterPro domain[177-297] IPR0053031.5e-23MOSC, N-terminal beta barrel
[307-406] IPR0053028.3e-21Molybdenum cofactor sulfurase, C-terminal
Orthology groupMCL10358 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204159-TA
ATGTCAAAAGTAAACAACGTAAGATTCCTACAACTTGGAAGGAACCCTGTCCACAAGAACAAAGCTGAATGTACATTTCTAGGATTAAGAGATGGGTGGCTAAGAGATAGGGAAATGTTAATAGTGGATGACAAGTATAATTTTATTACAGCAAGAGCATTTCCAAAAATGCTTTTAATTCAATCTAAAATAGAAAAATCTATTTTAACTTTGAGCAACGATGACATGGAACCGCTGAATGTAGATTTAGCCGAGGCGAGTATAGCTTTAAAGGAAACATTTAAAGCAACCGTGTGGGGTGTCAAGGTTCAAGTTTATGATTGCGGTTGGGAAGCTAGCGAGTGGCTGTCAAGTGAAATTACTGAACCCGAGTCGAAACCCCCAACCGAAGATTTTGCACAAAACGGACTACGGTCATATCTAAGTGCAGCGGTCGCTACAACAGGCGTCCTCGCAGGCGCATACTGTGTGTATCATCTTTATAATGAAGCTCGGAAACGTAAGCTACCAACAACATGGAGGGAGGTTGGCACTTTGAAGGATATTTATATATATCCTATAAAATCTTGCGGTCCGGTGCAAAAGGACAGAGCTGAATGCACTTTACTCGGTCTAAAGGACGGTTGGTTGCGAGACAGAACTTTGATGGTAGTTGATAATAATTACAACTTCGTCACTGCAAGGGCATATCCAGAATTGCTATTGGTCCGTCCAACAATAAGAAATTCTGTCTTATCCTTACAACATAATGACATGGAAATACTTAACATGGATCTGTCAGAGATAGTCTCTTTGCAAACCGCAAAAACTGCAACAGTGTGGGGTGTCCAAGTTCCTGTGTATGATTGCGGCTGGGAACCCAGCGAATGGTTTTCAAGATTGTTACACAAATCTGCAGCCGACTTTAGATTGGGAGCGCTTCCTGACGAGGTACCGTTTAATTTAATCAACGAGGCCTCTATAGACGATCTTAATTCAAAATTGCAAGGAAAAAAAGTTTGCTACAAAAACTTCAGACCGAACTTCTTAATTACCGGCGCTCGGCCTTACGAAGAAGACGATTGGAAATATGTTAAAATCGGAGAAAACATCTTCGAAGTGATCAAGCCATGCACGAGATGTATCATGACGACCATCGATCCCGAAACAGGCGTTCGTGATTCAAATGCGGAACCTCTGGAAACGTTGAAGAAATATCGTCAATTAGAAAATCCTGACGCTAGACGCTCGGCCGGGGATTCCCCGCGTATGGGTCTGCAAATGTCACTTCGCTCAGGCATCAACGGCATCGTCTCGATCGACGATCGTGTCTATGTAGCTTAA

Protein sequence:

>DPOGS204159-PA
MSKVNNVRFLQLGRNPVHKNKAECTFLGLRDGWLRDREMLIVDDKYNFITARAFPKMLLIQSKIEKSILTLSNDDMEPLNVDLAEASIALKETFKATVWGVKVQVYDCGWEASEWLSSEITEPESKPPTEDFAQNGLRSYLSAAVATTGVLAGAYCVYHLYNEARKRKLPTTWREVGTLKDIYIYPIKSCGPVQKDRAECTLLGLKDGWLRDRTLMVVDNNYNFVTARAYPELLLVRPTIRNSVLSLQHNDMEILNMDLSEIVSLQTAKTATVWGVQVPVYDCGWEPSEWFSRLLHKSAADFRLGALPDEVPFNLINEASIDDLNSKLQGKKVCYKNFRPNFLITGARPYEEDDWKYVKIGENIFEVIKPCTRCIMTTIDPETGVRDSNAEPLETLKKYRQLENPDARRSAGDSPRMGLQMSLRSGINGIVSIDDRVYVA-