Monarch geneset OGS2.0

DPOGS203140
TranscriptDPOGS203140-TA1686 bp
ProteinDPOGS203140-PA561 aa
Genomic positionDPSCF300035 - 1245678-1253298
RNAseq coverage345x (Rank: top 34%)
Annotation
HeliconiusHMEL0032414e-11564.67% 
BombyxBGIBMGA009187-TA3e-6549.83% 
DrosophilaMocs1-PC1e-13549.40% 
EBI UniRef50UniRef50_E2A1G12e-13648.28%Molybdenum cofactor biosynthesis protein 1 B n=4 Tax=Formicidae RepID=E2A1G1_CAMFO
NCBI RefSeqXP_001848384.15e-13747.47%molybdopterin cofactor synthesis protein a [Culex quinquefasciatus]
NCBI nr blastpgi|2700017342e-14150.30%hypothetical protein TcasGA2_TC000610 [Tribolium castaneum]
NCBI nr blastxgi|2700017342e-13550.30%hypothetical protein TcasGA2_TC000610 [Tribolium castaneum]
Group
Gene OntologyGO:00067772.9e-103Mo-molybdopterin cofactor biosynthetic process
GO:00468722.9e-103metal ion binding
GO:00081521e-33metabolic process
GO:00038241e-33catalytic activity
GO:00515361.4e-27iron-sulfur cluster binding
GO:00515391.2e-254 iron, 4 sulfur cluster binding
GO:00190081.2e-25molybdopterin synthase complex
KEGG pathway 
InterPro domain[52-360] IPR0134832.9e-103Molybdenum cofactor biosynthesis protein A
[369-523] IPR0028202.3e-56Molybdopterin cofactor biosynthesis C (MoaC) domain
[369-522] IPR0230451.5e-51Molybdenum cofactor biosynthesis C
[55-261] IPR0137851e-33Aldolase-type TIM barrel
[66-226] IPR0071971.4e-27Radical SAM
[232-360] IPR0105051.2e-25Molybdenum cofactor synthesis C-terminal
[61-265] IPR0066381.2e-09Elongator protein 3/MiaB/NifB
Orthology groupMCL13866 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203140-TA
ATGAAAGCAAAAATGTTTTATCTTTTTTATCCCAAATTACTACAAAACACAAGTAGTTATCAAACATTTAGCTTAAAACGTTTTATAAATTCTGATCTAAAACCTAACATAAAAGAAACTGCAAATTTATTACTTAGAAATGATGTTCCACCACTTGTTGACTTATATGGACGTAAACATGACTATTTACGAATCTCCCTAACTGAAAAATGCAATTTAAGGTGTCAATACTGCATGCCGGCCGAAGGTGTAAAGCTTAGTCCTCGTGACAAAATATTGTCTAATGAAGAAGTTTTACGATTGGCAAGAGTGTTTGCCGCTCTTGGTATAAATAAAATACGGTTAACTGGTGGCGAGCCTACATTGAGAAAAGACCTTGTGAATATTGTCCAGGAATTGACCAACCTCCATGGGATAACGACAGTGGCAATGACAACTAATGGTATAGCCCTAACAAGGAAATTACCTTCGCTACAACGTGCTGGTCTCTCAGCTCTGAATTTGTCTTTGGATTCGCTGAAACCAGAACGTTTTGAGCGCATGGCACGAAGACCGGGCTTACCCCATGTGCTGGCCAGCATGGATTTGGCGCTGCAGCTTGGCTTCAAATCCGTTAAAATTAATACCGTGCTTATGAAAGGTTTTAATGATGACGAAATTTGCGATTTCATTGAACTAACTCGCGACCGTGATATGGATATAAGGTTCATAGAGTTCATGCCTTTCTCTGGGAACCGTTGGGAGAAAGGGGAACGTATGGTTAGTGAGAAAGATGCTATCTCTGCTGCAATCGAGAGATACGGTGATCTGATACCGCTCCCCCCCACTCCGTGTCGCACTGCCACGCTGTGGCAGGTGCCAGGTTATACTGGTCGGGTGGGTTTCATATCGTCCATGACTAAGCCGTTTTGTTCAACTTGCAACCGTCTTCGACTCACAGCAGATGGTAATTTGAAGGTCCAAACAGACAAGCCCCTTAGGGAGACTAACCTCCGTGACGCCATACGAGCTGGTGTCAACGATGATGATATTGAGACCTTAGTCCGTAGCGCTCTTAGGAGGAAGTTACCTCGACATGCTGGTATGATAAAAGTTATTATTGTCATGACGCACTTAGACAAGTCGGGACGTGCTCGCATGGTCGATGTAGGTGAAAAACCGGTGACTGTTCGAACTGCGGAAGCGGAATGCTATCTTGTTGTTGGAGCTCGTCTGCTACGTCTATTGCGCTCGTCTGGCGTACCGAAAGGAGACGCGCTGACTGTCGCTCAGGTAGCTGGTACAATGGCGGCCAAGCGTACATCAGATCTGATACCCATGTGCCATCCGCTAGCTTTGACCTTGGCTCGGGTCCGAGTGATGTTGCCAAAGGAGAGTGAGCGCGGGGGAGGTGGCCGTGTACGAGTAACGTGCGAGGCGCGTGCAACAGCAAAAACTGGTGTGGAAATGGAAGCGCTCACCGGATGTTCCATAGCGGCGCTAACTTTGTATGACATGTGCAAATCGGTGGACAAAAACATGCAGATTACTGATCTCAAGGTGATATCGAAGACTGGTGGCAAAAGTTCCTGGGGTGATGTAGAAGAGAAAGAAGAGGAAGGTTTTGGACCTAAAGTACGCGAACATGATACGTCGCCGCATGCACCTGGAGAGACATACGTCCCTACTAATTTGTTATACTTTTAG

Protein sequence:

>DPOGS203140-PA
MKAKMFYLFYPKLLQNTSSYQTFSLKRFINSDLKPNIKETANLLLRNDVPPLVDLYGRKHDYLRISLTEKCNLRCQYCMPAEGVKLSPRDKILSNEEVLRLARVFAALGINKIRLTGGEPTLRKDLVNIVQELTNLHGITTVAMTTNGIALTRKLPSLQRAGLSALNLSLDSLKPERFERMARRPGLPHVLASMDLALQLGFKSVKINTVLMKGFNDDEICDFIELTRDRDMDIRFIEFMPFSGNRWEKGERMVSEKDAISAAIERYGDLIPLPPTPCRTATLWQVPGYTGRVGFISSMTKPFCSTCNRLRLTADGNLKVQTDKPLRETNLRDAIRAGVNDDDIETLVRSALRRKLPRHAGMIKVIIVMTHLDKSGRARMVDVGEKPVTVRTAEAECYLVVGARLLRLLRSSGVPKGDALTVAQVAGTMAAKRTSDLIPMCHPLALTLARVRVMLPKESERGGGGRVRVTCEARATAKTGVEMEALTGCSIAALTLYDMCKSVDKNMQITDLKVISKTGGKSSWGDVEEKEEEGFGPKVREHDTSPHAPGETYVPTNLLYF-