Monarch geneset OGS2.0

DPOGS206899
TranscriptDPOGS206899-TA1857 bp
ProteinDPOGS206899-PA618 aa
Genomic positionDPSCF300001 - 1780085-1785686
RNAseq coverage252x (Rank: top 41%)
Annotation
HeliconiusHMEL0068550.077.35% 
BombyxBGIBMGA012855-TA0.068.35% 
Drosophilacin-PA3e-11840.81% 
EBI UniRef50UniRef50_A7SKS58e-14647.02%Predicted protein n=8 Tax=cellular organisms RepID=A7SKS5_NEMVE
NCBI RefSeqXP_001627768.12e-14647.02%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|1563685753e-14547.02%predicted protein [Nematostella vectensis]
NCBI nr blastxgi|1563685755e-14346.87%predicted protein [Nematostella vectensis]
Group
Gene OntologyGO:00067771.1e-54Mo-molybdopterin cofactor biosynthetic process
GO:00323242.6e-45molybdopterin cofactor biosynthetic process
KEGG pathway 
InterPro domain[362-525] IPR0014531.1e-54Molybdopterin binding
[193-375] IPR0051102.6e-45MoeA, N-terminal and linker domain
[5-144] IPR0208179.6e-37Molybdenum cofactor synthesis
[534-612] IPR0051112.1e-18MoeA, C-terminal, domain IV
Orthology groupMCL14146 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206899-TA
ATGGTGAAAGCAGTCGCGATAATCACAGTAAGTGATAGTTGTTTTAAGGATAATTCAAAAGATACATCAGGACCTGCTTTGCTAAAATTTATAGAACAACATTTCCCTGAAGCAAACATTCATACGATAATTGTCCCAGATGAAAAAGAGATTATTGAGCGGGAATTAAAATATTTTTGCGATTCTCATATCGACCTTGTTCTTACAACTGGAGGTACAGGATTAAGTCCTCGAGATGTAACACCTGAAGCCACGAAGGCAGTACTTCACAAGGAAGTACCAGCTATTGCTGTTGCAATGACAATTGCAAGTCTTAAGAAGACACCGATGGCTATGCTCTCTAGAGCTGTAGCGGGAATAAGGGATAGAACACTTATCATTAACTTTCCCGGCAGTAAGAAAGCAGTCACTGAATGTATAGAAGTTGTTAAACCTATTCTTGGTCATGGTATATCCCTTATAACAAATAATTTGGCTAATGTGAGAATTGTTCATGATAAATTACAATCGGACCATACTTGTTCCCATATGAGGAACACTAATGTAGATATATCAAAGGTTGCTCTAAGACCTCGCCAATCACCCTTTCCTATGTTGGAAATGGTTGAGGCTTTTAACATTGTTGATGCTGTGATGATGCAATGGGTAGAGCGTACGGAAACTGTATCTATAGAAGATAGTGCTGGCTGTGTTGTGGCTCAAGACATAATAGCTAGGGAACCTATGCCGCCTTTTCCAGCATCAGTTAAGGATGGTTATGCATGCCTCAGTCTGGACGGAGTTGGTAAACGTAGAGTGCAGGCAGTGGTTGCAGCGGGAGATACTCCTCTCAGTCCACTAACACGTGGGCAGTGTGCTCGTGTCAACACAGGGGCTCCCCTACCACTTGGCGCTGATTGTGTTGTACAGGTGGAAGATACAAAACTTATTCAGGCATCAGCGGATAACCAGACGGAGTTAGAGGTGGAGATTCTGGTGGCGCCGAAACCACACCAGGATGTGAGACCTATAGGATTTGATATACCCTTGGGCTCTCTGCTCGTTGAAAAAGGTGACGTCATCGACGCCGCTAAAATTGGTATTTTGGCTGGGGCCGGATATCAGGAGATCACAGTAGTCGAACATCCTAAGGTAGGAATTATGTCTACCGGAAACGAGTTACAGGAGCCGTCGGATTCTTTCCTTCGTCCCTCACATATAAGGGATTCGAACAGAATTATGATTAAATCATTGCTCAAGGAGCACGGATTTGAAAGCATCGACTGCGGGATCGCCCGCGACCATCCTGGCGAACTTTCGCATGCGCTCGAGAACGCGCTCGCCGTCTGCGACGTTCTCGTCTGTACGGGGGGAGTGTCGATGGGTGAGAGGGACCTGCTCAAACCCGTGCTCATTAAAGACTTCAACGCGACCGTGCATTTCGGACGCGTCCGCATGAAGCCTGGTAAACCGAGCACGTTCGCGACATGCAAGTATGAAGGGAGGACCAAATTCATATTCGCACTGCCCGGTAACCCTGTATCAGCGTACGTGTGCTGTCTCCTGTTGGTGGTGCGAGCTCTGCGTCAGTGTACGCGGTACAGCGGCGAGTGGGCGCGGCTTGGAGTGAAACTCGCGCGTGACATCACACTGGATCCTCGCCCGGAGTACGCTCGAGCTCAGCTCAGCTTTCCGGACATACAGGATCTACCCGTCGCTACACTACTCGGAAATCAGTGCAGCAGCCGTCTGCTGAGTGCCTGCGGAGCCAACGTTCTATTGGAATTGCCGGGAGCCACCCCTGAATGCCCCATGCTACCAGCTGGTTCCGTAGTGCCAGCTCTGCTCACGGGACGGATTGATCTCCCTAGACTTTAA

Protein sequence:

>DPOGS206899-PA
MVKAVAIITVSDSCFKDNSKDTSGPALLKFIEQHFPEANIHTIIVPDEKEIIERELKYFCDSHIDLVLTTGGTGLSPRDVTPEATKAVLHKEVPAIAVAMTIASLKKTPMAMLSRAVAGIRDRTLIINFPGSKKAVTECIEVVKPILGHGISLITNNLANVRIVHDKLQSDHTCSHMRNTNVDISKVALRPRQSPFPMLEMVEAFNIVDAVMMQWVERTETVSIEDSAGCVVAQDIIAREPMPPFPASVKDGYACLSLDGVGKRRVQAVVAAGDTPLSPLTRGQCARVNTGAPLPLGADCVVQVEDTKLIQASADNQTELEVEILVAPKPHQDVRPIGFDIPLGSLLVEKGDVIDAAKIGILAGAGYQEITVVEHPKVGIMSTGNELQEPSDSFLRPSHIRDSNRIMIKSLLKEHGFESIDCGIARDHPGELSHALENALAVCDVLVCTGGVSMGERDLLKPVLIKDFNATVHFGRVRMKPGKPSTFATCKYEGRTKFIFALPGNPVSAYVCCLLLVVRALRQCTRYSGEWARLGVKLARDITLDPRPEYARAQLSFPDIQDLPVATLLGNQCSSRLLSACGANVLLELPGATPECPMLPAGSVVPALLTGRIDLPRL-