Monarch geneset OGS2.0

DPOGS204024
TranscriptDPOGS204024-TA1035 bp
ProteinDPOGS204024-PA344 aa
Genomic positionDPSCF300138 - 88362-92978
RNAseq coverage887x (Rank: top 14%)
Annotation
HeliconiusHMEL0049571e-13768.42% 
BombyxBGIBMGA004876-TA5e-10962.95% 
DrosophilaMocs2-PA3e-7139.57% 
EBI UniRef50UniRef50_D6WT962e-7548.62%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WT96_TRICA
NCBI RefSeqXP_397469.29e-7943.51%PREDICTED: similar to CG10238-PA [Apis mellifera]
NCBI nr blastpgi|3407140965e-7844.17%PREDICTED: molybdopterin synthase catalytic subunit-like [Bombus terrestris]
NCBI nr blastxgi|3407140963e-7844.44%PREDICTED: molybdopterin synthase catalytic subunit-like [Bombus terrestris]
Group
Gene OntologyGO:00067773.3e-68Mo-molybdopterin cofactor biosynthetic process
KEGG pathway 
InterPro domain[1-174] IPR0034483.3e-68Molybdopterin biosynthesis MoaE
Orthology groupMCL11759 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204024-TA
ATGGATTATTTAAAACTAACGTCGGATAAGCTCTCTGTTGAGGCTGTCTCCGAGATGGTGGTCGATGACAAATGTGGAGCTGTTTCTTTATTCGTTGGTACCACTAGGGACAACTTTGATGGTAAAAAGGTTCTCCATTTAGAATATGAGGCGTACGAAGCTATGGCGATCAACGCACTGAAGGCCATATGCAATGAAGTGCGGGAGAAATGGCCAAACGTACATGGAATAGCCATGTACCATCGGCTCGGTTCCGTTCCCTGTAAGGAGGCGTCAGTGGTGATAGCCGTCAGCAGCCCTCACAGACAGGACAGTCTGAGCGCTGTGTCGTACTGCATAGACCAGCTCAAGGCCACTGTACCGATATGGAAAAAGGAGGTGTACGACGGCAGCGAGCCCATGTGGAAGGAGAACAAGGAATGTCCGTCTTCAGTGTCGCCACATCCGCTAGAGTGTAATACACTAGAAACGCCCATAGACAAGAACTTAGTGCAAATAAATGTTTCAAACGAGGAGTTGGAGAAGAGGATAAGAAATTTCATCGAGAGGAAAAGAGATCAGGTGAACCTGAGTAACATCCAGGACTTCATACCGGTCAAGAGTGAGAACGAGAAGGAAGATACGGACACGTGTGCGAGAGTGAGAACTCTGTTCGTGAAGAGAAGCGACTCGAAGGGACATTTGAAAATTCGTAAAGTCCACAACGAGTGGGGTCCACAGACGGTGCGTCAGGAACCCGCGAAAATTAAAAAGGACGGGGACTTGCCGCCCTCTATATCTGAACGAGTGTGCGCCATCGAGACGTATCTAAACACCGGACCCGTCGCTAAAGACATTTACAAACGTTTGAAGGACATGGAAGACAAAATAAATCATCTTCAGAGCGTATCGCCCGAGTACTCCATGTTCTGGAAAAACAAAAGAGAAACAGAAATAAAGCAGGAGCCGAAAGTTGAACACACGTACACGGCGGAGGACATCGCCAAGAAGATAGAGATGCTGGAGAGACAGAGCGGGGTGGAGTACTCGGACTGA

Protein sequence:

>DPOGS204024-PA
MDYLKLTSDKLSVEAVSEMVVDDKCGAVSLFVGTTRDNFDGKKVLHLEYEAYEAMAINALKAICNEVREKWPNVHGIAMYHRLGSVPCKEASVVIAVSSPHRQDSLSAVSYCIDQLKATVPIWKKEVYDGSEPMWKENKECPSSVSPHPLECNTLETPIDKNLVQINVSNEELEKRIRNFIERKRDQVNLSNIQDFIPVKSENEKEDTDTCARVRTLFVKRSDSKGHLKIRKVHNEWGPQTVRQEPAKIKKDGDLPPSISERVCAIETYLNTGPVAKDIYKRLKDMEDKINHLQSVSPEYSMFWKNKRETEIKQEPKVEHTYTAEDIAKKIEMLERQSGVEYSD-