Monarch geneset OGS2.0

DPOGS202961
TranscriptDPOGS202961-TA1641 bp
ProteinDPOGS202961-PA546 aa
Genomic positionDPSCF300195 + 512217-516551
RNAseq coverage643x (Rank: top 20%)
Annotation
HeliconiusHMEL0111235e-10051.84% 
BombyxBGIBMGA005750-TA1e-14166.85% 
DrosophilaCG7280-PA2e-12946.80% 
EBI UniRef50UniRef50_Q16H846e-13243.34%Sulfite reductase n=9 Tax=Endopterygota RepID=Q16H84_AEDAE
NCBI RefSeqXP_971620.28e-13647.14%PREDICTED: similar to CG7280 CG7280-PA [Tribolium castaneum]
NCBI nr blastpgi|1892349362e-13447.14%PREDICTED: similar to CG7280 CG7280-PA [Tribolium castaneum]
NCBI nr blastxgi|1892349363e-13447.33%PREDICTED: similar to CG7280 CG7280-PA [Tribolium castaneum]
Group
Gene OntologyGO:00090558.1e-79electron carrier activity
GO:00551148.1e-79oxidation-reduction process
GO:00468728.8e-55metal ion binding
GO:00164918.8e-55oxidoreductase activity
GO:00301512.6e-37molybdenum ion binding
GO:00200377.3e-22heme binding
KEGG pathwaydpo:Dpse_GA202339e-128 
 K00387 (E1.8.3.1, SUOX)maps-> Sulfur metabolism
InterPro domain[179-417] IPR0005728.1e-79Oxidoreductase, molybdopterin-binding domain
[185-199] IPR0083358.8e-55Eukaryotic molybdopterin oxidoreductase
[423-546] IPR0147562e-37Immunoglobulin E-set
[424-545] IPR0050662.6e-37Moybdenum cofactor oxidoreductase, dimerisation
[84-161] IPR0011997.3e-22Cytochrome b5
Orthology groupMCL12703 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202961-TA
ATGTCATTGAGGACTATTTCAATACGGAATTTATTCCGACAAAAGAAAGTTTTTCATTTTACGCCATGTATAATTCAAAGTCAGAAACAATACGAGGACAGGTACTCAAATAGAAGTTGGAAGAATGAAAATATCTCTGTAGGATTACTCGCCTCCCTCATACTAGGCACACAGCTGACAGAGAATAACGATAATGAAAATAAACAAAATGAGAGTGAGCTGCACCAGCAGGCCGGAGCTAAACGGCCGGACTTACCGACATACAGGGCCGAGGAAGTCAGTCAACATAATAATGAGAGGAGTTTCTGGGTGACATACAAGCAGGGAGTGTATGATGTGACGTCATTCCTGCCGTCTCACCCCGGCGGTGAACAAATATACAATGCGGCCGGCCTCAGTGTAGAGCCTTTTTGGAACGTGTATGGCATGCATAAGACTAAGGAGATTTATGAATTACTAGAGAGCTACAGGATAGGCAATCTTCACGAGGACGATTTAGTGGATCACTCGGACGAGGAGATGTGGGCCAAGGAACCGTTCAGAGACAAGCGACTCTGCGTGAAGACAGCCAGACCGTTCAACGCCGAGACGCCGCCCGCTGAACAAGTGCGACACTTCGACACTCCGGCTGAGCTATTCTACGTCCGTCAACACATGCCGGTGCCGGATCTGGACGGCAGCTGTCACAGGCTGACGGTGCTGGTGGAGGGGGAGGGGGAGCGGCGCCTGCAGCTGTCCGTAGACGAACTGAATAGGTTCAAGCGGCAGGAGGTGCGGGCGGCGCTCATGTGTGCCGGGAACAGGAGGTCCGAGATGAACCTGGTGAAGCCGGTGAAGGGTCTCTCGTGGCGGACCGGCGCCATCGGTAACGCGGTGTGGGGCGGAGTGCTGCTGAGGGACGTGTTGCTGGCGGCAGGGGTCTCGGACAAGGACACGGAGGGGAAACACGTCACGCTGATGGGTGCCGACATGGACGCGACGGGTACTTACTTCAGTACGTCCATACCGCTGTCTCACGCCCTGGACGAGCGTGCCCGGGTCCTGTTGGCCACCTCCATGAACGGAGCGCCTCTCACCAAGGACCACGGACATCCCCTGAGGGTGGTGGTCCCGGGGGCGCCGGCCGTCAGGAGCGTCAAGTGGCTGCAGTGCATCAAGGTGTCGTCGGAGGAGAGTCCGTCTCACTGGCACCAGAGAGACTACCGCTCGTTCGGTCCGAGCGTGTCTTGGGAGACGGCGGACTTCCCCTCCGCGCCCCCCGTCTACAGCCTGCCCGTCACCTCGGCCGTGTGCTCGCCCGAGGACGGTGACACCGTACGTCCTCGCCGGGGAGCGCTGCACGTGCAAGGGTACGCGTACTCGGGCGGCGGCGCGAAGATCATCCGCGTAGAGGTCAGTACGGACCGCGGCGCCACATGGCGTGAGGCGCGGCTCAGGAGCGACAGCGCTCCGCCCAGAGAACACTACTCCTGGACACTGTGGGATGTCGATCTGCCGGCCGCCGGACCGCAGATGGAGATATGGGTGAAGGCCACCGACAGCAACTTCAACGCTCAGCCGGAGAACTTCAGAGACATCTGGAACATCCGCGGCATCCTCAGTAACGCCTATCATAAAATAAAAGTTAACGTAGAACAGTGA

Protein sequence:

>DPOGS202961-PA
MSLRTISIRNLFRQKKVFHFTPCIIQSQKQYEDRYSNRSWKNENISVGLLASLILGTQLTENNDNENKQNESELHQQAGAKRPDLPTYRAEEVSQHNNERSFWVTYKQGVYDVTSFLPSHPGGEQIYNAAGLSVEPFWNVYGMHKTKEIYELLESYRIGNLHEDDLVDHSDEEMWAKEPFRDKRLCVKTARPFNAETPPAEQVRHFDTPAELFYVRQHMPVPDLDGSCHRLTVLVEGEGERRLQLSVDELNRFKRQEVRAALMCAGNRRSEMNLVKPVKGLSWRTGAIGNAVWGGVLLRDVLLAAGVSDKDTEGKHVTLMGADMDATGTYFSTSIPLSHALDERARVLLATSMNGAPLTKDHGHPLRVVVPGAPAVRSVKWLQCIKVSSEESPSHWHQRDYRSFGPSVSWETADFPSAPPVYSLPVTSAVCSPEDGDTVRPRRGALHVQGYAYSGGGAKIIRVEVSTDRGATWREARLRSDSAPPREHYSWTLWDVDLPAAGPQMEIWVKATDSNFNAQPENFRDIWNIRGILSNAYHKIKVNVEQ-