Monarch geneset OGS2.0

DPOGS203952
TranscriptDPOGS203952-TA1359 bp
ProteinDPOGS203952-PA452 aa
Genomic positionDPSCF300005 + 210737-215431
RNAseq coverage636x (Rank: top 20%)
Annotation
HeliconiusHMEL0120903e-17663.58% 
BombyxBGIBMGA000481-TA5e-16759.38% 
DrosophilaCG17843-PA3e-6233.64% 
EBI UniRef50UniRef50_UPI00015B47612e-7236.31%UPI00015B4761 related cluster n=1 Tax=unknown RepID=UPI00015B4761
NCBI RefSeqXP_001602334.14e-7336.31%PREDICTED: similar to Quiescin-sulfhydryl oxidase4, putative [Nasonia vitripennis]
NCBI nr blastpgi|3838499341e-7435.06%PREDICTED: sulfhydryl oxidase 1-like [Megachile rotundata]
NCBI nr blastxgi|3838499348e-7334.46%PREDICTED: sulfhydryl oxidase 1-like [Megachile rotundata]
Group
Gene OntologyGO:00551145.8e-32oxidation-reduction process
GO:00169725.8e-32thiol oxidase activity
KEGG pathway 
InterPro domain[276-398] IPR0068635.8e-32Erv1/Alr
Orthology groupMCL11530 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203952-TA
ATGGCTTATCCTTCTTTACGTTATTTACACGAAAACTATGTGAAGGGTAATTCCAATGTAGGTGAGAAGTTTCAATCTGCTGAAAGTGCTGCAAAATTAAAAGATCAAATGATATTCAAAATACAAAATGAACAGCAAGCTGGACAATTAAAACATGCCCCTTCTTTGGATATTGATTCTCCAGCCAACATTCAAACAATGCCCACTCCATCTGGAGTTACATACACATTTTTAATTTTTGAATCTCCCAATTCAACTATTGGCTCAGAAATAGTTTTGGATACCAGTGACTATACTAATATATTGATCAAAAGAGTATCTGATTCCAGTAAGTTGGCAGAAAGTATTGGTGTTAAAACTTTTCCTGCGGTGGCAGTTGTTGGACCTTCAAGAACTCCCAATATTCTCAACCCTGGAACTCCAACAAAATCTAATATACTCAAAACTATTAATACATATTTAAGGTCACAGAATTTTGTATTTCCTAAGCACCTTGAATTTGAAGATGTAGATGAATTGAATGCTTTAAAAAATAAGGATTTAAGTTCCATGTCTGCAGATGCTGTTTTCTATAGTGATCTGGAAAAAACTTTAAAAACTAGTTTGCATACTGAAATTACGAGGCATAAAGTTCTAGATGGTGAACCACTTGAGGCCCTATTGGATTTTTTAAATGTTTTAATAACAGCCTTCCCCTTTAGAGCCAATATGGAAGAGTATATACTTGAATTACACAACAAACTTAGCTCAAAAAGTTCATGGAACGGAAATGAAGTATATGAATTAGTAAAAAAGTTAGAAGCTTCTCATGCACCAGTTTTCTCAACCAATGCAGATTATATACGATGCAAGGGTAGTCAAAGTAAATATCGAGGCTATACTTGTGGATTATGGACTTTGTTCCATGTTCTAACTGTAAATGCAGCAAGAAAACCAGGATATGAAGCCCCACATGTTTTAAGAGCTATGCATGGTTATGTCAAACATTTCTTTGGTTGTACTGAATGTTCCCAACACTTCCAGGCCATGGCAGCTAGAAATAGGTTGTTTGATGTCAAAGAAAACGATAAAGCAGTTCTCTGGTTATGGATTTCTCACAATGAAGTCAACTTGAGATTGGCAGGAGATGTAACCGAAGACCCTGCTCATCCAAAAATACAGTATCCAAGTGTCACTAACTGTCCAGACTGTAGACTTTCACGAGGAGCTTGGAATTTGCCTGCAGTTTTTGAATATTTGCAAAAAATATATGGTGCTAACAACATTCATGATGCAAGATCAATAGCCTCAGCAGCCGCCTCTCCTGGGCCATTCTCCGACCTGGACATTGGAATGTTAAGCCTCTTGTACATGTCGTAA

Protein sequence:

>DPOGS203952-PA
MAYPSLRYLHENYVKGNSNVGEKFQSAESAAKLKDQMIFKIQNEQQAGQLKHAPSLDIDSPANIQTMPTPSGVTYTFLIFESPNSTIGSEIVLDTSDYTNILIKRVSDSSKLAESIGVKTFPAVAVVGPSRTPNILNPGTPTKSNILKTINTYLRSQNFVFPKHLEFEDVDELNALKNKDLSSMSADAVFYSDLEKTLKTSLHTEITRHKVLDGEPLEALLDFLNVLITAFPFRANMEEYILELHNKLSSKSSWNGNEVYELVKKLEASHAPVFSTNADYIRCKGSQSKYRGYTCGLWTLFHVLTVNAARKPGYEAPHVLRAMHGYVKHFFGCTECSQHFQAMAARNRLFDVKENDKAVLWLWISHNEVNLRLAGDVTEDPAHPKIQYPSVTNCPDCRLSRGAWNLPAVFEYLQKIYGANNIHDARSIASAAASPGPFSDLDIGMLSLLYMS-