Monarch geneset OGS2.0

DPOGS212108
TranscriptDPOGS212108-TA1806 bp
ProteinDPOGS212108-PA601 aa
Genomic positionDPSCF300038 - 494078-506618
RNAseq coverage719x (Rank: top 18%)
Annotation
HeliconiusHMEL0125290.080.37% 
BombyxBGIBMGA006740-TA3e-17673.75% 
DrosophilaCG42345-PD5e-8632.46% 
EBI UniRef50UniRef50_E2BNY51e-16046.29%Laccase-4 (Fragment) n=1 Tax=Harpegnathos saltator RepID=E2BNY5_HARSA
NCBI RefSeqXP_967121.13e-12840.86%PREDICTED: similar to multicopper oxidase [Tribolium castaneum]
NCBI nr blastpgi|3838633965e-16445.55%PREDICTED: laccase-5-like [Megachile rotundata]
NCBI nr blastxgi|3838633964e-16045.55%PREDICTED: laccase-5-like [Megachile rotundata]
Group
Gene OntologyGO:00055071.4e-26copper ion binding
GO:00551141e-19oxidation-reduction process
GO:00164911e-19oxidoreductase activity
KEGG pathway 
InterPro domain[195-347] IPR0089722.1e-32Cupredoxin
[79-186] IPR0117071.4e-26Multicopper oxidase, type 3
[223-327] IPR0011171e-19Multicopper oxidase, type 1
[450-576] IPR0117069.8e-18Multicopper oxidase, type 2
Orthology groupMCL17317 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212108-TA
ATGTCCTTGCAAAGGATACTAGTGCTGACTTTTATTATAGCTTGTGTGGTTGTTGTTGTTTATTACACACCTATGCCAGAGGAATATTTCGAAAATTGTGATCGTGAATGTCACGAGCTCGACTGGCCTATGATTTGCAGAGTAAAACTCGTAATAGAGGTATACAAAACATTCAGCAAATCTTGTAACAGTTGCGTGGAAAATGGAAGCGAATGTCCGGCGATGTGCATCACAGCTGATGGTAGGGAACGAGGTGTCCTCTCTGCGAACAGAGAATTGCCTGCACCTGCCTTCCATGTATGTCAGAACGACATTCTCGTCGTTGATGTAGTACATCGAGCTCCTGCGCATGCATTATCAATACATTGGAGAGGTCAGCCTCAAAAAGAGACGCCTTTTATGGATGGTGCTCCCATGTTAACACAGTGCCCTCAGCCGGCCTATACAACGTTCCAGTATAAATTTCGAGCCTCTGCTGTAGGAACACATATGTACCATGCCCATTCAGCAGCGGATGCCGCTGATGGATTGGCCGGAGCGTTTATTGTACGACAGTCACCACGTCTGGATCCTTTAGCAAGTCTCTACGATGTAGATGCAACAGATCATACAATTTTTGTTGCTGAATGGGGTCACTCTATGGGACCACTTGCAGGTGTCACATCCAAAATACCAAACGCAGAATCTTTACTCATTAACGGCAAAGGAAAAACAAATGAAACGTTGAGCGGGCCATTATTTAAGTTTAATGTAGAATATGGCAAACGTTATAGATTCAGATTGGCATACGGCGGTGGATTTAAAAGCTGTCCAATAAACTTTTCCATCGACAAACATGCTATTAAGCTGGTTGCTTTGGATGGTCATATAATTCAAACTGAAACTGTAACATCAATTGAATTAGGAAGAGGAGAGCGAGCCGATTTCATTCTGGACGCTAATCAAGCGATTGGCGTATACAAAATCAGAGTTGTGGCAGATAAATCATGTCAGGATGATTTGGAGGGTGAAGCTGAATTAATATATAAAAATCAGGACAATAAGGTTTTACTTAAGAATGAGGGTAGTGAAGATTCCACAATAAATCGCATATTTTCAACAGTGGCTAGTGATAATTGTGTCAGTGATACTGTTTTGTGCTTGGATGAAATACATGGAGCTGAAAAGCTTGCTTCGGAATTAGCCGAGCCAGTTGACGAGGTTCTTTATGTGCCTTTTAATTACTCAACGAGGCAAATGTCAGCGAGACGTTTCGAAAGTTGGGGTCAGACCGACGGTCACCGTTTCACGTATCCGGCCTCCCCGCTTCTGACGCAGGGTTCCGATGTGGCCCCCGAAGCCATGTGTCCCAAAAACACCGGAGAGGGAGGGGAGTGCGTCCACGTCAAGTACATCCCTTTGCACTCAACTGTTGAGTTAATAATGTTTGATCAAGGAGGGGAATCGGATCATATTTTCCATCTCCACGGATACAGTTTCTATGTCACCGATGTTCGTCAAATGGACACGAAACTGGAAAAAGAAACTGTTATGAAAATGAATCAGGATGGTACGCTGTTCCCTTCCAAGAATCTCGATGACCCCGTGAGGAAAGATACGATAGTCATTCCTAAATTCGGTGTCGCCGCCTTGAGGTTCAAGGCGGATAACCCCGGTTACTGGATGATGAGGGACGAAAGATCAGCCCATTGGACTAGAGGCTTGGATTTCGTTTTAAAAGTCGGAGATCAAAGAGATTTCGTTAAGGCTCCGGCAGATTTCCCCAAATGCGGTTCATATGTCGGACCAGAATATTTCTTGATATAG

Protein sequence:

>DPOGS212108-PA
MSLQRILVLTFIIACVVVVVYYTPMPEEYFENCDRECHELDWPMICRVKLVIEVYKTFSKSCNSCVENGSECPAMCITADGRERGVLSANRELPAPAFHVCQNDILVVDVVHRAPAHALSIHWRGQPQKETPFMDGAPMLTQCPQPAYTTFQYKFRASAVGTHMYHAHSAADAADGLAGAFIVRQSPRLDPLASLYDVDATDHTIFVAEWGHSMGPLAGVTSKIPNAESLLINGKGKTNETLSGPLFKFNVEYGKRYRFRLAYGGGFKSCPINFSIDKHAIKLVALDGHIIQTETVTSIELGRGERADFILDANQAIGVYKIRVVADKSCQDDLEGEAELIYKNQDNKVLLKNEGSEDSTINRIFSTVASDNCVSDTVLCLDEIHGAEKLASELAEPVDEVLYVPFNYSTRQMSARRFESWGQTDGHRFTYPASPLLTQGSDVAPEAMCPKNTGEGGECVHVKYIPLHSTVELIMFDQGGESDHIFHLHGYSFYVTDVRQMDTKLEKETVMKMNQDGTLFPSKNLDDPVRKDTIVIPKFGVAALRFKADNPGYWMMRDERSAHWTRGLDFVLKVGDQRDFVKAPADFPKCGSYVGPEYFLI-