Monarch geneset OGS2.0

DPOGS200017
TranscriptDPOGS200017-TA1833 bp
ProteinDPOGS200017-PA610 aa
Genomic positionDPSCF300225 + 194542-199629
RNAseq coverage1164x (Rank: top 11%)
Annotation
HeliconiusHMEL0116770.078.98% 
BombyxBGIBMGA012763-TA0.079.54% 
DrosophilaCG8193-PA0.059.07% 
EBI UniRef50UniRef50_Q9V5210.059.07%Phenoloxidase subunit A3 n=111 Tax=Endopterygota RepID=PRPA3_DROME
NCBI RefSeqNP_001037335.10.079.38%phenoloxidase subunit 1 precursor [Bombyx mori]
NCBI nr blastpgi|1129836670.079.38%phenoloxidase subunit 1 precursor [Bombyx mori]
NCBI nr blastxgi|1129836670.079.77%phenoloxidase subunit 1 precursor [Bombyx mori]
Group
Gene OntologyGO:00068106.9e-304transport
GO:00053446.9e-304oxygen transporter activity
KEGG pathwaydme:Dmel_CG426400.0 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[1-609] IPR0137886.9e-304Arthropod hemocyanin/insect LSP
[74-345] IPR0089221.8e-108Uncharacterised domain, di-copper centre
[347-601] IPR0052032.2e-87Hemocyanin, C-terminal
[349-602] IPR0147562e-85Immunoglobulin E-set
[76-341] IPR0008961.4e-61Hemocyanin, copper-type
[3-72] IPR0052041.7e-31Hemocyanin, N-terminal
Orthology groupMCL10066 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200017-TA
ATGGAATTAGGTTACAATGAACAGTTTTCACTCTTTGTGCCCAAGCACAGGAAGATGGCTGGCAACCTCATTGACATTTTTATGAATATGCGTAATGTGGATGACCTAGTATCAGTGTGTTCTTATTGTCAGATGAGGATCAATCCTTATATGTTTAACTACTGCCTTTCAGTCGCCATATTACACAGGGATGACACAAAGGGTTTGAATATCCCAACCTTCGCGGAGACTTTCCCTGACAAGTTCATGGACCCTCGCGTTTTCCGAAAGGCTCGCGAAGTCAGCACCGTCGTACTACCTGGAAATAGGCTGCCAGTTGTTATACCTCAAAACTATACAGCATCGGATTCTGAGCCAGAGCAACGTGTAGCTTACTTCCGTGAAGATATCGGACTGAATTTGCACCATTGGCATTGGCACCTCGTGTACCCCTTCGACGCAGCTGACAGGAGCATCGTTGATAAGGACCGAAGGGGGGAACTGTTCTATTACATGCACCAGCAAATCATCGCTAGGTATAGTGTAGAGCGCATGTGCAATGGTTTATCACGTCCAAAACGATACAACAACTTCCGGGAGCCCATCGCCGAAGGTTATTTCCCCAAACTGGACTCACAAGTAGCCAGTCGTGCGTGGCCTCCACGCTTTGCTGGTTCAACCATCCGTGACCTAGACCGTCCCGTGGACCAAATCAGAGCGGATGTGTCTGAGTTGGAAACATGGAGAGACAGATTCATACAGGCTATTGAAGATATGGCGGTCCTTCTGCCTAATGGCCGTAAAGTGCCATTGGACGAAGAGACAGGTATGGATGTCCTCGGAAACCTGATGGAGTCTTCTATCATTAGCCGAAACCGTGGATTCTATGGAGACCTTCACAATATGGGACATGTTTTCATCAGCTACTCCCACGATCCTGACCACAGAAATCTAGAACAATTTGGTGTGATGGGCGACTCTGCAACGGCTATGCGTGATCCCGTGTTCTATCGTTGGCACGCCTATATCGATGACATCTTCCAACTGTACAAGAACAAACTAACGCCGTACTCCAATGATAAGTTTGACTTCCCTGGCATCCGTGTGCAATCCGTCGGCATTTCTTCCGGCTCGGGTCCGGACCGTCTATCGACGCAGTGGGAGCAGAGCACATTAGAGCTTGGAAGAGGGCTGGATTTCACGCCACGTGGTTCAGTGCTCGCAAAGTTCACGCATCTGCAACATGATGAATTCAATTATGTCATTGAAGTTAATAATACGAGTGGAGCTGGAGTGATGGGAACAGTTCGTCTGTTTATGGCACCCGTCAACGATGAGACAGGAAAGCCTTTGAACTTCGATGAACAGAGGAGGCTAATGGTGGAGATGGACAAGTTTACACATGCCATCCCTGCTGGTTCATCAACCATCCGTCGCGCGAGCACTCAATCATCGGTAACTATTCCATATGAGCGCACATTCCGCGCTCAATCTTCACGCCCTGGAGACCCGGGTTCTGCAGAAGCTGCCGAGTTTGACTTCTGTGGTTGCGGTTGGCCTCACCACCTTCTTATACCCAAGGGTACCACCAGAGGATACCCAGTCGTGCTATTTTGTATGATCTCCAATTGGAATGATGACAGAGTGGTTCAAGACTTAGTTGGTACATGCAACGATGCAGCTTCCTACTGTGGTATCCGAGACCGAAAGTACCCTGATCGCCGACCAATGGGATTCCCCTTCGACCGTCCATCCCGAGCTAGCTCGCTCCAGGACTTTTTAACTCCCAATATGGCCACCAAGCCATGCACTATTGTCTTCAGTGACAACGTCAGGGTTCGTTCCGCCCGGTAA

Protein sequence:

>DPOGS200017-PA
MELGYNEQFSLFVPKHRKMAGNLIDIFMNMRNVDDLVSVCSYCQMRINPYMFNYCLSVAILHRDDTKGLNIPTFAETFPDKFMDPRVFRKAREVSTVVLPGNRLPVVIPQNYTASDSEPEQRVAYFREDIGLNLHHWHWHLVYPFDAADRSIVDKDRRGELFYYMHQQIIARYSVERMCNGLSRPKRYNNFREPIAEGYFPKLDSQVASRAWPPRFAGSTIRDLDRPVDQIRADVSELETWRDRFIQAIEDMAVLLPNGRKVPLDEETGMDVLGNLMESSIISRNRGFYGDLHNMGHVFISYSHDPDHRNLEQFGVMGDSATAMRDPVFYRWHAYIDDIFQLYKNKLTPYSNDKFDFPGIRVQSVGISSGSGPDRLSTQWEQSTLELGRGLDFTPRGSVLAKFTHLQHDEFNYVIEVNNTSGAGVMGTVRLFMAPVNDETGKPLNFDEQRRLMVEMDKFTHAIPAGSSTIRRASTQSSVTIPYERTFRAQSSRPGDPGSAEAAEFDFCGCGWPHHLLIPKGTTRGYPVVLFCMISNWNDDRVVQDLVGTCNDAASYCGIRDRKYPDRRPMGFPFDRPSRASSLQDFLTPNMATKPCTIVFSDNVRVRSAR-