Monarch geneset OGS2.0

DPOGS206820
TranscriptDPOGS206820-TA2952 bp
ProteinDPOGS206820-PA983 aa
Genomic positionDPSCF300001 - 3712757-3722109
RNAseq coverage64x (Rank: top 67%)
Annotation
HeliconiusHMEL0116760.076.39% 
BombyxBGIBMGA012763-TA0.077.47% 
DrosophilaCG8193-PA0.055.82% 
EBI UniRef50UniRef50_Q9V5210.055.82%Phenoloxidase subunit A3 n=111 Tax=Endopterygota RepID=PRPA3_DROME
NCBI RefSeqNP_001037335.10.077.32%phenoloxidase subunit 1 precursor [Bombyx mori]
NCBI nr blastpgi|1129836670.077.32%phenoloxidase subunit 1 precursor [Bombyx mori]
NCBI nr blastxgi|1129836670.077.32%phenoloxidase subunit 1 precursor [Bombyx mori]
Group
Gene OntologyGO:00068102.2e-61transport
GO:00053442.2e-61oxygen transporter activity
KEGG pathwaydme:Dmel_CG426400.0 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[1-699] IPR0137880Arthropod hemocyanin/insect LSP
[147-418] IPR0089228.6e-108Uncharacterised domain, di-copper centre
[420-674] IPR0052031.1e-84Hemocyanin, C-terminal
[422-675] IPR0147568.2e-84Immunoglobulin E-set
[149-414] IPR0008962.2e-61Hemocyanin, copper-type
[37-145] IPR0052042.4e-39Hemocyanin, N-terminal
Orthology groupMCL10066 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206820-TA
ATGTCGAACGCCAAAGAAAACCTTCTGCTTTTCTTTGACCGCCCCACCGAGCCCTGCTTTATGCAGAAAGGCGAAGAAAAGGCTACCTTCGATTTGCCTGCTAACTACTACCCCGATAAATACAAGTCTGCGAGTGCGGCTTTGGCCAATCGCTTTGGTTCAGAGTCCAATCGACGTATTCCAGTAAGAAACATAGCACTGCCCAACCTCTCCTTGCCCATGGAACTGCCTTACAACGACCAGTTCTCACTCTTCGTTCCTAAGCATAGGCAGATGGCTGGCAAACTCATTGATATTTTTATGAGTATGCGTAACGTTGAAGACCTCATGTCCATATGCTCGTACTGTCAAATGAGAATAAATCCTTACATGTTCAACTACTGCCTCTCAGTCGCCATATTGCACAGGGATGATACCAAGGGCCTGAACATACCAACCTTCGCGGAGACCTTCCCCGATAAGTTCATGGACCCGCGTGTCTTCAGAAAAGCCCGTGAGGTCAGCACCGTTGTGCAGCCTGGAAACAGACTGCCGGTAGTGATCCCACAGAACTACACCGCAGCTGAGTTTGAGCCGGAACAGCGAGTGGCGTACTTCCGAGAGGATATCGGCCTGAACCTACACCACTGGCACTGGCATCTTGTTTACCCTTTTGATGCAGCAGACAGAAGCATTGTCAACAAGGACCGCAGGGGGGAGCTGTTCTACTATATGCATCAGCAGATTGTTGCAAGGTATAGCGTGGAGCGCATGTGCAATGATCTGTCTCGCCCAAAACGCTACAGTGACTTTCGTGAGCCCATCACTGAAGGATACTTCCCCAAACTGGATTCGCAGGTCGCTAGTAGAGCGTGGCCGCCGCGTTTCGCTGGTTCAAAAATTCGTGACCTTGACCGTCCAGTGGACCTGATCGCAGCGGATGTGTCCCAATTAGAGACGTGGAGGGATAGGTTTCTACAAGCCATTGATGATATGGCCGTTTTACTACCAAACGGTCGTAAAATGACTTTGGATGAGGACACCGGGATTGACGTTCTTGGTAACCTGATGGAGTCATCCATTATCAGTCGCAACCGAGCGTTTTATGGGGACTTTCACAATATGGGACACGTGTTCATCAGTTATTCCCATGATCCAGACCATAGGAATCTGGAACAATTTGGAGTGATGGGCGACTCAGCAACAGCTATGCGTGATCCTGTATTCTATCGCTGGCACGCGTACATCGATGACATCTTCCAGCTCTACAAGAACAAGCTAGCACCATATCCCAATGACAGGCTTGATTTTCCCGGTATCCAAGTGCTATCCGTTAGTACTTCGTCGGGCGCGGGACCGGACCGACTATTAACGCAATGGGAACAGAGCACAATGGAGCTAGGGCGAGGTTTAGATTTTACACCACGTGGCTCCGTTCTCGCAACTTTCACACATCTACAACATGACGAATTTAATTATCTCATCGAGGTTAATAACACAAGTGGGGCGGGCGTAATGGGTACGGTGCGTTTGTTCATGGCGCCTGTCGTCGATGAAAATGGAACTCCCCTGACCTTCGACGAACAGCGGAAGCTGATGATGGAGTTAGACAAATTCACACACGCCATACCATCTGGTTCGTCAACCATCCGTCGCTCGAGCACTCAGTCATCTGTTACCATCCCATACGAGCGTACTTTCCGCTCCCAGGGTTCTCGGCCAGGCGATCCTGGTTCTGTAGAGGCGGCCGAGTTCGACTTCTGCGGCTGCGGCTGGCCGCACCACTTGCTACTGCCGAAAGGAACCACAAGGGGGTACCCAGTTGTACTATTCTGTATGATCTCCAATTGGAATAACGACGGAGTAGCTCAAGACCTAGTGGGTAAATGTAATGATGCAGCCTCGTATTGTGGTATCCGTGACCGCAAGTACCCAGACCGCCGTGCCATGGGCTTTCCCTTCGATCGTCCTTCCCCTGCCAGCTTGCTTCAGGATTTCCTTACTCCTAATATGGCCACCAAACCCTGTACCATCCTTTTCAGTGACAACGTTAGGATACTGGTCTCATTGTTTCTCAGCATAGACGCAAAGGACTCTAAGGTCGTTAAGGGTGTAAAGGATGTTAAGAGTGTTAAGGGTGTAAAAGATGTTAAGGATGCCAAGAAAATAATTCCAGCGGAATCTGCTTCAAATCCGATTTACGGATCAGAGCAAAAGGGCAACTATGGCATTGGAGAAAACAAAGGATATAAATCAAGGGAAAGGTACAGAAGTTCAGGTTTGCTATCCCAAAATTATGGTAATAACTATGGGTATGGCTACAATAACGGTAGAAATAATTACGAATATGGAAATAATTATGGAAATGGCAATAATAATTATGATTATGGAAACAATAATGAATATGGAAATAGTAACTACAGGTCTGGTAATAACAAGAATTATGGAGACGATTATGGATATGGTCAGGATAATATCGGATATGGCATCAACAAAAATGGATACGGTGTAAGTGACTACGGGTACGGTGGTTATAATTATGGATATGGCAATAATAATTATGGATATGGCAATAATAATTATGGATATGGCAATAATAATTATGGATACGGCATTAGAAGTAATGAATATGGCCATAATAACTATGGAAACAGCAATAAAAATAATGGATATGGCAATAATAATTATGGATATGGAAATATTAATTACGGATATGGCAACAATGATTACGGTTATGACAACAACAATTACGGAGACGATGACTATAAAAGGAAATTGAGAGGACCAGATGAGATTGGTTATTTTGCCAGAGACCCAGCAAGTAGACGTCTCTATGGCAGACGAAATTACCAATATAGATTGATCGGCAGGCATGGACTAGGATTTGACAAGCGGACCGTTAACGCCTACAGTTACGTGGCCCCTATAACTCCTGAGCGGTTGGAAATAATTGGAAACCCTTACAGAAATTAG

Protein sequence:

>DPOGS206820-PA
MSNAKENLLLFFDRPTEPCFMQKGEEKATFDLPANYYPDKYKSASAALANRFGSESNRRIPVRNIALPNLSLPMELPYNDQFSLFVPKHRQMAGKLIDIFMSMRNVEDLMSICSYCQMRINPYMFNYCLSVAILHRDDTKGLNIPTFAETFPDKFMDPRVFRKAREVSTVVQPGNRLPVVIPQNYTAAEFEPEQRVAYFREDIGLNLHHWHWHLVYPFDAADRSIVNKDRRGELFYYMHQQIVARYSVERMCNDLSRPKRYSDFREPITEGYFPKLDSQVASRAWPPRFAGSKIRDLDRPVDLIAADVSQLETWRDRFLQAIDDMAVLLPNGRKMTLDEDTGIDVLGNLMESSIISRNRAFYGDFHNMGHVFISYSHDPDHRNLEQFGVMGDSATAMRDPVFYRWHAYIDDIFQLYKNKLAPYPNDRLDFPGIQVLSVSTSSGAGPDRLLTQWEQSTMELGRGLDFTPRGSVLATFTHLQHDEFNYLIEVNNTSGAGVMGTVRLFMAPVVDENGTPLTFDEQRKLMMELDKFTHAIPSGSSTIRRSSTQSSVTIPYERTFRSQGSRPGDPGSVEAAEFDFCGCGWPHHLLLPKGTTRGYPVVLFCMISNWNNDGVAQDLVGKCNDAASYCGIRDRKYPDRRAMGFPFDRPSPASLLQDFLTPNMATKPCTILFSDNVRILVSLFLSIDAKDSKVVKGVKDVKSVKGVKDVKDAKKIIPAESASNPIYGSEQKGNYGIGENKGYKSRERYRSSGLLSQNYGNNYGYGYNNGRNNYEYGNNYGNGNNNYDYGNNNEYGNSNYRSGNNKNYGDDYGYGQDNIGYGINKNGYGVSDYGYGGYNYGYGNNNYGYGNNNYGYGNNNYGYGIRSNEYGHNNYGNSNKNNGYGNNNYGYGNINYGYGNNDYGYDNNNYGDDDYKRKLRGPDEIGYFARDPASRRLYGRRNYQYRLIGRHGLGFDKRTVNAYSYVAPITPERLEIIGNPYRN-