Monarch geneset OGS2.0

DPOGS201819
TranscriptDPOGS201819-TA2079 bp
ProteinDPOGS201819-PA692 aa
Genomic positionDPSCF300145 + 323070-328744
RNAseq coverage1271x (Rank: top 10%)
Annotation
HeliconiusHMEL0083420.077.60% 
BombyxBGIBMGA013115-TA0.070.72% 
DrosophilaCG8193-PA0.050.66% 
EBI UniRef50UniRef50_D9J0440.074.03%Prophenoloxidase 2 n=4 Tax=Neoptera RepID=D9J044_PIERA
NCBI RefSeqNP_001037534.10.070.72%phenoloxidase subunit 2 precursor [Bombyx mori]
NCBI nr blastpgi|3003904880.074.03%prophenoloxidase 2 [Pieris rapae]
NCBI nr blastxgi|3054304870.073.59%prophenoloxidase [Pieris rapae]
Group
Gene OntologyGO:00068101.6e-282transport
GO:00053441.6e-282oxygen transporter activity
KEGG pathwaydme:Dmel_CG426400.0 
 K00505 (E1.14.18.1)maps-> Riboflavin metabolism
    Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Melanogenesis
InterPro domain[1-684] IPR0137881.6e-282Arthropod hemocyanin/insect LSP
[149-417] IPR0089226.5e-97Uncharacterised domain, di-copper centre
[422-680] IPR0052033.3e-90Hemocyanin, C-terminal
[422-681] IPR0147561.7e-84Immunoglobulin E-set
[151-414] IPR0008961e-56Hemocyanin, copper-type
[37-147] IPR0052042.7e-26Hemocyanin, N-terminal
Orthology groupMCL10066 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201819-TA
ATGGCGAACATTGTTACAGCTTTGAAGTTGTTGTTTGACCGTCCTAATGAACCCATGGTTTCACCCAAGGGTGACAATCAAGTTGTCTTTCAACTCACAGAACAGCATCTCGACGATAAATACAAGAGCAATGGCATCGAGATCAATAATCGTTTTGGGAAGGACAAACCAATTATCCCGTTAAAGGAACTAAAGACACTTCCTCAGTTTCCAAAAGCTAAACGGCTGCCAAGCGATGCCGATTTCTCTATTCTTCTGCCTGCCCACCAGGAGATGGCTGATGAGGTCATTGATGCCCTTCTAGCGGTGCCTGAAAACCAACTACCCGAATTTCTATCGACATGCGTTTATGCGCGTGTGAATCTGAATCCTCAGTTATTTAACTACTGCTACTCTGTGGCTTTGTTGCACAGGAAGGACACTAAAAATGTTCCACTTCAAAACTTCGCTGAGACCTTCCCGTCTAAGTTCGTTGATTCGAAGTTTTTCAGTCAAGCACGCGAATCCGCCGCCCTTGCCAAACAAGGAGCTCCGCGTGTGCCAATAATAATCCCGCGCGACTTTACCGCAAACGACTTAGACATTGAACACAGACTCGCTTACTGGCGCGAAGACATCGGAATCAACCTTCACCACTGGCATTGGCATCTGGTGTACCCATTCAGCGCAACTAAAAGAGAAATTGTGGCTAAGGACCGTCGTGGCGAACTCTTCTTCTACATGCACCAGCAAGTCATAGCCCGATACAACACGGAACGCCTGGCTAACCAGCTCGCACGTGCTAAGAAGTTCAGTGACTTCACGGAACCGACTCCTGAGCCGTACTATCCTAAATTGGACAGTCTCACATCGTCCCGCAGCTACCCGCCGCGGCAGGCCAACATGAGGTGGTCGGATCTCAACAGGCCCGTCGATGGTCTCGTGGTCACCATCGCCGATATGAACCGCTGGAAGAGGAACCTCGAAGAGGCCATCGCCACGGGCATGGTCAAACTGCCAAATGGCTCGACCCAGCCCCTGGACATAGACACTCTGGGGAACATGGTGGAGTCGAGCATACTGTCACCGAACAGAGATTACTACGGAACCTTGCACAACAACGGACACAGCTTCGCTGGATACTTGCACGATCCTGACCACAGATATCTGGAATCCTTCAACGTAATAGCTGACGAGGCGGTGAATATGCGAGATCCCTTCTTCTACCGCTGGCACGCGTTCATTGACGACCTTTTCCAGAAGTTCAAAGAGAGCAACAACGTGAGACGATACACGAGATCGGAGCTTTCGAACCCGGGGGTGCAAATCACGAATGCCAAGATCGTGAACAGCAATGGCGCCGCGGACAACACTCTACACACGTACTGGATGCAAAGCGACGTCGATCTGTCGCGCGGACTCGACTTCTCGGACCGCGGGCCGGTGTACGCGAGGTTCACTCACCTCAACTACAGGCCGTTCAGATATGTCATCGACGTAGACAACACGGGCAGCGCTCGCCGGACAACGGTCCGCATCTTCATAGCCCCCAAGTTTGATGAACGTGGCTTGCCGTGGATACTATCCGACCAACGCAAAATGTTCATCGAGATGGACAGATTCGTTGTGCCCTTGAACGCCGGCAAGAATGTCATAACACGTGAGTCTACCGAATCCTCGCTGACTATCCCCTTCGAGCAGACCTTCCGCGACCTCTCCAGCCAGGGAAGCGACCCTCGACGTGAGGACCTCGCTAGCTTCAATTTCTGCGGCTGTGGGTGGCCCCAACACATGCTTGTACCACGTGGCACTGAGAGCGGCATGCTGTTTGACTTTTTCGTTATGCTGTCAAACTACGACCTTGACGCCATAACTCAACCAGAAGGTGTTGCACCGCTGTCCTGTACAGAAGCTTCTAGCTTCTGTGGTCTGAAGGATCGTCTTTACCCCGACAAGCGCAACATGGGCTTCCCATTTGACAGACCTTCCAGCAGCGCTGCAAACATCCAGGACTTCATTCTGCCAAACATGTTCCTTGCTGATGTTAGCATTCGTCTACAAAACACCGTGGAAATAAATCCCAGAAATGCTAAAAACTAA

Protein sequence:

>DPOGS201819-PA
MANIVTALKLLFDRPNEPMVSPKGDNQVVFQLTEQHLDDKYKSNGIEINNRFGKDKPIIPLKELKTLPQFPKAKRLPSDADFSILLPAHQEMADEVIDALLAVPENQLPEFLSTCVYARVNLNPQLFNYCYSVALLHRKDTKNVPLQNFAETFPSKFVDSKFFSQARESAALAKQGAPRVPIIIPRDFTANDLDIEHRLAYWREDIGINLHHWHWHLVYPFSATKREIVAKDRRGELFFYMHQQVIARYNTERLANQLARAKKFSDFTEPTPEPYYPKLDSLTSSRSYPPRQANMRWSDLNRPVDGLVVTIADMNRWKRNLEEAIATGMVKLPNGSTQPLDIDTLGNMVESSILSPNRDYYGTLHNNGHSFAGYLHDPDHRYLESFNVIADEAVNMRDPFFYRWHAFIDDLFQKFKESNNVRRYTRSELSNPGVQITNAKIVNSNGAADNTLHTYWMQSDVDLSRGLDFSDRGPVYARFTHLNYRPFRYVIDVDNTGSARRTTVRIFIAPKFDERGLPWILSDQRKMFIEMDRFVVPLNAGKNVITRESTESSLTIPFEQTFRDLSSQGSDPRREDLASFNFCGCGWPQHMLVPRGTESGMLFDFFVMLSNYDLDAITQPEGVAPLSCTEASSFCGLKDRLYPDKRNMGFPFDRPSSSAANIQDFILPNMFLADVSIRLQNTVEINPRNAKN-