Monarch geneset OGS2.0

DPOGS206561
TranscriptDPOGS206561-TA1305 bp
ProteinDPOGS206561-PA434 aa
Genomic positionDPSCF300108 - 666022-670677
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0043691e-13652.83% 
BombyxBGIBMGA013746-TA3e-8353.10% 
Drosophilaea-PA1e-6040.00% 
EBI UniRef50UniRef50_Q49QW01e-12752.05%Prophenol oxidase activating enzyme 3 n=5 Tax=Obtectomera RepID=Q49QW0_SPOLT
NCBI RefSeqNP_001036832.11e-10244.25%prophenoloxidase activating enzyme [Bombyx mori]
NCBI nr blastpgi|567183905e-12752.05%prophenol oxidase activating enzyme 3 [Spodoptera litura]
NCBI nr blastxgi|567183901e-13052.94%prophenol oxidase activating enzyme 3 [Spodoptera litura]
Group
Gene OntologyGO:00038245.4e-81catalytic activity
GO:00042521.1e-76serine-type endopeptidase activity
GO:00065081.1e-76proteolysis
KEGG pathway 
InterPro domain[163-433] IPR0090035.4e-81Peptidase cysteine/serine, trypsin-like
[172-428] IPR0012541.1e-76Peptidase S1/S6, chymotrypsin/Hap
[23-74] IPR0227004.3e-13Proteinase, regulatory CLIP domain
[204-219] IPR0013142.1e-12Peptidase S1A, chymotrypsin-type
[23-75] IPR0066047.1e-09Disulphide knot CLIP
Orthology groupMCL21029 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206561-TA
ATGTTAATACTATATGTGTTGTGTATATTAGTTTCTCAGTACGGTCAAGTTCATGGACAAAATAATTGCAATAATCCAAATGGAAAAATTGGTACATGTATACCGATCAAGAATTGCCAACCGTTGATAGATCTAGTCACAAAACAGGGTAGAACCGAAGTGGAGACACAGTATTTGATAAATTCAAGGTGCGGATCCCTCGATACTATAAAGGTGTGTTGTGTGACGGAGTCGAGGGAATTGGAAGATCCGAGATGCTTCAACCCTGACGGCAACCAGGGCATCTGCACTGAAGTCACGACCTGCCCGTCCATCACAAAACTCCTTATAACACCGATATCAGCAAGTAATCTAGAGTTTATTAGAAACTCAAGATGCCTAGGACCAGCAAAATATAGTGTATGTTGCGGGCCTGACCACGTGAAGAAAGCTGTAATGAAGAACTGCCAGCCGTCAGCAGCGCCTGCAGACACGCGAAGTGATTGCTGCGGTCTAGACGCTTTCTCGGGAAATAGAATCTACGGTGGGAACGACACCGCAATAGATCAGTATCCATGGATAACTCTGATAGAGTACAGGGATAAACACAATAGAATAAAACTACTCTGTGGAGGTGCTCTTATAAGTTCCAAGTACGTGTTGACGGCTGGGCATTGCGTCACCGGACCTGTTCTGAACATCGGACAACCAGAAAATGTAAGACTCGGTGAATACAACACAACGAATGACGGTCCGGATTGTGTTGAAATACCAATGGAAATGAAAGATTGTACAGACGGAGTGGTTGTGATACCGATCGAGAAGATTATACCACACGAGGGTTACAACCCGGAGTCCGTATTAAAGCGGGATGATATTGCTTTGATAAGAATGGCGAGCTATGCACCGTTTACAGATTTCATTAGACCGATCTGCCTTCCGACTTCGGATGTTACACTGTCTCAAAAGGATTTGGTTTTCTACGCCGCTGGTTGGGGAGCGGTATCGATCGACGAGAGATTCAGCGCTATTAAACTTCACGTTGATTTACCATTTAGACCGTTAGAGGAATGTAAAAAGGCCTACAACGTTTCCAGCAGAAAAATAGAACTATGGAATCGTCAGTTGTGTGCTGGAGGTGTGAAGGGGAAGGACACCTGCAGAGGAGACTCCGGTGGGCCTCTGATGTATGACAACGGCAGGTCTTACTCCGTCATAGGAGTCGTCAGCTTTGGCCCCTCGCCGTGCGGCTTAGAGAACGTGCCGGGAGTTTATACCAAGGTCTACGAATATCTGCCTTGGATAAGGACCAACATTAAACCCTGA

Protein sequence:

>DPOGS206561-PA
MLILYVLCILVSQYGQVHGQNNCNNPNGKIGTCIPIKNCQPLIDLVTKQGRTEVETQYLINSRCGSLDTIKVCCVTESRELEDPRCFNPDGNQGICTEVTTCPSITKLLITPISASNLEFIRNSRCLGPAKYSVCCGPDHVKKAVMKNCQPSAAPADTRSDCCGLDAFSGNRIYGGNDTAIDQYPWITLIEYRDKHNRIKLLCGGALISSKYVLTAGHCVTGPVLNIGQPENVRLGEYNTTNDGPDCVEIPMEMKDCTDGVVVIPIEKIIPHEGYNPESVLKRDDIALIRMASYAPFTDFIRPICLPTSDVTLSQKDLVFYAAGWGAVSIDERFSAIKLHVDLPFRPLEECKKAYNVSSRKIELWNRQLCAGGVKGKDTCRGDSGGPLMYDNGRSYSVIGVVSFGPSPCGLENVPGVYTKVYEYLPWIRTNIKP-