Monarch geneset OGS2.0

DPOGS204147
TranscriptDPOGS204147-TA1476 bp
ProteinDPOGS204147-PA491 aa
Genomic positionDPSCF300034 - 1149927-1156822
RNAseq coverage766x (Rank: top 17%)
Annotation
HeliconiusHMEL0164812e-9563.37% 
BombyxBGIBMGA005173-TA3e-9356.23% 
Drosophilaea-PA5e-8044.99% 
EBI UniRef50UniRef50_Q8I9241e-11660.18%Prophenoloxidase activating factor 3 n=4 Tax=Obtectomera RepID=Q8I924_BOMMO
NCBI RefSeqNP_001036844.11e-11860.23%BzArgOEtase [Bombyx mori]
NCBI nr blastpgi|564183976e-12462.61%hemolymph proteinase 8 [Manduca sexta]
NCBI nr blastxgi|564183974e-12662.61%hemolymph proteinase 8 [Manduca sexta]
Group
Gene OntologyGO:00038246e-87catalytic activity
GO:00042522.5e-83serine-type endopeptidase activity
GO:00065082.5e-83proteolysis
KEGG pathway 
InterPro domain[228-490] IPR0090036e-87Peptidase cysteine/serine, trypsin-like
[235-485] IPR0012542.5e-83Peptidase S1/S6, chymotrypsin/Hap
[266-281] IPR0013142.4e-13Peptidase S1A, chymotrypsin-type
[166-209] IPR0227004.8e-07Proteinase, regulatory CLIP domain
Orthology groupMCL14995 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204147-TA
ATGAATTCATGCAAATTGTCTTGTATAAGGAAGCAGAATAGACCAGCAACAGATTCAGATGTTAGTGAAGAATGTCAATCCAAAAGGTCGTCCAATCCCGTTCCAGCTCCAGTTCCTGGCGCCTCATCAGAAGCGACTACTCCCGGGGCTTCAAATGTTCGAAATCCTTGTGGTGCACCAACATGTCCTCAAAGTTGTCCCGAGCCTAAAAATTCAGCAAAGGCGAGAGTAAATGCTATTTTGAATTACCTCAAAGAACTGTGGAAAAAAACTGATGATCCCTATAACTGGTCGCTGTTAAAAAGCGTCACCATGTTTATATTGGGCCTTAAACTTTTCGATGACCTCTACAAGCATATGGCGAAAGGTCAAAAGAGATGGGTTATGATTATAGCTGATCAGATAGACAATGAATCAAAACGTATAAGAGTGGAGGCAAAGTTTTGCTACAAGGAATGTCCTGCAGAATTATGGAACGCACTCTGTAAAAATGGGGCGAAATGTGTGACATTAGAAAACTGCTCATGGATTTTTGATAGCCTTCAAAATGGTTCAGATGTATTAATGGCTCTATTGAGAAGACTGCACTGCGGCTTCGACAAACATAATAATCCTAAGATTTGCTGTCCATCACAGTTCGAGATGAGAGGCGGTCTTGACTTGCTGCCAAACACTACTGTCTGTGGAATTCAAACCAATGATAGAATTGTTGGAGGACAACAGGCAGACCTTGACGAACATCCTTGGATGGTACTAATCAAATATGAAATACCTAAAGGAAGTACTTTTGCTTGTGGGGGAGTTTTAATATCACCTAGATATGTCATGACGGCAGCCCATTGTGTCAAAGGCTCAGATCTTCCACTGAACTGGAGACTATCGCAGGTACGACTTGGTGAATGGAACGTGGCTACTAAAACGGATTGCGTAAGAGATGATTGCAGCCCAGATCCATTAGATATTAATATTGAAGAGATAATCCCTCACGAAGACTATGACCCTGATAAGGGTCAGCAAAACGATATCGCGCTTTTGAGACTGGCACGGAATGTTGCATTCAATGATTTCGTGAAACCTATTTGTCTTCCTGTAAACTCAGCATTGAAGACGAGTACATTTGAGAATATTGATATGGAAGTAGCAGGTTGGGGGAAAACGGAAACAAAACTATCATCGGATATCAAATTGAAGGTTAAAGTCACTGTCAGAAATACAAACGATTGTAAAGAAATTTACGAACGGGCGAATCGCATAGTCACTGAGAAGCAACTTTGTGCTGGTGGTTTAGAGGGTCAGGATTCTTGTAGGGGAGATTCCGGTGGAGCACTTATGGGGCGAGTCGATGCTACTAAGAACTGGATGGCAGTTGGTGTAGTCTCCTATGGACCCTCACCTTGTGGTACAGCAGGATGGCCCGGCGTGTACACGAGAGTCACAGCTTTCACTGACTGGATAATGTCAAAATTGCAACCCTAA

Protein sequence:

>DPOGS204147-PA
MNSCKLSCIRKQNRPATDSDVSEECQSKRSSNPVPAPVPGASSEATTPGASNVRNPCGAPTCPQSCPEPKNSAKARVNAILNYLKELWKKTDDPYNWSLLKSVTMFILGLKLFDDLYKHMAKGQKRWVMIIADQIDNESKRIRVEAKFCYKECPAELWNALCKNGAKCVTLENCSWIFDSLQNGSDVLMALLRRLHCGFDKHNNPKICCPSQFEMRGGLDLLPNTTVCGIQTNDRIVGGQQADLDEHPWMVLIKYEIPKGSTFACGGVLISPRYVMTAAHCVKGSDLPLNWRLSQVRLGEWNVATKTDCVRDDCSPDPLDINIEEIIPHEDYDPDKGQQNDIALLRLARNVAFNDFVKPICLPVNSALKTSTFENIDMEVAGWGKTETKLSSDIKLKVKVTVRNTNDCKEIYERANRIVTEKQLCAGGLEGQDSCRGDSGGALMGRVDATKNWMAVGVVSYGPSPCGTAGWPGVYTRVTAFTDWIMSKLQP-