Monarch geneset OGS2.0

DPOGS213975
TranscriptDPOGS213975-TA1314 bp
ProteinDPOGS213975-PA437 aa
Genomic positionDPSCF300306 + 63476-66661
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0153754e-14558.39% 
BombyxBGIBMGA013049-TA2e-4138.02% 
DrosophilaCG9649-PA1e-2629.41% 
EBI UniRef50UniRef50_Q5MPB91e-9546.58%Hemolymph proteinase 16 n=1 Tax=Manduca sexta RepID=Q5MPB9_MANSE
NCBI RefSeqXP_318412.47e-5241.20%AGAP003960-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|564184114e-9546.58%hemolymph proteinase 16 [Manduca sexta]
NCBI nr blastxgi|564184111e-9346.94%hemolymph proteinase 16 [Manduca sexta]
Group
Gene OntologyGO:00038242.4e-68catalytic activity
GO:00042521.6e-51serine-type endopeptidase activity
GO:00065081.6e-51proteolysis
KEGG pathway 
InterPro domain[165-436] IPR0090032.4e-68Peptidase cysteine/serine, trypsin-like
[175-431] IPR0012541.6e-51Peptidase S1/S6, chymotrypsin/Hap
[206-221] IPR0013142.8e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL19073 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213975-TA
ATGAAAGTATTAAATATTAATTCAATAGTAATAATAATTTATTGTGTATTAAATGTAGAAACTATAAAATTTGCTAGACGTGTTCGGAAACGAGCGTCCTACGATGCATTTCCATGTAGTGATAGTGAAAAAATAAAATCGTATTTCTCTGCTGGATTAGTAGATGGTTATAATATAAAAATTGATGAGGAGGTCGTTCCTGGAACACTCATTAAATTAAAATTTGATTCTGAAGCGGCAGTTACGTTGGATAAGGCGTTCGCAAATTTTTATAATAATAAAGATATCATAAACAATATTTATGATATAATATTGTTTCGATTTAATAATACTTTCGTCCTGAATGTTAAAGGACAACCTGCTCCATACCCTCCGCCTTATTTGACGAGCATTAAAATAAATGGCAATGAATTATGTAGCCAGCCTAATTTGACATACTTTGAGGATTATCCGGTTGGTGTTGTGAGACGTCCCAACGTTCCTGATCGATTTTGTGGAAGACGTAAAGTCATTCATAGTGAGCTTATAACAAATGGATTGAAAACTAAACCAGGGGAATGGCCTTTTCATGCTGCACTACACCGTCGGGAAAAAATGGGTCTAAGATACACTTGTGGAGGGACATTAATTTCCAAGTTTTTTGTTTTAACAGCTGCCCATTGCACTACCGTTAGAGGAGTAGCCATTTTACCTGAAATCTTTAGTGTTTTTCTGGGAAAATATAATTTGTTTGGTGGTGACGTGTCAGTACAAGAAAAAGAGGTTTACAAGGTGTATGTTCATGACGAATTTACGTACAGGACTCTGGATAATGACATCTCTCTATTGAAATTGAAAACTGAAGCCGTATATGATAATTACGTGCAACCAGCTTGTTTATGGTTCAACAACGTGTACGATCAGCTACCTTCATCGCAAATTCAGGGCACGGTACCAGGTTGGGGTTTTGATATAACTGACTCCTTGTCTCCGACTCTCCACGCAGCCAGCATGCCTTTAGTTCCTGACAGGACTTGTGAATTAACCAATCCTTTATTTTATGTACAAGCTCTGCGTACTGCTAAAAAATTCTGTGCCGGCTATACAAATGGAACCTCTGCATGTAATGGTGATAGCGGAGGTGGTTTTCACGTCTTTGTTCCTGATTTAGCAAAAAGCAATATCCCAGACGTACCCGGAGCTTGGTATATAAGAGGCATTGTGTCCACCAGCTTATCGAGAACTGATGCTGCTATTTGTAATCCCAAAGCTTACGCCGTATTCACAGACGTTGAAAAATATCTAGATTGGATAAATATTTACGTAAATTCATAG

Protein sequence:

>DPOGS213975-PA
MKVLNINSIVIIIYCVLNVETIKFARRVRKRASYDAFPCSDSEKIKSYFSAGLVDGYNIKIDEEVVPGTLIKLKFDSEAAVTLDKAFANFYNNKDIINNIYDIILFRFNNTFVLNVKGQPAPYPPPYLTSIKINGNELCSQPNLTYFEDYPVGVVRRPNVPDRFCGRRKVIHSELITNGLKTKPGEWPFHAALHRREKMGLRYTCGGTLISKFFVLTAAHCTTVRGVAILPEIFSVFLGKYNLFGGDVSVQEKEVYKVYVHDEFTYRTLDNDISLLKLKTEAVYDNYVQPACLWFNNVYDQLPSSQIQGTVPGWGFDITDSLSPTLHAASMPLVPDRTCELTNPLFYVQALRTAKKFCAGYTNGTSACNGDSGGGFHVFVPDLAKSNIPDVPGAWYIRGIVSTSLSRTDAAICNPKAYAVFTDVEKYLDWINIYVNS-