Monarch geneset OGS2.0

DPOGS201678
TranscriptDPOGS201678-TA1140 bp
ProteinDPOGS201678-PA379 aa
Genomic positionDPSCF300103 + 492050-495542
RNAseq coverage1080x (Rank: top 12%)
Annotation
HeliconiusHMEL0119301e-12657.37% 
BombyxBGIBMGA009610-TA3e-4240.78% 
DrosophilaCG6361-PB3e-5137.30% 
EBI UniRef50UniRef50_Q5MPC83e-11151.73%Hemolymph proteinase 6 n=1 Tax=Manduca sexta RepID=Q5MPC8_MANSE
NCBI RefSeqXP_974337.26e-6641.74%PREDICTED: similar to hemolymph proteinase 6 [Tribolium castaneum]
NCBI nr blastpgi|564183931e-11051.73%hemolymph proteinase 6 [Manduca sexta]
NCBI nr blastxgi|564183933e-11252.27%hemolymph proteinase 6 [Manduca sexta]
Group
Gene OntologyGO:00038245e-83catalytic activity
GO:00042525.6e-73serine-type endopeptidase activity
GO:00065085.6e-73proteolysis
KEGG pathwaydpo:Dpse_GA195431e-49 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[113-370] IPR0090035e-83Peptidase cysteine/serine, trypsin-like
[127-365] IPR0012545.6e-73Peptidase S1/S6, chymotrypsin/Hap
[158-173] IPR0013142.3e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL16637 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201678-TA
ATGTATATCAAACTAATTGTTTACTGTAATTTATTAATATTAACAGTGTTTGCTGGCGAAGTTGATGACAAATGTACACCCAATGCAGCAATTAAAGAAGGAACTTGTAAACTGATAACTGATTGTGAGGTCGCTCTGAGGTCAATTAAAAAGAACCAAGGTCATCCTTATGCCAGATGCGGTTTTAGCAAGACATTTGAGATAGTCTGTTGTCCGAATGAAGTGGATTTACAACCTGCGGTCACAGTAAACCCTTTTAAAAGAACTACCTCAGTAAAACCAGAGGATAAATTTGGTGAAGTAGACACCAGAACAGTGAGAGTAGCTGATGAGGCCTGTAAACAAATCATCAGGAATCGTCTTCCTCCTCTCGGCTTGCATATAATCGGTGGCATTGAGGCTTCTCCTGGGGAATTCCCACATATGGTGGCTTTAGGGTACGGTGGACCGGATGTGTATGAGTTCAATTGTGGTGCTTCGCTGTTGTCAGAGCTGTATGCATTAACAGCAGCGCATTGCGTCGACACGCTCAATCAAATTAAACCAACTATAGCCCGTATGGGTGTCGTTGAACTCGGTGCGACGACATTTAATCCGAACACAGACTACAGGGTAGCAGATATATTGATACATCCTGATTACTTGAGACGCACTAAATATCATGATTTATCATTAGTAAGGATGGAAAGACCGGCTGAATTTGGTGTGAATATCGGCCCAATATGTTTGTATACAAATTTGCAAGATCCAACAACATCACTAACTGTTACTGGCTGGGGAAAAACCAGTATCACAAAGGAGGACAAGAGTGAGGTTTTGCTGAAAGCTAATGTCACTGTTGTGGCAAGAAGTAAATGTGGTCAATCTTATTCCAACTGGCGGAAGTTGCCTAGTGGCATCTTGAATGAACAAATATGTGCTGGAGATCCACAGGGACTGAGGGATACATGTCAGGGTGATTCTGGCGGTCCATTACAAGGTTTGGATGACCATGACGGCCAGTACCGTCTAGTGGGTGTGACGTCATTCGGCCGTGGTTGTGGGTCACCAGTGCCAGGGGTCTACACACGTGTTGCACACTACCTTGACTGGATAGAAAGTGTAGTATGGCCGCGTGGTTTAGACACATGGTCTAATTAA

Protein sequence:

>DPOGS201678-PA
MYIKLIVYCNLLILTVFAGEVDDKCTPNAAIKEGTCKLITDCEVALRSIKKNQGHPYARCGFSKTFEIVCCPNEVDLQPAVTVNPFKRTTSVKPEDKFGEVDTRTVRVADEACKQIIRNRLPPLGLHIIGGIEASPGEFPHMVALGYGGPDVYEFNCGASLLSELYALTAAHCVDTLNQIKPTIARMGVVELGATTFNPNTDYRVADILIHPDYLRRTKYHDLSLVRMERPAEFGVNIGPICLYTNLQDPTTSLTVTGWGKTSITKEDKSEVLLKANVTVVARSKCGQSYSNWRKLPSGILNEQICAGDPQGLRDTCQGDSGGPLQGLDDHDGQYRLVGVTSFGRGCGSPVPGVYTRVAHYLDWIESVVWPRGLDTWSN-