Monarch geneset OGS2.0

DPOGS204361
TranscriptDPOGS204361-TA1044 bp
ProteinDPOGS204361-PA347 aa
Genomic positionDPSCF300040 + 34143-36988
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0107791e-12360.18% 
BombyxBGIBMGA013049-TA4e-4732.34% 
DrosophilaCG31326-PA9e-3231.50% 
EBI UniRef50UniRef50_Q5MPB91e-9647.83%Hemolymph proteinase 16 n=1 Tax=Manduca sexta RepID=Q5MPB9_MANSE
NCBI RefSeqXP_002433014.12e-5840.36%hypothetical protein Phum_PHUM609370 [Pediculus humanus corporis]
NCBI nr blastpgi|564184114e-9647.83%hemolymph proteinase 16 [Manduca sexta]
NCBI nr blastxgi|564184113e-9447.83%hemolymph proteinase 16 [Manduca sexta]
Group
Gene OntologyGO:00038242e-76catalytic activity
GO:00042521.6e-54serine-type endopeptidase activity
GO:00065081.6e-54proteolysis
KEGG pathway 
InterPro domain[68-341] IPR0090032e-76Peptidase cysteine/serine, trypsin-like
[78-333] IPR0012541.6e-54Peptidase S1/S6, chymotrypsin/Hap
[109-124] IPR0013143.1e-14Peptidase S1A, chymotrypsin-type
Orthology groupMCL18565 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204361-TA
ATGGAACCTGGTGTGGTGTTTGGATTTTATTATTTTGCACAAGAAACTGGATTTAATTTTGTTGTAACTTCCTTCGTCCCTGGAATAGTTCCGTATATAACAAGCGTTATAATCAACAATGAGGAATATTGTCAATATCCAAATGTGGGATTCCTAAATCAATATGTTGTGGAAACGTTACTAAATGACGAAGAAAAGAACTGTGGTCGGCGTCAAGTACAAAGCACGCAGTTGATGGTTAATGGCGCTAATACTAAACCCGGAGACTGGCCATGGCACGTCGCTATTTATAAACAAGAAAGAAACATCATTAAGTATATCTGTGGTGGAACTCTTGTGTCTAAGAATTTCGTATTAACAGCCGCTCATTGTGTGTCAGTGAGGGGTTCTGCCTTGTTGCCAGACACGATAAGTGTTGTCCTTGGGAAATACAATTTATTTGGAGGTGATTTTGGGTCTGAAGAAAAGGAGGCTCATAGAGTAATAATTCACCAAAAATTTGAACACAGAACCCTAAACAACGATATAGCACTGATCGAGTTAAATACAGAAGTAACGTTTTCTGATTACATTCAACCTTCATGTCTGTGGTATAAGAAGGCGATCGAAAAATTACCAAGCAATCAAGTTATGGGAACGGTTGTAGGATGGGGTTTTGATAACAGTGGAACTCTTTCGCGTACACTTAAACAAGCTAAGATGCCGATTGTCTCGGATAACGTCTGTATCAGAAGTAAACCCTTATTTTATGCGAACATTTTGAATGGAAATAAATTTTGTGCTGGATTTCATAACGGAACATCTGCTTGCAACGGTGACAGCGGTGGAGCACTTGTGGTATTCGTACCAGATACGGCTGAGGATAATGACATAAGAGCTGAAGGAACTTGGCATGTTAAAGGCATTGTATCGATGACACTCTCTCAAAAAGATGTACCCGTATGCGATCCTGAACAGTACGTTGTGTTTACAGACGTTGAGAAATACAGAGTTTGGATAAAAAGCTACATCAAAGAAAAAGAAAGTGAAGATATTGATCAATAA

Protein sequence:

>DPOGS204361-PA
MEPGVVFGFYYFAQETGFNFVVTSFVPGIVPYITSVIINNEEYCQYPNVGFLNQYVVETLLNDEEKNCGRRQVQSTQLMVNGANTKPGDWPWHVAIYKQERNIIKYICGGTLVSKNFVLTAAHCVSVRGSALLPDTISVVLGKYNLFGGDFGSEEKEAHRVIIHQKFEHRTLNNDIALIELNTEVTFSDYIQPSCLWYKKAIEKLPSNQVMGTVVGWGFDNSGTLSRTLKQAKMPIVSDNVCIRSKPLFYANILNGNKFCAGFHNGTSACNGDSGGALVVFVPDTAEDNDIRAEGTWHVKGIVSMTLSQKDVPVCDPEQYVVFTDVEKYRVWIKSYIKEKESEDIDQ-