Monarch geneset OGS2.0

DPOGS212200
TranscriptDPOGS212200-TA1290 bp
ProteinDPOGS212200-PA429 aa
Genomic positionDPSCF300323 - 179198-180816
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0068403e-4337.55% 
BombyxBGIBMGA000993-TA1e-5341.25% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqNP_001166090.15e-0623.01%serine protease 73 [Nasonia vitripennis]
NCBI nr blastp%
NCBI nr blastx%
Group
Gene OntologyGO:00042522.5e-14serine-type endopeptidase activity
GO:00065082.5e-14proteolysis
GO:00038249e-14catalytic activity
KEGG pathway 
InterPro domain[24-223] IPR0012542.5e-14Peptidase S1/S6, chymotrypsin/Hap
[10-236] IPR0090039e-14Peptidase cysteine/serine, trypsin-like
Orthology groupMCL26768 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212200-TA
ATGCGTCTTTTTTTGATCATCGCATTATGTGCCGATTTCACAGCGAAATTCGTGGTCGCCGATGGAGAATTCGAAGTTAAAAGAGGCGTATTCCCTTTTATGGCTTTCGTTTATTACCCGGACAAAACAGTTGTAGATAGCACTGGCATAAGACTTACGAGAGGTGCAGTGCTACTACGGCCCGACTGGCTAATTACTTCTTCTCTAGAAGATGGACAAACGTCGCTTGCTAACATTTTTCCATTACAGATAATTCAAATTGCACGTCCGCAAAACTACAGTGCAATGGAATGGTGGTTCGCTGATATATCGTTGTTGAAAACTCTGCTTCCTTTTAACATAACAACGGCCGTGGCTCCAGCAACTATCGACACTAAACCCAGGGACGCGGATAGGAATTGCCTAATACTAGTCTACGCTGCGCCCAATGGGAACGCAAGCGACGATAGGATGTTGATGCAACTTTCAGTAGAAATATTAGGCTCGTCACCGGAGAACTGCGGCAGCAATTTCATGAGGAGTATGATATGTGGAACCAATACTGATGACACGAGAAAATATCCAGGTTTCTGTGAGGGTAATAGTGGAGGTCCTCTAGTATGTGAAAATGACGTGATTGCTATACAAACGTACATTAATGATTGTAAGCCACCTCATAGATATCAAGTTTTGGGCGCTTGGGAAAATTTAATAACATGCGCCTTAGAAGACAAATGCAAAGAGGAGCAATGTGCTAGAATTTGCAGCATCATACACAAAGATTCAGATGACGTTCTCATTCCGGAGAAAACGTTATCAGATGAGGCAAAAATACATTCTGATGAGGAAAATATCTCTACCACTGACGTGCAAGAAGTAACAAGTACCTCTGTAGAAGGGAGAAGACTGAATCAAGAATATTCGCACTTTTTATCCTTTAACTCCGGCATGGTTCAAACTGATGCTGCGACAGCGAATTCTACGGAAGCGCACAATACTAAGGAGGCAGTTATCAGCAGTGAAGTTGATACTACAGAGAAGTCAACGTCCACTGCGAATGACCATACTGATGAAGAGCAGGATACAAGTACGAAGTCACCCAACGTGGCCGTCCCCATGAGTCCAGAAACAGAAAACATGCCCGTCTATAAAACCATGGAATCAGATAGCAGTGTGCCAGAAAACCTCATAGAAAATTCACAACAGGGGGAGCCACCGCAACGGCCACACAAAGTGAGAAACGAAGCCTCGGGTAACGTCAATTCATTCTATTTCATGTCTTTAATATTGGCTGTGATTAATTTTACATAG

Protein sequence:

>DPOGS212200-PA
MRLFLIIALCADFTAKFVVADGEFEVKRGVFPFMAFVYYPDKTVVDSTGIRLTRGAVLLRPDWLITSSLEDGQTSLANIFPLQIIQIARPQNYSAMEWWFADISLLKTLLPFNITTAVAPATIDTKPRDADRNCLILVYAAPNGNASDDRMLMQLSVEILGSSPENCGSNFMRSMICGTNTDDTRKYPGFCEGNSGGPLVCENDVIAIQTYINDCKPPHRYQVLGAWENLITCALEDKCKEEQCARICSIIHKDSDDVLIPEKTLSDEAKIHSDEENISTTDVQEVTSTSVEGRRLNQEYSHFLSFNSGMVQTDAATANSTEAHNTKEAVISSEVDTTEKSTSTANDHTDEEQDTSTKSPNVAVPMSPETENMPVYKTMESDSSVPENLIENSQQGEPPQRPHKVRNEASGNVNSFYFMSLILAVINFT-