Monarch geneset OGS2.0

DPOGS206620
TranscriptDPOGS206620-TA669 bp
ProteinDPOGS206620-PA222 aa
Genomic positionDPSCF300048 - 813623-822414
RNAseq coverage368x (Rank: top 32%)
Annotation
HeliconiusHMEL0020547e-2748.76% 
BombyxBGIBMGA008515-TA3e-3956.15% 
DrosophilaJon74E-PA7e-3336.22% 
EBI UniRef50UniRef50_C9W8F62e-3255.38%Serine protease 17 n=1 Tax=Mamestra configurata RepID=C9W8F6_9NEOP
NCBI RefSeqNP_648994.12e-3136.22%jonah 74E [Drosophila melanogaster]
NCBI nr blastpgi|1717408912e-3557.25%chymotrypsin [Helicoverpa armigera]
NCBI nr blastxgi|2973407682e-3657.25%chymotrypsin [Helicoverpa armigera]
Group
Gene OntologyGO:00038241.2e-32catalytic activity
GO:00042527.8e-20serine-type endopeptidase activity
GO:00065087.8e-20proteolysis
KEGG pathway 
InterPro domain[92-222] IPR0090031.2e-32Peptidase cysteine/serine, trypsin-like
[108-218] IPR0012547.8e-20Peptidase S1/S6, chymotrypsin/Hap
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206620-TA
ATGGCTGTCATCACTAACGAAATATGCAACGTCGCTGTCTTCGGTGACATTCAATCATCGAACATTTGCACGAACACCCTCGGTGATAAGAGCACTTGCCGTGGTGACTCTGGTGGCCCACTTGTTGTCCAAAGACGTGTCAGGACTGTTCTGACCCCTAAAGTACAAAGAGCTAAGTTAATTATTAGACTATCAGAATATTTGAGAGGCCACGCTTATGAAGACGGGGAGGATCGAGTTCTAAACGGTTGTTTGATTGACACTCCCTTGTATACCATCGGAACCGTTTCTTTGCCCACTGGATTAGAAATCTTTGACGACTTCAACGGTCAAGTCGCTATTGCCTCTGGCTATGGACTTACACAAGACGGCGGCAGTATCAGCAACAGCCAATTCTTAAGTTACGTCAATATGTCAGTCATTACAAATGAAGTTTGCAACATCGCTTTCTTCGGTAACATTCGGCCTTCGAACATTTGCACCAGCACCCAAGGTGGTAAGAGCACTTGCCGTGGTGACTCTGGTGGTCCACTTGTTGTCCAAAGACGTGACAGAACTGTTCTGGTTGGCGTTACCTCATTTGGAATTGCTTTTGGTTGCGAAATCGGATGGCCAGCAGCCTTTTCAAGAATTACATCATTCCTTGGATTTATTAATGACAATTTATAA

Protein sequence:

>DPOGS206620-PA
MAVITNEICNVAVFGDIQSSNICTNTLGDKSTCRGDSGGPLVVQRRVRTVLTPKVQRAKLIIRLSEYLRGHAYEDGEDRVLNGCLIDTPLYTIGTVSLPTGLEIFDDFNGQVAIASGYGLTQDGGSISNSQFLSYVNMSVITNEVCNIAFFGNIRPSNICTSTQGGKSTCRGDSGGPLVVQRRDRTVLVGVTSFGIAFGCEIGWPAAFSRITSFLGFINDNL-