Monarch geneset OGS2.0

DPOGS209724
TranscriptDPOGS209724-TA885 bp
ProteinDPOGS209724-PA294 aa
Genomic positionDPSCF300105 + 12191-14980
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0080163e-6382.71% 
BombyxBGIBMGA008938-TA9e-12371.53% 
DrosophilaCG6865-PA3e-5140.70% 
EBI UniRef50UniRef50_G6D0U86e-173100.00%Serine protease H42 n=2 Tax=Obtectomera RepID=G6D0U8_DANPL
NCBI RefSeqXP_966366.15e-6345.52%PREDICTED: similar to GA19914-PA [Tribolium castaneum]
NCBI nr blastpgi|3454945654e-6546.55%PREDICTED: transmembrane protease serine 9 [Nasonia vitripennis]
NCBI nr blastxgi|3454945657e-6546.82%PREDICTED: transmembrane protease serine 9 [Nasonia vitripennis]
Group
Gene OntologyGO:00038245.9e-84catalytic activity
GO:00042522.6e-75serine-type endopeptidase activity
GO:00065082.6e-75proteolysis
KEGG pathway 
InterPro domain[28-279] IPR0090035.9e-84Peptidase cysteine/serine, trypsin-like
[37-274] IPR0012542.6e-75Peptidase S1/S6, chymotrypsin/Hap
[64-79] IPR0013148.8e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL15905 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209724-TA
ATGGAACGCGTTTGTGTGTTCATTGTTATAGTTTGTTGTTGTGAGCTTGGATTGTGTTATATTAATCTGGCTGATGTTGATTGTGGTCTGCTGAACGCACGTAGTGGTCGAATAGTCGGTGGTACAAACAGTCTGCCTGCTGAGTTCCCCTGGGCAGCCAGTTTGTGGAGACAGGGGACCCATCAGTGTGGAGCTACTATCATCAATAATAGATGGCTGGTTACCGCTGGACACTGTGTTTGCAGTGTATTTGACGAGTTCTACAAATCAAAACAGTTAACAGTCGTAGCAGGTTACACTGATATATCAGCATCGGACAAAAATCAAAGGCTCTCCAAAATTATACCTCATCCGGATTACAGATGTAAAAAAAAAACGAACGACGTAGCTCTCCTTAAAACAGAACAACAACTGGTATGGACAAACGAACTGCGACCCGCATGTCTTCCTCGGGCTAAATCATCTGACTTTACTGGTAAAAGTGCCACCGTTGCTGGTTGGGGTTTCACCAATGAAGACAGAGGAATTGGTGAGAGACCGAACGTTTTGCAAAAAACCGAAGTTACTGTAGTAGAAAACGGCGAATGTAATAGTTGGTACGAATCCCAGGGTAGTAAAGTAAGAATTATTGCAACTCAGATGTGCGCTGGTTATAAACAAGGAGGACGGGATTCATGCTGGGCTGACAGTGGAGGTCCACTTATGCTGCAAGGCGAAAAGGGTCATACTATGCTTATTGGAGTGGTTTCAACGGGCAGTGGTTGTGCCAGAGCGAAGATGCCAGGAATTTATACAAGGGTTTCAAAATTCACTGATTGGATTGTATCCAGTGTCAATAGTGATAATGCTAGAAAAGGTTTAAGTTGGTACCTGAGGGGTGGTTAG

Protein sequence:

>DPOGS209724-PA
MERVCVFIVIVCCCELGLCYINLADVDCGLLNARSGRIVGGTNSLPAEFPWAASLWRQGTHQCGATIINNRWLVTAGHCVCSVFDEFYKSKQLTVVAGYTDISASDKNQRLSKIIPHPDYRCKKKTNDVALLKTEQQLVWTNELRPACLPRAKSSDFTGKSATVAGWGFTNEDRGIGERPNVLQKTEVTVVENGECNSWYESQGSKVRIIATQMCAGYKQGGRDSCWADSGGPLMLQGEKGHTMLIGVVSTGSGCARAKMPGIYTRVSKFTDWIVSSVNSDNARKGLSWYLRGG-