Monarch geneset OGS2.0

DPOGS211355
TranscriptDPOGS211355-TA1089 bp
ProteinDPOGS211355-PA362 aa
Genomic positionDPSCF300173 - 620242-623843
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0044017e-16886.67% 
BombyxBGIBMGA008471-TA1e-15783.28% 
DrosophilaCG17572-PA1e-7745.87% 
EBI UniRef50UniRef50_G6DLK90.0100.00%Putative uncharacterized protein n=2 Tax=Obtectomera RepID=G6DLK9_DANPL
NCBI RefSeqXP_975692.24e-8747.71%PREDICTED: similar to CLIP-domain serine protease subfamily B (AGAP009263-PA) [Tribolium castaneum]
NCBI nr blastpgi|3640235972e-10871.11%seminal fluid protein CSSFP023 [Chilo suppressalis]
NCBI nr blastxgi|3640235976e-11070.63%seminal fluid protein CSSFP023 [Chilo suppressalis]
Group
Gene OntologyGO:00038245.1e-72catalytic activity
GO:00042523.1e-50serine-type endopeptidase activity
GO:00065083.1e-50proteolysis
KEGG pathway 
InterPro domain[98-360] IPR0090035.1e-72Peptidase cysteine/serine, trypsin-like
[112-356] IPR0012543.1e-50Peptidase S1/S6, chymotrypsin/Hap
[146-161] IPR0013147.8e-08Peptidase S1A, chymotrypsin-type
Orthology groupMCL17394 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211355-TA
ATGTTTAAACTGTATGCGTGTTTATGTTTAAGTATTTTAAAATTAACAGTTAACGCACAGTTTGGGACCGTATCCATAAGTGTCAATATGGATCCACGAGCAGATATTGATATGAGTACTCACAATCGCTGTCCAGTTAATATGTCGTGTGTCCCAATAAGTTCGTGTCCATTGTTAGAAGATCTATTAGATTTCTCGTGTTTTTCATCCGATCGGTATTTCCATCGTCTGAACTCGTTAACATGTGGCAATGTTAACAATGAAGACTATGTGTGCTGTCCGTCGTGCGAGTGCGGGCGAGTTTACGCACCAGGAACTGAATCCTGCGGGAAGAGCATGGTCCAAGGGATTGATTACAGCGGCATTGGAGCTCATCCCTGGGTTGCCAGAATAGGATTCGCAAATAAAGACACGGGCAACGTAAGATTTGCTTGCAGTGGCTCCATTATTGCGAAGCGGGTTATTTTGACAGCGGCGCATTGTGCTTTGGCGAAACCTGAAGGATACAAATTGTCTACGATAGTAGTCGGTGAGTGGGACATCAGTAGTAGTCCGGATTGCAGCGATTATTTCTGTGCTCCTGCCACACAAGCCATCAAAGTGGAGAGCGTGTCTGTGCATCCAGGATACGAACAGAAGATATTCAGACATGACATAGCGATGATTATATTAAAGGATGAGATAAAATTTTCTGTGACAGCTGCTCCGATCTGTTTGAATGATAAGCCGGAAGTGGTGATCAACGAACGCGCTTCGCTTGTCGGATGGGGGAAACTGTCCGGACAAAACAACTTGATTGGTCGCCAACAACAGTTAGAAGTACCGTTGGTGTCGCTGGAGATTTGTGAGAAGGTTTTTGGTGAATCCGTGCCTATTCATGAAGGGCAGCTTTGTGCGGGCGGCGAAGAGGGCAAGGACGCATGTTCGGGCTTTGGAGGAGCTCCTTTGATTCTTAATAGAGACGGCCAATTTGTACAGATTGGCATTGTATCCTTCGGGTCGGAGAACTGTGGCAGTGAAGGCATCCCCAGCGTGTACACAAACATCGCACATTATTATAGGTGGATTGTTGACAACATGCCTTCTTGA

Protein sequence:

>DPOGS211355-PA
MFKLYACLCLSILKLTVNAQFGTVSISVNMDPRADIDMSTHNRCPVNMSCVPISSCPLLEDLLDFSCFSSDRYFHRLNSLTCGNVNNEDYVCCPSCECGRVYAPGTESCGKSMVQGIDYSGIGAHPWVARIGFANKDTGNVRFACSGSIIAKRVILTAAHCALAKPEGYKLSTIVVGEWDISSSPDCSDYFCAPATQAIKVESVSVHPGYEQKIFRHDIAMIILKDEIKFSVTAAPICLNDKPEVVINERASLVGWGKLSGQNNLIGRQQQLEVPLVSLEICEKVFGESVPIHEGQLCAGGEEGKDACSGFGGAPLILNRDGQFVQIGIVSFGSENCGSEGIPSVYTNIAHYYRWIVDNMPS-