Monarch geneset OGS2.0

DPOGS205206
TranscriptDPOGS205206-TA948 bp
ProteinDPOGS205206-PA315 aa
Genomic positionDPSCF300265 - 224570-227880
RNAseq coverage3503x (Rank: top 4%)
Annotation
HeliconiusHMEL0147551e-9563.67% 
BombyxBGIBMGA014404-TA7e-11766.91% 
DrosophilaCG5390-PA2e-8450.32% 
EBI UniRef50UniRef50_B7SVM21e-12265.09%Serine proteinase-like protein 1 n=5 Tax=Obtectomera RepID=B7SVM2_HELAM
NCBI RefSeqNP_001037053.11e-12263.69%clip domain serine protease 11 [Bombyx mori]
NCBI nr blastpgi|2702981848e-12566.25%masquerade-like serine proteinase [Pieris rapae]
NCBI nr blastxgi|2702981841e-12766.46%masquerade-like serine proteinase [Pieris rapae]
Group
Gene OntologyGO:00038241.4e-81catalytic activity
GO:00042522.8e-72serine-type endopeptidase activity
GO:00065082.8e-72proteolysis
KEGG pathway 
InterPro domain[37-303] IPR0090031.4e-81Peptidase cysteine/serine, trypsin-like
[48-299] IPR0012542.8e-72Peptidase S1/S6, chymotrypsin/Hap
[90-105] IPR0013143.8e-09Peptidase S1A, chymotrypsin-type
Orthology groupMCL12309 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205206-TA
ATGAAGACCGGGCAATGCGAGTTTTATTTGGACGTGTGCTGTGAGCTACCGAACAAGAAGAATCCAGACGAGGTTATCACACCTCCGCCACCTCCAGCGCGTAAAGATTGCGGCTGGAGAAACCCGGACGGCGTTGGGTTCCGCATCACAGGAGATGACAACCATGAGACAAACTTTGGAGAGTTTCCTTGGATGATCGCCCTTTTGAAACGTGAACCAGTCGATCCAAACGATCCGAACAGTGAGACCCTAAACATATATCTTGGGGGCGGTTCTCTCATACATCCGAGCGTGGTTCTGACAGCAGCGCACTACGTCGACAAGCCTCAGAAGCTACGAGTGCGAGCCGGTGAATGGGACACGCAGACCAGGCAGGAGATATACCCCTACCAGGAAAGGGATGTGGCCAAGGTCAAAATTCACAAGGACTACAACAAACATACGTTGTTCTACGACGTAGCTCTGTTATTCTTGTCGGTGCCAATGCAGCTGGCGCCCAACGTGGGACTGGTTTGCCTTCCCGTGGAGAGACAGCTCCCGCGGGCCGGCACTAATTGCTTCGCCACCGGCTGGGGCAAAGACCAGTTCGGGAGGGATGGAAAATACCAGGTTATATTGAAAAAGAAAGAACTGCCGGTAGTTGATCGTAACGCTTGTCAGAAAGCTCTCCGTAAGACTCGTCTCGGTGGACTGTTCGAGCTGCACTCGTCCTTCATGTGTGCTGGAGGGCAGGGTTCAGACACGTGCACGGGCGACGGCGGCTCACCACTGGTCTGTCCAGTTGAGTACGAGAAGGATCGCTACGAGCAAGTGGGGATTGTATCCTGGGGTATCGGATGCGGTCAGGATGGTACTCCGGGGGTGTACACGGATGTCAGCAAGATGAGAGCCTGGATCGATGATAAAATTGTAGCAGAGGGTTACGAACCCAGGGCGTACGTGGCCTAG

Protein sequence:

>DPOGS205206-PA
MKTGQCEFYLDVCCELPNKKNPDEVITPPPPPARKDCGWRNPDGVGFRITGDDNHETNFGEFPWMIALLKREPVDPNDPNSETLNIYLGGGSLIHPSVVLTAAHYVDKPQKLRVRAGEWDTQTRQEIYPYQERDVAKVKIHKDYNKHTLFYDVALLFLSVPMQLAPNVGLVCLPVERQLPRAGTNCFATGWGKDQFGRDGKYQVILKKKELPVVDRNACQKALRKTRLGGLFELHSSFMCAGGQGSDTCTGDGGSPLVCPVEYEKDRYEQVGIVSWGIGCGQDGTPGVYTDVSKMRAWIDDKIVAEGYEPRAYVA-