Monarch geneset OGS2.0

DPOGS205208
TranscriptDPOGS205208-TA1620 bp
ProteinDPOGS205208-PA539 aa
Genomic positionDPSCF300265 - 166495-205039
RNAseq coverage620x (Rank: top 21%)
Annotation
HeliconiusHMEL0134687e-6739.52% 
BombyxBGIBMGA014404-TA5e-6137.91% 
DrosophilaCG5390-PA1e-5440.46% 
EBI UniRef50UniRef50_B7SVM23e-7037.91%Serine proteinase-like protein 1 n=5 Tax=Obtectomera RepID=B7SVM2_HELAM
NCBI RefSeqNP_001155060.14e-5835.09%serine protease homolog 21 [Nasonia vitripennis]
NCBI nr blastpgi|2423512331e-6539.84%serine proteinase-like protein 1b [Manduca sexta]
NCBI nr blastxgi|2089725491e-7038.00%serine proteinase-like protein 1 [Helicoverpa armigera]
Group
Gene OntologyGO:00038245.3e-55catalytic activity
GO:00042522.1e-33serine-type endopeptidase activity
GO:00065082.1e-33proteolysis
KEGG pathway 
InterPro domain[260-528] IPR0090035.3e-55Peptidase cysteine/serine, trypsin-like
[284-523] IPR0012542.1e-33Peptidase S1/S6, chymotrypsin/Hap
[313-328] IPR0013144.8e-07Peptidase S1A, chymotrypsin-type
Orthology groupMCL23320 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205208-TA
ATGGAAACGGCAACGGAAGTTCAACGCGAACGGATGTACGGGCAATTAACGGATGCGGTTGATCTGTCTATAGATAAAGATATCTATAGACAGATGTTAAAACCAAGGCCAGAGATAAACAGAGAGAGTCAGGAAAGAGAGAGAGTCGATATCCAGCATGCATGGCATGACGCTCAGAAGCGATACTCAGCCACCAACCACGCCCGCGAGCTATTAGTGATATTCCCACGGCACTCTCCTTTGCGTGAAAGAAATTCAAATGGGAACAGAAAAAATGTTAACTTGAATTCAGACAGCGAACTCAATTTGGACCCTATCCCAAAACCTTTCGTGACACGTCAGAAAATTCAACCGACTACAGATAAGGAAGAAAGTGAAATAAATTTGGATCCAGATGCGAAAGTCATAGCTGAATACTTCTCCTCCGGGCAGTCTACAGCCAGTCAATCACACACGGTCGCTGCAGACGGCCTCAGTAATAAACGTGACTCACCAAGTAACGCGACAGATTCCTCAGACGATGTTAATTTGGATCAGTCGGATATAAATATAAATTGCACGAGGGACGATGGCACGCCGGGTGTGTGTGTCTTGTATTATCAATGCAACACCGACGACGGTCAAGTTATCACTGATGGCAACTCGCTCATAGATGTCAGGCTAAAAGACGGCGCCTGTTCACACTACCTACAAGTCTGCTGCGTGTTGAACGTAGTCCAGGAGAAACCTGTCATATTGGAAGATAAAGTGCCTGATGTTCCCGCAAAATCCAAAAAATGTGGTTGGAATAATCCAGCATTAAATATATTCCAAACGAGAGAAACAGCTGGTCTGGATGACGGCATTTATGCCAATTACGGCGACTTTCCCTGGATGATAGCCGTTATTAGAAATCGAAACGAGACTGAAGAATGGTCGAAAAATGACTACCTGGGTGGCGGATTCCTGATACATCCAGCTGTAGTAGTCACAGCAGCTCATAAAGTGGAGCAGTACAATCCTCACCAGAAACTTTTTAATGGTTCCTCTTTGTTAAGATCAAATGCCGTGCCGGGGAGTGGGACACTCAGACCAACTCATAATTTTCGCATGTTTTCATCAATAGCATCGCTGTACAACGATATTGCTGTGCTTTTCCTGAAGAGCCCATTCTCACTGAACGGATCACCCAATATAAACGTCGCCTGCGTCGGTGCCAGCATGCCTCCGCCAGGGACCTTGTGTTTCAGTATGGGATGGGGCGCAAATTTCAAGAAGAAAAATGTTTACGCAGTCATACTTAAAAAAGTACGTTTGCCGCTCGTGGACCCTGAGAGATGCGAAACACTTTTGCGCAAAACACGTCTGGGACCTTTCTTCAGGCTGGACAAGTCGCTGACGTGTGCTGGAGGTGAAGACGGCGTGGACACGTGTCGTGGAGACGGAGGGTCCTCTCTGGTGTGTCCTGTACAGATGGCTGACGAGAGCACACGTTATGAAGTATTTGGGATGGTTGCGTACGGGATTGGTTGCGGGAGTAAGGATGTGCCAAGTGTTTACGTCAACATTCCCTATTTGAAGCCCTGGCTTGACGCTATTATTGCTAGTGAAGGGCTCTCCACGGAAACATATATCTCATAA

Protein sequence:

>DPOGS205208-PA
METATEVQRERMYGQLTDAVDLSIDKDIYRQMLKPRPEINRESQERERVDIQHAWHDAQKRYSATNHARELLVIFPRHSPLRERNSNGNRKNVNLNSDSELNLDPIPKPFVTRQKIQPTTDKEESEINLDPDAKVIAEYFSSGQSTASQSHTVAADGLSNKRDSPSNATDSSDDVNLDQSDININCTRDDGTPGVCVLYYQCNTDDGQVITDGNSLIDVRLKDGACSHYLQVCCVLNVVQEKPVILEDKVPDVPAKSKKCGWNNPALNIFQTRETAGLDDGIYANYGDFPWMIAVIRNRNETEEWSKNDYLGGGFLIHPAVVVTAAHKVEQYNPHQKLFNGSSLLRSNAVPGSGTLRPTHNFRMFSSIASLYNDIAVLFLKSPFSLNGSPNINVACVGASMPPPGTLCFSMGWGANFKKKNVYAVILKKVRLPLVDPERCETLLRKTRLGPFFRLDKSLTCAGGEDGVDTCRGDGGSSLVCPVQMADESTRYEVFGMVAYGIGCGSKDVPSVYVNIPYLKPWLDAIIASEGLSTETYIS-