Monarch geneset OGS2.0

DPOGS206217
TranscriptDPOGS206217-TA1290 bp
ProteinDPOGS206217-PA429 aa
Genomic positionDPSCF300334 - 224030-227330
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0112235e-14159.63% 
BombyxBGIBMGA009694-TA1e-13589.33% 
DrosophilaCG13744-PA1e-11054.32% 
EBI UniRef50UniRef50_A1Z7M52e-10854.32%CG13744 n=11 Tax=Diptera RepID=A1Z7M5_DROME
NCBI RefSeqXP_002089723.13e-11155.31%GE22661 [Drosophila yakuba]
NCBI nr blastpgi|3123762383e-11254.81%hypothetical protein AND_12968 [Anopheles darlingi]
NCBI nr blastxgi|3123762388e-11355.41%hypothetical protein AND_12968 [Anopheles darlingi]
Group
Gene OntologyGO:00038244.2e-85catalytic activity
GO:00042529.1e-72serine-type endopeptidase activity
GO:00065089.1e-72proteolysis
KEGG pathway 
InterPro domain[174-424] IPR0090034.2e-85Peptidase cysteine/serine, trypsin-like
[185-419] IPR0012549.1e-72Peptidase S1/S6, chymotrypsin/Hap
[211-226] IPR0013141.4e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL17134 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206217-TA
ATGTTCACGGTGAAGAGGCAAGCGAAGACATGGATACCGATACAGGACAGGAAAATAAAAAACTGTTTTATCACCAAAAAACTATCGAACAGTCCCAAGAATGTATTTAAAAGGCGAGCCTTTAAATGGCGGAAACCTAAATGCCAGACGGTTAACATAATCATACCTCTCATGATTTTGAACTTTGCTGGACACACCAGCTCGGAGAGTCTTAGTAACAGAGTCCTGGCCTCACTCCTGGGATACCCGACCACGTGTACTGTTGGTTCTCAAGTGCGAGCCTGTTCTCTGTCGCTGACTTGTTGGCTCCGCGGTGGTATCAGGGTGAAGGGTTGCGGAGGAAGCTGGTTGTTCTCATGCTGTTACATAGCCCGGGACAGCTATGACTATGATAACTCAATCCCCTCTTCCGACTGGAAATACAAAATACCGCCGAAGTTACGTCAAGTACCTCAGAGGAATGTGGTGCCAACTAACGTGTTCCGACGGAGAGTCGACGACGACATTAGTCAGATGGAGTGCGGCCTCTCCTCAAGTCGCATGCTCCAGAAGCGTATCATCGGCGGTCGGGAGGCCAGGGTCGCGGAGTTCCCCTGGCAGGCTCACGTCAGGATCTCAGAGTTCCAGTGCGGCGGAGTCTTAATATCTCGTTGGTACGTGGCGACGGCAGCTCACTGCGTGTCCCGAGCTCGTCCTAGGGATGTGGCCGTGTGGCTCGGAGCACTTGACACCACCTCTGGGGATAAAAGCGCGAGAAAAATTGGGGTCGTCCAGAAAATCCTCCACCCCCTCTTCCAGTTTCGCATGACCCAACCTGACCGGTACGACATAGCGTTGCTAAAACTCTCCCGACCTGTGACCTACACTAGTCACATCCTCCCGATCTGTCTGCCCGACGGAGATTTCGAACTCCGCGGCAAGTCAGGGGTCATCGCCGGCTGGGGCAAGACCGATACCAGCAACGGCCACACTGGCACTAACTTACTACGGTCCGCTACTGTACCGATTTTGAGCACCGAACAATGTATCAACTGGCACCAGAGTAAGCAGATCTCTGTTGAAATACATTCGGAGATGATCTGCGCCGGACATTCAGACGGACACCAAGATGCGTGTCTAGGTGACTCTGGAGGTCCCCTAATTGTGTTGGACAGGGGTCGTTACTACCTGGCCGGTATCACCTCGGCCGGGTTCGGCTGCGGCGTCGACCACCAGCCAGGGATCTATCACAACGTGCGGGTCACCGCTGGCTGGATCAGAGACGTCATCACCAGATATGGTGACCTCTAG

Protein sequence:

>DPOGS206217-PA
MFTVKRQAKTWIPIQDRKIKNCFITKKLSNSPKNVFKRRAFKWRKPKCQTVNIIIPLMILNFAGHTSSESLSNRVLASLLGYPTTCTVGSQVRACSLSLTCWLRGGIRVKGCGGSWLFSCCYIARDSYDYDNSIPSSDWKYKIPPKLRQVPQRNVVPTNVFRRRVDDDISQMECGLSSSRMLQKRIIGGREARVAEFPWQAHVRISEFQCGGVLISRWYVATAAHCVSRARPRDVAVWLGALDTTSGDKSARKIGVVQKILHPLFQFRMTQPDRYDIALLKLSRPVTYTSHILPICLPDGDFELRGKSGVIAGWGKTDTSNGHTGTNLLRSATVPILSTEQCINWHQSKQISVEIHSEMICAGHSDGHQDACLGDSGGPLIVLDRGRYYLAGITSAGFGCGVDHQPGIYHNVRVTAGWIRDVITRYGDL-