Monarch geneset OGS2.0

DPOGS215100
TranscriptDPOGS215100-TA1302 bp
ProteinDPOGS215100-PA433 aa
Genomic positionDPSCF300139 - 327594-334562
RNAseq coverage778x (Rank: top 17%)
Annotation
HeliconiusHMEL0142334e-12451.84% 
BombyxBGIBMGA009610-TA2e-4029.61% 
DrosophilaCG3700-PB2e-3537.02% 
EBI UniRef50UniRef50_Q5TNC51e-4231.10%AGAP008835-PA n=2 Tax=Anopheles RepID=Q5TNC5_ANOGA
NCBI RefSeqXP_001655815.16e-4530.95%serine protease [Aedes aegypti]
NCBI nr blastpgi|1571311281e-4330.95%serine protease [Aedes aegypti]
NCBI nr blastxgi|3838480361e-4334.80%PREDICTED: serine protease snake-like [Megachile rotundata]
Group
Gene OntologyGO:00038249e-71catalytic activity
GO:00042523.3e-51serine-type endopeptidase activity
GO:00065083.3e-51proteolysis
KEGG pathway 
InterPro domain[174-425] IPR0090039e-71Peptidase cysteine/serine, trypsin-like
[182-420] IPR0012543.3e-51Peptidase S1/S6, chymotrypsin/Hap
[217-232] IPR0013141.5e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL33381 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215100-TA
ATGTTAAAATTATATATATTTCTATGTGCCTGTTGTATGTGTGTCTGTGAAAACATACCCTTTCTGGAAATCGGCGACCCATGCTTCATCAAGAACGTAAGCGGAGTGTGCAGAACGATAAGCGAATGCCGATACGCCAAGAAATTAGTGAGAGAAGACCAGGTCAAACCTCCTACATGTAAATTTCAAGGTGATCAAAGAATAGTCTGCTGTCCTGAGACCGACCTGTTCCATGAAACGGGAGTCTTCAAGCATCACTTCATTGGGCTTGCCAAAGGAGTCAAGAAATCTAAGTACATGACCTGTCGCTACGATGGCTACCAGCCCTTGCAATGTTGTGAGAACGCTAAACCCGTCACCATTCCACCAGAACCGGCAACTTGTCCAAGCCTCCCAAGACCCCTGCTGGCCAAAAACCACATCGCCTGGACTAAATGCGTTGACTACCAGCGTTACATCCACAAGTGCGTGCCAGTTGATCCAATCAACCAACCATACAAAATGCAAAGGGTAAACACTTGCGGCATCAGCAACTCCAATTTTAGGATATCCGGTGGCGTTGAAGCTAAACCCAGAGAGTTTCCGTTCATGGCCGTCATCGGCTGCCACAATTCCCTGGACGTGGACGCCGACATCAAGTGGGTAGGCGGAGGCTCGCTGATCAGTGAGAAGTTTATACTCACGGCTACTCACATATTGAGTGAACCGACTTATGGCCGCGTACGGTACGCCTTGCTTGGCACTTTGAATAAGACAGACATAAGGTCCGGAGTCCTTTACAATATCGTGTCTATGATCGCGCACCCTGAATACGACATTCCCGTTAAAGCGAATGACATAGCGCTCCTGGAGCTAGACAGACAGGTCTTTTTCAATGAATTCATTCACCCCGTCTGTCTCCCGGTGCCGGGCAGATATATTACAAATGACTATATTGTTGCCGGTTGGGGCGAAAACAACAACAGATACAGCAGTGACGTGCTGTTAACTGCGAGACTGCGACCCAGCGATGAATGCAAGAGCAGAATAGTAAGAAAAGACTTCGTTTATTCGAATGAGAAGTATATCTGTGCTAAAGGAGAGCTGGAAAAGGGCGTCTATCAAGACACCTGCAAGGGCGACAGCGGAGGCCCGTTGTTGGCTCTGATGTTTAATATAAACTGCTCCTACTCCTTGGAGGGTATCGTCAGTTTTGGACCCGAATGCGGCAAAGGCTTTCCAGCGGTTTACACCAAAGTATCCAACTATTTGGATTGGATAGTTGAAAACGTATGGCCCGATAAGGTCAACAAAAAGCAATAA

Protein sequence:

>DPOGS215100-PA
MLKLYIFLCACCMCVCENIPFLEIGDPCFIKNVSGVCRTISECRYAKKLVREDQVKPPTCKFQGDQRIVCCPETDLFHETGVFKHHFIGLAKGVKKSKYMTCRYDGYQPLQCCENAKPVTIPPEPATCPSLPRPLLAKNHIAWTKCVDYQRYIHKCVPVDPINQPYKMQRVNTCGISNSNFRISGGVEAKPREFPFMAVIGCHNSLDVDADIKWVGGGSLISEKFILTATHILSEPTYGRVRYALLGTLNKTDIRSGVLYNIVSMIAHPEYDIPVKANDIALLELDRQVFFNEFIHPVCLPVPGRYITNDYIVAGWGENNNRYSSDVLLTARLRPSDECKSRIVRKDFVYSNEKYICAKGELEKGVYQDTCKGDSGGPLLALMFNINCSYSLEGIVSFGPECGKGFPAVYTKVSNYLDWIVENVWPDKVNKKQ-