Monarch geneset OGS2.0

DPOGS214906
TranscriptDPOGS214906-TA1380 bp
ProteinDPOGS214906-PA459 aa
Genomic positionDPSCF300135 + 360144-364128
RNAseq coverage103x (Rank: top 61%)
Annotation
HeliconiusHMEL0171888e-7255.30% 
BombyxBGIBMGA003025-TA2e-4943.59% 
DrosophilaepsilonTry-PA3e-2636.70% 
EBI UniRef50UniRef50_G3I5K34e-3326.32%Transmembrane protease, serine 11E n=6 Tax=Amniota RepID=G3I5K3_CRIGR
NCBI RefSeqXP_002081034.13e-4332.26%GD25907 [Drosophila simulans]
NCBI nr blastpgi|1955824365e-4232.26%GD25907 [Drosophila simulans]
NCBI nr blastxgi|1955824362e-4732.26%GD25907 [Drosophila simulans]
Group
Gene OntologyGO:00038243.8e-61catalytic activity
GO:00042525.1e-42serine-type endopeptidase activity
GO:00065085.1e-42proteolysis
KEGG pathway 
InterPro domain[210-446] IPR0090033.8e-61Peptidase cysteine/serine, trypsin-like
[222-443] IPR0012545.1e-42Peptidase S1/S6, chymotrypsin/Hap
[30-45] IPR0013141.5e-09Peptidase S1A, chymotrypsin-type
Orthology groupMCL26809 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214906-TA
ATGGAAAAGAGGATAGTCGGTGGCTCTGAAGCGCATCTCGACCAGTACCCTTACAATGTACAGTTTTATAATCTCGGCGGACTCTGCGGCGGTTCCATTCTCACTAAGAAAATAGTACTAACAGCCGCCCACTGTTTCGACTCAAACAAAAATTTGGCAGATATGAAAATATTTTCAAAACCGCGTTACCTGTTCGATCTCCACGCAAATAGTCATGATATATGGGATTTTCGTATTCACGAGGACTATGGGATGGAATCTGTTTTCGATAACGACATCGCCGTTATTATAATTCACAACGAGTTCGTCTTCGGTAAACAAGTTCGAAATATTAAAATAATTAATAATCAGCATTGGGAGAAAGAAAACGAGACCTTCACTGTCACTGGATGGGGGGAAACTAAGTATGGGGATGGCATTGATAAAACGGTTTTTAGACAGGTTCAACTGCGTTACATTAGCAAAGAGATGTGTATGAAAATGAACCAAATCACTCTGGACACTGAGATGTTTTGTCTTTATGGGGATGGAAAGAGAGATTCCTGTAGAGGTGACTCAGGAGGCGGGGTGCTGTGGAAGGGACAATTGGTTGGTATCGTGTCCCACGGCAAGGGATGTGGTGGTTTGCCGGGTACGAGGGATGTGTTAAATGAAGTCGAAGGCAGAATAATTGGCGGTCGACCGATACCTATAGAGATCTCGCCATTCTCGGTGCTCTTTTTTAATTTCAAATCTCTCTGTTCCGGTGTCATCATTAACAGCTTTGTTATACTTACGGCAGCGCACTGCTTTAATTCAAACACTAACAAACATCACATGCGCGTTGAAATAAATTGTCGTTATTTGTTTGATATCAATGCTGAAACATTTAGAGTTGCCGATTTTATTATACACGAAGATTTTAATAAAGTTATACCTTTTGAAGCCGACCTTGCGTTGATTATGGTTACGGCCAAAATGCTTCTGGATAGACGGAAGCAGAAAGCTTTATTAATGAAAAACGATGACTGGATGAACGAGAAGGCTAACGTCAGCGCCTCTGGTTGGGGTTGGACCAAGTACGGCGGCGGAGTATCCGAGCTGGGTTTAATGCGGACGTATTTGGGGTACGTGTCTAGACGGCGATGCGAGAGTCTGCATAGTTTGCGGCTCACGGAGGACATGTTCTGTTTGTACGGGAATGGGGTGAGGGATACTTGTCTGGGAGACTCCGGGGGAGGCGTTACTGTGAATGGAACGGTGGTAGGCATCGTATCTCATGGAGATGGTTGTGCAAAAAAAGGTAAGCCGAGCGTGTATATAAGCGTTGCTTATCACAGAAAATGGATCGAAAAGAAAAATCTCGAGTTATTAAGAAAAAATTGCTTTAAAACTTTATAA

Protein sequence:

>DPOGS214906-PA
MEKRIVGGSEAHLDQYPYNVQFYNLGGLCGGSILTKKIVLTAAHCFDSNKNLADMKIFSKPRYLFDLHANSHDIWDFRIHEDYGMESVFDNDIAVIIIHNEFVFGKQVRNIKIINNQHWEKENETFTVTGWGETKYGDGIDKTVFRQVQLRYISKEMCMKMNQITLDTEMFCLYGDGKRDSCRGDSGGGVLWKGQLVGIVSHGKGCGGLPGTRDVLNEVEGRIIGGRPIPIEISPFSVLFFNFKSLCSGVIINSFVILTAAHCFNSNTNKHHMRVEINCRYLFDINAETFRVADFIIHEDFNKVIPFEADLALIMVTAKMLLDRRKQKALLMKNDDWMNEKANVSASGWGWTKYGGGVSELGLMRTYLGYVSRRRCESLHSLRLTEDMFCLYGNGVRDTCLGDSGGGVTVNGTVVGIVSHGDGCAKKGKPSVYISVAYHRKWIEKKNLELLRKNCFKTL-