Monarch geneset OGS2.0

DPOGS208169
TranscriptDPOGS208169-TA1164 bp
ProteinDPOGS208169-PA387 aa
Genomic positionDPSCF300207 - 222587-226498
RNAseq coverage1516x (Rank: top 8%)
Annotation
HeliconiusHMEL0157130.078.59% 
BombyxBGIBMGA010257-TA6e-16869.53% 
DrosophilaCG9372-PA5e-9243.75% 
EBI UniRef50UniRef50_Q589Y59e-15165.41%Serine protease n=5 Tax=Obtectomera RepID=Q589Y5_BOMMO
NCBI RefSeqNP_001040415.16e-16571.82%clip domain serine protease 3 [Bombyx mori]
NCBI nr blastpgi|1980412612e-16472.09%hemocyte protease-1 [Bombyx mori]
NCBI nr blastxgi|1980412614e-16870.39%hemocyte protease-1 [Bombyx mori]
Group
Gene OntologyGO:00038241.2e-86catalytic activity
GO:00042525e-84serine-type endopeptidase activity
GO:00065085e-84proteolysis
KEGG pathway 
InterPro domain[144-384] IPR0090031.2e-86Peptidase cysteine/serine, trypsin-like
[153-381] IPR0012545e-84Peptidase S1/S6, chymotrypsin/Hap
[181-196] IPR0013144.6e-14Peptidase S1A, chymotrypsin-type
[61-106] IPR0066043.4e-07Disulphide knot CLIP
Orthology groupMCL15938 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208169-TA
ATGTTGTGTCTACTTATTTTGACGTTCCTTGTGACATTTTCCAATGGACTTCTAAGCTCGGAGCTGACTGAGCCAGAATGGTTGGACGTAGTTTCCGCTGGGGGAATGCATATAGGCCATGGTCGTACTAAGAGATTCGTAGAGCTTAACGAAAATCAGCCAAACATACCGCATCAGGCATGCCTACTACCGAGTGGCAGAGCGGGACATTGTCGCCATTTACATTACTGTATCCAAGAAGACTTTAAGAGGGACTTCATGAAATTCATGGACTATCTTTGTATTATACAGCATTCATCTATCGGTGTGTGTTGTCCTGATGATCGTACCCCGGATGCTATAGACGCGGTCGCCGGAGACTTACCAGCCACAGCACCTAGAGACGAAAACGAAGTCACTCTGAAGATAGATCGCGCTGAGAATAGAGGCTGTGGGTTGAGCACTCGCGCGCAGGCCCGAGTGACTGGAGCCAGTCCTGCCAATCCACGGGAATGGCCTTGGATGGCTTCCGTGACTCCCGAGGGAAGGGACCAGTGGTGCGGAGGTTCACTCATAACCGATCGACATGTACTCAGTGCGGCGCATTGCACTTATGGGTACGAACCCAGCGAATTGTTCGTCAGGCTCGGAGAATACGACTTTAAGAGAACCAACGATTCCCGTTCATATAACTTTAGGGTGATCGAGAAGAGGGAACATGAAATGTTTGACAGCGCTACCTACCACCACGACGTCGTCATACTTAAATTACACAGAGCAGCGGTGTTCAATACATATGTGTGGCCAATATGTCTCCCTCCCCGGGGATTGGAGTTGGACAACGAAATCGCTACAGTGATCGGTTGGGGCACTCAATGGTACGGGGGTCCAGCGAGTCACGTCCTCATGGAAGTTTCTGTACCTATATGGACGAGAGAGAAATGCACCCCAGCGTTTAGTGATTCAGTCTTCAACGAAACGTTATGTGCCGGCGGTCCGAATGGAGGGAAGGATGCTTGTCAGGGTGACAGCGGTGGTCCTCTGATGTACCAGATGTCTAGTGGTCGTTGGACAGTGGTAGGGGTGGTGTCGTGGGGTTTACGCTGCGGCGAGGCCGAGCACCCTGGTCTATATGCCCGCGTCGATAGATACCTCGAGTGGATACTAAGGAATTCTATCTTCTAG

Protein sequence:

>DPOGS208169-PA
MLCLLILTFLVTFSNGLLSSELTEPEWLDVVSAGGMHIGHGRTKRFVELNENQPNIPHQACLLPSGRAGHCRHLHYCIQEDFKRDFMKFMDYLCIIQHSSIGVCCPDDRTPDAIDAVAGDLPATAPRDENEVTLKIDRAENRGCGLSTRAQARVTGASPANPREWPWMASVTPEGRDQWCGGSLITDRHVLSAAHCTYGYEPSELFVRLGEYDFKRTNDSRSYNFRVIEKREHEMFDSATYHHDVVILKLHRAAVFNTYVWPICLPPRGLELDNEIATVIGWGTQWYGGPASHVLMEVSVPIWTREKCTPAFSDSVFNETLCAGGPNGGKDACQGDSGGPLMYQMSSGRWTVVGVVSWGLRCGEAEHPGLYARVDRYLEWILRNSIF-