Monarch geneset OGS2.0

DPOGS204148
TranscriptDPOGS204148-TA1008 bp
ProteinDPOGS204148-PA335 aa
Genomic positionDPSCF300034 - 1136771-1139676
RNAseq coverage307x (Rank: top 37%)
Annotation
HeliconiusHMEL0164814e-6750.00% 
BombyxBGIBMGA005173-TA1e-7146.62% 
Drosophilaea-PA2e-6239.53% 
EBI UniRef50UniRef50_Q8I9241e-8348.05%Prophenoloxidase activating factor 3 n=4 Tax=Obtectomera RepID=Q8I924_BOMMO
NCBI RefSeqNP_001036844.13e-8548.06%BzArgOEtase [Bombyx mori]
NCBI nr blastpgi|564183973e-8546.92%hemolymph proteinase 8 [Manduca sexta]
NCBI nr blastxgi|564183972e-8447.20%hemolymph proteinase 8 [Manduca sexta]
Group
Gene OntologyGO:00038241.5e-86catalytic activity
GO:00042521.6e-77serine-type endopeptidase activity
GO:00065081.6e-77proteolysis
KEGG pathway 
InterPro domain[70-334] IPR0090031.5e-86Peptidase cysteine/serine, trypsin-like
[79-329] IPR0012541.6e-77Peptidase S1/S6, chymotrypsin/Hap
[110-125] IPR0013145e-14Peptidase S1A, chymotrypsin-type
Orthology groupMCL14995 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204148-TA
ATGATTCTAGAAGATTGCGTTGACCTTTACGAAAGATTCAATGAAGGGACTTCCGCTATTTATATTAAGTTATTTCAAAAATTACAATGCGGTTTCAATGGTAACCAATCAAAGATTTGCTGTCCACCAAATTTTCTAACCTCTGTTGGTGATACGTCAAATATAAATCAAACAAATAAATTAAAGATCTTGCCTAACAACACTGTTTGTGGCATTGACACTAAAATTATAAATAGGATTTCCGGGGGGGAGGAAACTGAAATCGGTGAACATCCTTGGCTGGCGTTATTGAATTATGGTCCACCTTCGACTAATAGTTTTTATTGTAGTGGAGTCCTGATATCATCAAGATATGTCATGACCGCAGCACACTGCGTGAAGCGCACTTTGGAGGATGTCACAGTTTCTCAGGTGCGACTTGGTGAATGGGACTTGTTAAGGAATACGGACTGCTCGAAGAATTACTGCAGTTCTGATGCAATAGACGTCGATGTAGAAGAAATTGTGGTCCACGAGAACTTCATCATCGGAGATCCCTCATTTCACCATGATATTGCTCTTCTGAGATTAGCCCAAGATGTAACTTTCAGTGATTTCATCAGGCCGATCTGTCTTCCTATTGATACGGAAATAAGGGAAAATAATTTTGAACATTCAGTTCATGCGGAAATAGCGGGTTGGGGTCAAAATGAATACAGTTCATTTTCAGAGAAGAAACTCAAGGCTAAAGTTTCTGTCGTAAACTTAGAAACATGCAAAAAAGCATATGCCTATGGTAAGCACGTCATAACTAATAATCATATCTGTGCCGGTGGCGAGAGAGGCAAAGATATCTGTGATGGAGATTCTGGTGGTCCACTCATGGTTCAAGTTCAGGATAAGAGAATTTGGATGGCTGTTGGTGTGTCATCCTTTGGCCCAGCGACTTGTGGTGTAGAAGGATGGCCTAGCGTGTTCACCAGAGTGACGTCTTATGTACCCTGGATATTATCTAAGATACGACCTTGA

Protein sequence:

>DPOGS204148-PA
MILEDCVDLYERFNEGTSAIYIKLFQKLQCGFNGNQSKICCPPNFLTSVGDTSNINQTNKLKILPNNTVCGIDTKIINRISGGEETEIGEHPWLALLNYGPPSTNSFYCSGVLISSRYVMTAAHCVKRTLEDVTVSQVRLGEWDLLRNTDCSKNYCSSDAIDVDVEEIVVHENFIIGDPSFHHDIALLRLAQDVTFSDFIRPICLPIDTEIRENNFEHSVHAEIAGWGQNEYSSFSEKKLKAKVSVVNLETCKKAYAYGKHVITNNHICAGGERGKDICDGDSGGPLMVQVQDKRIWMAVGVSSFGPATCGVEGWPSVFTRVTSYVPWILSKIRP-