Monarch geneset OGS2.0

DPOGS211824
TranscriptDPOGS211824-TA1500 bp
ProteinDPOGS211824-PA499 aa
Genomic positionDPSCF300031 + 164734-174718
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0133993e-6031.74% 
BombyxBGIBMGA008167-TA2e-11066.07% 
DrosophilaCG18493-PA4e-7033.63% 
EBI UniRef50UniRef50_E2BVA86e-8437.21%Putative serine protease K12H4.7 n=2 Tax=Formicidae RepID=E2BVA8_HARSA
NCBI RefSeqXP_623676.25e-8438.00%PREDICTED: similar to CG3734-PA [Apis mellifera]
NCBI nr blastpgi|3072000542e-8337.21%Putative serine protease K12H4.7 [Harpegnathos saltator]
NCBI nr blastxgi|3072000549e-8137.21%Putative serine protease K12H4.7 [Harpegnathos saltator]
Group
Gene OntologyGO:00065084.8e-121proteolysis
GO:00082364.8e-121serine-type peptidase activity
KEGG pathway 
InterPro domain[28-463] IPR0087584.8e-121Peptidase S28
Orthology groupMCL30553 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211824-TA
ATGACCAACATGTGGCTTCTCCTGTGCCTCGTAACCTCAGCTTTTTCCCTGAAAACCTTAGAACCTCCTCCCCCTGAAGCATCAGCCCGGAGCTCGACCAACATCACAGAGGGATGGCTTCCCGTCAGACTCAATCACTTCGATGCTTCCAATACAGACACCTTCCAAATGCGTTACTACTACAACTCTCAATTCTCTCGCGGTCCATACATCGTGATCTTCGTAGGAGGTGAGTGGTCTATATCTCCAGGGTGGGTGAGGAGTGGACTCGCGTACGAGCTGGCCGAGAGGATAGGGGCTGGTCTATTCTACACAGAGCACAGATACTACGGACTAACAAGACCCACCAACGGAACTACTGTAGCAGAGATGAGATATCTGAGTGTGGACCAGGCCCTCGGAGACTTGGCTCAGTTCATAGAGTATGTGAGGAGTGATGACTTTGAGGGTGGGAGGTTTAGGAACGCGAGAGTGGCTTTATTCGGTTGTTCGTACGCGGGGTCGATGGCCACGTGGATGAAGCTTGGCTACCCTCACCTGGTCAGGACGTCCCTGTCGGACAGCGGACCGCTACACGCCCAGCAAGACTTCCCAGAATACTTGGAAGTCATAGCGACGGCTCTCCGAGTCCAAGGCAGTCAACAATGTGTCGATGATATTGAAAGTGCGATGAAGAGGATCAATGAACTGATAGAAACTGAAGCTGGACTCGACACCGTGTCCACACTGTTTAATACATGTTCCCGACTCCGCCGATCTCATCTGGATCTCTCGACCTTCTTCTGGTACGGCATCACGGAGACCTTCGCGTACCTCGTTCAATACGCCACCCCTGGGGACATACCACGAGCCTGTGACCACATCACCAATAAAACATTAGGTGATCCTATCGAACGTCTTTCTTCCTGGGTGACCTCTCAACCCTACACCCAGCCCTGCATCGAGTCAAGGTACTTTGAGAAGGTGGCCTCCCACACCAACACCAGCTATGACTCACCGGACGCCACAATGCGTCTGTGGACTTATCAGACGTGCACAGAGTACGGATGGTATCAGACCACCACCAGCTCCAGACAGCCGTTCCTCAACACTGTTCCACTGGAATACTTCCATCAGATGTGCAAGGACTTCTTTAACGACAGTATCGACGAGAATCTTCTCCGTTCAGCCATCGTTAGAACCAACCGTCTGTTCGCCGGCCTAGAGCACCTTCCTGACGGGGTGCTGTCAGTGGGGGGAGGACATGACCCCTGGTCTCCTGTTGGACCTAACAAGACGCACGAGACTCATTTAGCCCCCGTGTACGTAGTAGATGGGGTGTCTCACTGCAGAGCTATCAGGCCCACGGGCAGCAGTGAGACCGAACAGCTGGAGATAACCAAACAGTCGAGTCTGTTATTCATGGAGGGGCTCATGACGGACACGAGGTCTTCCAGCGCTCCCTTGCTCGCGTCACGACTGCTAGTACTCGCTCTCGTCATACTCTATGCTACGTTTTAA

Protein sequence:

>DPOGS211824-PA
MTNMWLLLCLVTSAFSLKTLEPPPPEASARSSTNITEGWLPVRLNHFDASNTDTFQMRYYYNSQFSRGPYIVIFVGGEWSISPGWVRSGLAYELAERIGAGLFYTEHRYYGLTRPTNGTTVAEMRYLSVDQALGDLAQFIEYVRSDDFEGGRFRNARVALFGCSYAGSMATWMKLGYPHLVRTSLSDSGPLHAQQDFPEYLEVIATALRVQGSQQCVDDIESAMKRINELIETEAGLDTVSTLFNTCSRLRRSHLDLSTFFWYGITETFAYLVQYATPGDIPRACDHITNKTLGDPIERLSSWVTSQPYTQPCIESRYFEKVASHTNTSYDSPDATMRLWTYQTCTEYGWYQTTTSSRQPFLNTVPLEYFHQMCKDFFNDSIDENLLRSAIVRTNRLFAGLEHLPDGVLSVGGGHDPWSPVGPNKTHETHLAPVYVVDGVSHCRAIRPTGSSETEQLEITKQSSLLFMEGLMTDTRSSSAPLLASRLLVLALVILYATF-