Monarch geneset OGS2.0

DPOGS205586
TranscriptDPOGS205586-TA2883 bp
ProteinDPOGS205586-PA960 aa
Genomic positionDPSCF300237 - 27054-41031
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0112446e-15248.19% 
BombyxBGIBMGA009750-TA2e-15187.31% 
DrosophilaCG34350-PA7e-14579.00% 
EBI UniRef50UniRef50_UPI00020633877e-14959.35%UPI0002063387 related cluster n=3 Tax=unknown RepID=UPI0002063387
NCBI RefSeqXP_973911.26e-14459.95%PREDICTED: similar to serine proteinase stubble [Tribolium castaneum]
NCBI nr blastpgi|3287783593e-14859.35%PREDICTED: hypothetical protein LOC409827 [Apis mellifera]
NCBI nr blastxgi|1610764320.041.72%CG34350, isoform A [Drosophila melanogaster]
Group
Gene OntologyGO:00038244.1e-91catalytic activity
GO:00042528e-88serine-type endopeptidase activity
GO:00065088e-88proteolysis
KEGG pathway 
InterPro domain[707-959] IPR0090034.1e-91Peptidase cysteine/serine, trypsin-like
[716-954] IPR0012548e-88Peptidase S1/S6, chymotrypsin/Hap
[747-762] IPR0013147.7e-13Peptidase S1A, chymotrypsin-type
Orthology groupMCL12722 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205586-TA
ATGGGTCCTCCCCTCAACACCTGGCCCTACAGCCACCATCGAGAACTTAAAAGCGCTAGTGATAAAAGTAGTTTAGTTAATATTGTAGACAATTTACGTAATTTAAGTATAAATTTTACTTTTATATTTATTTTTATACAAGTGTTTATAAATCTGAGTGCGGTTAGCGCGGGACCCGTGATACTCAGCTTGGACCATTTGCGCGGCTCGGTGCCAATAGCTAGAAACATAAGGCATTTACCATGCATATCAAGGAAAACTGCTCAAGAGGGACTCTGCATGTTCGCGATAGACTGTCTTAAAGCAAACGGAACTCATTTGGGAACATGTATAGATAGGTTTTACTTCGGTTCCTGCTGCCAGCTGACAGATAAATCTGCTATACCAAATATAGCTGCGAACAATATTGAAGATAACGCCATAGACGGCGCTAATTTCGTACATCCGCTCATAGATCACAAAATCCATAGTCAAACGAGTAAAAAACCCGATAAAATAAACGTGAACACAGATAAAGAGAATGCAAAACCAATACAAGACGATATCGCTACTAAGAAACCGTCAACAGTTCAGGATGTGACTAGCTCTGACAAAATGCAGTCGAAGTCTGATGAAACGGTATCTATTAACCATATAACAAGTGATATTAGGACTACAGAAGCTGTCACAGCTGTCAAAGAAGCTACTACACAATCTATGAAAGTATCCGACAATGTAGTGACCGAGATTCCTGTTAAACTGTCCACATTCCAAACAGTATCCGCTGCTGGTGACACAGCAACTGCAGCTCCGGAAGCCAAACCCACAAAACCACAAACACCAGAAGAACCAGTCAAACCAACAAGGAAACCGGTGAAACCAACGTATAAACCTAGACCATATAGACCCACGAATTTCACGAGACCTCCAATAAGTCCTAAACCGAAACCGACGAAGCCTGTCGCATTATTCAATACAACAAGGAAGCCGCCTTACCGACCGCCCCCGAAACGTAACTCAACCAAAAAACCCCTGCCATCTCCACCGAGACTTAATATAACCATCATACCTCAATCCACGAGTTCACGACCGACATTTACAAGACCTTCCTCTAGTGTCATTACATATATCAACTCAACATCTCCTAAAGTTGAAGAGCCTGCAGAAACAACCATAAAAACAACCCTATCAACAGATACAACAACATTAAAGACCACAACTACGACAACACCTCCTCCACCACCACCAACAACAACAATTGCATCTACTACTACATCTACAACTACAACTACAACAACTACTACTACTACCACCACAACCCCCCCACCATCCACAACAACTACACCAATACCAACAACAATGATCATAACAACAGCAGAATCAATCCCAACTGCAACAGAACGCGCCACTATAACAACAGAAAATATCGTAACCGAAACTTTACCTATAGAACCATCAACGGAGAAAGCAACAGAGATACCAACAGAAAAAAATACCGAGCGAATTACTACAGAACTCTTACCAGAACCAACAGAAGAGAAGGAGAAAGAAACTGTCACTGAAAACGTCACGACAGTTGTCACTGAAAAGGTGACATTACAGGATGTCATTGAAAAAGATACCGTAAAACCAGCCACTGAAGGTGATAATCCTGTGACAACGAAGCCTACCACTGACTACCCTCCCTTTGTAACTTGGACCAACGAGGCAAGTTCAAAAGCACCGGCTACTGTCAGCGACGACTGGTCACCAATCACACCTCCTGACGGCTGGGTCTTAATATCTACCATGTCTCCCAAACCGGAAACAACAGTGAAACCACAAACAACAGAAACTGAAACAACACTAAAACCAACTTCGGTTCTAACTGAAGCGACTTCAATTTTAACATCAACTTCAACCACGGCCTCGCCAACTTCAGAAATTGAGTTTGTTGTGAACGTGACATTGTCTCCTACAACACCCACTCCCACCTCGAGCATGGCGCCAACAACAAATGTCACCTCGGACGAAACACAAACAACAACAACAACAACACTAGCGGCTCTGACTACTATCGCGAACGTGACAACCACAGAGGCGACAACCACTATAACAACGACCACAGAATCTTACAATATGTCGAATTACAAAGAAGTATGCGGTAGGCGCATGTGGCCTCAGGCGAGGATCGTTGGTGGGGCGAAGTCCGGCTTCGGGCAGTGGCCCTGGCAGATATCGCTCCGACAGTACAGGACTTCGACCTACCTTCATAAGTGTGGGGCCGCTTTATTGAACGAGAACTGGGCGATCACTGCCGCTCATTGTGTTGACAGGGTTCCTCCATCGGAGTTGTTGGTGCGTCTCGGTGAATATGATCTCGCGAACGAGGACGAGCCCTACGGCTTCGCTGAGAGACGAGTGCAGATAGTAGCCAGCCATCCTCACTTCGATCCGGCTACCTTTGAATATGATCTAGCTTTACTGAGGTTCTACGAGCCGGTTACATTCCAGCCGAACATTCTTCCTGTGTGTGTCCCTGATGATGACGATTCTTACGTCGGACGAACAGCCTACGTCACGGGCTGGGGACGTCTCTATGATGAGGGTCCCCTCCCGAGTGTGTTGCAGGAGGTGGAGGTGCCTGTGATCAATAACACAGCCTGTGAGAGCATGTACCTCGCGGCTGGTTACAACGAGCACATACCGAACATATTCATTTGTGCCGGATGGAAGAAGGGAGGCTCGGACAGCTGTGAAGGCGACAGTGGTGGACCGATGGTGGTTCAGAGAGCGAAAGACGATCGCTTCGTACTGAGCGGAGTTATCTCGTGGGGTATCGGATGTGCGGAACCCAACCAGCCCGGGGTCTACACAAGGATATCCGAGTTCAGGGATTGGATCAACCAGATACTACGCTTCTAA

Protein sequence:

>DPOGS205586-PA
MGPPLNTWPYSHHRELKSASDKSSLVNIVDNLRNLSINFTFIFIFIQVFINLSAVSAGPVILSLDHLRGSVPIARNIRHLPCISRKTAQEGLCMFAIDCLKANGTHLGTCIDRFYFGSCCQLTDKSAIPNIAANNIEDNAIDGANFVHPLIDHKIHSQTSKKPDKINVNTDKENAKPIQDDIATKKPSTVQDVTSSDKMQSKSDETVSINHITSDIRTTEAVTAVKEATTQSMKVSDNVVTEIPVKLSTFQTVSAAGDTATAAPEAKPTKPQTPEEPVKPTRKPVKPTYKPRPYRPTNFTRPPISPKPKPTKPVALFNTTRKPPYRPPPKRNSTKKPLPSPPRLNITIIPQSTSSRPTFTRPSSSVITYINSTSPKVEEPAETTIKTTLSTDTTTLKTTTTTTPPPPPPTTTIASTTTSTTTTTTTTTTTTTTPPPSTTTTPIPTTMIITTAESIPTATERATITTENIVTETLPIEPSTEKATEIPTEKNTERITTELLPEPTEEKEKETVTENVTTVVTEKVTLQDVIEKDTVKPATEGDNPVTTKPTTDYPPFVTWTNEASSKAPATVSDDWSPITPPDGWVLISTMSPKPETTVKPQTTETETTLKPTSVLTEATSILTSTSTTASPTSEIEFVVNVTLSPTTPTPTSSMAPTTNVTSDETQTTTTTTLAALTTIANVTTTEATTTITTTTESYNMSNYKEVCGRRMWPQARIVGGAKSGFGQWPWQISLRQYRTSTYLHKCGAALLNENWAITAAHCVDRVPPSELLVRLGEYDLANEDEPYGFAERRVQIVASHPHFDPATFEYDLALLRFYEPVTFQPNILPVCVPDDDDSYVGRTAYVTGWGRLYDEGPLPSVLQEVEVPVINNTACESMYLAAGYNEHIPNIFICAGWKKGGSDSCEGDSGGPMVVQRAKDDRFVLSGVISWGIGCAEPNQPGVYTRISEFRDWINQILRF-