Monarch geneset OGS2.0

DPOGS215619
TranscriptDPOGS215619-TA4851 bp
ProteinDPOGS215619-PA1616 aa
Genomic positionDPSCF300041 - 2144672-2153415
RNAseq coverage138x (Rank: top 55%)
Annotation
HeliconiusHMEL0059240.067.24% 
BombyxBGIBMGA003679-TA0.080.76% 
Drosophilandl-PA4e-5629.25% 
EBI UniRef50UniRef50_D6W6H70.042.08%Serine protease P54 n=4 Tax=Coelomata RepID=D6W6H7_TRICA
NCBI RefSeqXP_002430856.13e-15642.44%Acrosin precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700151220.042.08%serine protease P54 [Tribolium castaneum]
NCBI nr blastxgi|2700151220.036.96%serine protease P54 [Tribolium castaneum]
Group
Gene OntologyGO:00038245.5e-88catalytic activity
GO:00042528.9e-86serine-type endopeptidase activity
GO:00065088.9e-86proteolysis
GO:00055152.2e-10protein binding
KEGG pathway 
InterPro domain[1271-1531] IPR0090035.5e-88Peptidase cysteine/serine, trypsin-like
[1294-1526] IPR0012548.9e-86Peptidase S1/S6, chymotrypsin/Hap
[1113-1163] IPR0021722.2e-10Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL17763 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215619-TA
ATGAGCAGCTTCCACAGGAGTGAATATAGGTTCGCTGGTGACTATGGCACCGGGTATAGACGGGCTGAAAGCAGTCGGGGCGGTGGTATGTGTTCTGCTGCTTTGGTCGGAGGAGCGCTATTGGCTGCAGTCGCGGTTCTGGCAGTCGCCGCATTAGCTTTCTACATGGGTGCCTTAAGGCCTGATAATGGAGAACCAATAATGACATTTGAGGGAACATTTCGCGTTACTCGCGGGGACGTTTACGGTGGTGTCCCAGGAAGTCCGTCCTGGCGAGAGCGTGCTCGCAGATACAGCGCGTCCCTTAAACAGGTGTACGCAGCTCCATCGCCCTTAAGACAGGCCTTCGCCGGAGCAATAGTAACAGGGTTCGGTGACAGACGCCTTGACGTTCACTTCAAACTATACTTAGATAGAAGAAAAATACCAAGTTCTCTTACAAATATAGAGGAATCTTTGAAAAAAATATTAATACAAGATTTGATATCCAAACATAGTGCATTTGGACAAATAAAAATAGATGCATCTAGTATAATTATAAAAAGGGACTTAGAACACACATACCACTCGGAACAGTATGTCAAGGAAGCCATGAATGAAACTGTAACTACACCTAATCCCAAGGTGTTATCACCTCAAAACGCGAAAGACAAAACTCTCCAAAGCCGAATTGGTGTTGTTCGAAAAACGACAGTTAAACCTAAACAAACTTTGAGAAAAGATGATCCAGATGAACCCGATATAGATACAGAGAACATTCCAGTAGTACAAGGCTCTTTCCAAATAACAAAGACAGAGGCGGATATAACGGAGAATAAACATAATCCAAGCAAAACTAATCCCAGTAGAGGCGATGAAAAAAACAACCATAAAACACCATCTACACCAAAAACAGCGACATCTACAAACACGAAATCGCCTACAACTTATACAACAACAACCACCAGCACAAGAAAGTTCCCACCAAGTACACTGAAACCAAAACCAATCACAGTAAAGTCTTCTTTGAATATGAAGCCAAAAGTGGATATAAATAACAGTTTCAGAGAAGTAAGTACTGCCAAGCCTTCAACAACAACGAGTACAATGAAAACAACAACTACATCTAAAAGAACAACAACTACTTCTATGGCAACAACTACTCAAAATGTGTCTCAAATCTTATATGATCTTCTTACAAATGAAAATCACGATAAAGATTTGCCAAAAATTGATACTTTATTCACAGTACCTCATGTCATTGATAATGAACCGTGGAGGCCTATTACAAGACCTTATTATGAAACAACCAGTAAACAATCTACTTTACCTATAATTGAACAAAATGCAGAAGATCGAATAGGTGTTGCAGAAGTGGTTGAAGATATTTCTCTTTTAGAATCTATGTTAACTCCCAGTCCACCGGTGAAACACAAAGATATAACAACACGTAGACCATCAGGGCTGTACAACGTAGATCCCCATCTAGCGGCAGATGTATACATCCCCAATCCAGTTTATACTAGTTTTACTATTCCAGCATTCATTCCACCGCTCAAAGATATGGAAACATTGGGATCAAGCTATCCAAAACCACATCCTTTACCAGTTGACAAAATAAGTGGTGCAATAGAAGTGGTACCTGAATCAAATTTAAACATGGATGACAATGACGGTAGACCTATAATAAGGCCTCCCAAAGAAAAAACTTCTTCCGTTAGTATAAATGTATTTCAATTAGATAGTAATACAGAAAAAGTTTCAATTGAGGGTGCATCAATTGTAAAGAAACAAAATATTACTTCCACAAGCACAACAATGAAAACAACTACAATTGTTGATCGAAATACAAGAATTTCTACAACAGATATTCCTTTGAGTACACCTTCTAGCACTACTACAATAGGACATAAAAAGGAAAGTACTACGAAGAGACCAAATAATAAAGTATCAATAATACCATCGACCGGAACTCCTCACCATACATGGGAATTAGTCAACACCTCAACAAATAATAACGACACTTCTAATAAAGTATCTCCCCAAAAGTATTACAATGATACATTGCAAGCTATAATTGTTAAAAACGATGCTTCTCTCAATACGACAACAAGATTCCCGAGCAAATTTTCTATTTTAAGAAATTTAACTGACCTAATTAAACGATATTCTCAAAATAGTACGTTAAAGCCGTCTGAAATCAAAATTGAAACTACTACCGTTCATTCCAAAACAACAACTTCAGTAAAATTGGAGGACATTGGAGAAATTGTTCGACACACACCAGTAGAAATGACTGGATCAGTGGAAGTTATTTCCGAGGAAGACCTAGAGACAACAACAGCTAGAATCATAACATTGATGCCTGCTAAATCCAATCTAGGAGTGAATCGACCTCTACGACCCCGTCCAAAAATTGATCCTCAAGTAACAGAGGATAGTGAACGTAGTTTCAATGATCCATCAGATGTAAATAATTCTGCAGATGATCTTGAATTGAGTTCTTATAATTACACAGAATTATTGTCAGAAGCGTCTGAAATGATATCGAGCTCAGCAAGTATGTACAATGATACAAATATTGATGTAGTTGAAACTAATGAAGATCCCAAAGCTCTTCGATCATCGGGTATACCGGCCAATCCTGTGAGTGGAAATAGACTGCCTAAGTCAAACGATTTAAAAAATCCAAATGTTGAAAACTTTAAAGAGAGTAATATTCCTGAAGGGACGTACAAGGTATCTTATCATGTAACGGGCAGCGTTAGCAGTAAACAGGCTAATAAAACTAAACACCTTCCTGCCTATGAGCTAGCTCTAGAACCGGATGTAGTGCTAGAAATACCATCAAATCAAAGTAGTACATTAACCCTAGATAAATTAAAGCAACTAGCTAGCCTTGCTACAATAACAAATTTTAACAACAGCACATTTTTCCGTGCTCCTGGTGGTGTAATTTCGACCAAAGCGATCCCGTCAAGTTATACATTAAATCAAGCTGGATTTAAAATACTCACAAAAACATTTAACAAAGCAACGCCCGCGAAACAAGAGGAAAACAGCTTTAATCAACCAGAAAAACCTATTAGTAAACCAATTCTTACAAAAAAACAAAATAAACCAGAATTTGAAAAAGAAATAAAAGTTGAAGAATTCTGTGATAACTCCACGTCGTTTTCATGCTCCAGCGGAGCTTGCATTCCCCTGACGTCTCGGTGTAATAGATTGATAGATTGTCCCTCCGGGGAGGACGAGAAGGCATGCTCGTGTGCAGATTACTTACGAGCTGATTTTTCACAATCTAAGATTTGTGACGGTTTCGTTGATTGTTGGGATTACTCGGATGAAAATAAATGTGACTGGTGTAAAGAAGGCCAATTTGTATGTGCGAACGCTCGTCAATGTATAGAAATGAATAAAGTTTGCGATGGAAATCCTGACTGTCCACTCGGCGACGACGAGAAAAGCTGTGTAGCTTTAGGTGACGACATTGACAGCAACGAAGTCATTCCGTATAACGAGGAAGGTTTTGTTATGGTTCGTAAGCGCGGTGTTTGGGGCAGACTCTGCGTGGAGAGTTTCAATGATGTGGTCACTCAAGCACATAGTTCACTTAAGTTACCAGACCTTGGTAGGGCCGTCTGTCGTGCAATGACCTTTCAAGATTCGCCGTGGGTTCGCGAGGCACGTGAGGGTAGAAAAGTGAGCACGATAGGTTACTGGGAAGTTTGGCACAATGTACACGCTCGAGCCGCGGACACGCGGTTGACTTTCAAACGATCTAGTTGTACGAGACATCGCGCTCTGCGTGTCAGATGCGAGGACTTGGACTGCGGAATACGACCTCACGCTGATGCACAGCAACCCAGGGGTGTAACTTACGAGCGAGTGCGGTGGGGCAGGGTGGTAGGTGGTGGAGGAGCGGCGGCAGGCGCCTGGCCCTGGCAGGCAGCTTTATACCGCGATGGAGACTTCCAGTGTGGCGCTACCCTTATCTCAACGCAGTGGCTTCTATCAGCAAGTCATTGTTTCTATCAAGCTACTGAAGCCCATTGGGTTGCACGACTCGGAGCGTTGCGGAGAGGAGCCTGGCCTCGTGGTCCTTGGGAGCGAGTGACACGCGTTCGTCAAGTAGTGTTACATCCGAAGTATGCACCACGTGGATTTAAAAATGACATAGCGTTATTGCGAGTTGACCCTCTGCCTCTGCACGCTCGTCTGCGGCCGGCCTGTCTGCCACCGTCGCGTTCACAACCGCCAGCCGGACACCATTGTACCGTGGTTGGTTGGGGACAATTGTATGAACATGAACGGGTATTCCCGGACACGCTCCAGGAGGTGGAGTTGCCGGTGATATCCACAGCAGAGTGTCGTCGCCGCACTCGTCTGCTGCCCCTCTACAGGATCACTGAAGATATGTTCTGTGCCGGCTATGAACGCGGCGGACGCGACGCTTGTCTTGGAGACTCGGGAGGGCCGCTTATGTGCCAAGAGGACGATAGATGGTATATTTACGGTGTAACCAGCAATGGCTATGGATGTGCCAGGGCGAACCGACCTGGCGTTTACACGAAGGTCTCCAACTACATCGAGTGGATTGACAGCGTCATGACGACTCACACGACGACAACGAACAAAACTATATCGAACAGCGAAGAAAACTCCAAAGATTTCTACGCGGATTTAGAAACGGCAGAGAACAAGAGAGTTCTTCATAGGACCTACGATACTTGTAGAGGTTTCCGATGCCCTCTTGGGGAATGCCTACCACAGTCTAGCGTCTGCAATGGCTTCCTTGAATGTTCGGACGGCAGTGACGAATGGCAATGCGATAATTTTATGACGAATTCAAGCTGGTACAGTCCCGTCTAA

Protein sequence:

>DPOGS215619-PA
MSSFHRSEYRFAGDYGTGYRRAESSRGGGMCSAALVGGALLAAVAVLAVAALAFYMGALRPDNGEPIMTFEGTFRVTRGDVYGGVPGSPSWRERARRYSASLKQVYAAPSPLRQAFAGAIVTGFGDRRLDVHFKLYLDRRKIPSSLTNIEESLKKILIQDLISKHSAFGQIKIDASSIIIKRDLEHTYHSEQYVKEAMNETVTTPNPKVLSPQNAKDKTLQSRIGVVRKTTVKPKQTLRKDDPDEPDIDTENIPVVQGSFQITKTEADITENKHNPSKTNPSRGDEKNNHKTPSTPKTATSTNTKSPTTYTTTTTSTRKFPPSTLKPKPITVKSSLNMKPKVDINNSFREVSTAKPSTTTSTMKTTTTSKRTTTTSMATTTQNVSQILYDLLTNENHDKDLPKIDTLFTVPHVIDNEPWRPITRPYYETTSKQSTLPIIEQNAEDRIGVAEVVEDISLLESMLTPSPPVKHKDITTRRPSGLYNVDPHLAADVYIPNPVYTSFTIPAFIPPLKDMETLGSSYPKPHPLPVDKISGAIEVVPESNLNMDDNDGRPIIRPPKEKTSSVSINVFQLDSNTEKVSIEGASIVKKQNITSTSTTMKTTTIVDRNTRISTTDIPLSTPSSTTTIGHKKESTTKRPNNKVSIIPSTGTPHHTWELVNTSTNNNDTSNKVSPQKYYNDTLQAIIVKNDASLNTTTRFPSKFSILRNLTDLIKRYSQNSTLKPSEIKIETTTVHSKTTTSVKLEDIGEIVRHTPVEMTGSVEVISEEDLETTTARIITLMPAKSNLGVNRPLRPRPKIDPQVTEDSERSFNDPSDVNNSADDLELSSYNYTELLSEASEMISSSASMYNDTNIDVVETNEDPKALRSSGIPANPVSGNRLPKSNDLKNPNVENFKESNIPEGTYKVSYHVTGSVSSKQANKTKHLPAYELALEPDVVLEIPSNQSSTLTLDKLKQLASLATITNFNNSTFFRAPGGVISTKAIPSSYTLNQAGFKILTKTFNKATPAKQEENSFNQPEKPISKPILTKKQNKPEFEKEIKVEEFCDNSTSFSCSSGACIPLTSRCNRLIDCPSGEDEKACSCADYLRADFSQSKICDGFVDCWDYSDENKCDWCKEGQFVCANARQCIEMNKVCDGNPDCPLGDDEKSCVALGDDIDSNEVIPYNEEGFVMVRKRGVWGRLCVESFNDVVTQAHSSLKLPDLGRAVCRAMTFQDSPWVREAREGRKVSTIGYWEVWHNVHARAADTRLTFKRSSCTRHRALRVRCEDLDCGIRPHADAQQPRGVTYERVRWGRVVGGGGAAAGAWPWQAALYRDGDFQCGATLISTQWLLSASHCFYQATEAHWVARLGALRRGAWPRGPWERVTRVRQVVLHPKYAPRGFKNDIALLRVDPLPLHARLRPACLPPSRSQPPAGHHCTVVGWGQLYEHERVFPDTLQEVELPVISTAECRRRTRLLPLYRITEDMFCAGYERGGRDACLGDSGGPLMCQEDDRWYIYGVTSNGYGCARANRPGVYTKVSNYIEWIDSVMTTHTTTTNKTISNSEENSKDFYADLETAENKRVLHRTYDTCRGFRCPLGECLPQSSVCNGFLECSDGSDEWQCDNFMTNSSWYSPV-