Monarch geneset OGS2.0

DPOGS215392
TranscriptDPOGS215392-TA1992 bp
ProteinDPOGS215392-PA663 aa
Genomic positionDPSCF300088 - 276169-288747
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0036740.069.80% 
BombyxBGIBMGA012439-TA4e-15773.77% 
DrosophilaFur2-PC2e-14143.93% 
EBI UniRef50UniRef50_D6X5680.054.81%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6X568_TRICA
NCBI RefSeqXP_001809621.10.054.41%PREDICTED: similar to prohormone convertase 1 [Tribolium castaneum]
NCBI nr blastpgi|2700007640.054.81%hypothetical protein TcasGA2_TC004402 [Tribolium castaneum]
NCBI nr blastxgi|2700007640.054.72%hypothetical protein TcasGA2_TC004402 [Tribolium castaneum]
Group
Gene OntologyGO:00042521.5e-97serine-type endopeptidase activity
GO:00065081.5e-97proteolysis
KEGG pathway 
InterPro domain[11-621] IPR0155003.5e-188Peptidase S8, subtilisin-related
[125-460] IPR0002091.5e-97Peptidase S8/S53, subtilisin/kexin/sedolisin
[456-599] IPR0089797.1e-36Galactose-binding domain-like
[504-588] IPR0028843.9e-26Proprotein convertase, P
[11-81] IPR0090206.7e-15Proteinase inhibitor, propeptide
Orthology groupMCL17035 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215392-TA
ATGCACGATGGAAGAGAGACCGCGAACAGTTATGATGGTGAGTGGATCGTTGAAGTGGTAGGTGGTGAGGAGGTGGCGCAGTTGGTGGCGCTGGAACACGGATATAAATACGAAGGACCGGTGCTGGGTTTGGCAAACATGTACGCGTTCCACGCACACGAGCGCAAGGAGCGTCGCACCCCGAGCAAGCACACATCCACACTGCGCAAGGACAGGAGGATTCGATGGGCGGAACAACTCTTTGCAAAAAGTCGCGTGAAGCGATATCCGTACCCTGACCTCGACGGCACATTAAAACGAGTAAAAAGAATAGATGAATACACCAGGGATGCAGACTTTACGAGGAGTTCAACCGTGGAACACGGACGGAGGGAGGTCTTCAATGACGAGCTCTGGGCCTACGAATGGTATTTGCAAGACACTCGTGACAATCCAAACGTACCTCGCCTGGACCTCAATGTGTTATCGGTGTATAATATGGGCTACAACGGACGTGGTGTTCGCGTGTCTATACTCGACGACGGAGTCGAACACAATCACACGGACTTACAGAACAACTACGATCCGGAAATCAGTTGGGATTGCAATGATGGAGACTCGGATCCATATCCGAGGCATGACGATAAAAACCGGAATTCTCACGGCACGAGATGTGCCGGTGAGATAGCGATGACGGCTAACAATAAGAAGTGCGGAGTGGGCGTGGCCTGGGGCGCCAAAGTGGGTGGAGTCAGAATGCTCGATGGACGAATCACTGATCATGTTGAAGGCGAAGCAATAGGATTCGCGTGGGACAAAGTGGACATATACAGCGCTTCATGGGGCCCCAACGATGACGGAGAGACCGTGGAGGGTCCAGGGCGACTCGCCATGGAGGCCTTCAAGAGAGGAGTGCAAATGGGCCGGAACGGTAAAGGGAATATATTCGTGTGGGCCAACGGCAATGGTGGAACACACGACGATAACTGTAACTGCGACGGCTACTCTTCCAGTATGTACACGATATCTATTGCTAGCGCTTCCCAACAAGGCCTGTTTCCTTGGTACGGAGAGATCTGCTCCTCGACTCTAGCAACCGCATACTCCTCTGGTGCTTACAGTGATCAGAAAATTGCCACTACAGACGTAAACGACTCGTGTACACTTGGGCACACGGGCACCTCTGCAGCGGCGCCATTGGCGGCCGGTATTATTGCTTTAATGCTAGATGCCAACCCAAATTTAACTTGGAGAGATGTCCAACATCTGATTGTATGGACTTCGGAATATACACCGCTATCTGATAACCCCGGTTGGCAAGTCAACGGCGCGGGTCTTTATTTCGACGTACGTTTCGGCTTTGGTCTTTTGAACGCCGGATCTCTTGTCAACGCCGCACTCAACTGGACTACAGTACCAAGTGCACTATCGTGTAGAATCGATGCTTCTCCGATCAAAGGCAAAGTCGCCATTTCAGCAATGGAAACTGTAGATATAACAGTAAAAGTATCGGACTGTGAAGTAAATTACTTAGAACACGTCGAACTGTATGTTAATATCGAGTATACGCGAAGAGGTGCTTTGGAAATACACCTAATTTCTCCTCAAGGTACGATGGTTCAACTACTCAGTCCTCGTCCGAGAGATACGTCCAAGGTCGGCTTTGTTAACTGGCCTTTAACCTCAGTAGCGACGTGGGGAGAGAGAGCTAATGGACTTTGGAGGGTCATCGTACAAGACAAGGGGAATAAATGGAACACGGGTTATGTCGGTGAACTGGTTCTCATAGTCCACGGTACAAAGGAAATGCCCGCTCACATGAGGAGTGGTCCGAGGAGATACGACGACACCTTCAGTCGGTACGAGATCGAGTCGTATGAGGATGAGCCGGCGGTACCAGGAGACCATGAGCACGGAGGAGTCGCCAGCGCGCTACTGGACCAGGCGGACACCGAGCTACAGAGGAACTACCACAGCAGGGGGCAGCAGGCTGGCGAGCGACACCGCGATTGA

Protein sequence:

>DPOGS215392-PA
MHDGRETANSYDGEWIVEVVGGEEVAQLVALEHGYKYEGPVLGLANMYAFHAHERKERRTPSKHTSTLRKDRRIRWAEQLFAKSRVKRYPYPDLDGTLKRVKRIDEYTRDADFTRSSTVEHGRREVFNDELWAYEWYLQDTRDNPNVPRLDLNVLSVYNMGYNGRGVRVSILDDGVEHNHTDLQNNYDPEISWDCNDGDSDPYPRHDDKNRNSHGTRCAGEIAMTANNKKCGVGVAWGAKVGGVRMLDGRITDHVEGEAIGFAWDKVDIYSASWGPNDDGETVEGPGRLAMEAFKRGVQMGRNGKGNIFVWANGNGGTHDDNCNCDGYSSSMYTISIASASQQGLFPWYGEICSSTLATAYSSGAYSDQKIATTDVNDSCTLGHTGTSAAAPLAAGIIALMLDANPNLTWRDVQHLIVWTSEYTPLSDNPGWQVNGAGLYFDVRFGFGLLNAGSLVNAALNWTTVPSALSCRIDASPIKGKVAISAMETVDITVKVSDCEVNYLEHVELYVNIEYTRRGALEIHLISPQGTMVQLLSPRPRDTSKVGFVNWPLTSVATWGERANGLWRVIVQDKGNKWNTGYVGELVLIVHGTKEMPAHMRSGPRRYDDTFSRYEIESYEDEPAVPGDHEHGGVASALLDQADTELQRNYHSRGQQAGERHRD-