Monarch geneset OGS2.0

DPOGS204619
TranscriptDPOGS204619-TA2064 bp
ProteinDPOGS204619-PA687 aa
Genomic positionDPSCF300432 + 81-8141
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0109580.052.62% 
BombyxBGIBMGA012217-TA4e-7553.70% 
DrosophilamodSP-PA9e-3532.20% 
EBI UniRef50UniRef50_Q69BL01e-11653.03%Pattern recognition serine proteinase n=2 Tax=Obtectomera RepID=Q69BL0_MANSE
NCBI RefSeqXP_001607879.14e-6928.21%PREDICTED: similar to ENSANGP00000018359 [Nasonia vitripennis]
NCBI nr blastpgi|396550534e-11653.03%pattern recognition serine proteinase precursor [Manduca sexta]
NCBI nr blastxgi|396550533e-12355.00%pattern recognition serine proteinase precursor [Manduca sexta]
Group
Gene OntologyGO:00038248e-48catalytic activity
GO:00042525e-18serine-type endopeptidase activity
GO:00065085e-18proteolysis
GO:00055151.7e-09protein binding
KEGG pathwaybta:5331811e-24 
 K06233 (LRP2)maps-> Hedgehog signaling pathway
InterPro domain[471-687] IPR0090038e-48Peptidase cysteine/serine, trypsin-like
[528-676] IPR0012545e-18Peptidase S1/S6, chymotrypsin/Hap
[230-301] IPR0160604e-10Complement control module
[178-227] IPR0021721.7e-09Low-density lipoprotein (LDL) receptor class A repeat
[241-298] IPR0004362.9e-06Sushi/SCR/CCP
Orthology groupMCL17556 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204619-TA
AGTCCCGACGAGTTCACGTGTTCAGACGATGTCTGTATCAGCCAGGGTCTGGTGTGTGATGGTCACAGTGACTGCTGGAACGCAGCTGATGAAATGGCTTGCAACGGACTATCGGACCCGCTCTCCGATTTGATGATCCGCAGACCCAAACGTCAGACGCAAAATTGTCGCAAGAACCAGTGGCAGTGTCGTGACGGCACCTGCATAGGGTTCGACGGTAAATGTGACGGTGTGGTCGACTGTCCCGACTTCAGCGACGAGACCTTCGCGCTGTGCAGGGACATGCAATGCCAGAGCAATTGGTTCCGCTGTACTTACGGCGCCTGCGTCGACGGCAGCGCCCCTTGTAATGGTGTGCAAGAGTGCGCTGATAACTCCGACGAGTTGCTGCCTAGGTGCCGCAATCAAACAATTGGTTCCAGGGGTAAGCACACGTGCGACAATGGTCAGGTGATATCCTCGGTGGACATATGCGATGGGAAGAAGGACTGCGCTGATGGCTCTGACGAGACCCTCGCCACCTGCGCCGGGAACAGCTGTCCGTCATACGTGTTCCAATGTGCGTATGGAGCCTGTGTGGACCAGAACGCGAAGTGCAACAAGGTGGAAGAGTGTGCTGATGGTTCTGACGAAACAGACGAGCTCTGCAACAGGCTGGCGCCGGGTCAGCCGGTGACTCCAGCCACGAGACCACCACCTCAGGGGGGTAATTGTCTGTTGCCTCCATACCCTCAGTATGGGTCGTACAAGGTCAGACAGTACCCCAACGCGGTCCCCGGCCAGAGGTATCCCAACGTGAGGCTGGACGTCACCTGTAACCCTGGCTTCCAGACTGAAAACAATAACAGCATCTTCTGCGATAACGGAGAGTGGTCAGGACCTATGCCAGCGTGTCTCCGTTTCTGCAGGCTTAACAAACACCCGAGCGTGGAGTACCGCTGTCTGTTGTCTGGCAACTCGGTGACAGGGTCCAGAGAGTGTGGCTCATTGGAGCCGTCTGGGACCGTCGTCACCCCCATCTGCCGCTCCCCCAATTACTACTCCTCGGGGGTAATGTCCAACATGCACTGCGTTGAAGGCAGTTGGGACTATATAGCTGTGTGCAAACCAGGTTTGACCAACGTTACAATAAGTATAGATAGTTTAGAAATTATCATAACATCGGATAACGCCCACGTAATAATTAACAATTACGGGAACAAGGAGGTTAAGGTCGTCAACAATATTAGTAACGCTGATAGGATTGTGTTTGAAGACAGTAGAACGACCACCAGTAGACCAACCGCTAGTAGAACGACTACCAGTGGACCGACTAGCGCTAATTATGATAATGAAATCGATGAGGGTGACTGGAGAATGGCCTCCGTTGACACAATAGGTTTCCAAGCTCAGCCCGTCCGGCCCAAAAAGTGCGGTACAATAACTCCTGAGGGTATCCAGCTGGTGATCGGCGGGCGGTCTGCCAAGCGCGGGGAACTCCCGTGGCACGCGGGGATTTACAGCAAATTATTCACACCTTACATGCAGATATGTGGCGGGTCGCTCATCAGTACAACCACTATTATATCCGGGTCTGGTGCCAACTTCCAGGATGACATCGCGCTGGTTTTGGTCGTGACGCCCTTCATATACCAGGTCTTCATTAGACCTGTCTGTCTGGACTTCGACGTCAACTTCGACAGAACCCAGCTCTCGGAAGGGAATATGGGCAAGGTAGCCGGCTGGGGTCTGACTGACAAAAACGGTAAAGCGTCCCAAGTGCTGAAGGTGGTAGATCTTCCTTACGTCAAAATTGAAGACTGCTACGCCATGTCCCCGCCGACGTTCCGCGCTTACATCACAAGCGACAAGATCTGCGCCGGTTACACTAACGGCACGACGCTCTGCCAGGGCGACAGCGGCGGCGGCCTGGCGTTCCCCGCCTACGAACTCAACACCCAGAGGTACTACCTGCGAGGCATCGTGTCCACAGCTCCCAGGAACGACGATCTTTGCAACGCCCACACCCTCACCACGTTTACGGCTGTATCGAAACACGAGCATTTCATCAAACAGTACCTCTAG

Protein sequence:

>DPOGS204619-PA
SPDEFTCSDDVCISQGLVCDGHSDCWNAADEMACNGLSDPLSDLMIRRPKRQTQNCRKNQWQCRDGTCIGFDGKCDGVVDCPDFSDETFALCRDMQCQSNWFRCTYGACVDGSAPCNGVQECADNSDELLPRCRNQTIGSRGKHTCDNGQVISSVDICDGKKDCADGSDETLATCAGNSCPSYVFQCAYGACVDQNAKCNKVEECADGSDETDELCNRLAPGQPVTPATRPPPQGGNCLLPPYPQYGSYKVRQYPNAVPGQRYPNVRLDVTCNPGFQTENNNSIFCDNGEWSGPMPACLRFCRLNKHPSVEYRCLLSGNSVTGSRECGSLEPSGTVVTPICRSPNYYSSGVMSNMHCVEGSWDYIAVCKPGLTNVTISIDSLEIIITSDNAHVIINNYGNKEVKVVNNISNADRIVFEDSRTTTSRPTASRTTTSGPTSANYDNEIDEGDWRMASVDTIGFQAQPVRPKKCGTITPEGIQLVIGGRSAKRGELPWHAGIYSKLFTPYMQICGGSLISTTTIISGSGANFQDDIALVLVVTPFIYQVFIRPVCLDFDVNFDRTQLSEGNMGKVAGWGLTDKNGKASQVLKVVDLPYVKIEDCYAMSPPTFRAYITSDKICAGYTNGTTLCQGDSGGGLAFPAYELNTQRYYLRGIVSTAPRNDDLCNAHTLTTFTAVSKHEHFIKQYL-