Monarch geneset OGS2.0

DPOGS203981
TranscriptDPOGS203981-TA1893 bp
ProteinDPOGS203981-PA630 aa
Genomic positionDPSCF300005 + 1080075-1086331
RNAseq coverage134x (Rank: top 56%)
Annotation
HeliconiusHMEL0135270.086.26% 
BombyxBGIBMGA002120-TA0.056.49% 
DrosophilaCalpC-PA3e-16045.84% 
EBI UniRef50UniRef50_B0WDU60.049.63%Calpain-c n=6 Tax=Diptera RepID=B0WDU6_CULQU
NCBI RefSeqXP_001120458.10.058.26%PREDICTED: similar to Calpain C (Calcium-activated neutral proteinase homolog C) (CANP C) [Apis mellifera]
NCBI nr blastpgi|3071742110.059.06%Calpain-C [Camponotus floridanus]
NCBI nr blastxgi|3071742110.059.06%Calpain-C [Camponotus floridanus]
Group
Gene OntologyGO:00041984.5e-73calcium-dependent cysteine-type endopeptidase activity
GO:00065084.5e-73proteolysis
GO:00056224.5e-73intracellular
GO:00055091.4e-20calcium ion binding
KEGG pathway 
InterPro domain[17-304] IPR0013004.5e-73Peptidase C2, calpain, catalytic domain
[2-25] IPR0226842.2e-70Peptidase C2, calpain family
[320-453] IPR0226831.3e-49Peptidase C2, calpain, domain III
[320-452] IPR0226821.6e-38Peptidase C2, calpain, large subunit, domain III
[472-618] IPR0119921.4e-20EF-hand-like domain
Orthology groupMCL16122 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203981-TA
ATGACGGATTACGAGCGCATCAAGGCTAGCTGCCTCCAGCGCGGCCAGCTATGGGAGGACCCTGACTTCCCCGCTATCCAGCCATCCGTGTTCTACCATCAGGTGCCACCCTTCAAATTTGAGTGGAAGCGCGCCAAAGAACTATACCAGTATCCAAAGTTCATATTGGACAACAGTGACAGCTTCGACATAGTTACAGGCAGACTTGGAGATAAATGGCTTTTATCGTGTATAGGAGTGCTATACCTGTGTAAAGGGCTATTCTACAGAGTGGTTCCAGCCGATCAACAAATCGATTCTAACTACGCAGGTATTTTCAGGTTTCGTCTTTGGTGGTGTGGACAATGGGTTGAAGTTCTTGTTGACGATAGATTACCTACGGTTAACGGTAAATTAGCTTTCATGCATTGCAGTCACTCTGAGCAATTGTGGCCGGCTCTACTTGAAAAGGCATATGCAAAAATGCATGGTTCGTACGAAGCTCTAAAATACGGTAACTTACTGGATGGACTGGCAGATCTTACTGGAGGAATCACAGAATCCCTGAATATTTCTGACCTTGCCGATGCCACAGCTCTACACAACTTAATGAAGACTACGAGCGTTGTTACAGCTTACCGTTTACCTAATGCCGCCACACATTCTGTGAAAAGCATTGAATCTGGAATGAATTACAGACTTTACAACGTGGAAAGGGTAGATACTTCTGATGGTCCAGTGTATTTAGTGCGATTGGGACGACCATTAACACCTGGTGATACGCACATTACTCATTTTGTTTTAGACCAAGCCACATGGACCCATTCTATTCCTCTACACGAACGTCAACGTTTGACATCTATAACAAAAGGTTTCTGGATGCTTTATAATGACTTCACTTCTATGTTTTCGCGTGTGGAAATAGTCCATCTTGATCTAGAAACGAGCAAAGCGGAAGCATCTCTCTCAGATAAAAACAAATGGCTAGTAAAAAGTCACCAAGGAAGGTGGAGAAAGGGCGTCACTGCTGGTGGTTGTAGAAATCACGTTAATCTGTTTCATATGAACCCACAAATACAAATTGTATTAAATGATCCTGACACGGTGATTATATCACTCAATCAGCATAGTATTATGGAACCTAAAGTTATAGGATTCAGCATTTACAAAATACCTAAGAGCTTAACAGAAACAGCATCATCACTTTTTTTTAAGAAGACTAAAAGTTCGATTAATTCCCAATACACAAACAGTAGGCAAGTTAGTGAAAGATGTCACTTAGAACCAGGTGCATACTTAGTAATACCCACGACTTTTGAACCTAGACAGGAAGCAAATTTTTCATTAAGAGTGTACTCTGTAAAGCAACTCAAACTGAAAGTATTAGATTGTGCCCCACAGATGTTAAAAGCAGCTATTTTAAAAGCACCTCCTGGGTTCGAAACTAGTAGTTTCACACAATATGAGTCACAATTTCTACAGCTGGCTGATGAACACAAAACTATAAATGCCTTCGAACTACAAGAATTATTAGAAAAGTGCTTGCCAAATGATTACATAAAGAGCTGTGCAACAATCGAAACATGTAGACAAATCGTCTTATCATTGGAAAAAGATGGCTCTGGTCGTATAACATTATCTGATTTCAAAGATCTCATATGCAGCCTGAAGCACTGGCAGATTGTATTCCGAGCTCACGCTCCAGAGAAAATGAGCGTCCTCAAGATTGAAAGGTTTCGAGATGCACTTCGCGATGTCGGCTTTGTAATTCCAGAACGGGCATTGTCATTACTTGTATTGAAGTACATGAGAAAAGATGGCATGCTGAGATTTGGGGACTTTGTATCTGCAGTAGTTCTTCTCCATAGAGCGTTTCTGGCTGAAGTCGGCTTTGACATGTTGATATTGGGCTGA

Protein sequence:

>DPOGS203981-PA
MTDYERIKASCLQRGQLWEDPDFPAIQPSVFYHQVPPFKFEWKRAKELYQYPKFILDNSDSFDIVTGRLGDKWLLSCIGVLYLCKGLFYRVVPADQQIDSNYAGIFRFRLWWCGQWVEVLVDDRLPTVNGKLAFMHCSHSEQLWPALLEKAYAKMHGSYEALKYGNLLDGLADLTGGITESLNISDLADATALHNLMKTTSVVTAYRLPNAATHSVKSIESGMNYRLYNVERVDTSDGPVYLVRLGRPLTPGDTHITHFVLDQATWTHSIPLHERQRLTSITKGFWMLYNDFTSMFSRVEIVHLDLETSKAEASLSDKNKWLVKSHQGRWRKGVTAGGCRNHVNLFHMNPQIQIVLNDPDTVIISLNQHSIMEPKVIGFSIYKIPKSLTETASSLFFKKTKSSINSQYTNSRQVSERCHLEPGAYLVIPTTFEPRQEANFSLRVYSVKQLKLKVLDCAPQMLKAAILKAPPGFETSSFTQYESQFLQLADEHKTINAFELQELLEKCLPNDYIKSCATIETCRQIVLSLEKDGSGRITLSDFKDLICSLKHWQIVFRAHAPEKMSVLKIERFRDALRDVGFVIPERALSLLVLKYMRKDGMLRFGDFVSAVVLLHRAFLAEVGFDMLILG-