Monarch geneset OGS2.0

DPOGS202493
TranscriptDPOGS202493-TA2727 bp
ProteinDPOGS202493-PA908 aa
Genomic positionDPSCF300131 - 641167-653734
RNAseq coverage765x (Rank: top 17%)
Annotation
HeliconiusHMEL0029711e-7364.55% 
BombyxBGIBMGA000097-TA2e-18063.11% 
DrosophilaCalpA-PB1e-9840.50% 
EBI UniRef50UniRef50_G3HC867e-10143.29%Calpain 11 n=1 Tax=Cricetulus griseus RepID=G3HC86_CRIGR
NCBI RefSeqNP_001153877.13e-10167.78%calpain-B [Apis mellifera]
NCBI nr blastpgi|3504135961e-11045.66%PREDICTED: calpain-A-like isoform 4 [Bombus impatiens]
NCBI nr blastxgi|3227923007e-10744.34%hypothetical protein SINV_03488 [Solenopsis invicta]
Group
Gene OntologyGO:00041989.4e-102calcium-dependent cysteine-type endopeptidase activity
GO:00065089.4e-102proteolysis
GO:00056229.4e-102intracellular
GO:00055091e-13calcium ion binding
KEGG pathway 
InterPro domain[174-684] IPR0013009.4e-102Peptidase C2, calpain, catalytic domain
[410-561] IPR0226836.4e-82Peptidase C2, calpain, domain III
[177-200] IPR0226842.8e-68Peptidase C2, calpain family
[410-559] IPR0226826.4e-57Peptidase C2, calpain, large subunit, domain III
[725-771] IPR0119921e-13EF-hand-like domain
Orthology groupMCL10157 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202493-TA
ATGGAAGAAGAATTTGTTAAATTGAATAAAAATAATTCCATGAAAGTGACCACCGGTTTCCAGAATGTGATTGTACGAACGGCAGTTGAGGCTTTCAATAAACCTGAATCCATACCTAAAACAGATCGAGATGTTAAAAACGCTGGGTTTTTTGCGAAATTACTAAGGAAATCGAATGTGTCGGATATCGATAAGACAAAAGGAAAACCTCTTTGGGCCAACAAAGTCGATTCAGATGAAAGAATTGATATCGAGAAGCGAAAAATAGTATCGACTACAGACAGATCAAACATCTCGAATAGATTTCATTTCGCCAAACCGGAAGTTACGTCGGAAGTTACATCGGAAGTTAATAACGATGAAATCAAAAATAAATCACCACGTAAAAAATCGATCGTCGTACCAAGTAGGGTAAATCGATTCAAAACGCCAGCAAATTACAACGGCAGTCACATGTTCATGCCTACAGGCGAACGGTTGTTCTGGCTCGGTGAGACCCGTCCATCATCATTCGGTCCAGCCACGTACCAGGATTTCAAGGAGATCAGATCTCGCTGTCTCTCCGAAGGCAGGCTGTTCGAGGATCCGGAATTCCCGGCCACCGATCGCAGTTTGTACTACAAGGAACGTCTGGATAGACCCTTAACATGGCTAAGACCTGGGGAAATCAGCGAAGATCCGCAGCTATTCGTGGAGGGCTACAGTCGCTTCGACGTGCAACAGGGCGAGTTAGGAGACTGTTGGTTGCTGGCTGCCGTCGCCAATCTGACGCTCCATAGAAAACTCTTCTTCCAAGTAGTGCCGGACGACCAGAGCTTCGATGAAGAATACGCTGGTGTCTTCCACTTCCGGTTCTGGCAGTATGGTCGCTGGGTGGACGTTGTCGTCGACGACCGCCTGCCGACCTACCGCGGAAAACTGGTCTTTCTTCACTCATCAGAGAGAAATGAGTTTTGGAGTGCCTTATTGGAGAAGGCCTATGCTAAACTCCACGGTTCCTATGAAGCCTTAAAGGGAGGTTCTACCTGTGAAGCCATGGAAGATTTCACGGGCGGTGTGACCGAAATGTACGAAATGACGGAACTACCGCCCAACTTCTATACTATACTACTGAAAGCATACGAACGTAACTCACTCATGGGATGCAGTATTGAGGTAGGAGTCGAAATCTGCAACTTGAACCCCGACTCCCTGGACCCCGAAGAATGTCCTGAGGGCTGCACCAAGAAGTGGGAGATGTCTGTGTTTGAAGGGGAGTGGGTCAGAGGTGTAACCGCTGGCGGCTGTAGGAATTACCTAGAATCGTTTTGGAAGAATCCTCAATACACCGTTACACTGAAAGACCCCGACGAAGATGACGCGGAGAACAAGTGTACCATAATAGTGGCGTTGATGCAAAAGAACCGTCGCTCTCAGCGTCACCAGGGGCTCGAGTGCCTCACCATAGGGTTCGCGGTGTACCGCCTGCCCGACTACGGCCATGTGCCCAAGCCCTTAGATGTCAACTTCTTCAAATACAACGCCAGTGTGGGCAGGTCGCAGGCCTTCATCAATCTGAGGGAGGTCAGCGCCAGATTCAAATTCGAGCCCGGAAGCTACGTCATAGTGCCGTCCACCTTCGAACCTGACGAGGAAGGGGAGTTCCTGTTGCGTGTGTTCTCCGAGAAAACGAATAATATGACAGAGAACGACGAAGAAGTAGGGATGGGAGACGTGGATGACAGAAAAATGTTAGATTATGTCATTACGGCGGTCAAACGGACGCTGGGAAGACACGCAGGGGACCTGGCATTCCGTGGAATTTATCGCTATAAGAATGGACAAAATGGCCATCATGACCAAAACGCGGATGGGATTGTTAGAACTAGAGTGAAGGAAATAACTCCCAACCCGGAGCCCGCGGATCCTGTTAGAGAGTTCTTCACCCGCCTGGCTGGGAGCGACGGGGAGGTGGACTGGCAGGAACTGAAGGAGATACTGGACTACGCCATGAGAGAGGACCTAATGCCGCTTTGTAATTGTCCGCCAAACGAACCGATGACATCTAAATGGCTATGCGGCATGGCGCTTATGACGGGGGCGGGGGATCCCCGCCCGATCTGCAAGGAGATAGGAATAGACCTACAGCAGATGCAAACACAGCCGGAGCAGGTCCACCTCAACCAGCCGCAGGCGCAAGAACTAAAAGGACAAGGTTTCTCTAAGGAAGTCTGTAGGAGTATGGTTGCTATGTTGGACAAAGACAACTCTGGCGGACTCGGCTTCGAAGAGTTCAAATCTCTTTGGATCGATTTGCGCAACTGGAGGAGCTGTTTGGAACAGCTGCTTTGCGCGCTCTGCACGCCGATCTGCAAGGAGATAGGAATAGACCTACAGCAGATGCAAACACAGCCGGAGCAGGTCCACCTCAACCAGCCGCAGGCGCAAGAACTAAAAGGACAAGGTTTCTCTAAGGAAGTCTGTAGGAGTATGGTTGCTATGTTGGACAAAGACAACTCTGGCGGACTCGGCTTCGAAGAGTTCAAATCTCTTTGGATCGATTTGCGCAACTGGAGGGTAGGTGACACGCTATACGGGTCAAGCGATGGCTACATACAGTTTGATGACTTCATCATGTGTTCGGTGCGGCTGAAGACCATGATCGACGCTTTCCAAGGCAGGTCGTCGGGCGGCGACTACGCCACGTTTTCCCTGGACGAATGGCTGAATCGCACAGTCTACTCCTAA

Protein sequence:

>DPOGS202493-PA
MEEEFVKLNKNNSMKVTTGFQNVIVRTAVEAFNKPESIPKTDRDVKNAGFFAKLLRKSNVSDIDKTKGKPLWANKVDSDERIDIEKRKIVSTTDRSNISNRFHFAKPEVTSEVTSEVNNDEIKNKSPRKKSIVVPSRVNRFKTPANYNGSHMFMPTGERLFWLGETRPSSFGPATYQDFKEIRSRCLSEGRLFEDPEFPATDRSLYYKERLDRPLTWLRPGEISEDPQLFVEGYSRFDVQQGELGDCWLLAAVANLTLHRKLFFQVVPDDQSFDEEYAGVFHFRFWQYGRWVDVVVDDRLPTYRGKLVFLHSSERNEFWSALLEKAYAKLHGSYEALKGGSTCEAMEDFTGGVTEMYEMTELPPNFYTILLKAYERNSLMGCSIEVGVEICNLNPDSLDPEECPEGCTKKWEMSVFEGEWVRGVTAGGCRNYLESFWKNPQYTVTLKDPDEDDAENKCTIIVALMQKNRRSQRHQGLECLTIGFAVYRLPDYGHVPKPLDVNFFKYNASVGRSQAFINLREVSARFKFEPGSYVIVPSTFEPDEEGEFLLRVFSEKTNNMTENDEEVGMGDVDDRKMLDYVITAVKRTLGRHAGDLAFRGIYRYKNGQNGHHDQNADGIVRTRVKEITPNPEPADPVREFFTRLAGSDGEVDWQELKEILDYAMREDLMPLCNCPPNEPMTSKWLCGMALMTGAGDPRPICKEIGIDLQQMQTQPEQVHLNQPQAQELKGQGFSKEVCRSMVAMLDKDNSGGLGFEEFKSLWIDLRNWRSCLEQLLCALCTPICKEIGIDLQQMQTQPEQVHLNQPQAQELKGQGFSKEVCRSMVAMLDKDNSGGLGFEEFKSLWIDLRNWRVGDTLYGSSDGYIQFDDFIMCSVRLKTMIDAFQGRSSGGDYATFSLDEWLNRTVYS-