Monarch geneset OGS2.0

DPOGS206355
TranscriptDPOGS206355-TA2307 bp
ProteinDPOGS206355-PA768 aa
Genomic positionDPSCF300082 + 730352-736931
RNAseq coverage384x (Rank: top 31%)
Annotation
HeliconiusHMEL0126222e-14967.23% 
BombyxBGIBMGA014111-TA0.071.72% 
Drosophilasol-PB8e-2229.21% 
EBI UniRef50UniRef50_F4WIK80.055.47%Calpain-7 n=12 Tax=Bilateria RepID=F4WIK8_ACREC
NCBI RefSeqXP_967682.10.057.40%PREDICTED: similar to calpain [Tribolium castaneum]
NCBI nr blastpgi|910894410.057.40%PREDICTED: similar to calpain [Tribolium castaneum]
NCBI nr blastxgi|910894410.057.40%PREDICTED: similar to calpain [Tribolium castaneum]
Group
Gene OntologyGO:00041986e-51calcium-dependent cysteine-type endopeptidase activity
GO:00065086e-51proteolysis
GO:00056226e-51intracellular
KEGG pathway 
InterPro domain[195-503] IPR0013006e-51Peptidase C2, calpain, catalytic domain
[638-768] IPR0226821.9e-24Peptidase C2, calpain, large subunit, domain III
[637-764] IPR0226838.7e-18Peptidase C2, calpain, domain III
[238-260] IPR0226848.7e-12Peptidase C2, calpain family
[5-68] IPR0073302.6e-09MIT
Orthology groupMCL16033 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206355-TA
ATGAGTGATTTCGTTAACGCAACGGAAGCAGCGTCTCTGGCTGTGCAGTTTGATCAAACAGGCCAGACTGATAAAGCAGTAGAATGTTACCGGACTGCTGCAAGATTGTTGGATCGTGTGTGTCATCAAGTTTCTTCAGAGAAGCAAATAGAATTCCGACGTAAAGTAAAAGAATACTTAGAGCGGGCTACACGCCTACAGGAGCAGAAAGATGGAGCCCAACAAGAGGAAAGTATAAGAAATCTACAAAAATGTTATTTTCTGATGCAGCAAGCGTTAGACGCAGATGAGGCCAATTTGAAGGATTTAGCTCTGCAATTATACACACAGGCAGTAGAGTTGGCTGTCAGTGTTAAATCAAGTGACCCTGATGTAATGACAAAGGTGAAAGGTTTAGCAGCACAAGCTCTGGATAGAGCCGAAGAGATTAAAGGTATCAAGAAAATGGGGGCTTTGTCGATTACACCAGGTGTATCACCAAATCGTGTTCATCTCCAACGAGAACAGAGTGTCCATTTAAAAGTGAGTGGAAAACAAGTGTACACGGAAGAGGAAAAGGAAGTACTGCGAACGACATCCAACATCAACAACAACTGTTTCCTTCCGTTTATGGAGGTGGATTTGTCAGAGAAATTCCAATATGCATTGCCGTTTGAAGACAGAGCCAGCGAATTGAAACTGTCATCGAAACAGGCTAAGGAGTTCCAGTGCTGGGTTCGGCCTCATGAAGTATCAAGTGATCCCAAGATGATCGCTGGTGATGACCTCGATTTCTTCAGCATCAAACAGACTATAGTATCAGATTGTTCGTTCGTGGCGTCGTTAGCTGTAAGTGCCTTGTACGAAAGGAGGTTCAATAAGAAGATAATAACCTCAATAATATACCCAAAGAATAAGAACAAACAGCCCGTGTACAATCCCTTCGGCAAGTACATGGTTAAGCTGCACTTAAATGGTGTCAGGAGAAAGGTCATCATAGATGATCGTCTGCCATACAGCAAGTACGGGCGTCTGCTGTGTTCGTATTCCAGCAATAAGAACGAGTTCTGGAACATCGACCTCCACGCGCTAACCGGTTGGATCCCGGAGCGGTGCGCGATCCGGTCCGAGGCGGACTTCAACGCGGATGGTTTGTATGAGATAGTGCGGGCGAGGCTGGAGGCGGGACACGTGCTGGCCAGCGTCGCCACCGGGGACCTGTCCGACGACGACGCCGAGCGGACCGGCCTCGTCGCGTCGCACGCTTACGCCGTGCTCGACGTGCGGCTGGTCAATGGCGTAAAGCTTCTGAAGCTCAAGAACCCGTGGTCTCACCTCCGCTGGCGAGGCAACTACAGCGAGCTGGACACTGTCCACTGGTCGCCGAACCTCTGCTCCGCACTGGACTACGACCCGGACAGCGCCGCTCAGTACGACAACGGAGTGTTCTGGATAGATTACGCGAGTATACTGAAGTTCTTTGATGTTTTTTATCTCAATTGGAACCCAGAGCTGTTTAAGTTCACTTATTGCATACATCAGAAATGGGAAGCCGGTAACGGTCCTATTAAAGACGCGTATACGATATCAGAGAATCCTCAGTACTCTTTGAAGGTGAACGGCACTGGCGCCGTCTGGTTGTTACTGACGAGACACATCACTAAGATAGAAGACTTCAGGAACAACCAGGAGTACATAACACTACTCGTTTATAAGAACGGGAAGCGAGTATACTATCCACACGACCCACCTCCTTATATAGACGGAATACGTATCAACAGTCCCCACTACCTCGTGAAGATAATAGTGGGGGAGAACAGTTCGGACAAATACACACTGGTCGTGTCCCAGTATGAGAAGACTCGCACCATATACTACACGCTGAGGGCCTACGCCACGTGTCCGTTCGCATTGGCAAAGCTGGACCCATATCCCTATACCAAAACTATCAGAGGTGAATGGTCGGGCAGAACAGCCGGCGGTTGTGAAAATCACAGACAAACTTATCAGAATAACCCAAAATATATAATAACGGTCCCAGAAAGCAGGAACCCGTGCCACGTCACCATAGAACTGAAAGGTCCCAAAGAATACCAGATAGGAGTAGACGCGAGGGTTGAATCCTTGGACGATCCAAATATAACCGCGCCGTTCTTGAGGGAATCCTCAGGAGCGTACAGATCTGGTTTCGTTGTGCTGGAGTTAAATAATTTACCAGGCGGACGGTATCTGCTCACACCATCTACCTTCTATCCGGGACAGGAAGGGCCATTTTTCCTTGAACTGAGATCTACTTGCAGCATCACAGCCGAGAGGAAGAATGAATGA

Protein sequence:

>DPOGS206355-PA
MSDFVNATEAASLAVQFDQTGQTDKAVECYRTAARLLDRVCHQVSSEKQIEFRRKVKEYLERATRLQEQKDGAQQEESIRNLQKCYFLMQQALDADEANLKDLALQLYTQAVELAVSVKSSDPDVMTKVKGLAAQALDRAEEIKGIKKMGALSITPGVSPNRVHLQREQSVHLKVSGKQVYTEEEKEVLRTTSNINNNCFLPFMEVDLSEKFQYALPFEDRASELKLSSKQAKEFQCWVRPHEVSSDPKMIAGDDLDFFSIKQTIVSDCSFVASLAVSALYERRFNKKIITSIIYPKNKNKQPVYNPFGKYMVKLHLNGVRRKVIIDDRLPYSKYGRLLCSYSSNKNEFWNIDLHALTGWIPERCAIRSEADFNADGLYEIVRARLEAGHVLASVATGDLSDDDAERTGLVASHAYAVLDVRLVNGVKLLKLKNPWSHLRWRGNYSELDTVHWSPNLCSALDYDPDSAAQYDNGVFWIDYASILKFFDVFYLNWNPELFKFTYCIHQKWEAGNGPIKDAYTISENPQYSLKVNGTGAVWLLLTRHITKIEDFRNNQEYITLLVYKNGKRVYYPHDPPPYIDGIRINSPHYLVKIIVGENSSDKYTLVVSQYEKTRTIYYTLRAYATCPFALAKLDPYPYTKTIRGEWSGRTAGGCENHRQTYQNNPKYIITVPESRNPCHVTIELKGPKEYQIGVDARVESLDDPNITAPFLRESSGAYRSGFVVLELNNLPGGRYLLTPSTFYPGQEGPFFLELRSTCSITAERKNE-