Monarch geneset OGS2.0

DPOGS213187
TranscriptDPOGS213187-TA2067 bp
ProteinDPOGS213187-PA688 aa
Genomic positionDPSCF300114 - 11372-23452
RNAseq coverage281x (Rank: top 39%)
Annotation
HeliconiusHMEL0218913e-5542.16% 
BombyxBGIBMGA007376-TA9e-6851.58% 
DrosophilaCalpA-PB1e-5932.64% 
EBI UniRef50UniRef50_UPI0002061A419e-14243.48%UPI0002061A41 related cluster n=1 Tax=unknown RepID=UPI0002061A41
NCBI RefSeqXP_973375.11e-15045.54%PREDICTED: similar to calpain [Tribolium castaneum]
NCBI nr blastpgi|910879813e-14945.54%PREDICTED: similar to calpain [Tribolium castaneum]
NCBI nr blastxgi|910879815e-14544.69%PREDICTED: similar to calpain [Tribolium castaneum]
Group
Gene OntologyGO:00041981.5e-50calcium-dependent cysteine-type endopeptidase activity
GO:00065081.5e-50proteolysis
GO:00056221.5e-50intracellular
KEGG pathway 
InterPro domain[51-324] IPR0013001.5e-50Peptidase C2, calpain, catalytic domain
[341-488] IPR0226821e-30Peptidase C2, calpain, large subunit, domain III
[46-68] IPR0226844e-21Peptidase C2, calpain family
[340-485] IPR0226832.1e-14Peptidase C2, calpain, domain III
Orthology groupMCL18857 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213187-TA
ATGAGGAAGTTCATGCGAACGCGTCCCGGACACCCTGGTCACCCCGGACACACCACACTACCCGGGCATGTCGGACATCCCGCGCTCATCCCGCACGCAGCGATCGGGCACAAGACCAACGGTGTTACCAGTAGGACCGCCAACCGTCAGGAGATCTGTCCTCGGCCGCGGTTCCTGGGAGACGCAGCGTTGGAAGCCGGGCTGTCGAACGACCAGCTCGAGGACGAACCCGCAGAGCTTCAGGCTGCTCGCTGGGACGTGCGGCCCGGGGCGGCCGGGGACGCATTGTCCCTGGCGACCGCCGCACTCTCGCACACGCCGCGTCTGCTTGCACGCGTCGCGCCACCACACTCCTTCCGCACCCGATACACTGGGAAATTCAGGTTCCGGTTTTGGGTATTCGGATCGTGGCGTGAGGTGGTGGTCGATGACCTGTTGCCGACACGCGGTGGCGTGTTGCTCACTGCGCGTGGTGGTCTGGTAGACGACTTCACATTGCCGCTTCTCGAGAAGGCTTACGCTAAACTACAAGGGTCGCTGGCATCACTGCGCGGGTGCGGCGCGGCTCAAGTACTACAAGACCTCACCGGTGCAGTAGTGCAGAGCTTCTCACCGCCGCGACAGCCCCGCTCGCTTTTGTTACAGGTGCTGCACTCGGCGGTGCCACGATCTACTCTGTTAGTGGCGTCTACGGAGCGCGGGACTTCTGGCCTTGTTCCTGGCCGCGCCTATCTGGTGACCGGTTTAGCCCGAGTACGTGAGACGGGAGGTGAGGGGGCTCTAGTGAGGCTGCTGGCTGCGGGTGGGCCTGCCGCGTGGTCGGGTGCATGGTCGCGAGGCTCTCCCGAGTGGCGAGCGTTGCCCCCTGCTGATCTGGACCTGCTGGCCGGCCGCCTCACACACCCTGGACACTTTTGGATGTCGTTCCAGGAGTTCGCGCGGTTATTTTCGCGATTAGAGCTGGTGCATATAGGGCCGGATGACTGGCTCCTGGAGCCAGCGCTGCACGCTAGACGGCCGTGGCGCGCGGTGTTGGCTCGTCGTCGCTGGAGACGCGGTTACAACGCGGGTGGGCCGCCGGCTTGCACTCGTACTGCTCATGCCAATCCCCAGTTTCACGTACATGTGCCACGATCTGAATCCGGCAAGTGCCACGTAGTGGTGTCTGTCACCCAGCAGTACTCTCCCGCTGGTTCGCCTGACCGCCTTCACGGCATCGGATTCGCGGTATATGAACTGCCCCCGGGTGCTCCTCCCCCTCGCGCCCCCCGAGCCCCCGCTGCTTTGGCTGACCTCCGAGCACTGGACGTGACACATTGGTCTCGAGCACGCGAGGTGGCCACGTTCTTCACGTTGCCGGCAGGACAGTACTTGGTGGTGCCTCACACGCACCGACCACACCTCGAGACTACCTTCCTGCTGCGCATCCTGACTGATGAGCACACGGACGTATGGGAGGTCAACGATGACAATGTGATCGTCCGTGACGTCGCGACCGAGTTCTTAGACGAAGGATGCCCTTTGGAGCCCGAGGTTCAGGCCGCGATCGCGAAAACGATCGGAAAAAGAGGCGTCGAAGAGGTGAGGCTCGACGTGAGAGGGATGAGGGGGGGACTGCTGAGAGACCGGGGTATGACGAGGACGGTGTGGTGTTCGCAGATGGACGCGCGCGCGCTCAGGAACTTGCTGCGGCGCGTATGGCGGCGCGTGCTGCCGGCGCGGCCGTCGCGGGCGCTGTGCGGTGCGCTGGTGGCGCTGGGCGACCCTGCCGCGGCGGGGAGGCTGGAACGGGGTGCGGTGCTGGGCGCCGCCCCAGGCTGCCGGGCCGCCGTCAGCGCCTACTGCTTGCGCGCGCTGCTGTGGGCGTGCGGCGTGCGAGCCTCCAACAAGGTGCTGGAGTGCCTCGTGCTGAGGTTCGCACGCGGGACTCGCCTCTCGCCCGACGCCTGCGTGTTGGCGCTGGCCCGTCTGCATCTCGCCCACGAGAGATTCCGAAGCCTCGACAACAAACTCAAATCTAATCCCATTTCGCTGGAGGAGATGCTCCTCATGACCATCTACTCCTGA

Protein sequence:

>DPOGS213187-PA
MRKFMRTRPGHPGHPGHTTLPGHVGHPALIPHAAIGHKTNGVTSRTANRQEICPRPRFLGDAALEAGLSNDQLEDEPAELQAARWDVRPGAAGDALSLATAALSHTPRLLARVAPPHSFRTRYTGKFRFRFWVFGSWREVVVDDLLPTRGGVLLTARGGLVDDFTLPLLEKAYAKLQGSLASLRGCGAAQVLQDLTGAVVQSFSPPRQPRSLLLQVLHSAVPRSTLLVASTERGTSGLVPGRAYLVTGLARVRETGGEGALVRLLAAGGPAAWSGAWSRGSPEWRALPPADLDLLAGRLTHPGHFWMSFQEFARLFSRLELVHIGPDDWLLEPALHARRPWRAVLARRRWRRGYNAGGPPACTRTAHANPQFHVHVPRSESGKCHVVVSVTQQYSPAGSPDRLHGIGFAVYELPPGAPPPRAPRAPAALADLRALDVTHWSRAREVATFFTLPAGQYLVVPHTHRPHLETTFLLRILTDEHTDVWEVNDDNVIVRDVATEFLDEGCPLEPEVQAAIAKTIGKRGVEEVRLDVRGMRGGLLRDRGMTRTVWCSQMDARALRNLLRRVWRRVLPARPSRALCGALVALGDPAAAGRLERGAVLGAAPGCRAAVSAYCLRALLWACGVRASNKVLECLVLRFARGTRLSPDACVLALARLHLAHERFRSLDNKLKSNPISLEEMLLMTIYS-