Monarch geneset OGS2.0

DPOGS211547
TranscriptDPOGS211547-TA4182 bp
ProteinDPOGS211547-PA1393 aa
Genomic positionDPSCF300159 + 11227-23884
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0176710.084.84% 
BombyxBGIBMGA009491-TA0.069.02% 
Drosophilasol-PB0.063.69% 
EBI UniRef50UniRef50_E2A3Y10.054.97%Calpain-D n=9 Tax=Coelomata RepID=E2A3Y1_CAMFO
NCBI RefSeqXP_001606848.10.054.53%PREDICTED: similar to small optic lobes protein [Nasonia vitripennis]
NCBI nr blastpgi|3071861080.054.97%Calpain-D [Camponotus floridanus]
NCBI nr blastxgi|3320173480.055.36%Calpain-D [Acromyrmex echinatior]
Group
Gene OntologyGO:00041987.4e-101calcium-dependent cysteine-type endopeptidase activity
GO:00065087.4e-101proteolysis
GO:00056227.4e-101intracellular
GO:00082701.1e-06zinc ion binding
KEGG pathway 
InterPro domain[815-1120] IPR0013007.4e-101Peptidase C2, calpain, catalytic domain
[749-773] IPR0018761.1e-06Zinc finger, RanBP2-type
Orthology groupMCL11680 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211547-TA
ATGGGTTCAATCGCATCGGTGCTGCAATGGCATTGTCAGACATGCGGACAAATCAATCCTACAGAGAGTGTGAAATGTTTAAAATGTGGTATAAAGAGGATATCTGGTCATGACAGTGATATACAAAAACGTTATCATACCGGTGATTCGTCCTCGGAATACGCTTCGCGGACTGAAAAAAGCGAAGGGACGACTCCAGGTTCTGAAGCCATCATTATACCAACAACAAACAATTTTAATAGCGGTTGGTCATGTGTGGGCTCAGCTCCGCTGGCCAGCAGATCGTGGCGTTGTGGCTGCGGTCTTAGGAATGTATCAGCTTCATGGCGATGTGCCGCTTGCGACAAAATCGCCTCACATGCTCCCGTTTACAGATTATCCGATGACGAGGATCAATCTTCAGGGATGACTGACAAGAATAAGTCTGATCAAGATCACAGTATGATAAGATCCAACACTCTAGCAGTTCCATCGTATAAGCCTTCGTCGAGTTATTCCGGTGGTCCTTTGAATTTGGACGTTCAGTCTCTTAGGTTGTCTCCGACTCAGATCACCGATCACGCCCACGGAGTCACCAGGTCCCTGTCGCACGGTTCCGTGATCAATTCACAGCACAAATGGTGGAATGTTGATGAAACCAGAAGTACCATCAACAGACCCACGAGCTTATTGGTCTCGGAGAGATATGGCACGTCTTCGTATAACCCAAGGGAGTCGTTTCTGAGAAGCCTTCACACAGTGACGCGTCGTCCCAAAAAGTGTAACGAGAGCAAATCCAATTGGGAGTTGAATTATGTCAAGGAATACCAAAGGGAACAAACGAGGGGATTGCATCTGAAGAAGTGGGCCTGCAGTAAATGTACTCTCGAAAATTCAGGGATCAGAACCCACTGTGAAGCCTGCTTGTCTCCAAGAATCTCACCCCTGGCACGTGTTTCTAATACAAGAGGTCTTAGTGTCACAACTCTGGGAAGACATGAGACTTTGAGTGGTAAAGATGGTGCTACTTCTTTGCCAACGTCGGGAGGCATTATGATAACAGTACCAGATTGGCCGCAGACTGAAAATAACTTGGGAGACTCCTTTCAGAGGTCAATCAGTGCTCAAAATTCACCGGAAACCAGACCGACTTATCGACGCTCGTTTTCTGAACAGACCAGCGAAACTAGACCCGTCGGTAAAGTGATATCTAGTAGGAGAAGTCTAAACGATTATCAGAAATCGCTCGCCAGCTATTGCAACACACTGACGAAAAGCGACGAAGTCTCAGAGAAAAAAATCGATGAAAGCAACGCTGAATTAATAGACAAGGAATGCAGTTGGGATACTAACGCGGAAGGTGTCATATACGCGCTACCCAACAAGGGAAAGTACAAAGATCTCAACCTGCAACTACAGGTCAACAACAATGGCACCAGATATTCGTACGTTTCGGTCCAAGACACGAAAACTGTTATGAGCAATTCACAAAACGAAGTTTTATACTCCAACGACATGGGTGACGGCGACTATGCCAGAATCGATGAGCTGTTGGGAGCCGAAAATTGTATTTCTCCAGTTGGTGATAGTAATATAAGGACTTCTCATATCGGTGCGGGCAAAGAAGCAGGCAAAGTGTATAATATGTTGCCATCCGTCGGCAAAGTGTCTCACAACCCGCGCGAATCAGCCAAAGTGCCTACCGCTCAACAAGCTTCTCGCGAGGGTGTGGGAGCGGGGGCCACTCGTATGTGGCAGTGCGGGGAGTGTTGGTTCGCGTACAACGCGTGGGGCGCGAGGTGCGATGTGTGTCGCAGTGCTCGAGCTCCGCACGCCGTCACGCTGGCGCCCGGACTGGAGCGAGATGCGCGGAGCAGACACAATTCATCATCGGAAGCCAGCGGCAAACAGGAAACGAACCGAAACGAACCGCCGAAGAAGTTAACAGTCCCTATAGCGGCGCTCGACCACGACCTCAACAGTGACGAACTACTGTTCGCTGTTGAATCTACTCCACCTCCTGAGACCTCGTGGACCTGTGTCCGTTGTACTCTGGACAATGAACCGGAAGACTCGTCGTGTGCAGCGTGCGCCGCCTCCAGACCCACCGATAACTACTGGTCGTGCAGTTCGTGCACTCTCCGCAACCCGATCTCGTGGCGCGTGTGTCGCGCGTGTAAGACCCCGCCCGTCCCGCGCCACGGCCCGCCCGCCGCCGCCGATAATAACATTCCTTTCCAACGTATCCTCAGGCCGGAGCCGGCTGAGTGGTCGTGTTCTGAGTGTACGTATGTTAACTCCGGAGCCGCGCTGGCCTGCGACATGTGTCAGTCACCGCGAGCCAGGCTGCTGCCCGCCGCGCCCGACCAGGCCTCGCTGGACGACGACGACTCGCCGGAGGGCTCGGACGGCGAGCGCCAGGACAGCACGCCCATGGAGATACTGAGGCTGAGGGAGGAGAGCCACGCCTGGGCACAGTGGCAGGAGGTCATGGCGCAGTGTGCCGCGACTGGTGAGATGTACGTAGACGAGTCCTTCCCCGCGGCCGCTCGCTCGCTGTACTACGGCGGCGGGCACATAAGGGGAGGACGGGCGGAACTGGACGGCGTGCTGGGTAACTGCTGGCTGCTGTCGGCGCTGGCGGTGCTGGCGGAGCGCTCGTCGCTGGTGCGCGGCGTGCTGGTGAGGGCGGAGCCGGCGCGGGGGGCCTACCAGCTGCGGCTGTGTAAGGACGGCCGCTGGGTCACCGTCACCCTGGACGACATGCTGCCCTGCAACAGGAAGGGACACCTCGTGTACTCGCAGGCTAAGAGGAAACAGCTTTGGGTGCCGTTGATAGAAAAAGCTGTTGCTAAACTCCACGGCTGCTACGAGGCGCTGGTCTCCGGGCGAGCTATAGAGGGTCTGTGTACGTTGACTGGAGCCCCGTGCGAGTCCGTGTCCCTGCAGGCGGGGGGGGGCGCGCCTCTGGAACAGCTGGACAGGGACCTGGTGTGGGCCCAGCTGTTGTCGTCACGACAAGCCTGCTTCCTTATGGGGGCCAGCTGTGGAGGAGGGAATATGAAGGTGGATGAAGAGGAGTACCAGCGTCTGGGTCTCCGTCCTCGTCACGCGTACTCGGTGCTGGACGTGGTGGAGGTGGCGGGGTACAGTCCGCCGCTCAGGTTGCTGCGGCTGCGGAACCCCTGGGGCCACTACACCTGGAGGGGCGCCTGGGCCGCCAACTGTCCGCGGTGGACCGACCAGCTGAGGAGAGCCCTGCCGGCTAACAACGCCGATAGAGATCAAGGGGTCTTCTGGATCAGTTTCGACGACGTCCTCAAATACTTCGACTGTATAGACATCTGTAAGGTCCGCGTGGGGTGGCACGAGGTCCGGCTGGCGGGCATCCTGCCGCCGCTGTCCTCCACCCGCCACCTCACCTGCCTGCTGCTCACCGCCGCACAGCCCACGGAGGTCGACTTCACACTCTTCCAGGAAGGTCAGAGGAACTCCGCTAAGAGCCAGCGTTCTCAGTTGGACCTGTGCGTGGTCGTGTTCCGGACCAAGTCGGGGTCCAACGCGCAGGTCGGCAAGCTGGTCGCTCACAGCAAGAGACAGGTGCGCGGCTTCGTCGGTTGCCACAAGATGCTGGAGAAGGGTTTCTACCTGGTCGTGTGTCTGGCCTTCAACCACTGGCACACCGGCCTGGAGGCGGAGCGAGCTCTGTGGCCTCGGCACGTGTTGGTCGCTCACTCGTCCAAGCCGCTGGGAGTGTCCCGCCCGAGCCTCCACCCTCACCTCCTGGCCGACGCGATCATAGGACTCACCCTGGCCAGGGGACAGCGACACGAGGGCCGGCAGGGGATGACGGCGTACTACCTCACTAAGGGCTGGGCGGGGCTGGTCGTGATGGTGGAGAACAGACACACAGACAAGTGGATCCACGTGAAGTGTGACTGTCAAGAGAGTTACAACGTAGTGTCGACACGCGGGGAACTCAAGACGATAGACTCCGTCCCTCCTCTGCACAGGCAAGTGATCATAGTCCTGACGCAGTTGGAGGGTAGTGGGGGGTTTTCTATCGCGCACCGCCTCACTCACCGCCTGGCCGCCGCCGCGCGCCTTCACGACTGGGCCCCGCGACCGGACGACGCCCCGCGACACAGACCGCCACTCGCACGGAGACTAAGCGGACTGCACGCGCCTAGACTCATCACATAG

Protein sequence:

>DPOGS211547-PA
MGSIASVLQWHCQTCGQINPTESVKCLKCGIKRISGHDSDIQKRYHTGDSSSEYASRTEKSEGTTPGSEAIIIPTTNNFNSGWSCVGSAPLASRSWRCGCGLRNVSASWRCAACDKIASHAPVYRLSDDEDQSSGMTDKNKSDQDHSMIRSNTLAVPSYKPSSSYSGGPLNLDVQSLRLSPTQITDHAHGVTRSLSHGSVINSQHKWWNVDETRSTINRPTSLLVSERYGTSSYNPRESFLRSLHTVTRRPKKCNESKSNWELNYVKEYQREQTRGLHLKKWACSKCTLENSGIRTHCEACLSPRISPLARVSNTRGLSVTTLGRHETLSGKDGATSLPTSGGIMITVPDWPQTENNLGDSFQRSISAQNSPETRPTYRRSFSEQTSETRPVGKVISSRRSLNDYQKSLASYCNTLTKSDEVSEKKIDESNAELIDKECSWDTNAEGVIYALPNKGKYKDLNLQLQVNNNGTRYSYVSVQDTKTVMSNSQNEVLYSNDMGDGDYARIDELLGAENCISPVGDSNIRTSHIGAGKEAGKVYNMLPSVGKVSHNPRESAKVPTAQQASREGVGAGATRMWQCGECWFAYNAWGARCDVCRSARAPHAVTLAPGLERDARSRHNSSSEASGKQETNRNEPPKKLTVPIAALDHDLNSDELLFAVESTPPPETSWTCVRCTLDNEPEDSSCAACAASRPTDNYWSCSSCTLRNPISWRVCRACKTPPVPRHGPPAAADNNIPFQRILRPEPAEWSCSECTYVNSGAALACDMCQSPRARLLPAAPDQASLDDDDSPEGSDGERQDSTPMEILRLREESHAWAQWQEVMAQCAATGEMYVDESFPAAARSLYYGGGHIRGGRAELDGVLGNCWLLSALAVLAERSSLVRGVLVRAEPARGAYQLRLCKDGRWVTVTLDDMLPCNRKGHLVYSQAKRKQLWVPLIEKAVAKLHGCYEALVSGRAIEGLCTLTGAPCESVSLQAGGGAPLEQLDRDLVWAQLLSSRQACFLMGASCGGGNMKVDEEEYQRLGLRPRHAYSVLDVVEVAGYSPPLRLLRLRNPWGHYTWRGAWAANCPRWTDQLRRALPANNADRDQGVFWISFDDVLKYFDCIDICKVRVGWHEVRLAGILPPLSSTRHLTCLLLTAAQPTEVDFTLFQEGQRNSAKSQRSQLDLCVVVFRTKSGSNAQVGKLVAHSKRQVRGFVGCHKMLEKGFYLVVCLAFNHWHTGLEAERALWPRHVLVAHSSKPLGVSRPSLHPHLLADAIIGLTLARGQRHEGRQGMTAYYLTKGWAGLVVMVENRHTDKWIHVKCDCQESYNVVSTRGELKTIDSVPPLHRQVIIVLTQLEGSGGFSIAHRLTHRLAAAARLHDWAPRPDDAPRHRPPLARRLSGLHAPRLIT-