Monarch geneset OGS2.0

DPOGS214555
TranscriptDPOGS214555-TA3039 bp
ProteinDPOGS214555-PA1012 aa
Genomic positionDPSCF300266 - 64297-67335
RNAseq coverage431x (Rank: top 28%)
Annotation
HeliconiusHMEL0031620.077.50% 
BombyxBGIBMGA003279-TA0.072.78% 
DrosophilaCG3107-PA0.045.60% 
EBI UniRef50UniRef50_B0WCZ90.050.90%Presequence protease, mitochondrial n=1 Tax=Culex quinquefasciatus RepID=B0WCZ9_CULQU
NCBI RefSeqXP_001662373.10.052.19%metalloprotease [Aedes aegypti]
NCBI nr blastpgi|1571319440.052.19%metalloprotease [Aedes aegypti]
NCBI nr blastxgi|1571319440.052.19%metalloprotease [Aedes aegypti]
Group
Gene OntologyGO:00082371.8e-66metallopeptidase activity
GO:00065081.8e-66proteolysis
GO:00082701.8e-66zinc ion binding
GO:00468724.9e-58metal ion binding
GO:00038244.9e-58catalytic activity
GO:00042221.4e-11metalloendopeptidase activity
KEGG pathway 
InterPro domain[501-747] IPR0135781.8e-66Peptidase M16C associated
[297-559] IPR0112494.9e-58Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[74-280] IPR0112371.3e-21Peptidase M16, core
[242-425] IPR0078631.4e-11Peptidase M16, C-terminal
Orthology groupMCL11946 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214555-TA
ATGTATTCAAGACTTCACAAGCTGTCGGGCTTACAAAAAAGCCTCATTATAGCCGGCCAGAGAAATTACGCTGGAGGGATTCTGAAAACGAAGAAGAATCTGTCGAGCTTGCAACCAGGAAAGGTCTACCATGGATTTATGTGCTGTGAAGTGGAACCAATAAATGAGTACAACATGACCGCGTACCTGTTAAGACACGAGAAAACTCAGACTGAATACCTGCATTTAGAGAGAGACGACACAAATAATGTTTTTTCTGTCGGTTTCCGTACAACTCCACTAGATTCTATGGGAACTCCACACATTTTGGAGCACACGGTGTTATGTGGCTCAGAAAAGTACCCAGTCAGAGATCCCTTCTTTAAAATGCTCAACAGATCTCTAGCTACATTCATGAATGCATTGACGGGTCCCGATTATACATTTTACCCGTTTTCATCCCAAAATGAAGTTGACTATAGGAATCTGCAGAAAGTATATCTAGATGCTGTGTTTAAACCTAATCTATCTAGATTAGACTTTCTACAAGAAGGTTGGAGGTTGGAACATTCTAATCTAGATGATAAATCTTCAAATTTAGTGTTCAAAGGTGTAGTGTATAATGAAATGAAAGGTGCATTCTCCGAGACAAGCTCACTGTTCGGCCAAAAGTTCATTAACACAATACTGCCGCAAGGCACATATGGCTTTGTGTCTGGCGGTGACCCTTTACACATTCCGGAACTAACCCATGAACATTTGAAAAAATTCCACGCCACCTACTATCACCCGAGTAATTCCAGAATATACTCCTATGGTAGTTTTCCTTTGGAACATAATCTAAAATTTCTTAATGAGACATATCTTAGTAAATATGAGTACCTTGATCCTAGTGCTACCGTTGTTGCACCACAGGAAAGGTGGAAAATGCTCAAAAGATCAAATATCCCCTGTAGATTTGACCAGTATGGTGGTCCCATAGAAAAACAGAACCAAATAGCCATGGGTATGGTTATGTCTGACATAACAGACATATATGAGACATTCATGCTGACGGCACTAGCTGAATTAATGATTATAGGTCCAAATTCAGCCTTCTATAAAAGCCTTATCGAGAAAAACATTTCTGGTGGTTACAACTCCTTGACGGGCTATGACAATCAGATACGTGATACACTATTTGTGGTCGGCTTACGTGATGTGGAAGAGTCAAAGTTTGACCTGGTTGAGAAGATTGCAAATCAGACCATACAAGATATATATGAGAAAGGCTTTGAAAAAGACCATATCGAAAGTGTACTCCACGGTTTTGAGTTGTCTATAAAACACCAGTCGCCCAAATTTGGCCTCAATATGCTTTTCAATCTAATGCCTCTATGGAATCACAACGGGCCGATATTAAGCGCTCTTAAAGTAAATAATCTACTGGAGCAAATGAAAAAGAATCTCAAGAATCCAAATTATGTGAAGAATGTCATTGAGAAGTACTTCATCAGGAACAACCATAAGCTGATAATGACTATGACACCGGATCCTAAATTTGATGACGTTTTCAACAACGCCGAGGCAGATCTATTAAGGGCCAAAGTCAGTAAATTGACATCAGAACAGAAAGAAAGCATTTACAAAGACGGGTTAGAACTTTCCAAAGCACAGAAGGAAATACAAAACCTCGATGTCCTGCCGTGTTTGAAAATTGATGAGATAACATTGAACAAAACAGCACCTCCCTTGAAACATACTATTTCTGGGACGGTACCCTTGCAATTATGTAGGGCTAACACCAATGGTGTGACGTATTTCAAGGGTGTCCTAGGCACTGAGTGCTTGAATGAGCAGCAGAGACAGTTTTTACCATTTTTCAACTACATTTTGGACAAATTCGACACCAAGTCATATAATTACAGGGATTTTGATAAATTTGTCAGCAAGTCAACTTCAGGATTATCATTCCTGACTCATATAACTGAGCACATAGACCAGAGAGAGCAGTACGAACAGGGTGTCATATTGAGCAGTCACTGCTTGGACCACAATCTGCCAAAAATGCTCGACATATGGAAAGAAATATTCAGCAAACCAAACTTTTCCAACAGCGAAAGAATGACTATGCTCCTAAATAACTACGCTTCATCATTGACAAGCGGTATCATAGACAGCGGTCACACGTACGCCATGCAGAGCGCCCGGTCGCTGGTGTCTCCTGTGGACGAGTGTAAGGAATGTCTGTTAGGAATCAAGCACGTCATGAATATGCAAGAAGCCCAGAAACAGTACAAGATAGAAAACGTCCAGGAGATAGTTGATCAAATCGGCAAAACTATATTACACGGCACAAATCTACGGGCAGCGTTTCATTACTCAGACGATAATGTACAAAGCACTATAGAACAGTTCTGCATGGATTTGTGTAAAGACGATCAGTCGGATGTCAATAGGATAAATTGGACGGATTGCAAGGCACCGAAGAAACAGAATCGAGGGGTTCACATAGCTATGAACATTCCTGTAAATTTCTGCTCCAAGGTCATACCGACAGTACCATACACTGACCCCGATTACCCTAAACTGAGGGTATTATCCAGATTTATAACGTCGAAATACCTACATCCCATAGTCCGCGAACAAAACGGCGCTTACGGCGGCGGTGCAATGTTAACTATCGATGGGATATTCAATTTCTACTCATACAGAGACCCAAATTCAAGGGTCACTTTGGATGTGTTCGACGATACAACCAATTGGATGTCCAAAAATAAGGACTTGGTTGATGACCAGAATCTGTTTGAAGCTAAGCTGTCCATACTTCAACAAATGGATCAACCGATAGCTGAATATATGAGAGGAATTGAGCTGTTCCTGTATGGGCTGTCATATGACATTTGGCAGACACAAAGGGAACGAGTGCTAGCTGTCACCAAGGAAGATCTCGTAGAAGTCTGCCAGAAGTATCTGAAAGGAGGCGAGTGGTCCGGAAAATGTGTGATTGGTAACGGTGCAAACCAGCAAATTAAAAAGGACAGTGAAAACTGGGACACAATTAACGGACCACAAGATTGA

Protein sequence:

>DPOGS214555-PA
MYSRLHKLSGLQKSLIIAGQRNYAGGILKTKKNLSSLQPGKVYHGFMCCEVEPINEYNMTAYLLRHEKTQTEYLHLERDDTNNVFSVGFRTTPLDSMGTPHILEHTVLCGSEKYPVRDPFFKMLNRSLATFMNALTGPDYTFYPFSSQNEVDYRNLQKVYLDAVFKPNLSRLDFLQEGWRLEHSNLDDKSSNLVFKGVVYNEMKGAFSETSSLFGQKFINTILPQGTYGFVSGGDPLHIPELTHEHLKKFHATYYHPSNSRIYSYGSFPLEHNLKFLNETYLSKYEYLDPSATVVAPQERWKMLKRSNIPCRFDQYGGPIEKQNQIAMGMVMSDITDIYETFMLTALAELMIIGPNSAFYKSLIEKNISGGYNSLTGYDNQIRDTLFVVGLRDVEESKFDLVEKIANQTIQDIYEKGFEKDHIESVLHGFELSIKHQSPKFGLNMLFNLMPLWNHNGPILSALKVNNLLEQMKKNLKNPNYVKNVIEKYFIRNNHKLIMTMTPDPKFDDVFNNAEADLLRAKVSKLTSEQKESIYKDGLELSKAQKEIQNLDVLPCLKIDEITLNKTAPPLKHTISGTVPLQLCRANTNGVTYFKGVLGTECLNEQQRQFLPFFNYILDKFDTKSYNYRDFDKFVSKSTSGLSFLTHITEHIDQREQYEQGVILSSHCLDHNLPKMLDIWKEIFSKPNFSNSERMTMLLNNYASSLTSGIIDSGHTYAMQSARSLVSPVDECKECLLGIKHVMNMQEAQKQYKIENVQEIVDQIGKTILHGTNLRAAFHYSDDNVQSTIEQFCMDLCKDDQSDVNRINWTDCKAPKKQNRGVHIAMNIPVNFCSKVIPTVPYTDPDYPKLRVLSRFITSKYLHPIVREQNGAYGGGAMLTIDGIFNFYSYRDPNSRVTLDVFDDTTNWMSKNKDLVDDQNLFEAKLSILQQMDQPIAEYMRGIELFLYGLSYDIWQTQRERVLAVTKEDLVEVCQKYLKGGEWSGKCVIGNGANQQIKKDSENWDTINGPQD-