Monarch geneset OGS2.0

DPOGS211055
TranscriptDPOGS211055-TA3771 bp
ProteinDPOGS211055-PA1256 aa
Genomic positionDPSCF300446 + 2906-18384
RNAseq coverage174x (Rank: top 50%)
Annotation
HeliconiusHMEL0077980.079.32% 
BombyxBGIBMGA009590-TA0.065.51% 
DrosophilaIde-PB0.046.78% 
EBI UniRef50UniRef50_Q16P730.051.31%Metalloprotease n=19 Tax=Endopterygota RepID=Q16P73_AEDAE
NCBI RefSeqXP_971897.10.055.06%PREDICTED: similar to metalloprotease [Tribolium castaneum]
NCBI nr blastpgi|910778500.055.06%PREDICTED: similar to metalloprotease [Tribolium castaneum]
NCBI nr blastxgi|3071658580.054.38%Insulin-degrading enzyme [Camponotus floridanus]
Group
Gene OntologyGO:00468722.3e-77metal ion binding
GO:00038242.3e-77catalytic activity
GO:00065082e-42proteolysis
GO:00042222e-42metalloendopeptidase activity
GO:00082709.2e-25zinc ion binding
KEGG pathwaytca:6605860.0 
 K01408 (IDE, ide)maps-> Alzheimer's disease
InterPro domain[253-497] IPR0112492.3e-77Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[41-248] IPR0112371.3e-64Peptidase M16, core
[41-176] IPR0117652e-42Peptidase M16, N-terminal
[204-379] IPR0078639.2e-25Peptidase M16, C-terminal
Orthology groupMCL10900 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211055-TA
ATGGAGGAATGGGAAAAGTTTATAGATCACTACGAAATAAATAGTGATTGTAACAAGTTTCGTCTGGCATTAAAAAGTGCGAAAGTTGCTTTGGACGAAGACTTGGCCCCATTCGAATCAGTGTTGCTCGTGAGCGACCCGACCACGGACAAGTCGGCCGCGGCGTTAGATGTCAATGTTGGTTATTTGAGTGACCCGGACGAGGTGCCCGGACTCGCTCATTTCTGTGAACACATGCTGTTCCTGGGCACACAGAAATATCCAGAGGAAAATGAGTATAACAAGTTCCTCTCTGAACACGGGGGCTCCTCAAACGCCTCTACTTCATCAGACCACACCACTTACTACTTCGACGTGTTGCCGCAACACCTCGGCAGGGCTCTGGACATTTTCGCTCAGTTCTTCATCTCGCCGCTCTTCACGGAGGGCGCCACCGGTCGCGAACTCTCCGCCGTGAACTCGGAACACGAGAAGAACACCTCCTCGGACACGTGGCGCCTCGACCAGCTCAATAAGAGCACAGCGGATGACAATCACCCCTACCACAAGTTTGGCACCGGTAACCGCGACACGTTAGAGAGGATACCGAGGGAGAGGGGCATCGACGTGAGGCAGGAACTGCTGAAGTTCCACCAGAAATGGTACTCCGCTAATATCATGACCCTGATCGTTGTGGGGAAAGAAAGCCTGGATGATCTAGAAGGTATTGTTGTGAAACTTTTCTCCGAAGTGGAGGACCGGGGCGTGACCGCGCCCACCTGGCCGGAGCACCCCTTCCCCCCGCACCTGAGGAAGAAGAGAGCGTACTGTTGCCCCGTTAAGGATCTAAGGTCGCTGTCCATAGACTTCCCCATACCAGATACGAGGAAACACTACAAGAGTGGACCGGGCCATTATTTATCACATCTTCTGGGACACGAGGGTCCCGGCAGCTTGTTGGCAGCTCTCAAACAGCGAGGCTGGTGCAACAGCCTGGTGGGTGGAACACGTATTGGAGCTCGTGGGTTCGGTTTCTTCGGAGTGCAGGTGGACCTGACTGAGGAGGGGGTGAAGCATATCGATGAGATCGTTGAGCTGGTCTTCCAGTACATCAGTATGTTACGCGAGTCCGGAACCCAGCGCTGGGTGTGGGAAGAGCAGCGAGATCTGATGGCTTTGGAGTTCCGCTTCAAAGACGCGCAGGACCCTCGAACGATGGCGGCGGGTCACGTGCACCTGCTGCAGGAGTTCCCTATGGAGGACGTGCTGTCCGCGTACTACCTCATGACGGACTGGCGCCCGGACCTCGTGGACGAAATGCTGAAGATGCTGACGCCGGAGAACGTGCGTGTGGGCGTGGTCGCCAAATGTTTCGAGAAGAAATGCACTCAGATTGAGCCGTGGTACGGAACCAAATATCTCCAGGAAGATATTGAAGAGAGTCTGCTGAAGGACACGCCGCTGATGAGACTCTGGTACAAGCGGGACGGTGAGTTCCAACTGCCAAAGTCATTCGTCACTTTGGATCTCGTGAGCCCGCTCGCGTACTCGGACCCTGTCTGCTGCAACCTAACGTCAATATGGGTTCTGTTACTGCGGGACAGTTTGCAGCAGTTCGCTTACTCTGCAGAACTCGCAGGTCTGAGGTGGAGCGTGGGCAACGCTAAATATGGACTCAGTATAGCGATAGACGGCTATGATGAGAAACAGCACGTGTTGCTGGAGAAGATCATGGAACACCTGGTGAACTTCCACGTGGACCCCGCGAGGTTCAAGGTCATGAAAGAGAGTCACATAAGAGCCATCAGGAACTTCGAAGCTGAACAACCATATCAGCACGCCGTCTACCAGCAGGCCATGTGTCTCTCCGACTTAGTGTGGACGAGGTGTCAGTTACTGGAAGCAGCACACAGCTTAACACCGGAACAGTTAACCGAGTTCACCATGCTGCTGATGAGGCGTGTCCACGTCGAGGGCCTCATGTTCGGGAACCTGACGAGGGAGCGAGCCCTGGAGGTCGCGGACAGTATAGAAGACAAATTACCGAAAGACGCGACGCCGCTGCTGGCCCAGCAACTGTTATTGTACAGAGAAATCGAAATCGAAAAAGGCTCGTGGTTTCTCCGCGAGATCGAGAACAGTGTTCACAAGTCGTCGTGCGCGTCCGTGTACTACGCGTGCGGCGTGCGGCGGGTCAGGCAGAACGTTGTTCTGGAGCTGCTGGCCCAGGCTCTGAGCGAACCCTGCTTCCATGTGCTGAGGACACAGGAACAGTTGGGTTACATAGTGTTTAGTGGTATCCGCCGCTCTAACGGGGTCCAAGGTCTGCGCGTGATCGTTCAGAGCGACCGACATCCGGCGTACTTGGAAGACAGGATTGAAAATTTCATACGGAGGTCGCAGGACACGCCGCTGATGAGACTCTGGTACAAGCGGGACGGTGAGTTCCAGCTGCCAAAGTCATTCGTCACTTTGGATCTCGTGAGCCCGCTCGCGTACTCGGACCCTGTCTGCTGCAACCTAACGTCAATATGGGTTCTGTTACTGCGGGACAGTTTGCAGCAGTTCGCTTACTCTGCAGAACTCGCAGGTCTGAGGTGGAGCGTGGGCAACGCTAAATATGGACTCAGTATAGCGATAGACGGCTATGATGAGAAGCAGCACGTGTTGCTGGAGAAGATCATGGAACACCTGGTGAACTTCCACGTGGACCCCGCGAGGTTCAAGGTCATGAAAGAGAGTCATATAAGAGCCATCAGGAACTTCGAGGCTGAACAACCATATCAGCACGCCGTCTACCAGCAGGCCATGTGTCTCTCCGACTTAGTGTGGACGAGGTGTCAGTTACTGGAAGCAGCACACAGCTTAACACCGGAACAGTTAACCGAGTTCACCATGCTGCTGATGAGGCGTGTCCACGTCGAGGGCCTCATGTTCGGGAACCTGACGAGGGAGCGAGCCCTGGAGGTCGCGGACAGTATAGAAGACAAATTACCGAAAGACGCGACGCCGCTGCTGGCCCAGCAACTGTTATTGTACAGAGAAATCGAAATCGAAAAAGGCTCGTGGTTTCTCCGCGAGATCGAGAACAGTGTTCACAAGTCGTCGTGCGCGTCCGTGTACTACGCGTGCGGCGTGCGGCGGGTCAGGCAGAACGTTGTTCTGGAGCTGCTGGCCCAGGCTCTGAGCGAGCCCTGCTTCCATGTACTGAGGACACAGGAACAGTTGGGTTACATAGTGTTTAGTGGTATCCGCCGCTCTAACGGGGTCCAAGGTCTGCGCGTGATCGTTCAGAGCGACCGACATCCGGCGTACTTGGAAGACAGGATTGAAAATTTCATACGGAGGTCGCAGGAATATCTGGAGAATATGACGGACGAGGAGTTCCTCAAACATCGGTCGTCCTTAGCGGCTCAGAAGCTCGAGAAGCCAAAGAGATTGGCGACCAGAGCCTCGCAGATGTGGAGCGAGATAACAGCTCAGGTGTACAACTTTGACAGGATGCACGTAGAGGTTGAGGAGTTGAACACTGTTACCAAGGACGAACTACTGGAATTCTATATGAAGCACATAAGCCCCAAGTCTCTGGAGCGTCAGAAGCTGTCAGTGTATGTAGTCTCTACAGCTGAAGGTGGCGCTGGGAACAAAGAGGCGAGCGAAGAAAATGATAACGAGACTCTACAACCAACGAAGATCACGGACCTAGTGGACTTCAAGTCTAGGAGGAGATTGTATCCAAATCCGCTGCCTTTCATTAATATACCGCGCAAGGGGGCCCACTGCAAACTGTAG

Protein sequence:

>DPOGS211055-PA
MEEWEKFIDHYEINSDCNKFRLALKSAKVALDEDLAPFESVLLVSDPTTDKSAAALDVNVGYLSDPDEVPGLAHFCEHMLFLGTQKYPEENEYNKFLSEHGGSSNASTSSDHTTYYFDVLPQHLGRALDIFAQFFISPLFTEGATGRELSAVNSEHEKNTSSDTWRLDQLNKSTADDNHPYHKFGTGNRDTLERIPRERGIDVRQELLKFHQKWYSANIMTLIVVGKESLDDLEGIVVKLFSEVEDRGVTAPTWPEHPFPPHLRKKRAYCCPVKDLRSLSIDFPIPDTRKHYKSGPGHYLSHLLGHEGPGSLLAALKQRGWCNSLVGGTRIGARGFGFFGVQVDLTEEGVKHIDEIVELVFQYISMLRESGTQRWVWEEQRDLMALEFRFKDAQDPRTMAAGHVHLLQEFPMEDVLSAYYLMTDWRPDLVDEMLKMLTPENVRVGVVAKCFEKKCTQIEPWYGTKYLQEDIEESLLKDTPLMRLWYKRDGEFQLPKSFVTLDLVSPLAYSDPVCCNLTSIWVLLLRDSLQQFAYSAELAGLRWSVGNAKYGLSIAIDGYDEKQHVLLEKIMEHLVNFHVDPARFKVMKESHIRAIRNFEAEQPYQHAVYQQAMCLSDLVWTRCQLLEAAHSLTPEQLTEFTMLLMRRVHVEGLMFGNLTRERALEVADSIEDKLPKDATPLLAQQLLLYREIEIEKGSWFLREIENSVHKSSCASVYYACGVRRVRQNVVLELLAQALSEPCFHVLRTQEQLGYIVFSGIRRSNGVQGLRVIVQSDRHPAYLEDRIENFIRRSQDTPLMRLWYKRDGEFQLPKSFVTLDLVSPLAYSDPVCCNLTSIWVLLLRDSLQQFAYSAELAGLRWSVGNAKYGLSIAIDGYDEKQHVLLEKIMEHLVNFHVDPARFKVMKESHIRAIRNFEAEQPYQHAVYQQAMCLSDLVWTRCQLLEAAHSLTPEQLTEFTMLLMRRVHVEGLMFGNLTRERALEVADSIEDKLPKDATPLLAQQLLLYREIEIEKGSWFLREIENSVHKSSCASVYYACGVRRVRQNVVLELLAQALSEPCFHVLRTQEQLGYIVFSGIRRSNGVQGLRVIVQSDRHPAYLEDRIENFIRRSQEYLENMTDEEFLKHRSSLAAQKLEKPKRLATRASQMWSEITAQVYNFDRMHVEVEELNTVTKDELLEFYMKHISPKSLERQKLSVYVVSTAEGGAGNKEASEENDNETLQPTKITDLVDFKSRRRLYPNPLPFINIPRKGAHCKL-