Monarch geneset OGS2.0

DPOGS214088
TranscriptDPOGS214088-TA1899 bp
ProteinDPOGS214088-PA632 aa
Genomic positionDPSCF300014 - 2341788-2344632
RNAseq coverage141x (Rank: top 55%)
Annotation
HeliconiusHMEL0114393e-6842.95% 
BombyxBGIBMGA002518-TA4e-1027.00% 
DrosophilaCG15255-PA5e-1329.70% 
EBI UniRef50UniRef50_A7RVC77e-1428.86%Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RVC7_NEMVE
NCBI RefSeqXP_001636612.11e-1428.86%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|1563939952e-1328.86%predicted protein [Nematostella vectensis]
NCBI nr blastxgi|3583395695e-2329.41%hypothetical protein CLF_100574 [Clonorchis sinensis]
Group
Gene OntologyGO:00065082.3e-22proteolysis
GO:00042222.3e-22metalloendopeptidase activity
GO:00082372.8e-06metallopeptidase activity
GO:00082702.8e-06zinc ion binding
KEGG pathway 
InterPro domain[51-248] IPR0240792.2e-27Metallopeptidase, catalytic domain
[53-247] IPR0015062.3e-22Peptidase M12A, astacin
[50-202] IPR0060262.8e-06Peptidase, metallopeptidase
Orthology groupMCL35008 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214088-TA
ATGCTGAAGAACGATAACCTACGGCTTATATCTTTAGCACTCTGTTTGTTTTATACGAACGCTATATTCGCTTCAAGTATAAAACCAGATATTCTAGAGAGTTTGGATCAAATTCGTGGTTTGGATTTATCTGAAAAGGACAAGAGTAGATATCATATATGGGAACAAGGAATGGTTCCTTATTATATAGACGACTTTTCTTTTGACAAAGTTCTACGGGATAGAATACGGTTTTACTTAGATCAGTTAAATAAGGCGACTGGTTTGCACTTCATGGAGATTCCGCAGCCACCCGAAGATGATAAACAGCGTTGGGTCTTCTTCCTTAACAGGAAAAGTCAGCTGGGATGTGATGATGTTTCTTATAACAATTATACCAATAAAGGAGTGCAGAGGGTAGTTTTGGGATACGACTGTCTTACGCCCGAAGATTTGGATGAGATAATCCTGTCTTTAGTAGGAGTTCCACCTCAGCACAATGCACCAGATCGTGATGACTATGTCACCGTAAATTGGGATAATATTTTGCCAGATAAACATTATTTGTTTGAGAAATTGAAAACCGATGAATGGGCCTTTGGTAAATTAAAATATGATTTTTCTAGTGCTGGCCATTATCGCACACATAAATATACTTCAAATGGAGGTGAAACTATTACACCAAAGAATTTAGAACACTATATGGAAGGAAGAAAAGGCCTGAGTTCCATAGACATTAAGAAAATAAGAATGTTTTATAATAATATTTCTTTTAAAGCCAAAAAACCTATGCAAGTACCAGAATGTAGCAAATTATTTGTACCTGGAAAAAATTTCAGTAAATATATTACTCCTAAAGTAAGTGTAAGTTCAGAAAAACCAGAAATAATATGTGATTCAGAAAATGCAAGCAATCATGACTGTCATAAAAATAAAGATAAGAAGGATGTTGGGGATGACGACAACGAAGATGTCAATAATGATAAAGAAAAAAATAATACAGATATTAAAGATGAAAAAAATGATAACAAGTCTGGTGAAAATAATGACAAGCAAGATAGCAAAGTAAAAATCGAGTCTACAGAGGAAAAAAAAATACTAAGTAAAGATGAAAAATTACCAAATAGTAAAGAAAAAGCTCTTTTGTTACTAAAATATCACGAGGTTGAAAAAATTAAATCTCCAAATATTAAAAAAAAACAAGTAGAAGTTACCAAATCCAAGTATAAAGCTATTAAAAACCTTCAAAATTCACAAACAACGGAAATGAAATCAGATGAAGATGAAAATACTAAGTATCCTAAAAAACTGAAAGGGGAAACTGGAGAGAACAAATCCGTAAATAGCGAAGAAAAACATTTCTTGCTCGCAAAGTACCAAAGTATTAAAAATAATAAGTATGCAAAAAACACCAAGGATTCTAAAAAAATTAAAGGGGAAACTGATGATAATAAATCGGTAAATAGCGAAGAAAAAACATTGTTGCTCGCAAAGCACCAAAGTATTAAAAATGATAAATATGCAAAAAATACCAAGTATTCTAAAAAAATTAAAGGGGAAACTGATGATAATAAATCGGTAAATAGCGAAGAAAAAAATTTCTTGCTCGCAAAGTACCAAAGTATTAAAAAGAATAAATATGCAGAAAACACTAAGTATTCTCAAAAAATCAAAGGGGAAACTGACAACAGCAAATCGTTAGACAGCGAAGAAAAACATTTTTTGTCCCAAAAAAATTACAAGTTTGGGAAAAGTAGACTTTCAAAAACTCAAAACGCACATAAAGACAAAAAATCTCAAAATAATAATAAGAAAAAACGAGTTTTATTAAAAGGTAATTCTAATAAATTGCCACGTAAAATTTTGGGGAAATCCGATTATGAAACTGGAGAGGAAAATGTTAATTTTCATTTTAAATAA

Protein sequence:

>DPOGS214088-PA
MLKNDNLRLISLALCLFYTNAIFASSIKPDILESLDQIRGLDLSEKDKSRYHIWEQGMVPYYIDDFSFDKVLRDRIRFYLDQLNKATGLHFMEIPQPPEDDKQRWVFFLNRKSQLGCDDVSYNNYTNKGVQRVVLGYDCLTPEDLDEIILSLVGVPPQHNAPDRDDYVTVNWDNILPDKHYLFEKLKTDEWAFGKLKYDFSSAGHYRTHKYTSNGGETITPKNLEHYMEGRKGLSSIDIKKIRMFYNNISFKAKKPMQVPECSKLFVPGKNFSKYITPKVSVSSEKPEIICDSENASNHDCHKNKDKKDVGDDDNEDVNNDKEKNNTDIKDEKNDNKSGENNDKQDSKVKIESTEEKKILSKDEKLPNSKEKALLLLKYHEVEKIKSPNIKKKQVEVTKSKYKAIKNLQNSQTTEMKSDEDENTKYPKKLKGETGENKSVNSEEKHFLLAKYQSIKNNKYAKNTKDSKKIKGETDDNKSVNSEEKTLLLAKHQSIKNDKYAKNTKYSKKIKGETDDNKSVNSEEKNFLLAKYQSIKKNKYAENTKYSQKIKGETDNSKSLDSEEKHFLSQKNYKFGKSRLSKTQNAHKDKKSQNNNKKKRVLLKGNSNKLPRKILGKSDYETGEENVNFHFK-