Monarch geneset OGS2.0

DPOGS204463
TranscriptDPOGS204463-TA1530 bp
ProteinDPOGS204463-PA509 aa
Genomic positionDPSCF300002 + 445988-452782
RNAseq coverage527x (Rank: top 24%)
Annotation
HeliconiusHMEL0062480.087.86% 
BombyxBGIBMGA007804-TA0.087.66% 
DrosophilaMmp1-PC0.062.40% 
EBI UniRef50UniRef50_B5X0I80.062.40%FI01410p n=84 Tax=Coelomata RepID=B5X0I8_DROME
NCBI RefSeqNP_001116499.10.087.29%matrix metalloproteinase 1 isoform 1 [Bombyx mori]
NCBI nr blastpgi|1723561130.087.29%matrix metalloproteinase 1 isoform 1 [Bombyx mori]
NCBI nr blastxgi|1723561130.087.29%matrix metalloproteinase 1 isoform 1 [Bombyx mori]
Group
Gene OntologyGO:00065081.9e-183proteolysis
GO:00042221.9e-183metalloendopeptidase activity
GO:00055091.9e-183calcium ion binding
GO:00082701.9e-183zinc ion binding
GO:00310124.7e-57extracellular matrix
GO:00082374.4e-51metallopeptidase activity
GO:00081521.5e-12metabolic process
KEGG pathwaydpo:Dpse_GA184840.0 
 K07763 (MMP14)maps-> GnRH signaling pathway
InterPro domain[1-441] IPR0162931.9e-183Peptidase M10A, matrix metallopeptidase, stromelysin type
[3-217] IPR0240791.8e-82Metallopeptidase, catalytic domain
[233-439] IPR0005852.7e-78Hemopexin/matrixin
[60-213] IPR0018184.7e-57Peptidase M10, metallopeptidase
[57-214] IPR0060264.4e-51Peptidase, metallopeptidase
[33-46] IPR0211901e-40Peptidase M10A, matrix metallopeptidase
[296-339] IPR0184871.1e-17Hemopexin/matrixin, repeat
[1-43] IPR0024771.5e-12Peptidoglycan binding-like
Orthology groupMCL10469 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204463-TA
ATGGATGAAAGCTCCTGGAAGAAAGCTATAGCAGAGTTTCAAAGCTTCGCCGGATTGAACACTACAGGTGAACTTGACGAAGAGACAAAAAATTTAATGTCACTGCCAAGGTGTGGAGTTAAAGATAAAGTAGGTTTCGGCGAAAGCAGGTCCAAGAGGTACGCCCTGCAAGGTTCAAGATGGCGTGTAAAAAATCTGACGTACAAAATATCGAAGTACCCCTCTCGACTAAACCGCGATGAAGTTGATACAGAGCTGGCGAAAGCTTTTTCAGTGTGGTCCGATTATACCGACCTGACATTTACGCAAAAGAGATCCGGACAAGTTCATATCGAAATTAGGTTTGAAAAAGGCGAGCATGGTGACGGCGATCCGTTTGACGGTCCCGGTGGCACCCTCGCACACGCTTACTTCCCTGTTTACGGTGGTGATGCCCATTTCGATGACGCTGAGATGTGGTCTATAAATTCTCTGCGTGGGACCAATCTCTTCCAGGTCGCGGCTCATGAATTCGGTCACTCATTAGGGTTGTCGCACAGCGATGTGCGAACTGCACTCATGGCCCCATTCTATCGCGGCTACAACAAAGCCTTCCAGTTAGACCAAGATGATATACAAGGAATCCAGGCTCTGTATGGACACAAAACTCAACTGGATGTCGGTGGATCCTTCCCAAGTCCATCTGCACCTCGTATAACCACCCCTCAGCCTTCTGCTGAAGATCCAGCGTTGTGTGCCGATCCTAATATCGATACCATTTTCAATTCCGCTGATGGCGCTACCTTCGTATTCAAAGGTGAACACTACTGGCGTCTAACGGAGAACGGTGTAGCAGCTGGATATCCTCGTCTGATCTCCCGATCATGGCCTGGTTTAACCGGTAACATCGATGCTGCATTCACATACAAGAACGGGAAAACGTATTTCTTCAAGGGTTCTAAATACTGGAGGTATAATGGACAGAAGATGGACGGACAGTATCCGAAAGAAATTAGTGAAGGATTCACTGGTATCCCTGATAACTTAGACGCTGCACTTGTTTGGTCTGGCAATGGTAAAATATATTTCTATAAGGGCTCCAAATTCTGGAGGTTCGACCCTGCTCAGCGTCCTCCCGTGAAATCAACCTATCCCAAACCTCTCTCGAATTGGGAAGGAATTCCCGATAATATAGATGCGGCTCTGCAATACACGAACGGTTACACGTACTTCTTCAAAGGCGGATCCTATTGGCGATTTAATGACAGAACGTTCAGCGTGGACACAGATAACCCAGCATTCCCAAGGTCAACAGGTTATTGGTGGCTCGGCTGCAGTAGCGCACCGAAAGGCACGGTCGGAGGTAACGCCAAATACATCCACCACGATCACGATGACGATGATGAAGTGGGGGACATCACTTTCGATGCAGATGCCGGGGGCTCTGAGGCACGAGGGGGCGGCGCAGGCAGCGGGGCGTTTAACGTTGAATCGTCCGTATTGACGTTGCTGCTAGCGAGCTCACTGCTGTTCCTATCGCCTCTGGCGTAA

Protein sequence:

>DPOGS204463-PA
MDESSWKKAIAEFQSFAGLNTTGELDEETKNLMSLPRCGVKDKVGFGESRSKRYALQGSRWRVKNLTYKISKYPSRLNRDEVDTELAKAFSVWSDYTDLTFTQKRSGQVHIEIRFEKGEHGDGDPFDGPGGTLAHAYFPVYGGDAHFDDAEMWSINSLRGTNLFQVAAHEFGHSLGLSHSDVRTALMAPFYRGYNKAFQLDQDDIQGIQALYGHKTQLDVGGSFPSPSAPRITTPQPSAEDPALCADPNIDTIFNSADGATFVFKGEHYWRLTENGVAAGYPRLISRSWPGLTGNIDAAFTYKNGKTYFFKGSKYWRYNGQKMDGQYPKEISEGFTGIPDNLDAALVWSGNGKIYFYKGSKFWRFDPAQRPPVKSTYPKPLSNWEGIPDNIDAALQYTNGYTYFFKGGSYWRFNDRTFSVDTDNPAFPRSTGYWWLGCSSAPKGTVGGNAKYIHHDHDDDDEVGDITFDADAGGSEARGGGAGSGAFNVESSVLTLLLASSLLFLSPLA-