Monarch geneset OGS2.0

DPOGS212727
TranscriptDPOGS212727-TA921 bp
ProteinDPOGS212727-PA306 aa
Genomic positionDPSCF300012 - 317751-319978
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0083231e-3457.48% 
BombyxBGIBMGA013174-TA1e-7156.36% 
DrosophilaCG6763-PA4e-7045.45% 
EBI UniRef50UniRef50_E2BJ263e-7247.84%Zinc metalloproteinase nas-4 n=11 Tax=Formicidae RepID=E2BJ26_HARSA
NCBI RefSeqXP_001946740.15e-8254.41%PREDICTED: similar to GA19845-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1937047531e-8054.41%PREDICTED: zinc metalloproteinase nas-13-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1937047532e-8052.48%PREDICTED: zinc metalloproteinase nas-13-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00065088.9e-67proteolysis
GO:00042228.9e-67metalloendopeptidase activity
GO:00082375.1e-47metallopeptidase activity
GO:00082705.1e-47zinc ion binding
KEGG pathway 
InterPro domain[92-288] IPR0240799.7e-79Metallopeptidase, catalytic domain
[101-289] IPR0015068.9e-67Peptidase M12A, astacin
[98-241] IPR0060265.1e-47Peptidase, metallopeptidase
Orthology groupMCL12415 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212727-TA
ATGATGGCTGTTTTACAGTTAATGTTATATATGTGTGTTGCCGGTTCCATCACCGCTCTGCCCGTCGACGATGATGTCGTGATGGTCGGTGATGACTCCAATGCTATTGAAGATTCGGATGTCATTGATCTGTCGGGTTTGGGTCCAGAAGCGTTCACCTTCCCCAAAAACGAAAGTGGGGACGCATTAGCGACGTGGACGGAGTCATCGCTGATGAATCCCGAGGAGATGAGCTTCTATGGCGAAGGTGACATCCTGATACCAACATACGGGAGAAACGCTGTCCGCGACCAGACCTCCCGATGGCCGAACGGTCACATCCCTTACCTTATAGACGGCAGCTTCAATCAAAACCAGCGTGACAACATCATGAAGGCCATAGCTGACTACCATCGCCTGACCTGCCTCCGTTTCATCCCTTACAACGGCCAGAGAGATTACATCGTGTTCAAGAGTGCCAACACCGGCTGCTGGTCCAGTGTAGGTCGATTAGGTGGTCGCCAAGAGGTGAACCTCCAAACACCAGGCTGTGTGTCCAAAAAGGGAACCGTCCTCCACGAAATATTACACGCGGTAGGATTTATTCACGAGCAAAGTCGACCGGAGAGAGACGACTTCGTCAAAATCAATTACAATAACATCAGAAGTGGTTCTGAAGGAAATTTTAAGAAGTCTGATTCGAAACGCGTCGCCGACCTCGGCATCCCCTACGACTACAACAGCGTAATGCATTACTCGGAATATGCGTTCTCAAAAAATTCAAAGAAAACCATCGAACCGAAGACGGGAGGGATGAAGGTTGGACAAAGGGAAGGCTTGAGTCGTGGAGATGTGAAGAAAGTCAATGCCATGTACAACTGTAAAAAAGAGGAGCCTCAGACGGGCTGGGTCGGCTCGGTCTGGCAGTCAATATTTGGATGA

Protein sequence:

>DPOGS212727-PA
MMAVLQLMLYMCVAGSITALPVDDDVVMVGDDSNAIEDSDVIDLSGLGPEAFTFPKNESGDALATWTESSLMNPEEMSFYGEGDILIPTYGRNAVRDQTSRWPNGHIPYLIDGSFNQNQRDNIMKAIADYHRLTCLRFIPYNGQRDYIVFKSANTGCWSSVGRLGGRQEVNLQTPGCVSKKGTVLHEILHAVGFIHEQSRPERDDFVKINYNNIRSGSEGNFKKSDSKRVADLGIPYDYNSVMHYSEYAFSKNSKKTIEPKTGGMKVGQREGLSRGDVKKVNAMYNCKKEEPQTGWVGSVWQSIFG-