Monarch geneset OGS2.0

DPOGS207006
TranscriptDPOGS207006-TA2151 bp
ProteinDPOGS207006-PA716 aa
Genomic positionDPSCF300001 + 1093342-1098878
RNAseq coverage252x (Rank: top 41%)
Annotation
HeliconiusHMEL0143774e-3676.83% 
BombyxBGIBMGA012930-TA5e-4144.13% 
DrosophilaCG6696-PA1e-2633.47% 
EBI UniRef50UniRef50_A8QBY83e-2939.51%NAS-15 protein, putative n=2 Tax=Onchocercidae RepID=A8QBY8_BRUMA
NCBI RefSeqXP_001901260.15e-3039.51%NAS-15 protein [Brugia malayi]
NCBI nr blastpgi|1705930151e-2839.51%NAS-15 protein [Brugia malayi]
NCBI nr blastxgi|1705930151e-2734.08%NAS-15 protein [Brugia malayi]
Group
Gene OntologyGO:00065081.3e-40proteolysis
GO:00042221.3e-40metalloendopeptidase activity
GO:00082372.7e-24metallopeptidase activity
GO:00082702.7e-24zinc ion binding
KEGG pathway 
InterPro domain[170-367] IPR0240791.2e-46Metallopeptidase, catalytic domain
[170-365] IPR0015061.3e-40Peptidase M12A, astacin
[168-321] IPR0060262.7e-24Peptidase, metallopeptidase
Orthology groupMCL26019 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207006-TA
ATGGAGTCTCAGAGAAAAATGATGTGTGCTTGGATAGCGTTAGTATTTTGTACGAATTGCTTTTATTTGGCAGAATCTTTAATACCGATGCTAACTGTCCGATATGTGGACCAAAAACTTATTGATAGAAAAAGGCAGGACACTTTAGAAATTTGTCGATTATTTAACCAAGAAATAAAGTATCTTGAAGCGGTCAACGCCACGAACTATACATTAAATAATTCAAGTAATAATAGAGCTAAAAGAGAGGTAAGGGCAGATGTCGATAACCCTTTATTAAAAGATGAGGATATAGAAAGAGTTGAAAGTATAGTCGACAAAATATACGACGACATGCAACAAAATGGCAGCCCCGTGACCTACAGGCGGAGATTCCTTAATGAAGAGGATAAGAAAAAAATACAAAGTAGAAGATTTAATTTTGCACCCGCCGAACTACTTGGTCCCAAGGAAGATAAGGATGACCATAAAATTTTGAATGATGATGAGGTTAAAATACCTTGGATTAAGAAATGGAAGCATGGCATTGTACCGTTTTTTATTGATCCTAATACTTACGATTCTATATTAGCCGAAACAATTCTGAAGGCTTTTGAATATATCGAAAAAGTGACTTGCATCCGTCTCCAACGACTCCGTGAGAGACCAACTGATGTACAGTCTTTGCAGAATGTCGAATGGTTGTACATCAGCAACCCTTTAGGGTTAAAACAATGCGTTCACAGCAATGAACGCAAGCCGAACTCAGGAGTTCAGATGGTTGTTTTTGGTTACGACTGCTTGTCGCAGGGTGAAATATCCCACGAAATAATGCACGTTTTAGGTTTTTCTCACGAACACACGCGGTCTGATAGAGACAAACATATCGACATCTTATGGGATAACATTAAACCTGGGTACAAGAAATATTTCGAAATAAGAAAAGATGATCCACTACTTGTTTTGCCGTATGACTACAAAAGCGTCTTACATTATCCATCGAGGGCATTTTCAAAAAACGGTCAACAGACCGTTAAAGCGCAAGCGGCAGTTAAAATTGGCCAAAGAGAAGCTCTCAGCGCATTGGATGTTGAAAAAATTGGAATGATATATGGTGCGGAATGTGTAGATCGTAATAAAGATTATTTGACGAAAACATGCCCAAGTGCAATGAGCACAAATCTCAACTCTGTTGTTGCAACTCAAGACGAAATTGATAAATACTTCGAAAATAGAATCTGGCCGTATGCACTTGTTAATTACAAGATAAGAAATAATTTGGAATTTACCGCTGAGGAAAAAGATAACATAAAAGCTGTTCTAAAACATATTGAAAAGGAAACTTGCATCGAGTTCAGAGACATTACTCAATCCGATGAAAGTTTAGACAAGGACGACGACAATAATGTAGATCATGTGCCTATTAATGAAACAGAAGATAGGAACAAAAAAAATGAGTCCGTAACAGTGCAGAATAACGAAAAATCAACAGTTCTTACTGAAGATGAAGAGACTACGACCAATTCTGAACATAAAATGACACATGTAACTGAAGACACTCCAAGCCATTTGCCCGGTCTGAGACATAAAAGCGATGGGATTGTAAAACAAGCCCATAAAAAAACACCATCTGAAATAACAAAAGAAACTGTGAAACGACATGGCAAAAAGATGTCAAATAGGAAACCATCGAGAAGACATTCAGACAATATGTTGATCTTCCAACGCTCTTCTCAACCGGGATGCCCTTGCCCTCCTTCTGGACGGCCTAATGGAAAGACGGTACTGAATTTGAATGCAGATTGTTTCAACTCCGTCAACGATTTACTGCATGTATTTGTGCACGTATTGGGTTTGGACCACCAACACAATATGTACGACAGGGACTCCTACTTACACATTCTATGGAATGACCTCACTCCAGAGGTAAAAAAAGATATGAAAGAAAAATTACCTCCTGCAGCATCAGTGGGATTCCCATACGACTATCAAAGTGTTATGCATTACCCATGGTTACAAATAAAAAATGGCTCCACCAACATCATGTACCCTGTCTGGAACGACGGTTGGGCGATGGGACATTGGCAAGGTTTGAGTTTGACAGATGTTAACAAAATAAATTTTCTGTACAAGTACGAGTGCAGGAAAAGACGAGAAGAAGCTCAAAAGTAA

Protein sequence:

>DPOGS207006-PA
MESQRKMMCAWIALVFCTNCFYLAESLIPMLTVRYVDQKLIDRKRQDTLEICRLFNQEIKYLEAVNATNYTLNNSSNNRAKREVRADVDNPLLKDEDIERVESIVDKIYDDMQQNGSPVTYRRRFLNEEDKKKIQSRRFNFAPAELLGPKEDKDDHKILNDDEVKIPWIKKWKHGIVPFFIDPNTYDSILAETILKAFEYIEKVTCIRLQRLRERPTDVQSLQNVEWLYISNPLGLKQCVHSNERKPNSGVQMVVFGYDCLSQGEISHEIMHVLGFSHEHTRSDRDKHIDILWDNIKPGYKKYFEIRKDDPLLVLPYDYKSVLHYPSRAFSKNGQQTVKAQAAVKIGQREALSALDVEKIGMIYGAECVDRNKDYLTKTCPSAMSTNLNSVVATQDEIDKYFENRIWPYALVNYKIRNNLEFTAEEKDNIKAVLKHIEKETCIEFRDITQSDESLDKDDDNNVDHVPINETEDRNKKNESVTVQNNEKSTVLTEDEETTTNSEHKMTHVTEDTPSHLPGLRHKSDGIVKQAHKKTPSEITKETVKRHGKKMSNRKPSRRHSDNMLIFQRSSQPGCPCPPSGRPNGKTVLNLNADCFNSVNDLLHVFVHVLGLDHQHNMYDRDSYLHILWNDLTPEVKKDMKEKLPPAASVGFPYDYQSVMHYPWLQIKNGSTNIMYPVWNDGWAMGHWQGLSLTDVNKINFLYKYECRKRREEAQK-