Monarch geneset OGS2.0

DPOGS206930
TranscriptDPOGS206930-TA1392 bp
ProteinDPOGS206930-PA463 aa
Genomic positionDPSCF300001 - 1099214-1102235
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0150782e-1229.55% 
BombyxBGIBMGA012906-TA6e-2746.04% 
DrosophilaCG6696-PA1e-1929.08% 
EBI UniRef50UniRef50_E2B7R89e-1927.09%Zinc metalloproteinase nas-14 n=5 Tax=Formicidae RepID=E2B7R8_HARSA
NCBI RefSeqXP_001844559.11e-2029.03%high choriolytic enzyme 1 [Culex quinquefasciatus]
NCBI nr blastpgi|3071906061e-2027.18%Zinc metalloproteinase nas-14 [Camponotus floridanus]
NCBI nr blastxgi|3071906063e-2127.18%Zinc metalloproteinase nas-14 [Camponotus floridanus]
Group
Gene OntologyGO:00065081.9e-25proteolysis
GO:00042221.9e-25metalloendopeptidase activity
KEGG pathway 
InterPro domain[60-253] IPR0240796.8e-34Metallopeptidase, catalytic domain
[61-250] IPR0015061.9e-25Peptidase M12A, astacin
Orthology groupMCL30970 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206930-TA
ATGACTTTTTTATTGAATTTCTTCGTGTTGAATATATTAAATCTAGAATTATTTGCTTATGATTTGGAGCCTCCATTAAAAAATTATAAAGATGGAAGTAAATCTGCTTTTATTCAAGCTCCATACGTAAGCATTGCACGAGATCAAAAAGTAAAGAAAATACAAGAAGAGATTACAAGCAGTTGGCCCGAAGGAATAATAAAATACTATGTGGAAGAAAAGAGTTATGATTCATCTATCATTACTCTTATACGCGCTGCGATGAGTGTTTTGGAATCGTCAGCTTGCATACGTTTCAAGGCAGTCAAGGATAAGCCAGAGGGCAATGACACATGGCTACACATCACCAATCCAAAAAAGAAAAGGGAATGCGTGCATGAACCCGAGGTTCTGGAAAGCGGAGAAATTGTTTTAGTTCTTGGTTATGACTGCCTTAAATCTAGAGACTTGATACATTCTTTGCTCCATGGTATTGGATTAAAGGACGAAGTGACGCATCCTCACAGAGACAACTATGTCAAAGTTGTGTGGGATAATATACAACCTGCTTACAGACATCTATATCGTACCCAACCAGTAGAGAATTCTAGAAGCATAGTTGAGTACGATCCATTAAGTATTATGCATTTCCACGATCGGGCTTTCAGTATGAATGGCAAAGCAACAATCCTACCATTGGAAACTGGTTTAAGGATTTCGCCATCAGACGGCTTATCACAGTTGGATAAAATGAAGTTACATATATATTTTGGACACGAATGTAATAAGAGGAAATTCGTTTCCCTCATGGAAACATGTAAAATGTCTTTAAAGAGTAAAAAAGAATCGGCTAGTGATGAAAATCGTGAGAAAGGAAAGGATCGAGATAATGTTACAGGAGAAAAAGGTGATAGTAAAAATGAAAATGAAGACCACGGAGGAAAAGGTGGTACGGAGAATGCTAATAAACTTGAAAAGGGTGAAACAGATGAAAATGAAGGAGAAGAAAATGGAGTAGAAGAAGAAAATGGAGTAGAAGAAGAAAAATTTACTGAAGAAGCAAATAATTCTGAAAATAAAGAAGAGGTAGATGAAAATACTACATGGAGAACCTTACATGGAAATTTAACGGAACTCGAGAAAAATGGCGAAACTGAAGACGCTAAACAAAATACTGAAGAGGATAACTCTTCAAAGATTACAGAAAAGGTTCAAGATGATGATGAAAATAACGATGAATCGAAAGAAAAATCCAAAAAACGATACATTCCTGCAATAATCGGAGTAATAGCTACGGCAAACTCTGATATGAGTTCCGGAAACGTAAATCAGTTGACGGAATCGGAGTCTGCAACAGAAAAGAAACTGAATTCTGGAATCAACTTAAATTATGATAATTATACCGACAAATAA

Protein sequence:

>DPOGS206930-PA
MTFLLNFFVLNILNLELFAYDLEPPLKNYKDGSKSAFIQAPYVSIARDQKVKKIQEEITSSWPEGIIKYYVEEKSYDSSIITLIRAAMSVLESSACIRFKAVKDKPEGNDTWLHITNPKKKRECVHEPEVLESGEIVLVLGYDCLKSRDLIHSLLHGIGLKDEVTHPHRDNYVKVVWDNIQPAYRHLYRTQPVENSRSIVEYDPLSIMHFHDRAFSMNGKATILPLETGLRISPSDGLSQLDKMKLHIYFGHECNKRKFVSLMETCKMSLKSKKESASDENREKGKDRDNVTGEKGDSKNENEDHGGKGGTENANKLEKGETDENEGEENGVEEENGVEEEKFTEEANNSENKEEVDENTTWRTLHGNLTELEKNGETEDAKQNTEEDNSSKITEKVQDDDENNDESKEKSKKRYIPAIIGVIATANSDMSSGNVNQLTESESATEKKLNSGINLNYDNYTDK-