DPGLEAN20651 in OGS1.0

New model in OGS2.0DPOGS204406 
Genomic Positionscaffold272:+ 74299-77254
See gene structure
CDS Length1647
Paired RNAseq reads  43
Single RNAseq reads  109
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007915 (2e-37)
Best Drosophila hit  CG15255 (2e-46)
Best Human hitmeprin A subunit beta precursor (3e-24)
Best NR hit (blastp)  AGAP010764-PA [Anopheles gambiae str. PEST] (2e-54)
Best NR hit (blastx)  AGAP010764-PA [Anopheles gambiae str. PEST] (2e-53)
GeneOntology terms

  
GO:0004222 metalloendopeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
InterPro families
  
IPR001506 Peptidase M12A, astacin
IPR006026 Peptidase, metallopeptidase
Orthology groupMCL13037

Nucleotide sequence:

ATGTCGCAGAGCGAAATTGAAGATTTTAGTAATTTTCTAAGACAAACATCAAGCCTGGAT
CAAAATTTAAAAAGTCTACCAGAAGGAGATTATGAGGAAGATGTATCTCTGGATGACGAT
CATCACGCTTGGGAGAAGAGCGGAAAGTTTGAAGGGGACCTCATTCTAAACGAACGTCAG
AGAAGGATGATTGTTAACAACGTCGTGGAAGGACTTGCTCGGAACGGTCTAACTGACAGC
ACTAAGCGTTGGCCGAATAACGAAGTGATTTATTTTATACAGCCTGACCACTTTTCCGAC
GATCAAGTACGTTCAATACAAAATGGTATCGAAGATTTGGCGAGAGCGTCGTGTGTTAAA
TTCAGACCTTACGTGAAAGGAGACGCTGATGCGGTAGTCATACAGGGAAGTAAGCGTGGT
TGCTTCTCACAAGTGGGTTACCAAGGGGGTTATCAAATTCTCAACTTATCTCGTCGCCAT
CCAGCCGACCGAGGTTGCTTCCGCCTTGGGACTGTAGTCCATGAACTACTCCATACTCTT
GGCTTCTTCCACATGCAGAGTAGTCCTGACCGCGACGAGTTCATTGACGTATTATGGGAT
AACATAATAAGACAGGCTAGGCACAATTTCCGCAAGTATGACTCACTTTCGGTTTCGGAT
TTTGGAGTTGGCTACGACTATGACAGCGTTCTGCATTATAGCCGTAAAGCTTTCTCTTCA
AATGGTCAAGACACGCTTGTACCTAAGAGAATCGGTCTTTCGGAAAAGGATATTGTTAAA
TTGAACAAAATGTATTGCGATGTAGATGCAGGCGTTATATCTCAAGATAGTATTTCATCG
TTCGATATGGAGAAGAAAAGGAAAGGTGCTAAAAATAAACCATTCGTGGGTCAAGGGCTA
GGATATCAAAAGGGGAAAACTGTTATTATAAAACTACCTAAAGCCGATGAACAGAACAGT
CCCAAAAATCCCGTACGTGGTTATTTTAGTGAAACAACGCAGACCATACACCCAACATTG
AATCTAGAAACTGGCCCTAAAGAAGACACAATTTTATATGACTATCAGTTTCCTGGACAT
AATATAAATGAATATATGTCACTATTACCGAGAAAGAAAGAAAACCAAGACGACTATAAA
GAATTAAAAATAATCGATGCGAATAATGACGACAAAAGAAATTCCTACTTCTCAAACGAA
GGCTCAATAGGTGAATCAGATAACAGAGATGTTATACGATTTTCACAATTACCAGCTGTT
CTTCAAGAGGATAAAGATGAAGTTGAAGATTCAGCACGAATATATTATTATGGTCAAAGT
GACAGTTTGTCACCACATAGCGTACCTATTATAGAAAAGCAACAGATTAAAGCAAAAAAT
GCCAAACATTTAGCTTACCCTCACTATGATACCTTAAATTTTCAACCCAATAGATTCTCT
GAAAAGGATTCAGATTTTTTATACGGAAAATCGAAACAAGATTCACCTCGAATTCTTCTT
TATAATCCATCAAATGAGTTTGCTGACAGCGAATGGCATTTGAATGAAAATTATAAACCT
AATTTTTATATACAAGAAGACGGACAAATTAAACATGATAAATATGATTTACCAAAAATA
GGGTATAATTATGAGTTGTTTCAATAA

Protein sequence:

MSQSEIEDFSNFLRQTSSLDQNLKSLPEGDYEEDVSLDDDHHAWEKSGKFEGDLILNERQ
RRMIVNNVVEGLARNGLTDSTKRWPNNEVIYFIQPDHFSDDQVRSIQNGIEDLARASCVK
FRPYVKGDADAVVIQGSKRGCFSQVGYQGGYQILNLSRRHPADRGCFRLGTVVHELLHTL
GFFHMQSSPDRDEFIDVLWDNIIRQARHNFRKYDSLSVSDFGVGYDYDSVLHYSRKAFSS
NGQDTLVPKRIGLSEKDIVKLNKMYCDVDAGVISQDSISSFDMEKKRKGAKNKPFVGQGL
GYQKGKTVIIKLPKADEQNSPKNPVRGYFSETTQTIHPTLNLETGPKEDTILYDYQFPGH
NINEYMSLLPRKKENQDDYKELKIIDANNDDKRNSYFSNEGSIGESDNRDVIRFSQLPAV
LQEDKDEVEDSARIYYYGQSDSLSPHSVPIIEKQQIKAKNAKHLAYPHYDTLNFQPNRFS
EKDSDFLYGKSKQDSPRILLYNPSNEFADSEWHLNENYKPNFYIQEDGQIKHDKYDLPKI
GYNYELFQ