DPGLEAN20331 in OGS1.0

New model in OGS2.0DPOGS205560 
Genomic Positionscaffold280:- 25285-42841
See gene structure
CDS Length1974
Paired RNAseq reads  1070
Single RNAseq reads  2730
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002526 (1e-120)
Best Drosophila hit  Ance-3 (2e-180)
Best Human hitangiotensin-converting enzyme isoform 3 precursor (7e-111)
Best NR hit (blastp)  angiotensin-converting enzyme 7 (AGAP009757-PA) [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  angiotensin-converting enzyme 7 (AGAP009757-PA) [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms


  
GO:0008241 peptidyl-dipeptidase activity
GO:0008237 metallopeptidase activity
GO:0006508 proteolysis
GO:0016020 membrane
InterPro families  IPR001548 Peptidase M2, peptidyl-dipeptidase A
Orthology groupMCL16932

Nucleotide sequence:

ATGGCCTGTGCTATAGGACGGATACTTATAAGAACTCTATTTCAATTTACAATAGTTCAT
ATTTTATATGCTGACCCTCAGCTGGACCTTCCCCAACTGCCCCAAGTGTCTTCACAGCGT
ACTCCTTATGGCTATGGATTTTCCAGTACACAGTCCAATTTACCCACAAGCCAGTTTGAT
TTAAACCGTAATTTTCAATATAGTACCGCCAGAGCACCGAGTGCTGTCACTCCGGGTGTG
TATCCTGTATCTTCTACCACGCATCAGATAAACCCGGACATAAATAATAACGGATATCCC
AGTTTAGATTACGGGAACGGACGGAATCCAAGTTCCACTCTTCGTCCATTCGATGATAGA
AATCGTGATGATAGAATAGACATCACGAATGACGCGAATTTTCGTCGGAATGATCCAAAT
TTCGCTAGAAACGATCCCAGCTATGTTGGGACCAACCCGTATAATTTCAACGGTGATGTG
AACTCTAATCGTATAGATGAAAGGCTTAATCACGCCAGCCTGCAGCAAATTAAGGATTTT
CTTCACCAAGCTGATGCACAGGCTTCTAAGGAGTGTACGAACAACGTAGCAGCTCAGTGG
AATTTTGAAACCGATGTTAATGATGCAACTCAGCATGCTGCTTTGGAAGCACAACAACGC
TACACGTTATTCCAGCGAGGGTTGTGGGAAGCGGCTCAGGGCCTGCCTCGTGGTGCTATC
AGGGATTTCTCTACCTTCAGACAACTGAAGTTACTATCTACCATCGGTCCGGCAGCTCTG
CCACCAGATCAATTGGACAGGTATAACAGAATCATTAACGATATGCTGGCTGTCTACAAT
TCAGCGGAGATCTGTGCCTATAATGAACCCTTCAAGTGCGGGCTTCACCTCCAGCCAGAC
CTACAGTTCACCATGTCACATTCCAGAGACTGGGATGAGTTACAACATGTCTGGACAGAA
TGGAGGAGGAATACTGGAAGACGGATTAGAGATTTGTACGAACAACTCGTTGATCTCACC
AATCAAGCAGCAAGGTTGAATAATTTCACTGATGCTTCTGCTTATTGGATGTTTCCATAC
GAAACCTTCAACATGAGACAAGAAGTGGACGAAGTTTGGGAACAGGTCAAGCCGCTTTAT
GATGTACTACATGCATATGTCCGTCGTCGTCTTCGTGAAGCGTATGGACCTGAAAGAATC
TCACGATCCGCACCTATCCCAGCTCATGTACTTGGTGATATGTGGGGGCAGAGCTGGTCT
GGGATAGTCCCCTTCACTCTACCATACCCCGGGAAAAAACTCGTCGATGTCACTCCCGAA
ATGGTGCAACAGGGTTACACACCCCTTACGATTTTCCAACTGGCGGAGGAGTTCTACGTT
TCCATGAACATGTCTGCGATGCCTCCAGACTTCTGGGCACTGAGCGTGTTTGAGCAGCCT
GCTGACCGACACGTGCACTGTCAACCGTCTGCTTGGGACTTTTGTAATGGACACGATTAC
AGAATAAAGATGTGCACTCATCCAGATCAGAAAGATTTGATAACGGCTCACCACGAGATG
GCACACATTGAATATTTCCTGTCATACAGAAATCAACCGAAGGTCTTCCGCGACGGAGCC
AACCCAGGATTCCACGAAGCGATCGGCGAGGCAATCGCGCTTTCCGTGTCATCTCCCCGC
CACCTCCAAACCCTGGGTCTCATCCAGAAGTCTGTAGATGACACGGCCCACGACATCAAT
TATCTCTTCACACAGGCGATGGATAAACTGGCTTTCCTTCCATTCGCCCTGGTGATGGAT
AAATGGCGCTGGGATGTCTTCACAGGCGACGTTAGGAAAGAGCAGTACAATTGTCATTGG
TGGAGATTAAGAGAACAGTACGAGGGCATTAAGCCGCCAGTGCTACGTTCTGAATTGGAC
TTCGATCCCGGCTCCAAATATCACATACCAGCAAACATTCCCTATATAAGGTGA

Protein sequence:

MACAIGRILIRTLFQFTIVHILYADPQLDLPQLPQVSSQRTPYGYGFSSTQSNLPTSQFD
LNRNFQYSTARAPSAVTPGVYPVSSTTHQINPDINNNGYPSLDYGNGRNPSSTLRPFDDR
NRDDRIDITNDANFRRNDPNFARNDPSYVGTNPYNFNGDVNSNRIDERLNHASLQQIKDF
LHQADAQASKECTNNVAAQWNFETDVNDATQHAALEAQQRYTLFQRGLWEAAQGLPRGAI
RDFSTFRQLKLLSTIGPAALPPDQLDRYNRIINDMLAVYNSAEICAYNEPFKCGLHLQPD
LQFTMSHSRDWDELQHVWTEWRRNTGRRIRDLYEQLVDLTNQAARLNNFTDASAYWMFPY
ETFNMRQEVDEVWEQVKPLYDVLHAYVRRRLREAYGPERISRSAPIPAHVLGDMWGQSWS
GIVPFTLPYPGKKLVDVTPEMVQQGYTPLTIFQLAEEFYVSMNMSAMPPDFWALSVFEQP
ADRHVHCQPSAWDFCNGHDYRIKMCTHPDQKDLITAHHEMAHIEYFLSYRNQPKVFRDGA
NPGFHEAIGEAIALSVSSPRHLQTLGLIQKSVDDTAHDINYLFTQAMDKLAFLPFALVMD
KWRWDVFTGDVRKEQYNCHWWRLREQYEGIKPPVLRSELDFDPGSKYHIPANIPYIR