DPGLEAN10773 in OGS1.0

New model in OGS2.0DPOGS205913 
Genomic Positionscaffold4652:- 5213-19591
See gene structure
CDS Length1515
Paired RNAseq reads  1172
Single RNAseq reads  3174
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007012 (0.0)
Best Drosophila hit  CG31145, isoform C (1e-160)
Best Human hitdentin matrix protein 4 (1e-111)
Best NR hit (blastp)  FAM20C [Culex quinquefasciatus] (5e-179)
Best NR hit (blastx)  PREDICTED: similar to AGAP002913-PA [Tribolium castaneum] (1e-170)
GeneOntology terms  GO:0005576 extracellular region
InterPro families  IPR009581 Protein of unknown function DUF1193
Orthology groupMCL14200

Nucleotide sequence:

ATGAATATGAAGCTCAAGGAGCGGCTCGCGCTCGCTCTTAGCGCATTTCTTGTACTGTTC
ACACTGATGCTGATTGTTGACATACAAATGGACTACGGCATCTCCGGACACAGGGTACCA
TTGCATGGACGTGTCAAAATAGGCGATGACACGGATAAAGGAAGATCGGCGTATATAGAG
TTTAGAAAAAGATTTCTACAGAAAAGCAATGGTAGCAATGGTTCACGAGAGTACGAGCAG
TCGGCTCCGAGTGAGACGAGCGGTGGCGACGCTGAACCTAGGACGCCGGCGACACCTGCC
GATAGGTTCGAGGACCTGCAGAGGATACTGGTCTTCCAGTTGCAGGGGAAGTCTGATGAC
AACCCTGTGGTGGTGCCACCCCATCGAGATCTGAACGTCTTAAGACCCGAGAACCCTACC
ATCGGTGAAATGGAGGACTTGGAACCAAGTGTAAATGCCTCAAATCTGGAGAAATTTCAA
TTGAAAATAGCGCAACACGAGCTCTATGAAGACGGAGAGCCGCTGGTCAGCGCCATTCTT
CGCGACATGACCTTTGAACCCATCCTACATGTTGAACAAAAGGAAGGAGGAACGCAACTG
AAGCTCATCATCGACTATCCAAATGGCGTGCAGGCTTTATTCAAACCGATGAGGTTCGCC
CGGGATGTACAAACTCTACCTAATCACTTCTACTTCTCAGACTATGAGCGGCATAACGCT
GAAATTGCTGCTTTTCATCTTGACAGGATACTCGGTTTCCGTCGAGCGATGCCGGTGGTG
GGTCGAGTTGTGAATATGACCACTGAGATCTATGACGTCACTGAGGGAGACATCTTGAAG
ACGTTCTTCGTATCTCCAGCGAACAACTTCTGTTTCCACGGCAAGTGTTCGTACTATTGC
GACACAGGACACGCGATATGCGGCAATCCGGACATGTTAGAAGGCAGCTTTGCGGCTTTC
CTACCAACCTCGGATCTAGCGGAGCGTAAGGTGTGGAGACATCCCTGGCGAAGATCTTAT
CACAAACGAAGGAAAGCTCAGTGGGAGCTGCAGTCCGACTACTGTGATACGGTCCGCAGT
ACTCCGCCCTACGACTCCGGCCGTCGTCTGTTAGACCTCATAGACATGTCGATATTCGAC
TTCCTCACTGGGAACATGGACAGACATCACTATGAGACATTCAAAATGTTCGGCAACGAG
ACTTTTACGTTGCACCTGGACCAGGGACGAGCTTTCGGTAAGGCGTTCCACGACGAGCTC
AGCATACTCGCGCCACTGCTACAGTGCTGCACCGTTAGACACACTACGCTTGCAGTCCTG
CTTAAATTCCATAACGGCGTGCCATTATCGAAAGTGCTCCGAGATTCTATGAAAGCTGAC
CCCGTGAATCCCGTGCTTTGGGAGCCTCACCTGGCCGCGTTAGACAGGCGTATAGTTACA
GTACTGGACGCGATCAGGAAATGCATAGATAAATTAGAAAATCCTCTACCGAATGAACTG
AACTCTGTCGTGTGA

Protein sequence:

MNMKLKERLALALSAFLVLFTLMLIVDIQMDYGISGHRVPLHGRVKIGDDTDKGRSAYIE
FRKRFLQKSNGSNGSREYEQSAPSETSGGDAEPRTPATPADRFEDLQRILVFQLQGKSDD
NPVVVPPHRDLNVLRPENPTIGEMEDLEPSVNASNLEKFQLKIAQHELYEDGEPLVSAIL
RDMTFEPILHVEQKEGGTQLKLIIDYPNGVQALFKPMRFARDVQTLPNHFYFSDYERHNA
EIAAFHLDRILGFRRAMPVVGRVVNMTTEIYDVTEGDILKTFFVSPANNFCFHGKCSYYC
DTGHAICGNPDMLEGSFAAFLPTSDLAERKVWRHPWRRSYHKRRKAQWELQSDYCDTVRS
TPPYDSGRRLLDLIDMSIFDFLTGNMDRHHYETFKMFGNETFTLHLDQGRAFGKAFHDEL
SILAPLLQCCTVRHTTLAVLLKFHNGVPLSKVLRDSMKADPVNPVLWEPHLAALDRRIVT
VLDAIRKCIDKLENPLPNELNSVV