DPGLEAN04178 in OGS1.0

New model in OGS2.0DPOGS209775 
Genomic Positionscaffold6669:+ 4463-10517
See gene structure
CDS Length1356
Paired RNAseq reads  185
Single RNAseq reads  633
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010291 (1e-33)
Best Drosophila hit  CG42249, isoform C (8e-67)
Best Human hit5'-nucleotidase precursor (8e-43)
Best NR hit (blastp)  5' nucleotidase [Culex quinquefasciatus] (5e-93)
Best NR hit (blastx)  5' nucleotidase [Culex quinquefasciatus] (1e-90)
GeneOntology terms


  
GO:0008253 5'-nucleotidase activity
GO:0046872 metal ion binding
GO:0009166 nucleotide catabolic process
GO:0000166 nucleotide binding
InterPro families


  
IPR008334 5'-Nucleotidase, C-terminal
IPR006179 5'-Nucleotidase/apyrase
IPR004843 Metallo-dependent phosphatase
IPR006146 5'-Nucleotidase, conserved site
Orthology groupMCL10367

Nucleotide sequence:

ATGTACGCTCTGTGTATATTGTTGTTGTGCGGAATTTCTGTACGCGGTGAAAAGTTTTAC
GAGTTAAACATTATTCATTATAATGACTTTCATGCAAGATTCGTAGAAACTAGTCCATCT
GGATCTGTCTGCAATCCGACAGCAGCTCCTTGTATCGGAGGCTTCGCCCGACTCGCAACC
CTTATTAGAGATGGACTCGAGAGGAATCCAGAATCCTTAGTATTAAACGGAGGGGACTCT
TTTCAAGGAACTATCTGGTATAATTTATTGAGATGGAATGTCACTCAGGATTTTATGAAC
ATGGTCCATCACGATGCTCATGTGTTAGGAAATCATGAGTTTGATAATGGCATTGAAGGT
ATTGTTCCATACCTTCAACATCTCCAATCTCAAGTTGTTACCGCTAATATTATCGACGAC
GATGAACCAACGATACAAGGACTGTACAAGCCTAGCATTGTAGTCGAAAAGGGAGGTCGG
AAGATTGGTATAATAGGTGTTATAATATCGAGTACTGACGAACTCGCAAGTACAGGAAAT
TTAAAGTTTACGGACGAAGTTGAAGCTGTTAGACGAGAAGCTGAAAAGTTAAACGAGCAA
GGAATAAATATCATTGTGGTTCTATCACACTGTGGGATAGATATAGATCGTAAAATAGCC
CTTAATGCTGGTCCCCATATAGACATAATTGTCGGAGGTCATAGTCATACACTTTTATCA
AACAGTGACCCTCCAGAAGGTTCGACTTGGACTCCATTGGGACCTTATCCCGTTGTCGTA
GAACAGACAGCAAGATCTGTATTGATCGTACAAGCTGGAGCTCATACAGCATTTTTAGGA
GAAATTAAACTTAATTTTAATGATAACGGAGATCTTATCAGTTGGGTTGGTGATCCTCAT
TATATAGGCAACAATGTTCTTCCAGCTCCTGACGTCTTAGAAAAGATTAATGAGTATCTG
CCAGTTATAACTGAACAAGCAACTGAATTGATTGGAGTCTCTAAAGTCCACATGTCCTCA
AGATGTAATTGCGGAGAATGCAATCTTGGTATGCGAGTTATATTTGATGGAGCCCGTCCC
GTTAACAATAGAGTTGTAAATGCTACAATAAGGTGTAATTATTGTGATATACCAACATAC
GCGCCTCTGGATCCGAACAAATATTACAAAGTCGTTTCACAATCCTTCATCGGAGGTGGT
GGCGATGGATTTAGTATGATATCGAATAATCGCCAGAACGTAGAAGTGTTGGGTGTGGAT
TACGACATACTGTTACGTTACGTACGGCATCAGTCGCCGATCATGAAGGACTTGGACGGA
CGGATACTTATAAAAGATCCATGTTTAGAGAACTGA

Protein sequence:

MYALCILLLCGISVRGEKFYELNIIHYNDFHARFVETSPSGSVCNPTAAPCIGGFARLAT
LIRDGLERNPESLVLNGGDSFQGTIWYNLLRWNVTQDFMNMVHHDAHVLGNHEFDNGIEG
IVPYLQHLQSQVVTANIIDDDEPTIQGLYKPSIVVEKGGRKIGIIGVIISSTDELASTGN
LKFTDEVEAVRREAEKLNEQGINIIVVLSHCGIDIDRKIALNAGPHIDIIVGGHSHTLLS
NSDPPEGSTWTPLGPYPVVVEQTARSVLIVQAGAHTAFLGEIKLNFNDNGDLISWVGDPH
YIGNNVLPAPDVLEKINEYLPVITEQATELIGVSKVHMSSRCNCGECNLGMRVIFDGARP
VNNRVVNATIRCNYCDIPTYAPLDPNKYYKVVSQSFIGGGGDGFSMISNNRQNVEVLGVD
YDILLRYVRHQSPIMKDLDGRILIKDPCLEN