DPGLEAN18426 in OGS1.0

New model in OGS2.0DPOGS202277 
Genomic Positionscaffold3542:- 11501-23796
See gene structure
CDS Length1884
Paired RNAseq reads  155
Single RNAseq reads  434
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004934 (2e-43)
Best Drosophila hit  CG10175, isoform C (7e-135)
Best Human hitPREDICTED: bile salt-activated lipase-like isoform 2 (2e-57)
Best NR hit (blastp)  alpha-esterase [Aedes aegypti] (3e-163)
Best NR hit (blastx)  alpha-esterase [Aedes aegypti] (3e-156)
GeneOntology terms
  
GO:0004091 carboxylesterase activity
GO:0008152 metabolic process
InterPro families


  
IPR002018 Carboxylesterase, type B
IPR019826 Carboxylesterase type B, active site
IPR019819 Carboxylesterase type B, conserved site
IPR002168 Lipase, GDXG, active site
Orthology groupMCL10058

Nucleotide sequence:

ATGCGTTTGTTGTTGTTAGTTTGTGCTTGTGTGATTGTGGCTGTTGAGGCACTCCCGCCA
GCTGCCCTTGGCGATGTTGTTGCAAACGCGGCAAGCAGTTTGAATTCTATGAAGGTAACA
GGGGCATGGCGTCGGCTGTCAAGCACTGTCAGCGAATCAGTCGGTCTGAAGTATGCTCTA
CGTCGCCTTCAGAACGTGCCAGCCAGAGCAGCGAGACTGGTGCACGCGTTGGTAGACCGC
ATGAGGGTCGGGGGTGGGCCCGTGGTTGATACCAAACTGGGCTCCCTAAGGGGACGCCGT
GTTATAACCAGGACCTCGGCTCAAATACCCTACTACAGCTTCAAGGGTATCAGGTACGCT
CAACCTCCACGCGGTTCTCTCCGGTTCCGTCCACCAGCACCCCTGAGCCCCTGGTCGGGA
GTCAGGGATGCGTTGGAAGAAGGTGCGGTTTGCCCACACAGGTTCATGCTGTTCGACACG
TACAAAGGAGACGAAGATTGCCTCTTCCTCAACGTCTACACGCCCGCCCTACCGGATAAG
TTGTCCGGATACAACCCGGAGCTGGCGGTGATGGTGTGGATCCACGGTGGAGCGTTCGCC
GTTGGATCAGGTAACGCGTTCCTGTACGGCCCCGACCACCTCGTTGGGGCCGGCGTGGTG
CTCGTCACACTCAACTACAGGCTGGGAGCGCTGGGCTTCCTTAGCCTGGAGAACGACGAG
GTCCCCGGCAATATGGGACTCAAGGACCAAGTGATGGCCTTAAAGTGGGTGAGAGATAAC
ATCCAAGTGTTCGGAGGTGATCCAAGCCGCGTCACTATCTTCGGAGAGTCCGCTGGAGCG
GCCTCCGTTCACCTGCACATGCTGTCCCCAGCCTCGAAGGGCTTGTTCCATGGTGTGATA
GCTCAGAGCGGGGTGTCTCTATCTCCGTGGGCGCTGGCGTCGTCTCCTCGTGAGCGAGCT
TTCCATCTTGGCAGGGAGCTCGGCATAGACACCAACAACACAGCCGAACTACTCGGGTAC
CTTCGAGCGACTCCGAGCGAATTGCTAGTTAAGGCTGGAGCTCGTCTCGTATCAGTCCCG
GGGGCTGCAGACCTGCACAGTACAGTGGCGCTGCCGTTCCTACCGGCGGTGGAGCCTCCC
GGCCCTGAAGCCTTCCTCACGAAACGACCCCAGGACTTACTACCTGGTGCTGACGTACCC
CTGATGACTGGGTACAACGCGCAAGAAGGCATCATATTATTCCGACGTCTGCAAAGATAT
CCAAAACTGTTGAGTGAGCTGGAAAGTGAATTTAGAAGAGTTGTTCCCCCCGAGCTGATA
ACCTCGGATGAGGAACGGTCCAAGAAGGTCGCGGATCACATCAGAACATTCTACTTTCAA
CAGCGGAAGATAGACATACGGAGCATAGACAGCCTAATCGACCTGTTCACAGACGTAATG
TTCTTGCGGCCGGCTTTGGAAACTCTGCGGTTGAATGCAAGGACGAACAGAACTAGTCCC
ACTTACATGTATCGATTCGGCTTCGACGGAGCGCTGGGGCTCTTCAAGCGGATGCTGGGC
ATCACACACCCAGGCGTCTGCCACGGTGATGAAATGGGCTACCTGTTCTACTTCTCAAGA
CTCAATTACAGGCTTGATGACGATTCCCCCGAATTGGCCGTCTCAAGAAAAATGGTCCAA
CTTTGGACTAACTTCGCTAAGACTGGCAACCCCACACCACCGATCGATTACGAGTCTGTG
CTAGACTTCAAATGGCCGCCGGTGAATGACAGTGATCACGTGACGTACCTCGACATCACG
CGCCAATTCGCCGTCAAGAGCGATCCGGAACCGAAAAGAGTTCGCTTCTGGGATTGGCTG
TATGATAACTACGAGAACGAATGA

Protein sequence:

MRLLLLVCACVIVAVEALPPAALGDVVANAASSLNSMKVTGAWRRLSSTVSESVGLKYAL
RRLQNVPARAARLVHALVDRMRVGGGPVVDTKLGSLRGRRVITRTSAQIPYYSFKGIRYA
QPPRGSLRFRPPAPLSPWSGVRDALEEGAVCPHRFMLFDTYKGDEDCLFLNVYTPALPDK
LSGYNPELAVMVWIHGGAFAVGSGNAFLYGPDHLVGAGVVLVTLNYRLGALGFLSLENDE
VPGNMGLKDQVMALKWVRDNIQVFGGDPSRVTIFGESAGAASVHLHMLSPASKGLFHGVI
AQSGVSLSPWALASSPRERAFHLGRELGIDTNNTAELLGYLRATPSELLVKAGARLVSVP
GAADLHSTVALPFLPAVEPPGPEAFLTKRPQDLLPGADVPLMTGYNAQEGIILFRRLQRY
PKLLSELESEFRRVVPPELITSDEERSKKVADHIRTFYFQQRKIDIRSIDSLIDLFTDVM
FLRPALETLRLNARTNRTSPTYMYRFGFDGALGLFKRMLGITHPGVCHGDEMGYLFYFSR
LNYRLDDDSPELAVSRKMVQLWTNFAKTGNPTPPIDYESVLDFKWPPVNDSDHVTYLDIT
RQFAVKSDPEPKRVRFWDWLYDNYENE