DPGLEAN14440 in OGS1.0

New model in OGS2.0DPOGS201807 
Genomic Positionscaffold12:+ 108603-121877
See gene structure
CDS Length1536
Paired RNAseq reads  1288
Single RNAseq reads  3272
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013235 (0.0)
Best Drosophila hit  sluggish A, isoform G (0.0)
Best Human hitproline dehydrogenase, mitochondrial isoform 1 precursor (4e-125)
Best NR hit (blastp)  sluggish A, isoform A [Drosophila melanogaster] (0.0)
Best NR hit (blastx)  proline oxidase [Drosophila melanogaster] (0.0)
GeneOntology terms





  
GO:0007626 locomotory behavior
GO:0042331 phototaxis
GO:0004657 proline dehydrogenase activity
GO:0006562 proline catabolic process
GO:0005759 mitochondrial matrix
GO:0006537 glutamate biosynthetic process
GO:0055114 oxidation reduction
InterPro families
  
IPR002872 Proline dehydrogenase
IPR015659 Proline oxidase
Orthology groupMCL12762

Nucleotide sequence:

ATGCTGATGAAACGCCTCCGCCAGCTGGTCGGTCAGAGGCTGTTCGAAGCCATCATGAAG
GCCACCTTCTACGGCCAGTTCGTCGCCGGCGAGGACCAGATCAAGATACAACCGACGCTT
GACAGGCTGCGGTCGTTCGGTGTAAAGCCGATCCTCGATTATTCCGTGGAGGAAGATCTC
TCCCAGGAGGAGGCTGAGAAGCGCGAAGTGAGCGCTTCGATATCGACGTGCGGCGACACG
CAGGAGGAGGGTCAACTGAAGCAGTACCACGTGGAGCAGAGATTCGCTGATCGCCGGTAC
AAGGTCACCAGCGCTAGAACATACTTCTACCTGAACGAGGCCTCATGCGAGAAGAACATG
GAAGCGTTTATGAACAGCATCGACACCGTCGCCAAAATAACCAAGAGCACTGGACTTATG
GCCGTGAAACTAACAGCCCTTGGCAGACCACAGTTACTTCTCCAACTGTCCGAGGTGATA
ATGCGCGCCCGTAGCTATATGCAGCAGATAGCTGGCGGTACTGGGAACGTACTCGCCCAT
CATAAGACCATCGAAGACCTGCAGAGATACTTAGGGGATTACAGCGCTCGGCCCGAAGTA
CAGGACTTTATGAACAAAGTCACCTCCGACACGGAAGGTATCGTCCATCTTTTCCCGTGG
TCGAACATTCTGGATAAGGATATGGGTTTGTCAGATTCATTCCGCGTCCCTGACCCGAAG
ACCGGTCAGATGCGACGCCTCATCTCCCAGATATCGCCCAAGGAGGAGGAAATGTTCAGG
AACATGCTGCGGCGTCTCAACAATATAATACAGGTGGCCAACGAGCATGACGTCAGGATT
ATGATAGACGCCGAACAGACATACTTTCAGCCGGCCATCTCGAGGATCTGTCTCGAAATG
ATGAGGAGGTATAACAAGAACAAATTCCTCGTATTCAATACATACCAGACCTATCTGAAG
AACACGTACAACGAGATAGTGACTGATCTCGAACAGGCGCAGCGTCAGAACTTCTACTGG
GGTGCCAAGCTGGTCCGGGGGGCCTACATAGAGCAGGAGCGTGCCCGTTCAGCCGCTATG
GGCTACGAGGATCCCACGTGTGAGAGCGTCGACGCTACGACAGCATCATTCCACCGCTGT
CTCAAGGAAATACTCAGCCGGGTTAAGAACGAGCAAAACGATCGTCTCGGTATAATGGTG
GCCTCTCACAATGAGGACACCGTCCGTTATGCCATCCAGTTAATGAAGGAACACGGCATC
GGGCCGGGGGATAAGGTGGTGTGCTTCGGGCAACTGCTGGGGATGTGTGATCACATCACA
TTCCCATTGGGTCAAGCTGGTTATTCGGCTTATAAGTATGTTCCTTACGGTCCTGTGCTG
GAAGTGCTGCCATACTTGTCCCGTCGAGCAAATGAGAACAGAGGCTTCCTCCAGAAGATA
AAGAAGGAGAAGGGTCTGCTTCTAAAAGAGATATTCCGTAGAATGTTCAGCGGACAGCTG
TTCTACAAACCGTCTGGGAACTATACACCGGTTTAA

Protein sequence:

MLMKRLRQLVGQRLFEAIMKATFYGQFVAGEDQIKIQPTLDRLRSFGVKPILDYSVEEDL
SQEEAEKREVSASISTCGDTQEEGQLKQYHVEQRFADRRYKVTSARTYFYLNEASCEKNM
EAFMNSIDTVAKITKSTGLMAVKLTALGRPQLLLQLSEVIMRARSYMQQIAGGTGNVLAH
HKTIEDLQRYLGDYSARPEVQDFMNKVTSDTEGIVHLFPWSNILDKDMGLSDSFRVPDPK
TGQMRRLISQISPKEEEMFRNMLRRLNNIIQVANEHDVRIMIDAEQTYFQPAISRICLEM
MRRYNKNKFLVFNTYQTYLKNTYNEIVTDLEQAQRQNFYWGAKLVRGAYIEQERARSAAM
GYEDPTCESVDATTASFHRCLKEILSRVKNEQNDRLGIMVASHNEDTVRYAIQLMKEHGI
GPGDKVVCFGQLLGMCDHITFPLGQAGYSAYKYVPYGPVLEVLPYLSRRANENRGFLQKI
KKEKGLLLKEIFRRMFSGQLFYKPSGNYTPV