DPGLEAN20978 in OGS1.0

New model in OGS2.0DPOGS203622 
Genomic Positionscaffold728:+ 26078-38008
See gene structure
CDS Length3708
Paired RNAseq reads  5177
Single RNAseq reads  13357
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007293 (0.0)
Best Drosophila hit  CG17018, isoform C (7e-51)
Best Human hitlimkain-b1 isoform 3 (5e-59)
Best NR hit (blastp)  PREDICTED: similar to limkain b1 [Tribolium castaneum] (2e-147)
Best NR hit (blastx)  PREDICTED: similar to limkain b1 [Nasonia vitripennis] (4e-93)
GeneOntology terms





  
GO:0005777 peroxisome
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0003723 RNA binding
GO:0003676 nucleic acid binding
GO:0000166 nucleotide binding
InterPro families

  
IPR000504 RNA recognition motif domain
IPR012677 Nucleotide-binding, alpha-beta plait
IPR021139 Domain of unknown function DUF88
Orthology groupMCL12913

Nucleotide sequence:

ATGACAATGAGCAAAATGTATCCAACGAGAGGTGTAGACATTTATGTGAGAAATAAAAGT
GCAAGAAGTCATTCCGTTCAGGGTCTAGCCACCAATATTACTAGCCCTGCCATGAAAATA
CCATTACCTCCTCGTTTATGGCTAACAGATGTAGAAGATGAGTCGTCAGATGATTTAAGC
CATAGTTCACTAGAAGATGGGACCATTCCAGTTGAAAGGACGAGAATAAGATCACGTTTT
CGTCACAAACCGAGGGCATCTGTTAATGTGCCTATTGGAATATTTTGGGATATAGAGAAT
TGTCCGGTGCCCAGAGGCTGTTCAGCGATTAATGTAGTAGCAGCGATCAGATCTAAGTTC
CTCACAGGCCGTCGGGAAGCCGATTTTGTAGTTGTGTGTGATGTGAGAAAGGAAACACCC
CAAAAGCTTCAGGAACTGAATGATGCTCAAGTATGTAGCAATACCTTGACATTTAAAATA
TTAGCTGCCGATGAGAAGCTAAGACAATGTATGCGTCGCTTCGGAGAGCTTCACTCAAGT
CCGGCAGCCCTATTGCTTATATCGGGAGACATCAATTTTGCAGCGGATCTCAGCGATTTC
AGACATAGGAAGGGTATGGAAGTTATATTGGTGCATAGACAGAATACCTCATCTGCTTTA
ATAGCCTGCGCCTCCTCTCATTACTCATACAACGAACTTACAGTGAATATACCGAGGAAT
GTGAATGTAAGTGGGGTGGAGGAAGAGGATCCTAGCCGTGAGATGGAGGTCATTAACCTG
CCTATGGACCAGCCGCCGCAGAGGGTTTCCAGGCGTCTGCTGCGACTGGCCGACAACTGT
GGGGGCAAGGTGATGAGGGTTGGCGCTGGGACGGCTCTCCTCAGGTTCCCTACACCTGAT
CACGCTTCACGAGCTCTAAAACGTATGGACGGCGAGGATGTTTTCGGTCGTAAAATTAGT
ACGCGTTACGCTCGAACGTTCCAATCGACGTATTCCAGCGACGAGGGTTACAGCACCGGC
CAGAGGCCGCAGTTTGTTGCACCTGTCGTTCAGCCCCAGCCTCCTCCCCCTCCCCCTCCG
TCCGAGCCTCCCCGCAGCGCTTCTAGCGACTGGGCTCTGGCGCTGCAGCAGCTGCCAGCC
CCAACACCACCACCACTGCAGTTCTGCCCACCACCAGCACCAAAACCGCGCAAGATTAGA
GGAACTCATGGGTCAGCCAGCCTGGACCGGTCTGGTTGCTCATCCACCAACAGTGATGAC
CGCCGTGAGAACAGTCATAGTCACAGTCACAGCCGAGCAGTGTCTCCTTGGAACTCCTCG
GTCAGCGAGCACAGCGAGGATGACACTGACCTCACCGTCTCCAACCTACCACCTTATGAA
CCACCAGTATTACAGGAAATGCTGACGAAACTGTTCAATCAGTACGTACCGGTGGTTCGG
GTGTCAGTGTGGGCGGCGGGCGAGGGTCCTCTGGCTACGGTCGTAGTACGATCTGAATGG
GACGCCCGTCTAGCTATAGCCCGCGTCCACAAACGAAGATTGGACAACAATTGGGCTGGG
AGACGATTGGAATTATCCCTAGGAAGACCATCGCCGGCTCCCAATCTAGACGTCCTCAGG
TCTAGACTGCGAGCTATACTCCTAGACCAGAAAAATTTCAGTCTACCGCTATTGAGACTG
CGAGATGCGTACGCTAGTAGACATTGCTGTGCGCTGACTACATCAGACATAGCTAAAGTC
AAGGATACGGTTGTCATCCATGAAGGTTTCGGACGTATGGTCCAATTGGTCGATCACACG
CCGGTCGTGAACACGGAAACGGAAGAGGCGCCATGGAAGTGTCATATACACGCTGTACTG
AATACTAGCCACGAGGACGGAAGCCGTATCTTGCAACCCGTCTACATGGATATAGCGGTT
CTGGCTAAGAATACTCAAGTGCTGTTGGAGAATCACGGGGGTATACTGCCCTTGTTGAGT
TTCGTGGAATGCTACGAGGCAATGTTCCCGTCGTTAGTAACAGACAATCGTCGCGGCGTC
GCCCTGGAACTGCTACTGCGAAGTATACAGACATTGGAAGTTAAAGACAGTCCCTCGAGA
CATTTAACCTGGAAAACATCCACAGACTCGCCCTCGCAGAGTTATATCAGCGACACTTCT
CGCAGCAGCGACCGCGACCGTCCACGCACAGCGCCCGCCTTGGAACCGATGCTGGCGTTG
TTTGAGAGAGAGTTGGTGGATTTGTTGAGAACAGCCCCAAGATGCTCAATACCGTTCAGC
AAGTTGATACCCGCGTTCCATCATCATTTCGGTCGCCAGTGTCGTGTGGCTGATTACGGC
TTCACCAAACTACCAGATCTGTTATCAGCTCTGGGTAACACTATCGTGGTTCTGGGCTCA
GGGTCATATCGTGTCATAACGATCTCATCCGCCGCCCAGGGGAGGCGTTGGACGTCGGAT
TTGTTGAAAATATTGAAGGCCCAACCCGGCCGGGTCATTCACATCCACGATCTGCCGCAG
TTATACCAGTCGACCATCGGCAGACCTTTCAGCACCGTCGATTATGGAGTGTGCACCATG
GACGAGCTGATGGAGAAGGTGTCCCCTCAGAGCGTCATAGTCTCACCAGAGGGCACCATC
TCACTCCCACGGAGAACCCCGACGCCGGAAGAACGAACGAGAACTGTACAGTTCGCGGTG
CAAGCGGTGGAGCTGCTTTGCTACACTCCGAACCTCAGAATGGAGTTTTCGCGTTTCGTG
CCGGCTTATCACGCGCACTTCGGGAGGCAGTTACGTGTTGCGCATTACGGATGTGTTAAA
TTGGTGGAACTGTTCGAATTGATACCGGAAGCAGTGAGTGTGTACTGCGAGTCTTCCGGT
GAGAGGAGCGTGCGGCTGGGGCTGCAGACTGCTACGGCCGTGATGGCGCAGAGATTGAAG
AGCCTGGCGCCCGTGTCGATCACCACCTTCCCCTCGCGCTACGCCGCCCAATTCGGCGCC
CCGCCCCTACCTGATTGCTTAGACGCTCCTAATCTCGAATCCCTTGTATACGCAGCCGGT
GGTTTCATTGAAGGTGATATAATTCACGTCGGTGATTCATCTCAATGGGCCAATTCGGCG
CTGTCAGCGTGCGCCGTGCTGTCGGCCGACCGCAGCGTCGCCAGGGGATCCACCGAGGAA
TATTTCATGACGGCATTCCGCTCGCTGTATGGCATCGAGCCGGACGTGAGCAATCTAATG
ATGTCCGGTGTCCTCGACGTGTCGGAGCGTCACGTGTGCCTGACTAACACCTGGCGTACC
GTGTGGCGTGTCGCACAAATCTTGTCCGATTACCCGGGGAACGTGGCGGCAGTCGAGATA
TTCATCGAGTACTCAAAGAGATACGGCCCGTCGTTCCCTAACGCGGAGCTCGGTATGGAT
GCTATGAACACGTTTCTGAGTAAGCATCGCAGTGTTTTCAATACTAGTGATGGCCGCTGG
GGTCTCTCGGCCGGCGTGACATTACCCAGACCGGAGTATTCACTACGCGCTGAAGATTAC
TCACTTCACGATACACCACCGGGGCAGAAGGGATCCCGCGTCTTTGAATCTCCGAAGACG
AATATTTGGAGTTCGCCGCCGGCCAGCGCCCTACCAACACCGACTGCTCTACTCAACCAC
GATAATAGACGTCGCACCCGTCTGGCAGCACAGTTTGATGCAGCGTAA

Protein sequence:

MTMSKMYPTRGVDIYVRNKSARSHSVQGLATNITSPAMKIPLPPRLWLTDVEDESSDDLS
HSSLEDGTIPVERTRIRSRFRHKPRASVNVPIGIFWDIENCPVPRGCSAINVVAAIRSKF
LTGRREADFVVVCDVRKETPQKLQELNDAQVCSNTLTFKILAADEKLRQCMRRFGELHSS
PAALLLISGDINFAADLSDFRHRKGMEVILVHRQNTSSALIACASSHYSYNELTVNIPRN
VNVSGVEEEDPSREMEVINLPMDQPPQRVSRRLLRLADNCGGKVMRVGAGTALLRFPTPD
HASRALKRMDGEDVFGRKISTRYARTFQSTYSSDEGYSTGQRPQFVAPVVQPQPPPPPPP
SEPPRSASSDWALALQQLPAPTPPPLQFCPPPAPKPRKIRGTHGSASLDRSGCSSTNSDD
RRENSHSHSHSRAVSPWNSSVSEHSEDDTDLTVSNLPPYEPPVLQEMLTKLFNQYVPVVR
VSVWAAGEGPLATVVVRSEWDARLAIARVHKRRLDNNWAGRRLELSLGRPSPAPNLDVLR
SRLRAILLDQKNFSLPLLRLRDAYASRHCCALTTSDIAKVKDTVVIHEGFGRMVQLVDHT
PVVNTETEEAPWKCHIHAVLNTSHEDGSRILQPVYMDIAVLAKNTQVLLENHGGILPLLS
FVECYEAMFPSLVTDNRRGVALELLLRSIQTLEVKDSPSRHLTWKTSTDSPSQSYISDTS
RSSDRDRPRTAPALEPMLALFERELVDLLRTAPRCSIPFSKLIPAFHHHFGRQCRVADYG
FTKLPDLLSALGNTIVVLGSGSYRVITISSAAQGRRWTSDLLKILKAQPGRVIHIHDLPQ
LYQSTIGRPFSTVDYGVCTMDELMEKVSPQSVIVSPEGTISLPRRTPTPEERTRTVQFAV
QAVELLCYTPNLRMEFSRFVPAYHAHFGRQLRVAHYGCVKLVELFELIPEAVSVYCESSG
ERSVRLGLQTATAVMAQRLKSLAPVSITTFPSRYAAQFGAPPLPDCLDAPNLESLVYAAG
GFIEGDIIHVGDSSQWANSALSACAVLSADRSVARGSTEEYFMTAFRSLYGIEPDVSNLM
MSGVLDVSERHVCLTNTWRTVWRVAQILSDYPGNVAAVEIFIEYSKRYGPSFPNAELGMD
AMNTFLSKHRSVFNTSDGRWGLSAGVTLPRPEYSLRAEDYSLHDTPPGQKGSRVFESPKT
NIWSSPPASALPTPTALLNHDNRRRTRLAAQFDAA