New model in OGS2.0 | DPOGS203622  |
---|---|
Genomic Position | scaffold728:+ 26078-38008 |
See gene structure | |
CDS Length | 3708 |
Paired RNAseq reads   | 5177 |
Single RNAseq reads   | 13357 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007293 (0.0) |
Best Drosophila hit   | CG17018, isoform C (7e-51) |
Best Human hit | limkain-b1 isoform 3 (5e-59) |
Best NR hit (blastp)   | PREDICTED: similar to limkain b1 [Tribolium castaneum] (2e-147) |
Best NR hit (blastx)   | PREDICTED: similar to limkain b1 [Nasonia vitripennis] (4e-93) |
GeneOntology terms    | GO:0005777 peroxisome GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process GO:0003723 RNA binding GO:0003676 nucleic acid binding GO:0000166 nucleotide binding |
InterPro families    | IPR000504 RNA recognition motif domain IPR012677 Nucleotide-binding, alpha-beta plait IPR021139 Domain of unknown function DUF88 |
Orthology group | MCL12913 |
Nucleotide sequence:
ATGACAATGAGCAAAATGTATCCAACGAGAGGTGTAGACATTTATGTGAGAAATAAAAGT
GCAAGAAGTCATTCCGTTCAGGGTCTAGCCACCAATATTACTAGCCCTGCCATGAAAATA
CCATTACCTCCTCGTTTATGGCTAACAGATGTAGAAGATGAGTCGTCAGATGATTTAAGC
CATAGTTCACTAGAAGATGGGACCATTCCAGTTGAAAGGACGAGAATAAGATCACGTTTT
CGTCACAAACCGAGGGCATCTGTTAATGTGCCTATTGGAATATTTTGGGATATAGAGAAT
TGTCCGGTGCCCAGAGGCTGTTCAGCGATTAATGTAGTAGCAGCGATCAGATCTAAGTTC
CTCACAGGCCGTCGGGAAGCCGATTTTGTAGTTGTGTGTGATGTGAGAAAGGAAACACCC
CAAAAGCTTCAGGAACTGAATGATGCTCAAGTATGTAGCAATACCTTGACATTTAAAATA
TTAGCTGCCGATGAGAAGCTAAGACAATGTATGCGTCGCTTCGGAGAGCTTCACTCAAGT
CCGGCAGCCCTATTGCTTATATCGGGAGACATCAATTTTGCAGCGGATCTCAGCGATTTC
AGACATAGGAAGGGTATGGAAGTTATATTGGTGCATAGACAGAATACCTCATCTGCTTTA
ATAGCCTGCGCCTCCTCTCATTACTCATACAACGAACTTACAGTGAATATACCGAGGAAT
GTGAATGTAAGTGGGGTGGAGGAAGAGGATCCTAGCCGTGAGATGGAGGTCATTAACCTG
CCTATGGACCAGCCGCCGCAGAGGGTTTCCAGGCGTCTGCTGCGACTGGCCGACAACTGT
GGGGGCAAGGTGATGAGGGTTGGCGCTGGGACGGCTCTCCTCAGGTTCCCTACACCTGAT
CACGCTTCACGAGCTCTAAAACGTATGGACGGCGAGGATGTTTTCGGTCGTAAAATTAGT
ACGCGTTACGCTCGAACGTTCCAATCGACGTATTCCAGCGACGAGGGTTACAGCACCGGC
CAGAGGCCGCAGTTTGTTGCACCTGTCGTTCAGCCCCAGCCTCCTCCCCCTCCCCCTCCG
TCCGAGCCTCCCCGCAGCGCTTCTAGCGACTGGGCTCTGGCGCTGCAGCAGCTGCCAGCC
CCAACACCACCACCACTGCAGTTCTGCCCACCACCAGCACCAAAACCGCGCAAGATTAGA
GGAACTCATGGGTCAGCCAGCCTGGACCGGTCTGGTTGCTCATCCACCAACAGTGATGAC
CGCCGTGAGAACAGTCATAGTCACAGTCACAGCCGAGCAGTGTCTCCTTGGAACTCCTCG
GTCAGCGAGCACAGCGAGGATGACACTGACCTCACCGTCTCCAACCTACCACCTTATGAA
CCACCAGTATTACAGGAAATGCTGACGAAACTGTTCAATCAGTACGTACCGGTGGTTCGG
GTGTCAGTGTGGGCGGCGGGCGAGGGTCCTCTGGCTACGGTCGTAGTACGATCTGAATGG
GACGCCCGTCTAGCTATAGCCCGCGTCCACAAACGAAGATTGGACAACAATTGGGCTGGG
AGACGATTGGAATTATCCCTAGGAAGACCATCGCCGGCTCCCAATCTAGACGTCCTCAGG
TCTAGACTGCGAGCTATACTCCTAGACCAGAAAAATTTCAGTCTACCGCTATTGAGACTG
CGAGATGCGTACGCTAGTAGACATTGCTGTGCGCTGACTACATCAGACATAGCTAAAGTC
AAGGATACGGTTGTCATCCATGAAGGTTTCGGACGTATGGTCCAATTGGTCGATCACACG
CCGGTCGTGAACACGGAAACGGAAGAGGCGCCATGGAAGTGTCATATACACGCTGTACTG
AATACTAGCCACGAGGACGGAAGCCGTATCTTGCAACCCGTCTACATGGATATAGCGGTT
CTGGCTAAGAATACTCAAGTGCTGTTGGAGAATCACGGGGGTATACTGCCCTTGTTGAGT
TTCGTGGAATGCTACGAGGCAATGTTCCCGTCGTTAGTAACAGACAATCGTCGCGGCGTC
GCCCTGGAACTGCTACTGCGAAGTATACAGACATTGGAAGTTAAAGACAGTCCCTCGAGA
CATTTAACCTGGAAAACATCCACAGACTCGCCCTCGCAGAGTTATATCAGCGACACTTCT
CGCAGCAGCGACCGCGACCGTCCACGCACAGCGCCCGCCTTGGAACCGATGCTGGCGTTG
TTTGAGAGAGAGTTGGTGGATTTGTTGAGAACAGCCCCAAGATGCTCAATACCGTTCAGC
AAGTTGATACCCGCGTTCCATCATCATTTCGGTCGCCAGTGTCGTGTGGCTGATTACGGC
TTCACCAAACTACCAGATCTGTTATCAGCTCTGGGTAACACTATCGTGGTTCTGGGCTCA
GGGTCATATCGTGTCATAACGATCTCATCCGCCGCCCAGGGGAGGCGTTGGACGTCGGAT
TTGTTGAAAATATTGAAGGCCCAACCCGGCCGGGTCATTCACATCCACGATCTGCCGCAG
TTATACCAGTCGACCATCGGCAGACCTTTCAGCACCGTCGATTATGGAGTGTGCACCATG
GACGAGCTGATGGAGAAGGTGTCCCCTCAGAGCGTCATAGTCTCACCAGAGGGCACCATC
TCACTCCCACGGAGAACCCCGACGCCGGAAGAACGAACGAGAACTGTACAGTTCGCGGTG
CAAGCGGTGGAGCTGCTTTGCTACACTCCGAACCTCAGAATGGAGTTTTCGCGTTTCGTG
CCGGCTTATCACGCGCACTTCGGGAGGCAGTTACGTGTTGCGCATTACGGATGTGTTAAA
TTGGTGGAACTGTTCGAATTGATACCGGAAGCAGTGAGTGTGTACTGCGAGTCTTCCGGT
GAGAGGAGCGTGCGGCTGGGGCTGCAGACTGCTACGGCCGTGATGGCGCAGAGATTGAAG
AGCCTGGCGCCCGTGTCGATCACCACCTTCCCCTCGCGCTACGCCGCCCAATTCGGCGCC
CCGCCCCTACCTGATTGCTTAGACGCTCCTAATCTCGAATCCCTTGTATACGCAGCCGGT
GGTTTCATTGAAGGTGATATAATTCACGTCGGTGATTCATCTCAATGGGCCAATTCGGCG
CTGTCAGCGTGCGCCGTGCTGTCGGCCGACCGCAGCGTCGCCAGGGGATCCACCGAGGAA
TATTTCATGACGGCATTCCGCTCGCTGTATGGCATCGAGCCGGACGTGAGCAATCTAATG
ATGTCCGGTGTCCTCGACGTGTCGGAGCGTCACGTGTGCCTGACTAACACCTGGCGTACC
GTGTGGCGTGTCGCACAAATCTTGTCCGATTACCCGGGGAACGTGGCGGCAGTCGAGATA
TTCATCGAGTACTCAAAGAGATACGGCCCGTCGTTCCCTAACGCGGAGCTCGGTATGGAT
GCTATGAACACGTTTCTGAGTAAGCATCGCAGTGTTTTCAATACTAGTGATGGCCGCTGG
GGTCTCTCGGCCGGCGTGACATTACCCAGACCGGAGTATTCACTACGCGCTGAAGATTAC
TCACTTCACGATACACCACCGGGGCAGAAGGGATCCCGCGTCTTTGAATCTCCGAAGACG
AATATTTGGAGTTCGCCGCCGGCCAGCGCCCTACCAACACCGACTGCTCTACTCAACCAC
GATAATAGACGTCGCACCCGTCTGGCAGCACAGTTTGATGCAGCGTAA
Protein sequence:
MTMSKMYPTRGVDIYVRNKSARSHSVQGLATNITSPAMKIPLPPRLWLTDVEDESSDDLS
HSSLEDGTIPVERTRIRSRFRHKPRASVNVPIGIFWDIENCPVPRGCSAINVVAAIRSKF
LTGRREADFVVVCDVRKETPQKLQELNDAQVCSNTLTFKILAADEKLRQCMRRFGELHSS
PAALLLISGDINFAADLSDFRHRKGMEVILVHRQNTSSALIACASSHYSYNELTVNIPRN
VNVSGVEEEDPSREMEVINLPMDQPPQRVSRRLLRLADNCGGKVMRVGAGTALLRFPTPD
HASRALKRMDGEDVFGRKISTRYARTFQSTYSSDEGYSTGQRPQFVAPVVQPQPPPPPPP
SEPPRSASSDWALALQQLPAPTPPPLQFCPPPAPKPRKIRGTHGSASLDRSGCSSTNSDD
RRENSHSHSHSRAVSPWNSSVSEHSEDDTDLTVSNLPPYEPPVLQEMLTKLFNQYVPVVR
VSVWAAGEGPLATVVVRSEWDARLAIARVHKRRLDNNWAGRRLELSLGRPSPAPNLDVLR
SRLRAILLDQKNFSLPLLRLRDAYASRHCCALTTSDIAKVKDTVVIHEGFGRMVQLVDHT
PVVNTETEEAPWKCHIHAVLNTSHEDGSRILQPVYMDIAVLAKNTQVLLENHGGILPLLS
FVECYEAMFPSLVTDNRRGVALELLLRSIQTLEVKDSPSRHLTWKTSTDSPSQSYISDTS
RSSDRDRPRTAPALEPMLALFERELVDLLRTAPRCSIPFSKLIPAFHHHFGRQCRVADYG
FTKLPDLLSALGNTIVVLGSGSYRVITISSAAQGRRWTSDLLKILKAQPGRVIHIHDLPQ
LYQSTIGRPFSTVDYGVCTMDELMEKVSPQSVIVSPEGTISLPRRTPTPEERTRTVQFAV
QAVELLCYTPNLRMEFSRFVPAYHAHFGRQLRVAHYGCVKLVELFELIPEAVSVYCESSG
ERSVRLGLQTATAVMAQRLKSLAPVSITTFPSRYAAQFGAPPLPDCLDAPNLESLVYAAG
GFIEGDIIHVGDSSQWANSALSACAVLSADRSVARGSTEEYFMTAFRSLYGIEPDVSNLM
MSGVLDVSERHVCLTNTWRTVWRVAQILSDYPGNVAAVEIFIEYSKRYGPSFPNAELGMD
AMNTFLSKHRSVFNTSDGRWGLSAGVTLPRPEYSLRAEDYSLHDTPPGQKGSRVFESPKT
NIWSSPPASALPTPTALLNHDNRRRTRLAAQFDAA