New model in OGS2.0 | DPOGS207108  |
---|---|
Genomic Position | scaffold1:+ 2777220-2781815 |
See gene structure | |
CDS Length | 1590 |
Paired RNAseq reads   | 267 |
Single RNAseq reads   | 630 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012785 (6e-134) |
Best Drosophila hit   | ND |
Best Human hit | selenoprotein O (7e-81) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Saccoglossus kowalevskii] (9e-128) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Saccoglossus kowalevskii] (2e-125) |
GeneOntology terms    | GO:0003674 molecular_function GO:0008150 biological_process GO:0009507 chloroplast |
InterPro families   | IPR003846 Uncharacterised protein family UPF0061 |
Orthology group | ND |
Nucleotide sequence:
ATGGCCAAACCAATAAGTGATTTCAAGGAATGGAAGTTTCGACTCCCACCGAGCTATGTT
CAGTTGCCTATAACCGGATACCCTGATTACAATATTCCCAGAGCCGTCAAAGATGCTGTT
TTTGTTAAAGTACCAACGGAGCCCCTTACTGGAAAAATTGATCTTGTCTGTGTGTCCAAT
GATGCATTGACTGACATTCTCGATCTTGATCCTGTAGTAGCGGAGAGTGAGGAATTTGTC
GAATTCATAAATGGCAAATATTTACCTCAAGGTGCTCTGAGTGTATGTCATGGATATGGT
GGGTATCAATTTGGTTTTTGGGCTGACCAGCTTGGAGATGGTAGGGCACATATTCTTGGT
GAATATGTTAATAGTAAAGGAGAGTTATGGCAGTTGCAGTTAAAGGGTTCCGGTGAGACT
CCCTTTTCTCGTTTCGGTGACGGTCGTGCAGTACTCAGATCATCGCTGCGTGAGATGGTA
GCTAGCGAGGCATGCCACCACCTTGGCATTCCTACCACCAGGGCCGCAGGATTGGTGGCA
AGTGATTCCCACAAGGTCCTGCGAGATAGGAGTTACAGTGGTCTGGCCCGTCCCGAGCGG
GCGGCTGTGCTACTCCGACTGGCACCTAGTTGGATGAGGATTGGAAGTTTCGAGTTAATG
CACCGCAGGCAACAGACAGACATGTTGGTCGAGCTTGCTGATCATGTTATAAAGCATTTC
TTCAGCCACATTGATCTTAATGATAAAGACAAATATGTGAAGTTTTTCACAGAGGTGGCT
CACAAGAACCTGGACATGGTTGCCACCTGGCAAGGGCTTGGCTTCACTCATGGGGTTCTT
AACACAGACAACATCAGTATATTGGGTCTGACCATTGACTATGGACCGTTTGGGTTCATA
GAACACTATTATGAGAATTATGTTCCAAACTCGTCCGACGATATGGGAAGATATGCATTT
AACAAACAGCCAGAGATCTTGCTGTGGAACTTGGGAAAATTGGCTGAAGCGTTACAGTTA
ATACTTTGTGATGAGAGTAAAAAGAAAATTAAAGATGTTATCGATACTTTGGAACTGTAT
GTGAAAGATAAAATACTTCATACTTACATTCTAAAACTTGGTCTCACTGAAGTCAGAAAA
GGTGACGATAAATTGGTGAAAGATTTCCTTGAGATGATGCAACAAACGTCATCCGACTTC
ACTGGCAGTTTTAGACAAATTTCCGAGATTAGTTTGAATCAGCTACTAGATAAGGAAACG
TTGGAGTCCAAGTGGGCTTTAGCAAGGCTGAGTAAATCGAAGAATTGGGATAAGTGGATT
CAGCGATACAAGGATAGATGTTGTCAAGAAAATGTGAATGAAGACGAGAGAGTGAAACAT
ATGCTCAAAGTAAATCCACTATACGTCCCCCGTAATTGGATGTTACAAGAGGCCATTAAA
GATGCAGAGAACAATGATTTCAACAAGGTAAGATTGTTACTTGAAATCTTTACCAAACCG
TATGAAGCGAATGAAGAAGCTGAAAAATTGGGATACTCGTCACAACCACCGAGCTGGTCC
TTTGGCCTTAAGTTGAGTTGTTCCAGTTAA
Protein sequence:
MAKPISDFKEWKFRLPPSYVQLPITGYPDYNIPRAVKDAVFVKVPTEPLTGKIDLVCVSN
DALTDILDLDPVVAESEEFVEFINGKYLPQGALSVCHGYGGYQFGFWADQLGDGRAHILG
EYVNSKGELWQLQLKGSGETPFSRFGDGRAVLRSSLREMVASEACHHLGIPTTRAAGLVA
SDSHKVLRDRSYSGLARPERAAVLLRLAPSWMRIGSFELMHRRQQTDMLVELADHVIKHF
FSHIDLNDKDKYVKFFTEVAHKNLDMVATWQGLGFTHGVLNTDNISILGLTIDYGPFGFI
EHYYENYVPNSSDDMGRYAFNKQPEILLWNLGKLAEALQLILCDESKKKIKDVIDTLELY
VKDKILHTYILKLGLTEVRKGDDKLVKDFLEMMQQTSSDFTGSFRQISEISLNQLLDKET
LESKWALARLSKSKNWDKWIQRYKDRCCQENVNEDERVKHMLKVNPLYVPRNWMLQEAIK
DAENNDFNKVRLLLEIFTKPYEANEEAEKLGYSSQPPSWSFGLKLSCSS