New model in OGS2.0 | DPOGS203592  |
---|---|
Genomic Position | scaffold783:- 64428-69864 |
See gene structure | |
CDS Length | 1737 |
Paired RNAseq reads   | 1259 |
Single RNAseq reads   | 3121 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001350 (5e-139) |
Best Drosophila hit   | CG11593 (8e-53) |
Best Human hit | protein prune homolog 2 (7e-43) |
Best NR hit (blastp)   | GF24411 [Drosophila ananassae] (4e-65) |
Best NR hit (blastx)   | GD13300 [Drosophila simulans] (2e-56) |
GeneOntology terms    | GO:0005575 cellular_component GO:0008150 biological_process GO:0003674 molecular_function |
InterPro families    | IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal IPR022181 Bcl2-/adenovirus E1B 19kDa-interacting protein 2 |
Orthology group | MCL18983 |
Nucleotide sequence:
ATGAGCTGCCTCCAGTCACCGAGGCTGAGTGACGACGACCTCCTCAGTTCACCATCCTCA
GATGAAAAGATGGACACAGCATCGGAGTCGTTCCGCTCCACACAGCACACACCCAACTCG
ATACCCGACGTGGACCACGAGAATGATGTGCTGAGGCTACGGAACCCCGACATCATAGAC
ACCGACCAAATCCTAGACAATAAAGTCACCAAGTTGAAGTCCGAACTGAACGATAGCTTT
ATATCACGCTTCACCACACTCACGCTGAGCTCTCCCGACAACAAGGCCAGGAATATAGGT
ATAAGCACTCCCAAATATAGCTCAACGCTAACTTTAACGACCAACCACCCTCTCATGGGC
TTGGCCAGTCCGGACGAGAGCTGTCCCGTCACGAAGAAGACTCACAGGACAGAAGAAGAG
AAAGACTTGAGCTTATCCAGTATAGAAGATGATAGAGGAACAACGAACAGATACGAGTTC
TACACGCCACAGAGGAATACTTCGAAGAATAATCTTTACTTTTCCGAAAACTACGTCACA
GCGGACAGTCAGTTCCTGAGCGAGTCTTTAAACATGACAGCGGAGGCTAAAGATCAGCAC
CAGAAAAGAATTATAATACCAAGGTTCCTGGACACCCCTGTCAAAGATACCACCCAGCAG
AATCTAAGCGTGTTCCTGAATTCGGGGAGGGAGGGGAGAGACATAAAGAAGTACACAGAA
GAAAGGGGAGAGAGAACAGCCTTGGATGTTATAGATAAAGAAGATTTTCATTCCGGGGTT
GAATATCGAGACTCCCCTCCGCGGACAGTTCCTAGCTTCGACCTACCTATTGAAATGTCC
AGTAACATAGGCATGCATAATGGCGTGGAATCCCCCGACGTGGAGTCATTATCGACCCAG
GAAGACAGGTCCGCGTACCAAGCCTTACTGGACCCCTACACCGGGTCAGTGGCCTTGAGA
CACACCACGCACAGGAGCAACCCGCCGAGAAGAAAAGTACAATTGCCGCCGGAGGATGAC
GAGTGTAGCCTGGACAGCGTCAGCGGCGGTTCCCTGGAGTCCGAGGACGAGCCGCCGCCC
GTAGAGAGTGCGCCCGACCCTCACAGCGAGGACGACACCAAGAGGTCCAAGTCCACGAAC
ACGGTGAGCGAGTGCAGCGACCCCATACCTGAATACTCAGCGGCCGAGGAGTTCCGCGAG
GAGCGCTCCTGGCTCAGCGTCACACACGGGGGAGGCCGCGCCGTCTGTGATATGAAGGTC
ATAGAGCCGTTCAAGCGCGTGGTGTCCCACGGCGGATACGAGGAGGGAGGCGCCGCCCTC
ATCGTGTTCAGCGCTTGCCACCTCCCGGACACCAGGCGCCCCGACTACAGATACGTCATG
GACAACCTGTTCTTGTATGTGATGTGGAGCCTGGAGCGGTTGGTGACGGACGAGTACGTG
CTAGTGTACCTGCACGGGAGCGCCGGCAGACGGAGGATGCCCACCTTCGCCTGGCTGCAC
GAGTGTTACAAGCTGGTGGACAGACGGTTGAGGAAGAGTCTGAAGCGCCTGTACCTGGTG
CACCCCACGTTCTGGTTGAAGTCGTTCGTCGTCATCACCAAGCCTTTCGTCAGTTACAAG
TTCTTCCGGAAGCTGTCCTACGTGGAGAGTCTGAAGGAGCTGTTCCGCCTGGTGCCGGTG
GAGCCCAACGCGATACCCGACCTCGTGAAGGAGTACGACGACCACAGGAAGAAATAA
Protein sequence:
MSCLQSPRLSDDDLLSSPSSDEKMDTASESFRSTQHTPNSIPDVDHENDVLRLRNPDIID
TDQILDNKVTKLKSELNDSFISRFTTLTLSSPDNKARNIGISTPKYSSTLTLTTNHPLMG
LASPDESCPVTKKTHRTEEEKDLSLSSIEDDRGTTNRYEFYTPQRNTSKNNLYFSENYVT
ADSQFLSESLNMTAEAKDQHQKRIIIPRFLDTPVKDTTQQNLSVFLNSGREGRDIKKYTE
ERGERTALDVIDKEDFHSGVEYRDSPPRTVPSFDLPIEMSSNIGMHNGVESPDVESLSTQ
EDRSAYQALLDPYTGSVALRHTTHRSNPPRRKVQLPPEDDECSLDSVSGGSLESEDEPPP
VESAPDPHSEDDTKRSKSTNTVSECSDPIPEYSAAEEFREERSWLSVTHGGGRAVCDMKV
IEPFKRVVSHGGYEEGGAALIVFSACHLPDTRRPDYRYVMDNLFLYVMWSLERLVTDEYV
LVYLHGSAGRRRMPTFAWLHECYKLVDRRLRKSLKRLYLVHPTFWLKSFVVITKPFVSYK
FFRKLSYVESLKELFRLVPVEPNAIPDLVKEYDDHRKK