New model in OGS2.0 | DPOGS206808  |
---|---|
Genomic Position | scaffold1:- 3613523-3615647 |
See gene structure | |
CDS Length | 1626 |
Paired RNAseq reads   | 321 |
Single RNAseq reads   | 1252 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000630 (0.0) |
Best Drosophila hit   | ND |
Best Human hit | WD repeat-containing protein 47 isoform 3 (9e-74) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP006111-PA [Tribolium castaneum] (4e-163) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP006111-PA [Tribolium castaneum] (1e-160) |
GeneOntology terms   | GO:0005515 protein binding |
InterPro families   | IPR006595 CTLH, C-terminal LisH motif |
Orthology group | MCL17560 |
Nucleotide sequence:
ATGCCATCGACACACCTTAGCCTCCGCGAAGACGACGTAGTGCGTCTAGCATTGGAGTTC
CTAAACTTGCGCGACTTGCATATCACCCAGCTGTCACTAGAGCGAGAAACCGGTGTCATC
AACGGCAATTACGCTGACGACGTACTGTTTCTGCGGCAACTGATTCTAGATGGACAGTGG
GATGATGTACTCGAGTTCATACAACCACTCAGCGCTCTGAAAGCATTTGAAGCTGATAAA
TTCAATTACGCAATTCTTAGACATAAATATATAGAACTTCTTTGCATTAGATCCGAGATA
AAGGCTTACAACAATGTTGAAAACATCGTTGATGAAGTCGTCAAAGTTCTTAGCGACTTA
GAAAAATTAGCTCCTTCTAAAGAAGAATATTCTAATTTGTGTCTTCTGCTTACACTTCCT
AGTATAACCGACCATACTCAATTTAAAAAATGGAACCCGAGTAATGCGAGGGTCCAATGT
TTCAGAGAAGTATATCCTCTGGTAGAACAATTTTTGCCTTGCGATAAAAAATCCTTTTCA
AGTGGAACAGCGCCAAAAAGTGCGAAAAACGATCGACTCATGCAATTACTTATAAAGGGG
ATTTTATACGAATCTTGCGTTAATTACTGCCAGGCCAAAGCCACTGGATCCAAAGAAGCA
CAGACAAATGAAGTAAACTTTTCCCGCCTATTAGATGGTTCCGGATTTGACGAGTCGGAC
TTGAGTTTGTTGTCGTGGCTTCAGTCAATCCCGGCGGAGACTTTCAGTGTACCTTTCGAG
CAACGAATGTTAAATGTTGACGTAGAAAAGTTAGAGCGACCGTCATTGCATACTTCTTGG
ACTGAACATATGCTTGTAACACCAATTAAACCGCGCACCTTTCCGCACTTGGCCATGCCA
TTTAGAAGACCTCGGTCTGCAGCAGATGCCATGACAAGATCTCTCCGACCTCTACCTGAC
GGAGCACCTCGGCATCCCGCGGTCATGGCTCTATCTGCAATCGACTACAGTCCCTCTTCC
TCTTATTTTGCTGGTTTCCATCTAACTGGAATTAAAGGCAACAAGCTAATGAACACGTCT
GTGGACAGATTGTTCGATAACAAAAACTGTGACAGTATTTACAACGGGCTAAATGAATCC
TTGAAACCCATCGAGGAGCAATTGAAACCCATCGGTGAATATACTAAGACCAACGAGAGC
AGTAGTATAGGAGCTTCCAACGCTACTACACCAGAGCATAGAGCACACGATTACTCAGCA
TCTACGTCCATATCAACGGCTGCAGTTTCAGAGCGACCTTCAACTGACGTCTTGCCCGGT
CCAGACGGGGACTTCCGTAGGGAATTACTCAAAGAATATCAGATGCAGAAGACGCGTGAG
TGCGATGATTACCTCCGTAATCTATGTTCTTCTGAAATCAGGACACAGAAACCGCAGGAG
CAAGTAGGTGACAGTGCATCCGCTTTTCCCTCTGCTGTGAAGTATCCTGCGCCTTCTAAT
GTGGAACAAAGTCAACCAATGCCAGACCATTCTAAGTACTTAAGAAAAAATGAGATCAAC
GAGTCTATTGAAGTGCCGGAAACGCCTAATTTACAGTTGAGTGACTGCTCTCCAAATATT
TTCTAA
Protein sequence:
MPSTHLSLREDDVVRLALEFLNLRDLHITQLSLERETGVINGNYADDVLFLRQLILDGQW
DDVLEFIQPLSALKAFEADKFNYAILRHKYIELLCIRSEIKAYNNVENIVDEVVKVLSDL
EKLAPSKEEYSNLCLLLTLPSITDHTQFKKWNPSNARVQCFREVYPLVEQFLPCDKKSFS
SGTAPKSAKNDRLMQLLIKGILYESCVNYCQAKATGSKEAQTNEVNFSRLLDGSGFDESD
LSLLSWLQSIPAETFSVPFEQRMLNVDVEKLERPSLHTSWTEHMLVTPIKPRTFPHLAMP
FRRPRSAADAMTRSLRPLPDGAPRHPAVMALSAIDYSPSSSYFAGFHLTGIKGNKLMNTS
VDRLFDNKNCDSIYNGLNESLKPIEEQLKPIGEYTKTNESSSIGASNATTPEHRAHDYSA
STSISTAAVSERPSTDVLPGPDGDFRRELLKEYQMQKTRECDDYLRNLCSSEIRTQKPQE
QVGDSASAFPSAVKYPAPSNVEQSQPMPDHSKYLRKNEINESIEVPETPNLQLSDCSPNI
F