New model in OGS2.0 | DPOGS200790  |
---|---|
Genomic Position | scaffold1325:- 22311-23480 |
See gene structure | |
CDS Length | 1170 |
Paired RNAseq reads   | 564 |
Single RNAseq reads   | 1314 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011898 (2e-103) |
Best Drosophila hit   | CG33523, isoform D (1e-45) |
Best Human hit | motile sperm domain-containing protein 2 isoform 1 (8e-32) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP005556-PA [Tribolium castaneum] (5e-80) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP005556-PA [Tribolium castaneum] (2e-69) |
GeneOntology terms    | GO:0008150 biological_process GO:0005575 cellular_component GO:0005198 structural molecule activity |
InterPro families    | IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal IPR008962 PapD-like IPR000535 Major sperm protein |
Orthology group | MCL13531 |
Nucleotide sequence:
ATGCTTTACGAGATCATGGTCTGGAGAAAAAAGATGAACGCCAACGAAATCAACGAGAAC
ACCGTTAATTTGGATTACCTGAAAGAAGGTATCTTCTTCCCGCAAGGCAGAGACATCGAT
AGCTGTTTGCTGTTCATCATGAAGTCGAAACTGTACCATAAAGGACAGAAAAACGTAGAC
GAAGTCAAGAAAATTATCATATACTGGTTGGAGAGAATCGAAAGAGAGGAAGACGGCAAG
AAAATTACTCTCTTCTTTGATATGGACGGCTGTGGTCTCAACAACATGGATATAGAGATT
ATCATGTACATGGTTACGTTATTAAAGAATTATTATCCTAATCTTATTAATTACATCATC
ATATTCCAACTGCCCTGGATGCTGTCGGCCGGGTTTAAGATTGTCAAGGGCATCCTTCCC
GCCGAAGCCATCGAGAGACTGAGAACAGTGAATAAAGATAAGCTGAAAGAGTTAGTGGCT
CCGGAACAGGCGTTAGTCAGTTGGGGCGGCAAAAACGAGTATGTATTCAATTTCTTTCCA
GAAAATAGGATCAGTGTTGATAACACCAGCAAATCCTCGACCCTTGATAGTCAACATTCT
TTGGGTGAAATGTTGAGCTTGAACCCGGGTAAGTTATTAATATTTAAGGTCGAAAATGAC
AGGATATGTGCTCAATTAACGATAACAAACATGGATGACAGTGTTATATCATTTAAAATA
AGAACAACTGCACCAGAAAAGTATGTCGTTAAACCAAGCTCAGGTATTTTAACGAGCAAA
GCATCACAGACTATTCAAATACAAGTAAACTCGGGGTTCCAAATCAACTCGGTGGAAAAG
GACAGGTTTCTGGTGGTGTCGATGCAGATACCGAGTGCTGATATATCGGCCAAAGAGATC
AGTGAAATGTGGAAAACCATTGGCTGCAAGGCCGACGAGTACAGACTGAAGTGTTCAACA
GTCAATATGTTGAAGTCGGAGCCAATACAGGAGAAACCGAGCCACGAGCATGACTCTATA
ATGTATAAATTGAACAATCTTCAAAACAATCACAAGATGCTGGTGAAGAACATCAAAACC
CTGAGGATGTACCAGTATGCGACCTTATTCCTGACATTTCTCAGCCTGAGTCTGTGTTAT
GTAACATACAACAAGGATTGCCAGCTATAA
Protein sequence:
MLYEIMVWRKKMNANEINENTVNLDYLKEGIFFPQGRDIDSCLLFIMKSKLYHKGQKNVD
EVKKIIIYWLERIEREEDGKKITLFFDMDGCGLNNMDIEIIMYMVTLLKNYYPNLINYII
IFQLPWMLSAGFKIVKGILPAEAIERLRTVNKDKLKELVAPEQALVSWGGKNEYVFNFFP
ENRISVDNTSKSSTLDSQHSLGEMLSLNPGKLLIFKVENDRICAQLTITNMDDSVISFKI
RTTAPEKYVVKPSSGILTSKASQTIQIQVNSGFQINSVEKDRFLVVSMQIPSADISAKEI
SEMWKTIGCKADEYRLKCSTVNMLKSEPIQEKPSHEHDSIMYKLNNLQNNHKMLVKNIKT
LRMYQYATLFLTFLSLSLCYVTYNKDCQL