DPGLEAN22305 in OGS1.0

New model in OGS2.0DPOGS200790 
Genomic Positionscaffold1325:- 22311-23480
See gene structure
CDS Length1170
Paired RNAseq reads  564
Single RNAseq reads  1314
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011898 (2e-103)
Best Drosophila hit  CG33523, isoform D (1e-45)
Best Human hitmotile sperm domain-containing protein 2 isoform 1 (8e-32)
Best NR hit (blastp)  PREDICTED: similar to AGAP005556-PA [Tribolium castaneum] (5e-80)
Best NR hit (blastx)  PREDICTED: similar to AGAP005556-PA [Tribolium castaneum] (2e-69)
GeneOntology terms

  
GO:0008150 biological_process
GO:0005575 cellular_component
GO:0005198 structural molecule activity
InterPro families

  
IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal
IPR008962 PapD-like
IPR000535 Major sperm protein
Orthology groupMCL13531

Nucleotide sequence:

ATGCTTTACGAGATCATGGTCTGGAGAAAAAAGATGAACGCCAACGAAATCAACGAGAAC
ACCGTTAATTTGGATTACCTGAAAGAAGGTATCTTCTTCCCGCAAGGCAGAGACATCGAT
AGCTGTTTGCTGTTCATCATGAAGTCGAAACTGTACCATAAAGGACAGAAAAACGTAGAC
GAAGTCAAGAAAATTATCATATACTGGTTGGAGAGAATCGAAAGAGAGGAAGACGGCAAG
AAAATTACTCTCTTCTTTGATATGGACGGCTGTGGTCTCAACAACATGGATATAGAGATT
ATCATGTACATGGTTACGTTATTAAAGAATTATTATCCTAATCTTATTAATTACATCATC
ATATTCCAACTGCCCTGGATGCTGTCGGCCGGGTTTAAGATTGTCAAGGGCATCCTTCCC
GCCGAAGCCATCGAGAGACTGAGAACAGTGAATAAAGATAAGCTGAAAGAGTTAGTGGCT
CCGGAACAGGCGTTAGTCAGTTGGGGCGGCAAAAACGAGTATGTATTCAATTTCTTTCCA
GAAAATAGGATCAGTGTTGATAACACCAGCAAATCCTCGACCCTTGATAGTCAACATTCT
TTGGGTGAAATGTTGAGCTTGAACCCGGGTAAGTTATTAATATTTAAGGTCGAAAATGAC
AGGATATGTGCTCAATTAACGATAACAAACATGGATGACAGTGTTATATCATTTAAAATA
AGAACAACTGCACCAGAAAAGTATGTCGTTAAACCAAGCTCAGGTATTTTAACGAGCAAA
GCATCACAGACTATTCAAATACAAGTAAACTCGGGGTTCCAAATCAACTCGGTGGAAAAG
GACAGGTTTCTGGTGGTGTCGATGCAGATACCGAGTGCTGATATATCGGCCAAAGAGATC
AGTGAAATGTGGAAAACCATTGGCTGCAAGGCCGACGAGTACAGACTGAAGTGTTCAACA
GTCAATATGTTGAAGTCGGAGCCAATACAGGAGAAACCGAGCCACGAGCATGACTCTATA
ATGTATAAATTGAACAATCTTCAAAACAATCACAAGATGCTGGTGAAGAACATCAAAACC
CTGAGGATGTACCAGTATGCGACCTTATTCCTGACATTTCTCAGCCTGAGTCTGTGTTAT
GTAACATACAACAAGGATTGCCAGCTATAA

Protein sequence:

MLYEIMVWRKKMNANEINENTVNLDYLKEGIFFPQGRDIDSCLLFIMKSKLYHKGQKNVD
EVKKIIIYWLERIEREEDGKKITLFFDMDGCGLNNMDIEIIMYMVTLLKNYYPNLINYII
IFQLPWMLSAGFKIVKGILPAEAIERLRTVNKDKLKELVAPEQALVSWGGKNEYVFNFFP
ENRISVDNTSKSSTLDSQHSLGEMLSLNPGKLLIFKVENDRICAQLTITNMDDSVISFKI
RTTAPEKYVVKPSSGILTSKASQTIQIQVNSGFQINSVEKDRFLVVSMQIPSADISAKEI
SEMWKTIGCKADEYRLKCSTVNMLKSEPIQEKPSHEHDSIMYKLNNLQNNHKMLVKNIKT
LRMYQYATLFLTFLSLSLCYVTYNKDCQL