DPGLEAN17101 in OGS1.0

New model in OGS2.0DPOGS202463 
Genomic Positionscaffold4192:+ 5391-10558
See gene structure
CDS Length1191
Paired RNAseq reads  3103
Single RNAseq reads  7914
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009981 (8e-142)
Best Drosophila hit  CG8963, isoform C (3e-16)
Best Human hitpolyadenylate-binding protein-interacting protein 1 isoform 1 (5e-11)
Best NR hit (blastp)  GA24791 [Drosophila pseudoobscura pseudoobscura] (1e-23)
Best NR hit (blastx)  GA24791 [Drosophila pseudoobscura pseudoobscura] (4e-23)
GeneOntology terms





  
GO:0016070 RNA metabolic process
GO:0005515 protein binding
GO:0042048 olfactory behavior
GO:0048854 brain morphogenesis
GO:0031987 locomotion involved in locomotory behavior
GO:0001964 startle response
GO:0002121 inter-male aggressive behavior
InterPro families

  
IPR016024 Armadillo-type fold
IPR016021 MIF4-like, type 1/2/3
IPR003890 MIF4G-like, type 3
Orthology groupMCL15825

Nucleotide sequence:

ATGAACAACGGGGACGTCGGAGGCTCTAGAGGCAGAGGCCGTGGTTGGAATTCCGACAAC
CAACCTCGCGAATTACGCCGGCCCAAAACTGTAACTGAGGAAGTGAAAGAACCTCCAAAG
TCAATACTGTCAGCTGAAGCGAAAGAATGGTATCCGCGGAACTACGTGCCCCAGAACCAA
GTGTCGTATGGGCAGGAGCACTACCGAGTGCCTCGCTATTCCGCTCAAGATAGAATCCGA
CAGGCTCAGGAACAGGATCCGTACAACTTTGAGGATATGTCATATTCTTTGGATGAACCC
GAAAGTGCCTTACGGGAAAATATAGCAAACCTGATTACTGTTATGTGTGAGATAACATTT
GACCCAGGCAAGTTTGACACTCTCTGTGGACCTCTGGTAGATTCATTTTATGCCACTCTA
CATGATGCTAACTACACCAGGCCTCTTGTCGAAGCTATCGTGAATCAGTCAATATTCGAG
GCCAACTTCCGCTACAATGGCGCCCGTCTTTGCTCGATGTACGACTCCGTCTCGCCTCCC
GAAGACTCAACATTCCGAGCCTGTCTGTTGGAACGTTGTACTGCCGAGGAGAACAAAATC
ATAAGTGGGGCAGAAACATCGGAGGAGAACGTCAGAGGTTTTGCTATGTTCCTGGCTGAG
ATATACACACAGCTGGAGGACAATCAGGGAGGAAGAATAAGGACTCTGGGTGAAAGTCTC
TGTAAAGTGTTCTTGCATCTTTTGGACACCGACAAAGAGGTCAACATAAAAGCGGTATGC
CAGTTGTTGAAATTGTCCGGTATAGCGCTGGACGCGGATTGTCCGTCTAGCATGCAGCAG
CTGTTCGATCGCTTGAAGCAACGTTCGGATCTGGCGAGCGTGCGTCACGTGGTGTCGCTG
AGGGCAACCCGTTGGGGTCTGGCTGACCCCGACCCGCCGGCCCCGCCCGCTGACAGACGA
CGAAACGCTAACTCCGAAGCCGACGGTGTTGGAGGTTATCTCGCAGACGGACATTCGCTA
ACTGCCGAGGAATGCGCCTTCTTGCAAAGCAACCTACCCCCAAAACCCGCGGCTATAGAG
GACGACATACTTGAGGAATTGGAAAATGATGCATGGGATACTGGCATGGATCCGGAAATG
CAGGCGGGCTTCCTAGAGTTCCTCAAGATATCCAATCAAATCAAACGATAG

Protein sequence:

MNNGDVGGSRGRGRGWNSDNQPRELRRPKTVTEEVKEPPKSILSAEAKEWYPRNYVPQNQ
VSYGQEHYRVPRYSAQDRIRQAQEQDPYNFEDMSYSLDEPESALRENIANLITVMCEITF
DPGKFDTLCGPLVDSFYATLHDANYTRPLVEAIVNQSIFEANFRYNGARLCSMYDSVSPP
EDSTFRACLLERCTAEENKIISGAETSEENVRGFAMFLAEIYTQLEDNQGGRIRTLGESL
CKVFLHLLDTDKEVNIKAVCQLLKLSGIALDADCPSSMQQLFDRLKQRSDLASVRHVVSL
RATRWGLADPDPPAPPADRRRNANSEADGVGGYLADGHSLTAEECAFLQSNLPPKPAAIE
DDILEELENDAWDTGMDPEMQAGFLEFLKISNQIKR