DPGLEAN10956 in OGS1.0

New model in OGS2.0DPOGS210829 
Genomic Positionscaffold944:+ 1391-3457
See gene structure
CDS Length2067
Paired RNAseq reads  1730
Single RNAseq reads  4628
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014395 (1e-103)
Best Drosophila hit  ND
Best Human hitpiggyBac transposable element-derived protein 1 (1e-32)
Best NR hit (blastp)  PREDICTED: similar to predicted protein [Hydra magnipapillata] (2e-153)
Best NR hit (blastx)  PREDICTED: similar to predicted protein [Hydra magnipapillata] (5e-135)
GeneOntology terms



  
GO:0005044 scavenger receptor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0016020 membrane
InterPro families  ND
Orthology groupMCL19305

Nucleotide sequence:

ATGGCAAACCAAATGCGTAAAGATTTCATAAAATTATCTGAAATGGGTGAAGAGGAGATC
TGGCAATTGCTAGATAATATTCCTACTGATGATGAAGGGACAGACGATGACAACGATGAC
GACGTTGATAGCGATCAAGGCGCCCCAAATCTTGATTTCATGCATACTGAAGATGAATTG
GAACCACTCACTGACGAATCTCAGAAAACGGAAATACCTACTAATACAACAGAATCGGGC
AGTGCAGTGTCAACAATTTGTCCGATTGTTCCAGAGCAACTTGATCAAAATGAAGAAATA
ATTTCGCTTATGCCTGACCACACAGAAGAAATTGCTGTACAGGAATCCGCGAAACCCTAT
CGACGCCGCAAACGTCCTCGTACCCCGGAGCCAACAGAAGAAGAAGACGGTCCCGTTGTT
CAAGCTCTCGGTGTAGTGGATGATGTTGCAGACATGAAAAATGATTCTCCACAATTCAAA
TCTATTGTTTGGAAAAAAAAGAATCTTCACCTTCATGTAAATGAAGTGGTTTTTAGAGGT
CAAAAAGAATTACCGGAGACGATTACCAGATTGGACACACCTTATAAATGTTTCCGTTAC
TTTATGAACGACGCCCTGTTTGACCATCTTGTTGAACAATCAAATTTGTATGCAAGGCAA
AAGAACATAAGAACAAACTTCAGTGTCCAATCCGTTGATTTGCGAAAATTTGTTGGCATT
CTATTGTATATGTCGGTTTATCGCTATCCAAATGTGCGATCATATTGGGGAAATAATTCC
TTTGAGGCAATTCGCCAAACAATGCCCGTTTTGCGATTTGAAGCAATACGCCGGTACCTC
CATTATAACGACAATGCAGCAGTAGTTACACGAGGTGACCCAGGATATGACCGTCTTTAT
AAAGTTCGTCCCTTGGTAAAACATTTCAATGAAAGATTTCTATCAGTGCCCATGCCTTCT
AGGCTGTGCGTAGATGAACAAATGTGTGCCACAAAAATGACGGGATCCCATTTGCGCCAA
TATATGCCCAATAAGCCACATAAGTGGGGCTTCAAATTTTTTTGTCTTTGTGATACTTCC
GGATTTTCGTACTCTTTCGAAGTATACACTGGTGCCGGAGATAACGTGATTTTTGATGGT
ATGCCAGATCTTGGGGCTGCGTCAAATGTTGTAGTTCGCTTGTCAAAACAAATACCAAAT
TTCGTAAATCACATCCTATACTTCGATAATTTCTACACGTCCCTTGGCCTGCTTACGTAT
CTCCGAAGTAGAGGAATTTACAGTTTGGGAACTGTGCGAGTAAACAGAGTACCCAACTGT
AAATTGTCTAGCGATGCAATTTTGCAACAGAAAAAGGTTGATCGTGGTTACTCAGAAGAG
TTTGTAGGTACTGCATATGGTATTGATATATCCTCTGTGCTATGGAATGATACGAAAACT
GTGCGCCTATTGTCTACCTACGTTGGAGTAAAACCATTTGCGTCTAAAAACATAAACAAA
CAGATTTCAAAAGTAACACGTTGGGATAGAAAAAAGAAAACCCACTATGACATTGACTGT
CCACAAATCATCAAAGAATATAATCGGCATATGGGGGGTGTCGATTTGATGGATGGCTTA
TTAGGCCGTTATCATATTCGTATGAAAACCCGGAAATGGACCAACCGAATTTTTTATCAT
ATGGTCGACGTGGCAATGGTGAATGCTTATATACTTTATCATCGGTTGCATCCCCATGCA
GATAAAATTGAGTTGCCAACGTTCAGAACACAAGTCGCAGAATCACTCTGCGTGTGCGGC
ACTATTCCAGTAAAACGAAGCGTTGGCCGACCATCCAATACGACACCGCCACCAAAGATA
CCAACAGCGAAACGAGCCTATCTGCCAACCGATGATATTCGTTATGACCAAATTGGCCAC
TGGTGCGTTTTTAGGGATCGGTCTGGCAAGAAGCAGTGCAAATACCCTAAATGTAAATCG
GAAACTCAAGCATACTGCACTAAATGCAATCTATCTTTGTGCAGTTCAACAACAAAGACA
TGCTTTTATGATTTTCATAACAAATAG

Protein sequence:

MANQMRKDFIKLSEMGEEEIWQLLDNIPTDDEGTDDDNDDDVDSDQGAPNLDFMHTEDEL
EPLTDESQKTEIPTNTTESGSAVSTICPIVPEQLDQNEEIISLMPDHTEEIAVQESAKPY
RRRKRPRTPEPTEEEDGPVVQALGVVDDVADMKNDSPQFKSIVWKKKNLHLHVNEVVFRG
QKELPETITRLDTPYKCFRYFMNDALFDHLVEQSNLYARQKNIRTNFSVQSVDLRKFVGI
LLYMSVYRYPNVRSYWGNNSFEAIRQTMPVLRFEAIRRYLHYNDNAAVVTRGDPGYDRLY
KVRPLVKHFNERFLSVPMPSRLCVDEQMCATKMTGSHLRQYMPNKPHKWGFKFFCLCDTS
GFSYSFEVYTGAGDNVIFDGMPDLGAASNVVVRLSKQIPNFVNHILYFDNFYTSLGLLTY
LRSRGIYSLGTVRVNRVPNCKLSSDAILQQKKVDRGYSEEFVGTAYGIDISSVLWNDTKT
VRLLSTYVGVKPFASKNINKQISKVTRWDRKKKTHYDIDCPQIIKEYNRHMGGVDLMDGL
LGRYHIRMKTRKWTNRIFYHMVDVAMVNAYILYHRLHPHADKIELPTFRTQVAESLCVCG
TIPVKRSVGRPSNTTPPPKIPTAKRAYLPTDDIRYDQIGHWCVFRDRSGKKQCKYPKCKS
ETQAYCTKCNLSLCSSTTKTCFYDFHNK