DPGLEAN20235 in OGS1.0

New model in OGS2.0DPOGS203178 
Genomic Positionscaffold72:- 8554-12394
See gene structure
CDS Length1998
Paired RNAseq reads  171
Single RNAseq reads  530
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014395 (7e-55)
Best Drosophila hit  ND
Best Human hitpiggyBac transposable element-derived protein 3 (1e-36)
Best NR hit (blastp)  PREDICTED: similar to hCG32740, partial [Hydra magnipapillata] (1e-70)
Best NR hit (blastx)  PREDICTED: similar to hCG32740 [Hydra magnipapillata] (2e-67)
GeneOntology terms



  
GO:0005044 scavenger receptor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0016020 membrane
InterPro families  ND
Orthology groupMCL10014

Nucleotide sequence:

ATGGCTTCGAAACAACAAAGTGTTGGCGGGACCCGGAACTTACAACTAACTCGCGCCCAT
AAGATCTTGGCTTTGGTACCCGCTCAAAACGCATGCTCTGACGATAGTTCCACATCTGAT
GAGGAAGCAAATGCTTATATACCGCCTTCACCAGATACCTCTGATCCACCTTCAATAGCA
TCATCACTATTAGCATCATCACTAGAAGCCCTTAATTTGCTTGATACTTCAACTGACTCC
AAAAACGAACAGCATGATATACCCATTACTTTACCACTCACACCAACATTTGATGAAGTT
TTTCCATCGCCGAATACAAGTAACTACAATGTTATACCCACATTGAACTCCATCCAATCT
TTGCCATCTCCTTCTGTTGAACTATCATCTCCCCAGCCTAGCACCAGCGGACAGCAGTTA
CTACTTACCAGGTCCAAGCGTAAAAAGACAACAAAGACTGTAGTGACCAAAAAAGTAAAG
GTTCAGAAATTCGCCCTAAATTATCAATGGTTAAAAGCAGTATTTCGTCACAATGTATCT
CTTGAAGAAAACTTATACAATCTCCATCAAACACAAATAGAGACAGTACTGGATTATTTT
TATTTCTTTTTTTCTCCTGACTTAATTACTGACATTGTTTATAATACAAATTTATATGCA
GTTGAACAACTAGGTCGATCTATTCAGCTGACAGAAGATGAGATTAAAAGCTTCCTGGCC
ATTCAAATCATGATGGGTATAGTACAGATGCCAGCTTATACTGACTATTGGGCAAGAAAA
ACAAGATACCCTCTCATTGCTGATCTTATGCCTCTCAAAAAGTATCAACAAATACGTCGA
TATATTCATTTTGTTGATAATACTTTGCAAGATTCAGACCGTTATTTCAAGATTCGCCCC
CTAATGGAGAAAATACGCAAAAATTTTCTGAAAATTGAAGAAGAAGGAAAATATTCCATT
GACGAGATGATGATACCTTATAAAGGTCGTAAAGCAGGTAAACGAAAGCAGTACATAAAA
ATGAAACCTAAGAAATGGGGGTTTAAGAATTTTGTCCGTGCGGGTGTTTCGGGTATCATC
TACGATTTTATTCTGTATGGTGGCGACGATACCTTTCGTGGACTGACTTTTTCAGAGAAA
GAAGCTACAATTGGTTTAGGAGGTATGGTAGTGCTTGCATTGTGTCAAACTATAAAGAAA
AAGCCGGCCATTGTGTATGCTGATAACTTTTTTATGTCGCCTGAACTAACATATATTCTG
CGGGAAGAATACGGGATCCTTAGTCTAGGAACAATAAGGACTAATCGTCTCAGAGGCTGT
CAAGAGTTATTGCCAACTGACAAACAATTAAAAAAGAAGAAACGCGGTTCTAGCGCCCAG
GTGGTTTGCAATAAGAATAAGTTGGCAGTCGTAAAGTGGAACGACAATAAAGTGGTTACA
CTTATTAGCACCTACATAGACTCGTACCCCTTAGAAACAATCAAACGATACGATAAGGAT
GAGAAAAAGAAAGTAGATGTAGAATGTCCTCAAGTGGTCAAACATTACAACAAACATATG
GGAGGGGTCGATTTAGCAGATATGTTGATATCGTTATATAGAACTCCCTTCAAAAGTCAC
CGTTGGTACTTGGGAATATTTTCACAACTTGTTGATATGTGTATAAATAACGCTTGGCTC
CTACATAGAAGAGATGGGAAGAAGACTTCATTGAAAGATTTCAGATTTGAATTGTTTGAT
GGGTTGTCTAAGTCTAATAGAATAGGAACAAACCAAAACGTTACAGACGATATAGGCGAG
AATCTGAAAATCCATAAACCAGTCTCAGTCCGACCAACTGATAGCGTCAGATTTGATAAC
ACAGGTCATCTTCCAGAAGCAGGTTGTCACCATAGCCACGTCCTGCAAACTTTTCGTCTA
GATGACAGCGTTATCTGCCAATCGTACATGTGGGAGACACTGGCAAACAGATCACTATGC
CTTAGGGACATAGAATAA

Protein sequence:

MASKQQSVGGTRNLQLTRAHKILALVPAQNACSDDSSTSDEEANAYIPPSPDTSDPPSIA
SSLLASSLEALNLLDTSTDSKNEQHDIPITLPLTPTFDEVFPSPNTSNYNVIPTLNSIQS
LPSPSVELSSPQPSTSGQQLLLTRSKRKKTTKTVVTKKVKVQKFALNYQWLKAVFRHNVS
LEENLYNLHQTQIETVLDYFYFFFSPDLITDIVYNTNLYAVEQLGRSIQLTEDEIKSFLA
IQIMMGIVQMPAYTDYWARKTRYPLIADLMPLKKYQQIRRYIHFVDNTLQDSDRYFKIRP
LMEKIRKNFLKIEEEGKYSIDEMMIPYKGRKAGKRKQYIKMKPKKWGFKNFVRAGVSGII
YDFILYGGDDTFRGLTFSEKEATIGLGGMVVLALCQTIKKKPAIVYADNFFMSPELTYIL
REEYGILSLGTIRTNRLRGCQELLPTDKQLKKKKRGSSAQVVCNKNKLAVVKWNDNKVVT
LISTYIDSYPLETIKRYDKDEKKKVDVECPQVVKHYNKHMGGVDLADMLISLYRTPFKSH
RWYLGIFSQLVDMCINNAWLLHRRDGKKTSLKDFRFELFDGLSKSNRIGTNQNVTDDIGE
NLKIHKPVSVRPTDSVRFDNTGHLPEAGCHHSHVLQTFRLDDSVICQSYMWETLANRSLC
LRDIE