DPGLEAN09794 in OGS1.0

Genomic Positionscaffold5158:- 7942-14193
See gene structure
CDS Length3042
Paired RNAseq reads  809
Single RNAseq reads  3195
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014395 (2e-44)
Best Drosophila hit  ND
Best Human hitpiggyBac transposable element-derived protein 3 (3e-31)
Best NR hit (blastp)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-67)
Best NR hit (blastx)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (7e-69)
GeneOntology terms



  
GO:0005044 scavenger receptor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0016020 membrane
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10014

Nucleotide sequence:

ATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTTGACTGCATTGAG
ATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCAGAGCTATTAAAG
CATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATACAAGTAGATGGA
GTTGCAATGGGTTCACCGGTTTCCCCCGTTGTCGCTGACATATTCATGGAGGACTTCGAG
GTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAACGGTATGTAGATGAC
ACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCATCTCAATTCTATC
AATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCTTTAGCTTTCCTTGAT
ATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTTTATAGGAAACCCACA
CATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATCCAGTTAGCTACCGTT
GGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGACCACCTAGAGGCCGAG
CTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTGCCTCGCCAGCATCGC
AAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTACCATATGTGAAG
GGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAAACTATTTACAAA
CCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATTCCTTTACAACAA
GCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATTGGACAGACGAAGAGG
AGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAACAGGCGCGCGTCGAAG
TCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGTTTTGATAAACCT
CAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAGGCTATTGAAATT
AAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCAAACACCTGGGACCCC
GTTCTTAAAAATATAAAATCCCATGTCCGAAACCACACCGCAGGACCTCAAGACACCGTG
AGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGAAATCGATGGCGTATA
GTCTTGACATCAACGAAGCTAGTGGCCCGGGTGGCATTCCAGTTCCATTTCAATTGCCTA
CGTATGAGCGAGGGGTGGGAATTAGGAGAAGGTAGGAAAGGGTGGGAACAGGGAAAGGAC
AACGACGACGAAACTAGAGGGTGGTACTTCCCCGGTCGAGCCAGAATATGGTCACAGTGT
AGTTACTTTACCACAGCTAGGCTCTACCACCTCAGAAATGATGAGTTAAATTGCAGGTGC
AGCGACGAGAAGACTTCGCCTAATCGCGCAAGTAAAATTATTGCTTTAGTGCCGAATGAA
CATGCACCTTCCGATGCTAGCGAAATATCAGATTCGGAAACTGAATTGCATGATGCACGT
ACTAATTCTTCCACCCCCCTGTCTTCTCCTGCACCATCTATTGCATCTTCCCTTGCTAAC
ATAACAATAGATACATCCGACGATGAAGAATTACACTCCGAAGATGCTGATATTATACCA
GATTCTATCGTGCGTGAAATTGGCGAGTCCATTTATGAAAATGTTCCATCCCTATCTGCT
ATTCCTTCATTGCCTGCAACGCCTATAACACCAATAGCTTCTTCATTAGCACCCCGAAAA
ACAAGATCTAGGAAACCACCCACAGTCAAATCCAAAAGACCAAAAAAAAACAAAAAAATT
CAGCTGACTTACAATTGGAGAGTAGCTCAGTTCCGTCACCAAGCGACAATAGAAGAGGAT
GAGGAGTATGATTACATAGACCTGCCAGAAGATGATTCACCTTTGAGTTTCATTCACCTT
TTTTTTTCTCAAGATATGTTTACTAGTATCGTAGAGCAGACAAACTTATATTCAGTTCAA
CAAACTGGCAAGTCAATTCAACTAACAGACGAAGAATTTAGAGACTTTTTAGCCATCCAT
ATACTTATGGGTATTGTTGTAATGCCTTCATATCTTGATTATTGGTCAGAAAAATTTAGA
TATGGTAATGTCGCAGATATAATGTCTTTGAAAAGATATCAGCTGATACGAAGACATTTA
CATTTTATTGACAATACAATGGACGATGGAGACAAATATTTTAAAGTTCGTCCAATCATA
GAGAATTTACGACAAAATTGTCTGAAACAACAGAAAAAGGAGACAAAATTTAGTATAGAT
GAAATGATGATAGCATATAAAGGGACTAAGGCCGGTAAAAGAAAACAATACATGAAAGAT
AAACCTAACAAATGGGGATTTAAAAATTATGTAAGGGCCGGAGCTTCCGGAATAATTTAT
GATTTTGTGCTGTATGGAGGTGACGATACTTTTCGCAACCACAAGTTCACTGAAGAAGAA
TTGTCTCTAGGTTTTGGTGCTCAGGTGGTTATAGCATTATGTCAAAGTATTTGTCAAAAA
CCATCTTTCGTGTTTTGCGATAACTTTTTTTCTTCACCTGAGTTGCTATTTATTCTCCGA
GAAAACTATGGTGTTTTTGCCTTAGGCACTATTCGAGGCAATCGCCTTCGAGGAGCAGAA
AAAGTTTTACCTACTGAAAAAGCCATGAAAAAGAAACCTCGAGGACATTTTGTAGAAGCT
ATATGTGATAAAAACCGCTTAGCTGTTGTTCGTTGGAATGATAATAAGGCTGTCACATTT
ATTAGTTCCTTTGTAGCAAGTGAGCCTATAGAAAAAATACGCCGATATTCCAAAGACGCA
AAAGCCAAAATCGATGTCCATGCTTATGCTTTTTGGGTGGGCGATGGAAACGATGACCAC
TTTGGCGAGGAAACCACACAGAACACGACCATGGGGATCTAA

Protein sequence:

MVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYIQVDG
VAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSI
NSKIQCTIELEANNSLAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATV
GKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVK
GVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKR
SIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEI
KKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRI
VLTSTKLVARVAFQFHFNCLRMSEGWELGEGRKGWEQGKDNDDETRGWYFPGRARIWSQC
SYFTTARLYHLRNDELNCRCSDEKTSPNRASKIIALVPNEHAPSDASEISDSETELHDAR
TNSSTPLSSPAPSIASSLANITIDTSDDEELHSEDADIIPDSIVREIGESIYENVPSLSA
IPSLPATPITPIASSLAPRKTRSRKPPTVKSKRPKKNKKIQLTYNWRVAQFRHQATIEED
EEYDYIDLPEDDSPLSFIHLFFSQDMFTSIVEQTNLYSVQQTGKSIQLTDEEFRDFLAIH
ILMGIVVMPSYLDYWSEKFRYGNVADIMSLKRYQLIRRHLHFIDNTMDDGDKYFKVRPII
ENLRQNCLKQQKKETKFSIDEMMIAYKGTKAGKRKQYMKDKPNKWGFKNYVRAGASGIIY
DFVLYGGDDTFRNHKFTEEELSLGFGAQVVIALCQSICQKPSFVFCDNFFSSPELLFILR
ENYGVFALGTIRGNRLRGAEKVLPTEKAMKKKPRGHFVEAICDKNRLAVVRWNDNKAVTF
ISSFVASEPIEKIRRYSKDAKAKIDVHAYAFWVGDGNDDHFGEETTQNTTMGI