DPGLEAN19546 in OGS1.0

Genomic Positionscaffold4921:- 152-2697
See gene structure
CDS Length1701
Paired RNAseq reads  720
Single RNAseq reads  3437
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001072 (3e-53)
Best Drosophila hit  CG33291 (3e-25)
Best Human hitankyrin repeat and BTB/POZ domain-containing protein 2 (1e-15)
Best NR hit (blastp)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-67)
Best NR hit (blastx)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-68)
GeneOntology terms


  
GO:0005575 cellular_component
GO:0008150 biological_process
GO:0003677 DNA binding
GO:0005515 protein binding
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10014

Nucleotide sequence:

ATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTTGACTGCATTGAG
ATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCAGAGCTATTAAAG
CATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATACAAGTAGATGGA
GTTGCAATGGGTTCACCGGTTTCCCCCGTTGTCGCTGACATATTCATGGAGGACTTCGAG
GTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAACGGTATGTAGATGAC
ACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCATCTCAATTCTATC
AATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCTTTAGCTTTCCTTGAT
ATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTTTATAGGAAACCCACA
CATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATCCAGTTAGCTACCGTT
GGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGACCACCTAGAGGCCGAG
CTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTGCCTCGCCAGCATCGC
AAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTACCATATGTGAAG
GGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAAACTATTTACAAA
CCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATTCCTTTACAACAA
GCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATTGGACAGACGAAGAGG
AGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAACAGGCGCGCGTCGAAG
TCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGTTTTGATAAACCT
CAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAGGCTATTGAAATT
AAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCAAACACCTGGGACCCC
GTTCTTAAAAATATAAAATCCCATGTCCGTAACCACACCGCAGGACCTCAAGACACCGTG
AGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGAAATCGATGGCGACTA
GTTGAACTTCGTTGCGTGCCGTGGACGGCTGGCGAGGTATCCCGGGCGATACAAGCAGGT
CGATGTCGGGATATAGCACCCCGTATGGCTCCAGACTCACCTCCTAGACTGGCCTACTTA
TTACAGAGAGCTTTGGTTCGGATAGGGCGGGAAGCTCAGCGTCTGTCACAGAACTTCGGC
TTCTGTTCCAAGCACGAGGTGGCTGGCGCTTTCCGAATCGTACTGAGTACACCATTAGCT
GATTCTTGTATAAAGGGTTGTCAACGTGCAGCGACTATGTATGCAACGTCAGTAAGCGCT
GCAAGAAGACTAGGCTCGGCGGCTCGAGCGCGCACAGGTCTAGCACCTGGACGTTTTCAG
CGCTGGATGTTGGACGTGCGCGTCGCGGCATTCGTACATGAGTTGGTATTTTATTATGGC
TATAGTTTTCAGAATAGTTAA

Protein sequence:

MVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYIQVDG
VAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSI
NSKIQCTIELEANNSLAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATV
GKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVK
GVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKR
SIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEI
KKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRL
VELRCVPWTAGEVSRAIQAGRCRDIAPRMAPDSPPRLAYLLQRALVRIGREAQRLSQNFG
FCSKHEVAGAFRIVLSTPLADSCIKGCQRAATMYATSVSAARRLGSAARARTGLAPGRFQ
RWMLDVRVAAFVHELVFYYGYSFQNS