DPGLEAN21555 in OGS1.0

Genomic Positionscaffold8343:+ 1071-3522
See gene structure
CDS Length2154
Paired RNAseq reads  1159
Single RNAseq reads  4122
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (4e-75)
Best NR hit (blastx)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (6e-77)
GeneOntology terms  ND
InterPro families  IPR000477 Reverse transcriptase
Orthology groupMCL10014

Nucleotide sequence:

ATGAATACTCGCGAAAATCTTAGATCTCATTATGTGAACCAGTACGGACTGGATATCGCA
AATAGAATACGCCGGTTGGAGTACCTCAAGACGAAGAAAGCCAAACTATTACCATCCTTG
AATTTCCTCAAGAGATGCCGGGATCACGACATCATACCAACATGTGTCAAAATATCACCA
AAAAAAGACATCAAACATAGCGGCAGGATACTCCATCAGGCAAGTAAACGGTTACTTTGT
CATCTTATTAGACAACATCGGTCGGATGCACGGCAGTTAGATTTTGAAATTGATGTTCTG
AGTTACGAAATCGAAAAATGTTTGTCAAAGACAGATTGGCAAAAATGTTTTGATATTATT
AATAAACGTGAACAATTCACGTATAATCAATTGAAAAATAATAAAATTAATAAATTTAAA
AAATTAACGGAAAAAATTAAAAGTTGTAATAGTAATGGCAATAATTATAATATTAATTTA
AGTCGGGCAACACAAACATCGGTAACTGTTGTCAATCTGTCAAAACAAACTTTAGATAAA
GAAACCGTTGAGATCCTTAAAAAAGGATTGAATTTTGCACCAGTGCCATCAAAATTACCG
TTCGAAGACATTATCTGTAACGTTGAGGAATGTCTTCATAAAAATCATGTCACGAGAGAA
GAGTCTGAAGCAATCCGGCAGGATGTTTCGTATGTTCTGCGCCGGAGCAAATTGCCTCGA
CAAAATATCTCTCGTCAGGAATCGGTTGCACTCAAACAACTTCGCAACAATGAAGACCTG
ACTGTCCTCCGTGCAGATAAAGGGAACGCGACCATATTATCACCACTCCGGGGCCACACA
AAGTCATTTGTTAAGGATTCCTACCAATTTGTCAAAGATTTAAAACACCTAAAATTAAGC
GACAATGACAGTATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTT
GACTGCATTGAGATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCA
GAACTATTAAAGCATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATA
CAAGTAGATGGAGTTGCAATGGGTTCGCCGGTTTCCCCCGTTGTCGCTGACATATTCATG
GAAGACTTCGAGGTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATATATAAACGG
TATGTAGATGACACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCAT
CTTAATTCTATCAATAGTAAGATTCAGTTTACTATAGAATTGGAGGCAAATAATTCTTTA
GCTTTCCTTGATATACTTGTCATTAGGAATCCTGACAATACTTTGGGACATACTGTTTAT
AGGAAACCCACACACACGGACAGGTACCTCAACGGTAACTCGCACCACCACCCTATACAG
TTAGCTACCGTTGGCAAATCTTTGTTACAGAGAGCCCAACACCTTTGTGATGCTGACCAC
CTAGAGGCCGAGCTGCAGCATGTAAAACATGCTCTCACCATCAACAACCTGCCCGTGCCT
CGCCAGCATCGCAAGAACCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTA
CCATATGTGAAGGGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAA
ACTATTTACAAACCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATT
CCTTTACAACAAGCGGGTGTATACAAACTCGACTGTGACTGTGGCTTGTCATACATTGGA
CAGACGAAGAGGAGCATCGGTACAAGGGTTAAGGAACACATCTCAGACATCAAAAACAGG
CGCGCGTCGAAGTCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGT
TTTGATAAACCTCAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAG
GCTATTGAAATTAAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCAAAC
ACCTGGGACCCCGTTCTTAAAAATATAAAATCCCATGTCCGAAACCACACCGCAGGACCT
CAAGACACCGTGAGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATAA

Protein sequence:

MNTRENLRSHYVNQYGLDIANRIRRLEYLKTKKAKLLPSLNFLKRCRDHDIIPTCVKISP
KKDIKHSGRILHQASKRLLCHLIRQHRSDARQLDFEIDVLSYEIEKCLSKTDWQKCFDII
NKREQFTYNQLKNNKINKFKKLTEKIKSCNSNGNNYNINLSRATQTSVTVVNLSKQTLDK
ETVEILKKGLNFAPVPSKLPFEDIICNVEECLHKNHVTREESEAIRQDVSYVLRRSKLPR
QNISRQESVALKQLRNNEDLTVLRADKGNATILSPLRGHTKSFVKDSYQFVKDLKHLKLS
DNDSMVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYI
QVDGVAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNH
LNSINSKIQFTIELEANNSLAFLDILVIRNPDNTLGHTVYRKPTHTDRYLNGNSHHHPIQ
LATVGKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKNHLKPPTVERQPAIL
PYVKGVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCGLSYIG
QTKRSIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIRE
AIEIKKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARK