DPGLEAN04250 in OGS1.0

Genomic Positionscaffold2881:+ 1130-6353
See gene structure
CDS Length1806
Paired RNAseq reads  975
Single RNAseq reads  3644
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008009 (2e-52)
Best Drosophila hit  CG18812, isoform B (9e-48)
Best Human hitganglioside-induced differentiation-associated protein 2 isoform a (5e-19)
Best NR hit (blastp)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0)
Best NR hit (blastx)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0)
GeneOntology terms

  
GO:0008150 biological_process
GO:0005575 cellular_component
GO:0003674 molecular_function
InterPro families
  
IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal
IPR000477 Reverse transcriptase
Orthology groupMCL10012

Nucleotide sequence:

ATGCCACATCAGTGGCGTTATAGTTACATTACCCCTATATACAAAGGCAGGGGCAGTGTT
CAAGATTGTGGTAGTTATAGGGGCGTTAAGATCATGAGTCACACCATGAAGCTCTTTGAG
CGTATGATCGACCTCAGGCTCCGCCGAGAGTGTACTGTCTCGGAATGTCAATATGGATTT
CAGCCAGGATCGGGCACCTTGGACGCCATCTTTGCCATCAGAACTCTGATGGAGGCATAC
AGGGAAAAAAGGAGAGCTCTGCATGTCGCATTCCTAGATCTGCAGAAGGCCTTTGACTGC
GTGCCTCGTCAATGTATCTGGTGGGCATTGCGATTCAAAGGGATCCCTGAGGCCTATATT
GACATCATCAGAGACATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGAT
ACAAAACCCTTTCCGATCTCAGTAGGGCTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTG
TTCAATGTAGTGCTGGACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATG
ATGTATGCCGATGACATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTG
AACCTCTGGAAGGGTACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAG
TACATGGCTTGCGGAAGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTT
AAGTCGGAAAAGTTCAGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCAC
GATGTCCAAGCCCGGATCAGCGCTGCTTGGGCGAAATGGCGTGACGTCACAGGTGTGGTC
TGCGATCGCAGAATACCTACCAAGCTCAAGGGAATAATATACAAGAGCATAATCCGACCG
GTTCTCTTATATGGAAGCGAATGTTGGCCAACACTGTCCAGGCACACTCAGGAGCTTCAC
GTCACGGAGATGAAGATGCTGAGGTGGATGTGTGGCGTAACGCGGGCTGACCGTATACGT
AACACATTTATCCGAGGTAGTCTTGGAGTCCGTGACGTAGCGGATAAGCTTCAAGAGAGT
CGCCTGAGATGGTATGGCCACGTTGCACGCCGGCCTGAGAATTACGTCGGAAAAATTTGC
CTTGATATGTCGGTCCCTGGAGCAAGACCCCCAGGACGCCCAAGAAAGCGATGGCTGGAC
ACCGTGAAGCAGGATATGAGAGCCAATGGACTTACCACCGCGGATGCTAAAGACCGTGCA
AAATACGAGCGTCTTCTCCGTCGTGCTCGAAGCGAAGACCTGAGCGAAGTGTCCGGTATA
GGATGCTTGTATCAGAGCGGCGTTGACAGACTCGGAAGACCGGTGGTCGTGTTCATTGGC
AAATGGTTCCCCATCACGGAGATAGATCTCGATAAGGCTTTATTATATCTCATCAAGCTG
CTAGACCCCATCGTCCGTGGGGATTACGTCATAGCCTACTTCCACACACTGGCCTCGTCG
AACAACCATCCACCCTTCTCGTGGCTGAAGGAGGTTTACACCGACGACGGAATATTCATT
CCATATAAGAAGAATTTGAAAGCCTTTTACATAGTACATCCTACGTTTTGGACGAAGATG
ATGACATGGTGGTTCACAACATTCATGGCGCCCGCTATCAAGGCGAAAGTCCACACGCTG
CCTGGTGTTGAGTATTTATACTCTGTGATGGCGAGAGACCAGCTCCTCCACGGGCGGAAG
CAGTGTCTGTACGATATGACGATAAACGGCCTGCACTACTTCCAGCCGGACACGAGCAAC
ACGTAG

Protein sequence:

MPHQWRYSYITPIYKGRGSVQDCGSYRGVKIMSHTMKLFERMIDLRLRRECTVSECQYGF
QPGSGTLDAIFAIRTLMEAYREKRRALHVAFLDLQKAFDCVPRQCIWWALRFKGIPEAYI
DIIRDMYRDSVSMVRTAVGDTKPFPISVGLHQGSALSPFLFNVVLDTVSANIQDQPPWLM
MYADDIALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAV
KSEKFRYLGSILHESGGIDHDVQARISAAWAKWRDVTGVVCDRRIPTKLKGIIYKSIIRP
VLLYGSECWPTLSRHTQELHVTEMKMLRWMCGVTRADRIRNTFIRGSLGVRDVADKLQES
RLRWYGHVARRPENYVGKICLDMSVPGARPPGRPRKRWLDTVKQDMRANGLTTADAKDRA
KYERLLRRARSEDLSEVSGIGCLYQSGVDRLGRPVVVFIGKWFPITEIDLDKALLYLIKL
LDPIVRGDYVIAYFHTLASSNNHPPFSWLKEVYTDDGIFIPYKKNLKAFYIVHPTFWTKM
MTWWFTTFMAPAIKAKVHTLPGVEYLYSVMARDQLLHGRKQCLYDMTINGLHYFQPDTSN
T