Genomic Position | scaffold2881:+ 1130-6353 |
---|---|
See gene structure | |
CDS Length | 1806 |
Paired RNAseq reads   | 975 |
Single RNAseq reads   | 3644 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008009 (2e-52) |
Best Drosophila hit   | CG18812, isoform B (9e-48) |
Best Human hit | ganglioside-induced differentiation-associated protein 2 isoform a (5e-19) |
Best NR hit (blastp)   | endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0) |
Best NR hit (blastx)   | endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (0.0) |
GeneOntology terms    | GO:0008150 biological_process GO:0005575 cellular_component GO:0003674 molecular_function |
InterPro families    | IPR001251 Cellular retinaldehyde-binding/triple function, C-terminal IPR000477 Reverse transcriptase |
Orthology group | MCL10012 |
Nucleotide sequence:
ATGCCACATCAGTGGCGTTATAGTTACATTACCCCTATATACAAAGGCAGGGGCAGTGTT
CAAGATTGTGGTAGTTATAGGGGCGTTAAGATCATGAGTCACACCATGAAGCTCTTTGAG
CGTATGATCGACCTCAGGCTCCGCCGAGAGTGTACTGTCTCGGAATGTCAATATGGATTT
CAGCCAGGATCGGGCACCTTGGACGCCATCTTTGCCATCAGAACTCTGATGGAGGCATAC
AGGGAAAAAAGGAGAGCTCTGCATGTCGCATTCCTAGATCTGCAGAAGGCCTTTGACTGC
GTGCCTCGTCAATGTATCTGGTGGGCATTGCGATTCAAAGGGATCCCTGAGGCCTATATT
GACATCATCAGAGACATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGAT
ACAAAACCCTTTCCGATCTCAGTAGGGCTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTG
TTCAATGTAGTGCTGGACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATG
ATGTATGCCGATGACATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTG
AACCTCTGGAAGGGTACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAG
TACATGGCTTGCGGAAGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTT
AAGTCGGAAAAGTTCAGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCAC
GATGTCCAAGCCCGGATCAGCGCTGCTTGGGCGAAATGGCGTGACGTCACAGGTGTGGTC
TGCGATCGCAGAATACCTACCAAGCTCAAGGGAATAATATACAAGAGCATAATCCGACCG
GTTCTCTTATATGGAAGCGAATGTTGGCCAACACTGTCCAGGCACACTCAGGAGCTTCAC
GTCACGGAGATGAAGATGCTGAGGTGGATGTGTGGCGTAACGCGGGCTGACCGTATACGT
AACACATTTATCCGAGGTAGTCTTGGAGTCCGTGACGTAGCGGATAAGCTTCAAGAGAGT
CGCCTGAGATGGTATGGCCACGTTGCACGCCGGCCTGAGAATTACGTCGGAAAAATTTGC
CTTGATATGTCGGTCCCTGGAGCAAGACCCCCAGGACGCCCAAGAAAGCGATGGCTGGAC
ACCGTGAAGCAGGATATGAGAGCCAATGGACTTACCACCGCGGATGCTAAAGACCGTGCA
AAATACGAGCGTCTTCTCCGTCGTGCTCGAAGCGAAGACCTGAGCGAAGTGTCCGGTATA
GGATGCTTGTATCAGAGCGGCGTTGACAGACTCGGAAGACCGGTGGTCGTGTTCATTGGC
AAATGGTTCCCCATCACGGAGATAGATCTCGATAAGGCTTTATTATATCTCATCAAGCTG
CTAGACCCCATCGTCCGTGGGGATTACGTCATAGCCTACTTCCACACACTGGCCTCGTCG
AACAACCATCCACCCTTCTCGTGGCTGAAGGAGGTTTACACCGACGACGGAATATTCATT
CCATATAAGAAGAATTTGAAAGCCTTTTACATAGTACATCCTACGTTTTGGACGAAGATG
ATGACATGGTGGTTCACAACATTCATGGCGCCCGCTATCAAGGCGAAAGTCCACACGCTG
CCTGGTGTTGAGTATTTATACTCTGTGATGGCGAGAGACCAGCTCCTCCACGGGCGGAAG
CAGTGTCTGTACGATATGACGATAAACGGCCTGCACTACTTCCAGCCGGACACGAGCAAC
ACGTAG
Protein sequence:
MPHQWRYSYITPIYKGRGSVQDCGSYRGVKIMSHTMKLFERMIDLRLRRECTVSECQYGF
QPGSGTLDAIFAIRTLMEAYREKRRALHVAFLDLQKAFDCVPRQCIWWALRFKGIPEAYI
DIIRDMYRDSVSMVRTAVGDTKPFPISVGLHQGSALSPFLFNVVLDTVSANIQDQPPWLM
MYADDIALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAV
KSEKFRYLGSILHESGGIDHDVQARISAAWAKWRDVTGVVCDRRIPTKLKGIIYKSIIRP
VLLYGSECWPTLSRHTQELHVTEMKMLRWMCGVTRADRIRNTFIRGSLGVRDVADKLQES
RLRWYGHVARRPENYVGKICLDMSVPGARPPGRPRKRWLDTVKQDMRANGLTTADAKDRA
KYERLLRRARSEDLSEVSGIGCLYQSGVDRLGRPVVVFIGKWFPITEIDLDKALLYLIKL
LDPIVRGDYVIAYFHTLASSNNHPPFSWLKEVYTDDGIFIPYKKNLKAFYIVHPTFWTKM
MTWWFTTFMAPAIKAKVHTLPGVEYLYSVMARDQLLHGRKQCLYDMTINGLHYFQPDTSN
T