New model in OGS2.0 | DPOGS211416  |
---|---|
Genomic Position | scaffold337:+ 138572-140179 |
See gene structure | |
CDS Length | 1224 |
Paired RNAseq reads   | 144456 |
Single RNAseq reads   | 421187 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010897 (0.0) |
Best Drosophila hit   | arrestin 2 (4e-162) |
Best Human hit | beta-arrestin-1 isoform B (1e-82) |
Best NR hit (blastp)   | arrestin, Arr2-like (AGAP006263-PA) [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | arrestin, Arr2-like (AGAP006263-PA) [Anopheles gambiae str. PEST] (2e-179) |
GeneOntology terms    | GO:0016059 deactivation of rhodopsin mediated signaling GO:0016028 rhabdomere GO:0016060 metarhodopsin inactivation GO:0007602 phototransduction GO:0016030 metarhodopsin binding GO:0005624 membrane fraction GO:0005625 soluble fraction GO:0016062 adaptation of rhodopsin mediated signaling GO:0005515 protein binding GO:0005737 cytoplasm |
InterPro families    | IPR014753 Arrestin, N-terminal IPR014752 Arrestin, C-terminal IPR014756 Immunoglobulin E-set IPR000698 Arrestin IPR011021 Arrestin-like, N-terminal IPR011022 Arrestin-like, C-terminal |
Orthology group | MCL16066 |
Nucleotide sequence:
ATGCTTGACTTCCTATCCTTAGCTTTCGTGGTGGCGGTGAAGGTTTTCAAGAAAACCACA
CCCAATGGAAAGGTTACGGTTTACCTGGGCAAACGGGACTTCATTGATCACGTTGACTAC
TGCGATCCTGTCGATGGAGTCGTAGTGGTTGACACCGAGTACCTGAAAGGACGAAAAGTT
TACAGCCAGCTGGTCACGACCTACCGATTCGGTCGCGAAGAAGATGAGGTCATGGGAGTC
AAATTCTCGAAGGAACTGGTCATCGGCCAGGACCAAGTGGTACCAATGGTCAACGCGAAA
ATGGAACTGACACCTGTCCAAGAAAAGCTCCTGAAAAAGCTCGGCCCAAATGCCTTCCCA
TTCACATTTACCTTCCCCGAAATGTCGCCCAGCTCGGTCACTCTGCAACCATCTGATGAG
GATCAGGGCAAACCCATGGGTGTGGACTACTGCGTGCGAACCTACGTAGCTGACAACGAG
GATGACAAGGGCCACAAAAGGAGCTCCGTTACCCTTGCTATCAAGAAGCTGCAACATGCC
CCAGCTTCTCGCGGACGACGCCTACCTAGCTCCCTTGTCAGCAAGGGCTTCACTTTCAGC
AACGGCAAGATCAGTTTGGAAGTGACCCTCGACAAGGAAATCTACTATCATGGAGAGAAG
GTTGCCGCCAACATCATCGTTTCCAACAACTCCAGGAAATCCGTTCGCAACATCCGCTGC
ATGGTTGTACAGCATGTTGAGATTACCATGATCAACTCTCAATTCAGCCGCCATGTTGCA
TCTCTGGAAAGCCGCGAGGGTTGCCCAGTAACACCCGGAGCTAGCCTGTCTAAGACCTTC
TACTTGGTGCCTCTGGCTCGCAGCAACAAGGATATTCGAGGCGTCGCCCTGGACGGCCAC
CTTAAGGAGGATGACGTCAACCTCGCAAGCTCTACCCTGGTGTCGGAGGGCAAGTGCCCA
GCTGATGCTATTGGTATCGTGGTATCTTACTCCGTACGAGTGAAGCTGAACTGCGGAACT
CTGGGAGGCGAGCTTGTTACGGACGTGCCATTCAAACTGCTGCATCCTGCTGAGGGAAGC
GTAGAACGCCAACGTTTCAACGCAATGAAGAAGATGCAATCCATTGAGCGTCACCGCTAC
GAAAATTCTCTGTATGCCAACGAGGAGGAAGACAACATCGTTTTTGAGGACTTCGCCCGC
CTTAGGATGAACGAACCGGAATAA
Protein sequence:
MLDFLSLAFVVAVKVFKKTTPNGKVTVYLGKRDFIDHVDYCDPVDGVVVVDTEYLKGRKV
YSQLVTTYRFGREEDEVMGVKFSKELVIGQDQVVPMVNAKMELTPVQEKLLKKLGPNAFP
FTFTFPEMSPSSVTLQPSDEDQGKPMGVDYCVRTYVADNEDDKGHKRSSVTLAIKKLQHA
PASRGRRLPSSLVSKGFTFSNGKISLEVTLDKEIYYHGEKVAANIIVSNNSRKSVRNIRC
MVVQHVEITMINSQFSRHVASLESREGCPVTPGASLSKTFYLVPLARSNKDIRGVALDGH
LKEDDVNLASSTLVSEGKCPADAIGIVVSYSVRVKLNCGTLGGELVTDVPFKLLHPAEGS
VERQRFNAMKKMQSIERHRYENSLYANEEEDNIVFEDFARLRMNEPE