DPGLEAN15910 in OGS1.0

New model in OGS2.0DPOGS211416 
Genomic Positionscaffold337:+ 138572-140179
See gene structure
CDS Length1224
Paired RNAseq reads  144456
Single RNAseq reads  421187
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010897 (0.0)
Best Drosophila hit  arrestin 2 (4e-162)
Best Human hitbeta-arrestin-1 isoform B (1e-82)
Best NR hit (blastp)  arrestin, Arr2-like (AGAP006263-PA) [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  arrestin, Arr2-like (AGAP006263-PA) [Anopheles gambiae str. PEST] (2e-179)
GeneOntology terms








  
GO:0016059 deactivation of rhodopsin mediated signaling
GO:0016028 rhabdomere
GO:0016060 metarhodopsin inactivation
GO:0007602 phototransduction
GO:0016030 metarhodopsin binding
GO:0005624 membrane fraction
GO:0005625 soluble fraction
GO:0016062 adaptation of rhodopsin mediated signaling
GO:0005515 protein binding
GO:0005737 cytoplasm
InterPro families




  
IPR014753 Arrestin, N-terminal
IPR014752 Arrestin, C-terminal
IPR014756 Immunoglobulin E-set
IPR000698 Arrestin
IPR011021 Arrestin-like, N-terminal
IPR011022 Arrestin-like, C-terminal
Orthology groupMCL16066

Nucleotide sequence:

ATGCTTGACTTCCTATCCTTAGCTTTCGTGGTGGCGGTGAAGGTTTTCAAGAAAACCACA
CCCAATGGAAAGGTTACGGTTTACCTGGGCAAACGGGACTTCATTGATCACGTTGACTAC
TGCGATCCTGTCGATGGAGTCGTAGTGGTTGACACCGAGTACCTGAAAGGACGAAAAGTT
TACAGCCAGCTGGTCACGACCTACCGATTCGGTCGCGAAGAAGATGAGGTCATGGGAGTC
AAATTCTCGAAGGAACTGGTCATCGGCCAGGACCAAGTGGTACCAATGGTCAACGCGAAA
ATGGAACTGACACCTGTCCAAGAAAAGCTCCTGAAAAAGCTCGGCCCAAATGCCTTCCCA
TTCACATTTACCTTCCCCGAAATGTCGCCCAGCTCGGTCACTCTGCAACCATCTGATGAG
GATCAGGGCAAACCCATGGGTGTGGACTACTGCGTGCGAACCTACGTAGCTGACAACGAG
GATGACAAGGGCCACAAAAGGAGCTCCGTTACCCTTGCTATCAAGAAGCTGCAACATGCC
CCAGCTTCTCGCGGACGACGCCTACCTAGCTCCCTTGTCAGCAAGGGCTTCACTTTCAGC
AACGGCAAGATCAGTTTGGAAGTGACCCTCGACAAGGAAATCTACTATCATGGAGAGAAG
GTTGCCGCCAACATCATCGTTTCCAACAACTCCAGGAAATCCGTTCGCAACATCCGCTGC
ATGGTTGTACAGCATGTTGAGATTACCATGATCAACTCTCAATTCAGCCGCCATGTTGCA
TCTCTGGAAAGCCGCGAGGGTTGCCCAGTAACACCCGGAGCTAGCCTGTCTAAGACCTTC
TACTTGGTGCCTCTGGCTCGCAGCAACAAGGATATTCGAGGCGTCGCCCTGGACGGCCAC
CTTAAGGAGGATGACGTCAACCTCGCAAGCTCTACCCTGGTGTCGGAGGGCAAGTGCCCA
GCTGATGCTATTGGTATCGTGGTATCTTACTCCGTACGAGTGAAGCTGAACTGCGGAACT
CTGGGAGGCGAGCTTGTTACGGACGTGCCATTCAAACTGCTGCATCCTGCTGAGGGAAGC
GTAGAACGCCAACGTTTCAACGCAATGAAGAAGATGCAATCCATTGAGCGTCACCGCTAC
GAAAATTCTCTGTATGCCAACGAGGAGGAAGACAACATCGTTTTTGAGGACTTCGCCCGC
CTTAGGATGAACGAACCGGAATAA

Protein sequence:

MLDFLSLAFVVAVKVFKKTTPNGKVTVYLGKRDFIDHVDYCDPVDGVVVVDTEYLKGRKV
YSQLVTTYRFGREEDEVMGVKFSKELVIGQDQVVPMVNAKMELTPVQEKLLKKLGPNAFP
FTFTFPEMSPSSVTLQPSDEDQGKPMGVDYCVRTYVADNEDDKGHKRSSVTLAIKKLQHA
PASRGRRLPSSLVSKGFTFSNGKISLEVTLDKEIYYHGEKVAANIIVSNNSRKSVRNIRC
MVVQHVEITMINSQFSRHVASLESREGCPVTPGASLSKTFYLVPLARSNKDIRGVALDGH
LKEDDVNLASSTLVSEGKCPADAIGIVVSYSVRVKLNCGTLGGELVTDVPFKLLHPAEGS
VERQRFNAMKKMQSIERHRYENSLYANEEEDNIVFEDFARLRMNEPE