DPGLEAN20231 in OGS1.0

New model in OGS2.0DPOGS215383 
Genomic Positionscaffold211:+ 183534-186906
See gene structure
CDS Length2025
Paired RNAseq reads  2170
Single RNAseq reads  9264
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012396 (3e-77)
Best Drosophila hit  CG1105 (5e-56)
Best Human hitarrestin domain-containing protein 3 (3e-20)
Best NR hit (blastp)  arrestin domain containing 4 [Culex quinquefasciatus] (5e-64)
Best NR hit (blastx)  arrestin domain containing 4 [Culex quinquefasciatus] (4e-64)
GeneOntology terms  GO:0006911 phagocytosis, engulfment
InterPro families

  
IPR014756 Immunoglobulin E-set
IPR011021 Arrestin-like, N-terminal
IPR011022 Arrestin-like, C-terminal
Orthology groupMCL31163

Nucleotide sequence:

ATGGGTTTTGAAGATGGTCAAATAGTTCTAGACAGTCCCAATGGAGCGTACTACTCAGGA
CAGGCGGTCTGTGGAACATTACATTTTATACAGACAAAAGAAAAAACATTCAGAGGTATA
TATGTACAGTTTAAAGGATATTGTAAAGTTCATTGGACCACTTCTCACACTAGGACGGTA
AATGGTAAAAGTGAATCATATACAACATCTCATGATTCACATGAAGAATATGTGAATGTG
AAGACTTATCTAGTTGGAGGAGAGTCAGGCGAACATAGCATCGACCCTGGAACTTATGAG
TACAATTTCAGGTTCAATATTCCTGTCAATTGCCCATCATCCTTTGAGGGTCACTTGGGT
CATGTGAGATATGAGATTAAAGCCGTAGTAGACAGGGCATTTAAATTTGATCAGGAGAAG
AAAGTTGCTGTACGTGTTATGGCACCCCTGGATCTCAATCAAAATCCTTATTGCAAGGAT
CCTTTGGAGCTGGAGTTCAATGATTCCTACTGCTGCTTCTGTATGGGATCAGGCTCAGCG
GACACAATGGTGAAGCTGCCTGTAGCAGGTTATTGCCCTGGCCAGACTATACCCATCGAA
CTTAAATGTTCTAATCAAGGCAGTGTTGAGATTGATTATATAAAATTGGAAATAACTAAG
AAACTAACCTTTACTGCAACCCACGAGCCTGGAACTAGGACAGAGAAAGAAACTGTAGCA
GAGATCAAGAAGAATTCTATACCCACAAACACCACCAGAGATTGGACCGTGGAGATGATG
GTCCCAGCCCTGGATGTATACAATCTGGACAACTGTCGGTACATTGATGTGGAATACAAA
TTCAAGGTGACAGTTAACCCTTCTGGATGTCACTCATCCACTGATGGAAGTCGGAAGATT
ATAATTGGTACAGTTCCACTAGTCGGTTTCCAAGATGACGTACAGAATCCACTCGAAAGT
CAAATGCCGCAGCAAACGATTACAGCCGTTACCCAGCAGCCAGTATCAGGAATAAGTTCC
TACCCTGGATCCCCATACCCTCCTGTGGTTAATGTTCAACCTTACCCTAATACTGCCTCA
CCGTACCCGCCAGCTCCGTCACCCTATCCTCAAACCACATCACCCTATCCACAAGCCACA
TCACCCTATCCTCCAACCACATCACCCTATCCTCCAACCACATCACCCTATCCTCCAACC
ACATCACCCTATCCTCCAACCACATCACCCTATCCTCCAACCACATCACCCTATCCTCCA
ACCACATCACCCTATCCTCCAACCACATCACCCTATCCTCCAACCACATCACCCTATCCT
CCAACCACATCACCCTATCCTCCAACCACATCACCCTATCCTCCAACCACATCACCCTAT
CCTCCAACCACATCACCCTATCCTCCAACCACATCACCCTATCCTCCAACCACATCACCC
TATCCTCCAACCACGTTTCCTTACCCTGAAACCACGTCACCATATCCAAACCCTGCATCA
CCCTATGCTCCCACTTCTTCACCTTACCCAAATAAATCTCCATACCCTACTGGCAATTCC
CCATACCCTCCTGCGAGCCCATCAAGTTCTCCATATCCAGACAACCCTCCCCCATACCCT
GGTAACAATCAAGCAAATAATTCCCCATATCCAGCCAGCCCTCACCCCTCTACTAATTCC
CCCTATCCGGCTGCTCCCTACCCTGCAACTAATGCACCCTACCCTGATTCTTCCCCTTAC
CCTCCTAAACAAGACAAGACAAACAAGCCCCTGGGTTTCTCGGTTCCAAGTGGTAATGAA
GTCAGTACACCACTTTTGCAGCCGAACCTCGATCCCAGCCCTTACCCAACCATGTCGCCT
GGTATACACCACCATCAGCTTTCTGCCGCCGTTGATAACTTCGTTTCTGTCGTTGGCATG
TATTCACTGCTCACTGTTTTCCACACACAGCCTACATCGAGCCCCAACCCGTTCGCAGCT
GCCAGCGCGCCCGCACTGGACTCGCCCGATACCCGTGTGTATTGA

Protein sequence:

MGFEDGQIVLDSPNGAYYSGQAVCGTLHFIQTKEKTFRGIYVQFKGYCKVHWTTSHTRTV
NGKSESYTTSHDSHEEYVNVKTYLVGGESGEHSIDPGTYEYNFRFNIPVNCPSSFEGHLG
HVRYEIKAVVDRAFKFDQEKKVAVRVMAPLDLNQNPYCKDPLELEFNDSYCCFCMGSGSA
DTMVKLPVAGYCPGQTIPIELKCSNQGSVEIDYIKLEITKKLTFTATHEPGTRTEKETVA
EIKKNSIPTNTTRDWTVEMMVPALDVYNLDNCRYIDVEYKFKVTVNPSGCHSSTDGSRKI
IIGTVPLVGFQDDVQNPLESQMPQQTITAVTQQPVSGISSYPGSPYPPVVNVQPYPNTAS
PYPPAPSPYPQTTSPYPQATSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPP
TTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSPYPPTTSP
YPPTTFPYPETTSPYPNPASPYAPTSSPYPNKSPYPTGNSPYPPASPSSSPYPDNPPPYP
GNNQANNSPYPASPHPSTNSPYPAAPYPATNAPYPDSSPYPPKQDKTNKPLGFSVPSGNE
VSTPLLQPNLDPSPYPTMSPGIHHHQLSAAVDNFVSVVGMYSLLTVFHTQPTSSPNPFAA
ASAPALDSPDTRVY