DPGLEAN07799 in OGS1.0

New model in OGS2.0DPOGS212441 
Genomic Positionscaffold871:+ 76379-81646
See gene structure
CDS Length1848
Paired RNAseq reads  11
Single RNAseq reads  38
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002904 (8e-76)
Best Drosophila hit  BBS1 (2e-64)
Best Human hitBardet-Biedl syndrome 1 protein (1e-68)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL009211 [Aedes aegypti] (6e-107)
Best NR hit (blastx)  PREDICTED: similar to BBS1 [Tribolium castaneum] (7e-104)
GeneOntology terms







  
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0060026 convergent extension
GO:0070121 Kupffer's vesicle development
GO:0060027 convergent extension involved in gastrulation
GO:0033339 pectoral fin development
GO:0007368 determination of left/right symmetry
GO:0032402 melanosome transport
GO:0042384 cilium assembly
InterPro families
  
IPR015943 WD40/YVTN repeat-like-containing domain
IPR011046 WD40 repeat-like-containing domain
Orthology groupMCL11557

Nucleotide sequence:

ATGAGCAGATGGTTAGACGTAGAAGCAGGAGCGGATGATATAGATATTAATACATTGCCT
TCCAATGTTACCTTTTCGGATCTGCAAAATGATAATGAGGCTAAACTTATTATTGGAGAT
TTCGGAAGGAGTGATGATGGCCCTAGACTTAAAATTTTTAAAGGCGCCATGCAGATATCA
GACTTGACTTTACCAGATCTACCTTTGGGGGTAGTCAGTTTTTATGCTGTTGAAACAAAT
CCCCGTCCTCAACCTGTAATTGCTGTTGCTTTCAGTTCCTCCGTATACTTTTATAGGAAT
CTGAAATTGTTTTATAAATATTACCTACCTCGTGTCGAACTCAATGCTGGCGAATTAGAT
ACTTGGAAACAGCTTACAAATCCATCTAACCATAAAGAAGAAACAATCCTAAAACTCACA
GAGAGCCTACACAACATACCGCATAAAGTGTTAAGCATACAATCTAGAAACTTTCTATCT
CTCACATTGGACGAACAGCTGGAATACTTGGAAAATACTCAGGAGTTACCAAAGAAGAAA
AACGGAGAAATTGTTTGCATTTCAACCATCAGACTCTCTTCAGTTGATAAACGTGCAGTA
AGCTGTCTGGTTCTAGGTACAGAGGATGGAGAGATTATCGTGTTGGACTCCCAGACGTTT
ACACAAACAAATGTGGTTAATATAAGTCCTGTTAAGAAAACACCGTTTCAAATAGTAACG
ACCGGAGTATACAATGTGGATTACAGAATAACTGTCGCTACCAGGGAGAGAAGTGTTTGT
TTGCTAAAAAGAGACTGGAAAGAGGGACGTACTTTATTCAATACTGACGACCACATTATT
GCTATTGAAGTTTTTGCCACAGATAATAGTATTTTGGTTATATGCGCCGACAAAACCTTC
TCTTGTTATAACCGTAAGGGTAGGAAACAATGGTCCTTAAGCCTAGACCATCGCCCTATT
TGTCTTTCTCTTGTCCCAATATCTCACTTGGGCATGACGCTATCCGCGGTCGCGCTCGTA
TCCGGCCACGTGACACTCTATGACGGCAAATATCCTAGAGATAATATATTTATTAGAGAC
GTTGTCTCCGTCATGAAGTTTGGTCAGCTCGGTCAAGAGGAGCATGTTTTTACTATCATA
ACAACAAATGGCCATCTGCTGTTGAAAATATTAAAACGAACGGCTGACTTCAATTCGAAC
TCCACGGGGATGGAAACCTCTGAATCTAATATCGGTCAGAGACCGTGGCTCATACCTAAG
AAATCGAAACTGTTCCTGGAACAGGCTGTGAGGGAAAGAGAGAATCCTAAAGCTATGCAT
GAGGCGTTCCAGTATGAATTGAACCGTCTACGTTTATTAACGGCTCAGACCCTCTTGGAG
GCGTATAGGAAGTCGGATAACTTCGTTGGAACCGGCAATATGGAACCCATAAGATTATCC
GCCGAGGTGGAAGGACTAGGTCCCGTGTTTGTGGTGACCTTGATAGTACACAACACGTGC
TCCGAGCGCGCCGTCTCCGGCCTCGCCGTGTTGTTCCACGTCATCTCCACTGGATACAGA
GTCCATACACCATATACCAAGGTTCCACTGATCGCACCGGGAAATCAGTTGAAATTTCCA
ATTAAAGTTGAGGAAATATTCAGTGAGAACGTGAACCCGGATGTATTTTTTCGTAATGTG
ACGGGTCAGGCGGGGGAGGGGTCAGTCATTAAAGTGTTGCTTTTGAAGGAGGGAAAAGTA
AGCCCAGTTCTCGCAGCGACAGTGACCATGCCTCCCACTGACCCCATGATGATACCGTAC
GATAAGCTTCAGACCTCCAACTTTGGACAGAACTCATCGCAAAATTAA

Protein sequence:

MSRWLDVEAGADDIDINTLPSNVTFSDLQNDNEAKLIIGDFGRSDDGPRLKIFKGAMQIS
DLTLPDLPLGVVSFYAVETNPRPQPVIAVAFSSSVYFYRNLKLFYKYYLPRVELNAGELD
TWKQLTNPSNHKEETILKLTESLHNIPHKVLSIQSRNFLSLTLDEQLEYLENTQELPKKK
NGEIVCISTIRLSSVDKRAVSCLVLGTEDGEIIVLDSQTFTQTNVVNISPVKKTPFQIVT
TGVYNVDYRITVATRERSVCLLKRDWKEGRTLFNTDDHIIAIEVFATDNSILVICADKTF
SCYNRKGRKQWSLSLDHRPICLSLVPISHLGMTLSAVALVSGHVTLYDGKYPRDNIFIRD
VVSVMKFGQLGQEEHVFTIITTNGHLLLKILKRTADFNSNSTGMETSESNIGQRPWLIPK
KSKLFLEQAVRERENPKAMHEAFQYELNRLRLLTAQTLLEAYRKSDNFVGTGNMEPIRLS
AEVEGLGPVFVVTLIVHNTCSERAVSGLAVLFHVISTGYRVHTPYTKVPLIAPGNQLKFP
IKVEEIFSENVNPDVFFRNVTGQAGEGSVIKVLLLKEGKVSPVLAATVTMPPTDPMMIPY
DKLQTSNFGQNSSQN