DPGLEAN08786 in OGS1.0

New model in OGS2.0DPOGS203367 
Genomic Positionscaffold6:+ 42662-54360
See gene structure
CDS Length2853
Paired RNAseq reads  5144
Single RNAseq reads  12013
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003892 (0.0)
Best Drosophila hit  beta'-coatomer protein (0.0)
Best Human hitcoatomer subunit beta' (0.0)
Best NR hit (blastp)  coatomer protein complex subunit beta 2 [Bombyx mori] (0.0)
Best NR hit (blastx)  coatomer protein complex subunit beta 2 [Bombyx mori] (0.0)
GeneOntology terms




  
GO:0030903 notochord development
GO:0005198 structural molecule activity
GO:0016192 vesicle-mediated transport
GO:0005515 protein binding
GO:0030117 membrane coat
GO:0006886 intracellular protein transport
InterPro families








  
IPR020472 G-protein beta WD-40 repeat
IPR011046 WD40 repeat-like-containing domain
IPR011048 Cytochrome cd1-nitrite reductase-like, C-terminal haem d1
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR016453 Coatomer, beta' subunit
IPR001680 WD40 repeat
IPR006692 Coatomer, WD associated region
IPR019781 WD40 repeat, subgroup
Orthology groupMCL14280

Nucleotide sequence:

ATGTTACAAAAGCCGTTAAGATTGGAAATCAAGAGGAAGCTGACAGCGCGATCTGATCGC
GTCAAGTGCGTCGACCAGCACCCAACAGAACCGTGGCTGCTTTGTTCGCTGTACAGCGGC
GACGTCAATATATGGAACTATGAAACACATACACAGATCAAAAGATTCGAGGTGTGCGAC
TTACCAGTGAGGGCTGCTAAATTCGTGATGCGGAAGAACTGGGTCGTGACAGGATCTGAC
GATATGCAGATCAGAGTGTTTAATTACAATACTCTAGAAAGGGTGCACAACTTTGAGGCT
CACTCGGACTACATAAGATGCATCGTCATACATCCCACACAGCCTTACATACTGACAAGC
AGCGACGATCTCCTCATCAAGCTGTGGAACTGGGACCGTAACTGGGCTTGCCAGCAAGTG
TTCGAGGGCCACACCCATTACGTGATGCAAATTGTTATCAACCCTAAAGATAACAATACA
TTTGCCAGCGCAAGTCTCGATACCACTGTTAAGGTTTGGCAGCTCGGTTCATCAATCTCA
AACTTCACATTGGAAGGTCACGAGAAAGGTGTCAACTGTGTGGATTACTACCACGGTGGT
GAAAAGCCCTACCTCATCAGCGGTGCCGATGACCGTCTTGTCAAGATATGGGATTACCAG
AACAAGACATGTGTTCAGACGCTAGAAAGTCACGCCCAGAATGTCACAGCGGTTTCCTTC
CACCCGGAACTTCCCATCCTGCTTACGGGCTCCGAGGACGGCACCGTGAGGATCTGGCAC
GCCGGGACATACAGACTGGAAGCGGCCCTCAACTACGGCTTTGAAAGAGTATGGACTTTA
TCATCACTCCACAGATCCAACAATGTGGCTATCGGATATGACGAAGGTACCATAATGATC
AAAGTTGGAAGAGAAGAGCCCGCTATATCCATGGATGTGAACGGTGGGAAAATAATTTGG
GCCAAGCATTCTGATATGCAGCAAGTTAATTTGAAAGCTCTACCCGAAGGTACAGATATA
AAAGATGGCGAACGGGTCCCAGTGGTTGCTAAAGATATGGGTTCCTGTGAGATATATCCC
CAGACGATAGCCCACAATCCAAACGGACGTTTCGTGGTTGTGTGCGGTGATGGGGAATAC
ATAATATACACAGCCATGGCCTTGAGGAATAAGGCCTTCGGAACAGCCCAGGAGTTTGTG
TGGGCTTTGGATAGCTCGGAGTACGCTACACTGGAGAATTCTAGCACAGTGAAAGTCTTT
AAGAACTTCAAGGAGAGGAAGAGCTTTAAACCTGAATATGGCGCTGAAGGTATCTTCGGT
GGATTCATGCTGGGCGTTAAGTCTATCAGTGGCATGGCCTTCTCGTTCTACGACTGGGAA
CAATTGGAGCTCATTAGACGTATCGAGATTCAGCCTCGTCATGTTTTCTGGTCTGAGAGC
GGAAGCCTAGTGTGTCTGGCCAGCGAGGAGGCCTACTACGTGCTGAAGTACAACGCTTCT
GTCGTAGCTAAATCAAGAGAAAATAATACTAACGTAACCGAGGACGGCATCGAGGATGCT
TTCGAGGTTGTGGGCGAAGTAAATGAGTCGGTGAAGACGGGCTTGTGGGTAGGCGACTGC
TTCATATACACCAACTCGTTGAACAGAATCAACTATTACGTCGGCGGTGAGATTGTGACC
ATAGCGCACTTGGACCACACGATGTATATCCTGGGATACGTCGCTAAAGAAAACAGGCTG
TACCTCAACGACAAGGAGTTGAACATAGTGTCGTATTCCCTCCTGCTGCCGGTTCTGGAG
TATCAGACGGCGGTGATGAGAGGTGACTTCGAAACAGCTGATCGCGTCCTGCCGACCATA
CCTCACGATCATCGCACCAGGGTCGCACATTTTCTCGAGAAACAGGGCTTCAAACAACAA
GCTCTGGCTGTGTCAACGGAGCCCGAACACCAGTTCGAGCTGGCCCTGTCGCTGGGCGAG
CTGAAGAAGGCCAGCCAGTTGGCAGAGGAGTCAGATAAGGCCGAGGGCCGCGAGGACAAC
CAGCCCTCGAGGCCTTCAGCTGCCAGGTGGTCCAGATTGGGAGCAGCAGCTGCAGCAGCT
GCAGACACTGATCTCACCAAGTTCTGCTACCAGAAGGCCCGCGACTACAGCGCCCTGCTA
CTATTCTCCGTCAGCACTGGCGATCGTGAGTTGCTGGAAGAGGTGGCTCATATGTCCGAT
CTGGCCGGTGAAGATAACATAGCCTTCACATCCTATCTTACTCTGAATGACCTGGACTCT
TGTCTGGCGCTGCTTCTCAAACGAAACAAACTACCAGAGGCTGCGTTCTTCTGCAGGTCA
TACTATCCTTCAATGATGAGCGATGTCCTCAAACGTTGGAGGGATTCCGTCTCTATGACC
AATCCCAAGTGCGGCCAGGCCTTGGCCGATCCCAACAAATACGACAACCTGTTCCCGGAA
TACATGGATACCCTGGCGATGGAGTTCTACCAGAAGCACTTTGGTTATCCGTACTACAAT
CAGTTGGAGCATATCAAAGAGAACACTGATTTATGCAATGTTGACCGAGACATGGCTCAC
GAAAGGCTGGTCGCTATCCACATGGGCGCCTGGGACCCTAGGGTCATAACCCCACCATCC
GGTGCTTCAGGTCTCTCCAGTCTACAGGACAGTCCGAGACGAGATCCCAGAAATCCAGAT
AGTTCAGATGAAGCTTCCTATTCTGATGAAAAGATCAGACGTAGAGACTCCATGGACATC
CTCGAAGAGATTGAACGTGAGATAGACAACATTGTGCTGGACAACAACGAAGAGGATCTG
GATTCGTCAGACGAGACCATGTATCTTGAATAA

Protein sequence:

MLQKPLRLEIKRKLTARSDRVKCVDQHPTEPWLLCSLYSGDVNIWNYETHTQIKRFEVCD
LPVRAAKFVMRKNWVVTGSDDMQIRVFNYNTLERVHNFEAHSDYIRCIVIHPTQPYILTS
SDDLLIKLWNWDRNWACQQVFEGHTHYVMQIVINPKDNNTFASASLDTTVKVWQLGSSIS
NFTLEGHEKGVNCVDYYHGGEKPYLISGADDRLVKIWDYQNKTCVQTLESHAQNVTAVSF
HPELPILLTGSEDGTVRIWHAGTYRLEAALNYGFERVWTLSSLHRSNNVAIGYDEGTIMI
KVGREEPAISMDVNGGKIIWAKHSDMQQVNLKALPEGTDIKDGERVPVVAKDMGSCEIYP
QTIAHNPNGRFVVVCGDGEYIIYTAMALRNKAFGTAQEFVWALDSSEYATLENSSTVKVF
KNFKERKSFKPEYGAEGIFGGFMLGVKSISGMAFSFYDWEQLELIRRIEIQPRHVFWSES
GSLVCLASEEAYYVLKYNASVVAKSRENNTNVTEDGIEDAFEVVGEVNESVKTGLWVGDC
FIYTNSLNRINYYVGGEIVTIAHLDHTMYILGYVAKENRLYLNDKELNIVSYSLLLPVLE
YQTAVMRGDFETADRVLPTIPHDHRTRVAHFLEKQGFKQQALAVSTEPEHQFELALSLGE
LKKASQLAEESDKAEGREDNQPSRPSAARWSRLGAAAAAAADTDLTKFCYQKARDYSALL
LFSVSTGDRELLEEVAHMSDLAGEDNIAFTSYLTLNDLDSCLALLLKRNKLPEAAFFCRS
YYPSMMSDVLKRWRDSVSMTNPKCGQALADPNKYDNLFPEYMDTLAMEFYQKHFGYPYYN
QLEHIKENTDLCNVDRDMAHERLVAIHMGAWDPRVITPPSGASGLSSLQDSPRRDPRNPD
SSDEASYSDEKIRRRDSMDILEEIEREIDNIVLDNNEEDLDSSDETMYLE