DPGLEAN15559 in OGS1.0

New model in OGS2.0DPOGS207010 
Genomic Positionscaffold1:+ 727245-744719
See gene structure
CDS Length5046
Paired RNAseq reads  13433
Single RNAseq reads  31215
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012935 (0.0)
Best Drosophila hit  clathrin heavy chain, isoform B (0.0)
Best Human hitclathrin heavy chain 1 (0.0)
Best NR hit (blastp)  clathrin heavy chain [Bombyx mori] (0.0)
Best NR hit (blastx)  clathrin heavy chain [Bombyx mori] (0.0)
GeneOntology terms



















  
GO:0007291 sperm individualization
GO:0016183 synaptic vesicle coating
GO:0008021 synaptic vesicle
GO:0030125 clathrin vesicle coat
GO:0005905 coated pit
GO:0030135 coated vesicle
GO:0007269 neurotransmitter secretion
GO:0030129 clathrin coat of synaptic vesicle
GO:0030132 clathrin coat of coated pit
GO:0006886 intracellular protein transport
GO:0005198 structural molecule activity
GO:0016192 vesicle-mediated transport
GO:0005515 protein binding
GO:0030130 clathrin coat of trans-Golgi network vesicle
GO:0033227 dsRNA transport
GO:0005811 lipid particle
GO:0035159 regulation of tube length, open tracheal system
GO:0035002 liquid clearance, open tracheal system
GO:0030198 extracellular matrix organization
GO:0005886 plasma membrane
GO:0031410 cytoplasmic vesicle
InterPro families







  
IPR000547 Clathrin, heavy chain/VPS, 7-fold repeat
IPR016341 Clathrin, heavy chain
IPR022365 Clathrin, heavy chain, propeller repeat
IPR015348 Clathrin, heavy chain, linker, core motif
IPR001473 Clathrin, heavy chain, propeller, N-terminal
IPR016024 Armadillo-type fold
IPR016025 Clathrin, heavy chain, linker/propeller domain
IPR012331 Clathrin, heavy chain, linker
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL12457

Nucleotide sequence:

ATGGCTCAGGTGTTGCCAATACGCTTCCAGGAGCATTTACAGCTCACAAATATAGGTATA
AATCCCGCTTCAATATCATTCAACACCCTGACCATGGAGTCAGATAAGTTCATCTGTGTC
CGTGAGAAGGTGGGTGACACCTCGGAGGTTGTCATCATTGACATGGCGGATCCCACCAAC
CCCATAAGGAGACCCATCAGCGCTGACTCTGCCATCATGAACCCAGCTAGCAAAGTCATC
GCTCTCAAGGGAAAGGCTGGAGTGGAAGCGCAAAAGACCCTCCAAATATTCAACATTGAA
ATGAAATCCAAGATGAAGGCGCACACTATGACCGAGGATGTAGTTTTCTGGAAGTGGATC
TCGCCTAACACTTTGGCCCTGGTGACCAAAATATCAGTATACCACTGGTCCATGGAGGGG
GATTCGACACCAGTCAAGATGTTCGATAGACATTCATCTCTCGCTGAGTGTCAGATTATC
AACTACAGAACCGATCCTAAGCAGCAGTGGCTGCTACTTGTCGGTATCTCGGCGCAACAG
AACCGTGTTGTGGGCGCGATGCAGTTGTACTCAGTTGAGCGGAAGTGTTCTCAGCCGATC
GAAGGTCATGCTGCTTCGTTCGCGACCTTCAAGGCTGAGGGTAACGCTGAGCTGTCTACG
CTGTTTTGTTTCGCTGTGAGGACAGCACAGGGCGGGAAGCTGCACATCATCGAGGTTGGT
CAGACCCCAGCCGGTAACCAGCAGTTCCCTAAGAAAGCGGTGGACGTTTTCTTCCCGGCT
GAAGCCCAGAACGATTTCCCGGTCGCCATGCAAGTGTCGCCCAAATATGACGTCATCTAC
CTGATCACCAAATACGGTTACATCCATATGTACGACATCGAAACCGGCACATGCATTTAT
ATGAATCGCATCTCCTCTGACACTATATTCGTGACAGCACCCCACGAATCGACCGGCGGA
ATTATTGGTGTGAACCGCAAGGGACAAGTTCTGTCTGTGACGGTGGAAGAGGAGTCCATA
GTGCCGTACATCAACACGGTGCTGCAGAACCCTGAACTGGCGCTCCGGCTGGCTGTGAGG
AATAACCTGGCCGGTGCCGAGGAGTTGTTCGTCAGGAAATTCAACATGCTGTTCACCAAC
GGACAGTACGGAGAGGCAGCTAAGGTAGCGGCTATGGCTCCGCGCGGGATCCTCCGTACG
CCGCAGACGATCCAGCGGTTCCAGCAGGTGCCCACCCAGCCCGGCCAGACCTCCCCGCTG
TTGCAGTACTTCGGCATCCTGTTGGACCAAGCACAGCTCAACAAGTTCGAATCGCTGGAG
TTGTGCCGACCTGTACTTCTGCAAGGTCGCAAGCAACTATTGGAGAAATGGCTGAAGGAA
GAGAAATTGGAATGTTCAGAGGAACTGGGAGACCTTGTGAAGCAGGTCGATCCCACTCTG
GCACTATCAGTTTATTTAAGGGCGAATGTAGCTGCCAAAGTGATCCAATGTTTCGCCGAA
ACCGGCCAGTTCCAGAAGATCGTGTTATACGCTAAGAAGGTGGGCTATACGCCGGATTAT
ATCTATCTCCTGAGATCTGTGATGCGTACGAATCCCGAGCAAGGCGCAGGTTTCGCGGGT
ATGCTGGTCGCCGAGGACCCGCCGCTGGCTGACATCAATCAGATCGTGGACGTGTTCATG
GAACAGAACATGGTACAGCAGTGCACAGCCTTCTTACTCGATGCCTTGAAGAACAACCGT
CCCGAGGAAGGAGCCCTACAGACCAGATTGTTAGAGATGAATCTGATGTCAGCGCCTCAA
GTGGCAGACGCGATTCTGGGCAATGGTATGTTCACGCACTACGACCGCGCCCATGTCGCT
CAGCTCTGCGAGAAGGCCGGCCTACTGCAACGTGCTCTAGAGCATTACACAGACTTGTAC
GACATTAAGAGAGCTGTGGTTCACACACACTTGCTGTCCGCCGATTGGTTGGTGAGTTAT
TTCGGCACCCTATCAGTCGAAGACTCCCTCGAGTGTCTTAAGGCGATGCTACAAGCGAAC
ATTCGCCAAAACCTTCAGATCTGCGTACAGATCGCAACCAAATACCACGAACAACTAACA
ACCAAGGCTCTCATTGAATTATTCGAGGGTTTCAAGACTTATGAAGGTCTATTCTACTTC
CTCGGCTCCATTGTGAACTTCAGTCAGGATTCAGAAGTACATTTCAAGTACATCCAGGCT
GCATGCAAGACTGGTCAGATCAAAGAAGTGGAACGCATCTGTCGCGAGTCGAACTGCTAC
AACGCGGAGCGTGTGAAAAATTTCCTTAAGGAAGCCAAACTTCCCGATCAGTTGCCTCTA
ATCATTGTGTGCGATAGATTCGACTTCGTACACGACCTCGTCTTGTATTTGTATAGAAAC
AGCCTCCAAAAGTACATCGAGATTTACGTACAGAAGGTAAATCCGTCAAGGCTGCCTGTA
GTTGTCGGTGGTCTGTTGGATGTAGACTGCGCTGAGGATATAATCAAAAACCTCATACTC
GTAGTCCGAGGACAGTTCTCCACAGACGAGCTCGTAGCTGAAGTTGAGAAGAGAAACAGA
CTAAAGTTGCTCCTACCATGGTTGGAGACGCGGGTCCACGAGGGCTGCAACGAGCCAGCG
ACGCACAACGCTCTAGCCAAGATTTACATTGATTCTAACAATAATCCCGAGAGATTCTTG
AAGGAGAACCAATGGTACGATTCCCGTGTTGTGGGTCGCTACTGTGAGAAGCGCGATCCC
CACCTCGCTTGTGTGGCGTACGAGCGTGGGCAGTGTGACCGCGAGCTGATCGCCGTATGC
AATGATAACTCGCTGTTCAAGACTCAAGCGCGGTACCTCGTGAGGAGACGGGACCAGGAC
CTCTGGCTGGAAGTACTGGCCGAGTCAAACCCTTACAAGAGGCAGCTTATAGATCAGGTT
GTACAAACGGCTCTGTCGGAAACCCAAGACCCTGAGGACATTTCGGTGACGGTGAAGGCA
TTCATGACAGCTGATTTGCCGAATGAGCTGATCGAGCTGTTAGAGAAGATTGTCCTAGAT
AACTCTGTGTTCTCTGATCACAGGAACCTACAGAATCTGCTTATTTTGACAGCTATCAAG
GCCGATCGCACCCGTGTTATGGAATACATCAATCGCCTGGACAACTACGACGCACCGGAC
ATCGCTAACATAGCCATCAATAACGAGCTATATGAGGAAGCTTTTGCTATCTTCAAGAAG
TTCGATGTTAATACATCGGCCATTCAAGTCCTGATAGACCAAGTGAAGGATCTACAACGC
GCTTATGAATTCGCCGAGCGTTGCAACGAGCCGGGCGTTTGGTCACAACTGGCTAAGGCT
CAGTTACAGCAGGGATTGGTGAAGGAAGCCATTGATTCTTACATAAAGGCAGACGATCCA
TCCGCCTATATGGACGTAGTTGATACAGCCACCAAACAACAGTCCTGGGAGGATCTCGTC
AGATACCTACAGATGGCTCGCAAGAAGGCTCGTGAATCGTACATAGAATCCGAATTGATT
TACGCTTACGCCCGCACTGGGAGGCTGGCTGATCTCGAAGAGTTCATCTCTGGTCCGAAC
CACGCCGACATACAGAAGATAGGGGACAGGTGTTTCGACGATAAGATGTACAACGCTGCT
AAACTGCTCTACAATAACGTGAGCAACTTTGCTCGTTTGGCCATCACTCTGGTGCATCTC
AAGGAATTCCAAGGCGCGGTGGACAGTGCCCGCAAGGCGAACTCCACTCGTACATGGAAG
GAGGTTTGCTTCGCCTGTGTCGACGCCGGTGAATTCCGTCTCGCTCAGATGTGCGGACTA
CATATAGTTGTGCACGCTGACGAGTTGGAGGACCTCATTAATTACTACCAGGATCGTGGT
CATTTCGACGAGCTGATCAGTCTGCTCGAGGCTGCTCTCGGTCTCGAACGTGCTCATATG
GGAATGTTCACAGAACTGGCCATACTTTACTCCAAGTACAAACCAGCTAAGATGCGCGAA
CATTTGGAACTATTCTGGTCTCGCGTTAACATTCCGAAGGTCCTTCGCGCCGCGGAACAA
GCTCATCTGTGGTCCGAACTAGTGTTCCTGTACGATAAATACGAGGAGTACGACAACGCT
GCTCTCACCATGATGCAACACCCCACAGAGGCATGGAGGGAGGGCCACTTCAAGGATATC
ATCACTAAGGTGGCGAATATGGAGCTGTACTACAAGGCTATCCAGTTTTACTTGGACTAC
AAACCTCTTCTTCTGAACGATCTTCTGCTAGTGCTGGCTCCACGTATGGATCACACTCGT
GCTGTGGGATTCTTCACCAAGGCGGGCCACCTACAGCTGGTTAAGGCCTACCTGAGGTCC
GTACAGAGCCTCAACAATAAAGCTGTCAATGAAGCACTCAATTCCCTGCTCATTGATGAA
GAGGATTATCAGGGCTTGAGGACATCGATTGACGCTTTCGATAACTTTGACACGATCGCA
CTGGCGCAGCAACTGGAGAAACACGAACTCACCGAGTTTAGAAGAATTGCTGCCTATTTG
TACAAAGGCAACAATAGATGGAAACAGAGCGTCGAGCTTTGCAAGAAGGACGCTTTATAC
GCTGATGCTATGGAATACGCCGCTGAGTCCCGTCAGGCAGATGTCGCTGAGGAACTGCTA
GACTGGTTCCTTGAAAGACGCAACTACGAGTGCTTCTCGGCTACTTTGTACCAGTGTTAC
GACCTCTTGAAACCCGATGTAGTTATTGAACTGGCGTGGAGACATAATATCATGGATTTC
GCAATGCCGTATCTCATCCAAACTGTACGCGAACTGACAACTAAAGTTGAAAAGTTGGAG
GAGGCTGACGCCAAACGTAGCACAGAGAGCGCTGAACAAGAAGCCAAACCAGCAATGATT
ATGGAACCACAGCTTATGCTTACTGCCGGACCTTCAATGGCTTATCCGGGTGTACCGGCC
CAGTCACCGTACGCTTACGCGGCGCAGGCACCGTCCCCGGCGCCCTACCACGGCTACGGC
ATGTAG

Protein sequence:

MAQVLPIRFQEHLQLTNIGINPASISFNTLTMESDKFICVREKVGDTSEVVIIDMADPTN
PIRRPISADSAIMNPASKVIALKGKAGVEAQKTLQIFNIEMKSKMKAHTMTEDVVFWKWI
SPNTLALVTKISVYHWSMEGDSTPVKMFDRHSSLAECQIINYRTDPKQQWLLLVGISAQQ
NRVVGAMQLYSVERKCSQPIEGHAASFATFKAEGNAELSTLFCFAVRTAQGGKLHIIEVG
QTPAGNQQFPKKAVDVFFPAEAQNDFPVAMQVSPKYDVIYLITKYGYIHMYDIETGTCIY
MNRISSDTIFVTAPHESTGGIIGVNRKGQVLSVTVEEESIVPYINTVLQNPELALRLAVR
NNLAGAEELFVRKFNMLFTNGQYGEAAKVAAMAPRGILRTPQTIQRFQQVPTQPGQTSPL
LQYFGILLDQAQLNKFESLELCRPVLLQGRKQLLEKWLKEEKLECSEELGDLVKQVDPTL
ALSVYLRANVAAKVIQCFAETGQFQKIVLYAKKVGYTPDYIYLLRSVMRTNPEQGAGFAG
MLVAEDPPLADINQIVDVFMEQNMVQQCTAFLLDALKNNRPEEGALQTRLLEMNLMSAPQ
VADAILGNGMFTHYDRAHVAQLCEKAGLLQRALEHYTDLYDIKRAVVHTHLLSADWLVSY
FGTLSVEDSLECLKAMLQANIRQNLQICVQIATKYHEQLTTKALIELFEGFKTYEGLFYF
LGSIVNFSQDSEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLPDQLPL
IIVCDRFDFVHDLVLYLYRNSLQKYIEIYVQKVNPSRLPVVVGGLLDVDCAEDIIKNLIL
VVRGQFSTDELVAEVEKRNRLKLLLPWLETRVHEGCNEPATHNALAKIYIDSNNNPERFL
KENQWYDSRVVGRYCEKRDPHLACVAYERGQCDRELIAVCNDNSLFKTQARYLVRRRDQD
LWLEVLAESNPYKRQLIDQVVQTALSETQDPEDISVTVKAFMTADLPNELIELLEKIVLD
NSVFSDHRNLQNLLILTAIKADRTRVMEYINRLDNYDAPDIANIAINNELYEEAFAIFKK
FDVNTSAIQVLIDQVKDLQRAYEFAERCNEPGVWSQLAKAQLQQGLVKEAIDSYIKADDP
SAYMDVVDTATKQQSWEDLVRYLQMARKKARESYIESELIYAYARTGRLADLEEFISGPN
HADIQKIGDRCFDDKMYNAAKLLYNNVSNFARLAITLVHLKEFQGAVDSARKANSTRTWK
EVCFACVDAGEFRLAQMCGLHIVVHADELEDLINYYQDRGHFDELISLLEAALGLERAHM
GMFTELAILYSKYKPAKMREHLELFWSRVNIPKVLRAAEQAHLWSELVFLYDKYEEYDNA
ALTMMQHPTEAWREGHFKDIITKVANMELYYKAIQFYLDYKPLLLNDLLLVLAPRMDHTR
AVGFFTKAGHLQLVKAYLRSVQSLNNKAVNEALNSLLIDEEDYQGLRTSIDAFDNFDTIA
LAQQLEKHELTEFRRIAAYLYKGNNRWKQSVELCKKDALYADAMEYAAESRQADVAEELL
DWFLERRNYECFSATLYQCYDLLKPDVVIELAWRHNIMDFAMPYLIQTVRELTTKVEKLE
EADAKRSTESAEQEAKPAMIMEPQLMLTAGPSMAYPGVPAQSPYAYAAQAPSPAPYHGYG
M