New model in OGS2.0 | DPOGS207010  |
---|---|
Genomic Position | scaffold1:+ 727245-744719 |
See gene structure | |
CDS Length | 5046 |
Paired RNAseq reads   | 13433 |
Single RNAseq reads   | 31215 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012935 (0.0) |
Best Drosophila hit   | clathrin heavy chain, isoform B (0.0) |
Best Human hit | clathrin heavy chain 1 (0.0) |
Best NR hit (blastp)   | clathrin heavy chain [Bombyx mori] (0.0) |
Best NR hit (blastx)   | clathrin heavy chain [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0007291 sperm individualization GO:0016183 synaptic vesicle coating GO:0008021 synaptic vesicle GO:0030125 clathrin vesicle coat GO:0005905 coated pit GO:0030135 coated vesicle GO:0007269 neurotransmitter secretion GO:0030129 clathrin coat of synaptic vesicle GO:0030132 clathrin coat of coated pit GO:0006886 intracellular protein transport GO:0005198 structural molecule activity GO:0016192 vesicle-mediated transport GO:0005515 protein binding GO:0030130 clathrin coat of trans-Golgi network vesicle GO:0033227 dsRNA transport GO:0005811 lipid particle GO:0035159 regulation of tube length, open tracheal system GO:0035002 liquid clearance, open tracheal system GO:0030198 extracellular matrix organization GO:0005886 plasma membrane GO:0031410 cytoplasmic vesicle |
InterPro families    | IPR000547 Clathrin, heavy chain/VPS, 7-fold repeat IPR016341 Clathrin, heavy chain IPR022365 Clathrin, heavy chain, propeller repeat IPR015348 Clathrin, heavy chain, linker, core motif IPR001473 Clathrin, heavy chain, propeller, N-terminal IPR016024 Armadillo-type fold IPR016025 Clathrin, heavy chain, linker/propeller domain IPR012331 Clathrin, heavy chain, linker IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL12457 |
Nucleotide sequence:
ATGGCTCAGGTGTTGCCAATACGCTTCCAGGAGCATTTACAGCTCACAAATATAGGTATA
AATCCCGCTTCAATATCATTCAACACCCTGACCATGGAGTCAGATAAGTTCATCTGTGTC
CGTGAGAAGGTGGGTGACACCTCGGAGGTTGTCATCATTGACATGGCGGATCCCACCAAC
CCCATAAGGAGACCCATCAGCGCTGACTCTGCCATCATGAACCCAGCTAGCAAAGTCATC
GCTCTCAAGGGAAAGGCTGGAGTGGAAGCGCAAAAGACCCTCCAAATATTCAACATTGAA
ATGAAATCCAAGATGAAGGCGCACACTATGACCGAGGATGTAGTTTTCTGGAAGTGGATC
TCGCCTAACACTTTGGCCCTGGTGACCAAAATATCAGTATACCACTGGTCCATGGAGGGG
GATTCGACACCAGTCAAGATGTTCGATAGACATTCATCTCTCGCTGAGTGTCAGATTATC
AACTACAGAACCGATCCTAAGCAGCAGTGGCTGCTACTTGTCGGTATCTCGGCGCAACAG
AACCGTGTTGTGGGCGCGATGCAGTTGTACTCAGTTGAGCGGAAGTGTTCTCAGCCGATC
GAAGGTCATGCTGCTTCGTTCGCGACCTTCAAGGCTGAGGGTAACGCTGAGCTGTCTACG
CTGTTTTGTTTCGCTGTGAGGACAGCACAGGGCGGGAAGCTGCACATCATCGAGGTTGGT
CAGACCCCAGCCGGTAACCAGCAGTTCCCTAAGAAAGCGGTGGACGTTTTCTTCCCGGCT
GAAGCCCAGAACGATTTCCCGGTCGCCATGCAAGTGTCGCCCAAATATGACGTCATCTAC
CTGATCACCAAATACGGTTACATCCATATGTACGACATCGAAACCGGCACATGCATTTAT
ATGAATCGCATCTCCTCTGACACTATATTCGTGACAGCACCCCACGAATCGACCGGCGGA
ATTATTGGTGTGAACCGCAAGGGACAAGTTCTGTCTGTGACGGTGGAAGAGGAGTCCATA
GTGCCGTACATCAACACGGTGCTGCAGAACCCTGAACTGGCGCTCCGGCTGGCTGTGAGG
AATAACCTGGCCGGTGCCGAGGAGTTGTTCGTCAGGAAATTCAACATGCTGTTCACCAAC
GGACAGTACGGAGAGGCAGCTAAGGTAGCGGCTATGGCTCCGCGCGGGATCCTCCGTACG
CCGCAGACGATCCAGCGGTTCCAGCAGGTGCCCACCCAGCCCGGCCAGACCTCCCCGCTG
TTGCAGTACTTCGGCATCCTGTTGGACCAAGCACAGCTCAACAAGTTCGAATCGCTGGAG
TTGTGCCGACCTGTACTTCTGCAAGGTCGCAAGCAACTATTGGAGAAATGGCTGAAGGAA
GAGAAATTGGAATGTTCAGAGGAACTGGGAGACCTTGTGAAGCAGGTCGATCCCACTCTG
GCACTATCAGTTTATTTAAGGGCGAATGTAGCTGCCAAAGTGATCCAATGTTTCGCCGAA
ACCGGCCAGTTCCAGAAGATCGTGTTATACGCTAAGAAGGTGGGCTATACGCCGGATTAT
ATCTATCTCCTGAGATCTGTGATGCGTACGAATCCCGAGCAAGGCGCAGGTTTCGCGGGT
ATGCTGGTCGCCGAGGACCCGCCGCTGGCTGACATCAATCAGATCGTGGACGTGTTCATG
GAACAGAACATGGTACAGCAGTGCACAGCCTTCTTACTCGATGCCTTGAAGAACAACCGT
CCCGAGGAAGGAGCCCTACAGACCAGATTGTTAGAGATGAATCTGATGTCAGCGCCTCAA
GTGGCAGACGCGATTCTGGGCAATGGTATGTTCACGCACTACGACCGCGCCCATGTCGCT
CAGCTCTGCGAGAAGGCCGGCCTACTGCAACGTGCTCTAGAGCATTACACAGACTTGTAC
GACATTAAGAGAGCTGTGGTTCACACACACTTGCTGTCCGCCGATTGGTTGGTGAGTTAT
TTCGGCACCCTATCAGTCGAAGACTCCCTCGAGTGTCTTAAGGCGATGCTACAAGCGAAC
ATTCGCCAAAACCTTCAGATCTGCGTACAGATCGCAACCAAATACCACGAACAACTAACA
ACCAAGGCTCTCATTGAATTATTCGAGGGTTTCAAGACTTATGAAGGTCTATTCTACTTC
CTCGGCTCCATTGTGAACTTCAGTCAGGATTCAGAAGTACATTTCAAGTACATCCAGGCT
GCATGCAAGACTGGTCAGATCAAAGAAGTGGAACGCATCTGTCGCGAGTCGAACTGCTAC
AACGCGGAGCGTGTGAAAAATTTCCTTAAGGAAGCCAAACTTCCCGATCAGTTGCCTCTA
ATCATTGTGTGCGATAGATTCGACTTCGTACACGACCTCGTCTTGTATTTGTATAGAAAC
AGCCTCCAAAAGTACATCGAGATTTACGTACAGAAGGTAAATCCGTCAAGGCTGCCTGTA
GTTGTCGGTGGTCTGTTGGATGTAGACTGCGCTGAGGATATAATCAAAAACCTCATACTC
GTAGTCCGAGGACAGTTCTCCACAGACGAGCTCGTAGCTGAAGTTGAGAAGAGAAACAGA
CTAAAGTTGCTCCTACCATGGTTGGAGACGCGGGTCCACGAGGGCTGCAACGAGCCAGCG
ACGCACAACGCTCTAGCCAAGATTTACATTGATTCTAACAATAATCCCGAGAGATTCTTG
AAGGAGAACCAATGGTACGATTCCCGTGTTGTGGGTCGCTACTGTGAGAAGCGCGATCCC
CACCTCGCTTGTGTGGCGTACGAGCGTGGGCAGTGTGACCGCGAGCTGATCGCCGTATGC
AATGATAACTCGCTGTTCAAGACTCAAGCGCGGTACCTCGTGAGGAGACGGGACCAGGAC
CTCTGGCTGGAAGTACTGGCCGAGTCAAACCCTTACAAGAGGCAGCTTATAGATCAGGTT
GTACAAACGGCTCTGTCGGAAACCCAAGACCCTGAGGACATTTCGGTGACGGTGAAGGCA
TTCATGACAGCTGATTTGCCGAATGAGCTGATCGAGCTGTTAGAGAAGATTGTCCTAGAT
AACTCTGTGTTCTCTGATCACAGGAACCTACAGAATCTGCTTATTTTGACAGCTATCAAG
GCCGATCGCACCCGTGTTATGGAATACATCAATCGCCTGGACAACTACGACGCACCGGAC
ATCGCTAACATAGCCATCAATAACGAGCTATATGAGGAAGCTTTTGCTATCTTCAAGAAG
TTCGATGTTAATACATCGGCCATTCAAGTCCTGATAGACCAAGTGAAGGATCTACAACGC
GCTTATGAATTCGCCGAGCGTTGCAACGAGCCGGGCGTTTGGTCACAACTGGCTAAGGCT
CAGTTACAGCAGGGATTGGTGAAGGAAGCCATTGATTCTTACATAAAGGCAGACGATCCA
TCCGCCTATATGGACGTAGTTGATACAGCCACCAAACAACAGTCCTGGGAGGATCTCGTC
AGATACCTACAGATGGCTCGCAAGAAGGCTCGTGAATCGTACATAGAATCCGAATTGATT
TACGCTTACGCCCGCACTGGGAGGCTGGCTGATCTCGAAGAGTTCATCTCTGGTCCGAAC
CACGCCGACATACAGAAGATAGGGGACAGGTGTTTCGACGATAAGATGTACAACGCTGCT
AAACTGCTCTACAATAACGTGAGCAACTTTGCTCGTTTGGCCATCACTCTGGTGCATCTC
AAGGAATTCCAAGGCGCGGTGGACAGTGCCCGCAAGGCGAACTCCACTCGTACATGGAAG
GAGGTTTGCTTCGCCTGTGTCGACGCCGGTGAATTCCGTCTCGCTCAGATGTGCGGACTA
CATATAGTTGTGCACGCTGACGAGTTGGAGGACCTCATTAATTACTACCAGGATCGTGGT
CATTTCGACGAGCTGATCAGTCTGCTCGAGGCTGCTCTCGGTCTCGAACGTGCTCATATG
GGAATGTTCACAGAACTGGCCATACTTTACTCCAAGTACAAACCAGCTAAGATGCGCGAA
CATTTGGAACTATTCTGGTCTCGCGTTAACATTCCGAAGGTCCTTCGCGCCGCGGAACAA
GCTCATCTGTGGTCCGAACTAGTGTTCCTGTACGATAAATACGAGGAGTACGACAACGCT
GCTCTCACCATGATGCAACACCCCACAGAGGCATGGAGGGAGGGCCACTTCAAGGATATC
ATCACTAAGGTGGCGAATATGGAGCTGTACTACAAGGCTATCCAGTTTTACTTGGACTAC
AAACCTCTTCTTCTGAACGATCTTCTGCTAGTGCTGGCTCCACGTATGGATCACACTCGT
GCTGTGGGATTCTTCACCAAGGCGGGCCACCTACAGCTGGTTAAGGCCTACCTGAGGTCC
GTACAGAGCCTCAACAATAAAGCTGTCAATGAAGCACTCAATTCCCTGCTCATTGATGAA
GAGGATTATCAGGGCTTGAGGACATCGATTGACGCTTTCGATAACTTTGACACGATCGCA
CTGGCGCAGCAACTGGAGAAACACGAACTCACCGAGTTTAGAAGAATTGCTGCCTATTTG
TACAAAGGCAACAATAGATGGAAACAGAGCGTCGAGCTTTGCAAGAAGGACGCTTTATAC
GCTGATGCTATGGAATACGCCGCTGAGTCCCGTCAGGCAGATGTCGCTGAGGAACTGCTA
GACTGGTTCCTTGAAAGACGCAACTACGAGTGCTTCTCGGCTACTTTGTACCAGTGTTAC
GACCTCTTGAAACCCGATGTAGTTATTGAACTGGCGTGGAGACATAATATCATGGATTTC
GCAATGCCGTATCTCATCCAAACTGTACGCGAACTGACAACTAAAGTTGAAAAGTTGGAG
GAGGCTGACGCCAAACGTAGCACAGAGAGCGCTGAACAAGAAGCCAAACCAGCAATGATT
ATGGAACCACAGCTTATGCTTACTGCCGGACCTTCAATGGCTTATCCGGGTGTACCGGCC
CAGTCACCGTACGCTTACGCGGCGCAGGCACCGTCCCCGGCGCCCTACCACGGCTACGGC
ATGTAG
Protein sequence:
MAQVLPIRFQEHLQLTNIGINPASISFNTLTMESDKFICVREKVGDTSEVVIIDMADPTN
PIRRPISADSAIMNPASKVIALKGKAGVEAQKTLQIFNIEMKSKMKAHTMTEDVVFWKWI
SPNTLALVTKISVYHWSMEGDSTPVKMFDRHSSLAECQIINYRTDPKQQWLLLVGISAQQ
NRVVGAMQLYSVERKCSQPIEGHAASFATFKAEGNAELSTLFCFAVRTAQGGKLHIIEVG
QTPAGNQQFPKKAVDVFFPAEAQNDFPVAMQVSPKYDVIYLITKYGYIHMYDIETGTCIY
MNRISSDTIFVTAPHESTGGIIGVNRKGQVLSVTVEEESIVPYINTVLQNPELALRLAVR
NNLAGAEELFVRKFNMLFTNGQYGEAAKVAAMAPRGILRTPQTIQRFQQVPTQPGQTSPL
LQYFGILLDQAQLNKFESLELCRPVLLQGRKQLLEKWLKEEKLECSEELGDLVKQVDPTL
ALSVYLRANVAAKVIQCFAETGQFQKIVLYAKKVGYTPDYIYLLRSVMRTNPEQGAGFAG
MLVAEDPPLADINQIVDVFMEQNMVQQCTAFLLDALKNNRPEEGALQTRLLEMNLMSAPQ
VADAILGNGMFTHYDRAHVAQLCEKAGLLQRALEHYTDLYDIKRAVVHTHLLSADWLVSY
FGTLSVEDSLECLKAMLQANIRQNLQICVQIATKYHEQLTTKALIELFEGFKTYEGLFYF
LGSIVNFSQDSEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLPDQLPL
IIVCDRFDFVHDLVLYLYRNSLQKYIEIYVQKVNPSRLPVVVGGLLDVDCAEDIIKNLIL
VVRGQFSTDELVAEVEKRNRLKLLLPWLETRVHEGCNEPATHNALAKIYIDSNNNPERFL
KENQWYDSRVVGRYCEKRDPHLACVAYERGQCDRELIAVCNDNSLFKTQARYLVRRRDQD
LWLEVLAESNPYKRQLIDQVVQTALSETQDPEDISVTVKAFMTADLPNELIELLEKIVLD
NSVFSDHRNLQNLLILTAIKADRTRVMEYINRLDNYDAPDIANIAINNELYEEAFAIFKK
FDVNTSAIQVLIDQVKDLQRAYEFAERCNEPGVWSQLAKAQLQQGLVKEAIDSYIKADDP
SAYMDVVDTATKQQSWEDLVRYLQMARKKARESYIESELIYAYARTGRLADLEEFISGPN
HADIQKIGDRCFDDKMYNAAKLLYNNVSNFARLAITLVHLKEFQGAVDSARKANSTRTWK
EVCFACVDAGEFRLAQMCGLHIVVHADELEDLINYYQDRGHFDELISLLEAALGLERAHM
GMFTELAILYSKYKPAKMREHLELFWSRVNIPKVLRAAEQAHLWSELVFLYDKYEEYDNA
ALTMMQHPTEAWREGHFKDIITKVANMELYYKAIQFYLDYKPLLLNDLLLVLAPRMDHTR
AVGFFTKAGHLQLVKAYLRSVQSLNNKAVNEALNSLLIDEEDYQGLRTSIDAFDNFDTIA
LAQQLEKHELTEFRRIAAYLYKGNNRWKQSVELCKKDALYADAMEYAAESRQADVAEELL
DWFLERRNYECFSATLYQCYDLLKPDVVIELAWRHNIMDFAMPYLIQTVRELTTKVEKLE
EADAKRSTESAEQEAKPAMIMEPQLMLTAGPSMAYPGVPAQSPYAYAAQAPSPAPYHGYG
M