Monarch geneset OGS2.0

DPOGS207010
TranscriptDPOGS207010-TA5088 bp
ProteinDPOGS207010-PA1695 aa
Genomic positionDPSCF300001 + 1127856-1145330
RNAseq coverage1693x (Rank: top 8%)
Annotation
HeliconiusHMEL0154640.089.39% 
BombyxBGIBMGA012935-TA0.095.58% 
DrosophilaChc-PD0.084.79% 
EBI UniRef50UniRef50_P297420.084.79%Clathrin heavy chain n=181 Tax=root RepID=CLH_DROME
NCBI RefSeqNP_001136443.10.095.58%clathrin heavy chain [Bombyx mori]
NCBI nr blastpgi|2193628290.095.58%clathrin heavy chain [Bombyx mori]
NCBI nr blastxgi|2193628290.095.58%clathrin heavy chain [Bombyx mori]
Group
Gene OntologyGO:00068863.9e-184intracellular protein transport
GO:00301323.9e-184clathrin coat of coated pit
GO:00301303.9e-184clathrin coat of trans-Golgi network vesicle
GO:00051983.9e-184structural molecule activity
GO:00161923.9e-184vesicle-mediated transport
GO:00054882.7e-147binding
KEGG pathwaytca:6561930.0 
 K04646 (CLTC)maps-> Huntington's disease
    Endocytosis
    Lysosome
    Bacterial invasion of epithelial cells
InterPro domain[1-1692] IPR0163410Clathrin, heavy chain
[2-354] IPR0160253.9e-184Clathrin, heavy chain, linker/propeller domain
[1203-1542] IPR0119902.7e-147Tetratricopeptide-like helical
[3-336] IPR0014731.9e-144Clathrin, heavy chain, propeller, N-terminal
[1202-1537] IPR0160248.9e-121Armadillo-type fold
[355-500] IPR0123312.4e-77Clathrin, heavy chain, linker
[985-1130] IPR0005472.4e-43Clathrin, heavy chain/VPS, 7-fold repeat
[337-360] IPR0153485.8e-11Clathrin, heavy chain, linker, core motif
[155-193] IPR0223651.2e-06Clathrin, heavy chain, propeller repeat
Orthology groupMCL12071 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207010-TA
ATGGCTCAGGTGTTGCCAATACGCTTCCAGGAGCATTTACAGCTCACAAATATAGGTATAAATCCCGCTTCAATATCATTCAACACCCTGACCATGGAGTCAGATAAGTTCATCTGTGTCCGTGAGAAGGTGGGTGACACCTCGGAGGTTGTCATCATTGACATGGCGGATCCCACCAACCCCATAAGGAGACCCATCAGCGCTGACTCTGCCATCATGAACCCAGCTAGCAAAGTCATCGCTCTCAAGGGAAAGGCTGGAGTGGAAGCGCAAAAGACCCTCCAAATATTCAACATTGAAATGAAATCCAAGATGAAGGCGCACACTATGACCGAGGATGTAGTTTTCTGGAAGTGGATCTCGCCTAACACTTTGGCCCTGGTGACCAAAATATCAGTATACCACTGGTCCATGGAGGGGGATTCGACACCAGTCAAGATGTTCGATAGACATTCATCTCTCGCTGAGTGTCAGATTATCAACTACAGAACCGATCCTAAGCAGCAGTGGCTGCTACTTGTCGGTATCTCGGCGCAACAGAACCGTGTTGTGGGCGCGATGCAGTTGTACTCAGTTGAGCGGAAGTGTTCTCAGCCGATCGAAGGTCATGCTGCTTCGTTCGCGACCTTCAAGGCTGAGGGTAACGCTGAGCTGTCTACGCTGTTTTGTTTCGCTGTGAGGACAGCACAGGGCGGGAAGCTGCACATCATCGAGGTTGGTCAGACCCCAGCCGGTAACCAGCAGTTCCCTAAGAAAGCGGTGGACGTTTTCTTCCCGGCTGAAGCCCAGAACGATTTCCCGGTCGCCATGCAAGTGTCGCCCAAATATGACGTCATCTACCTGATCACCAAATACGGTTACATCCATATGTACGACATCGAAACCGGCACATGCATTTATATGAATCGCATCTCCTCTGACACTATATTCGTGACAGCACCCCACGAATCGACCGGCGGAATTATTGGTGTGAACCGCAAGGGACAAGTTCTGTCTGTGACGGTGGAAGAGGAGTCCATAGTGCCGTACATCAACACGGTGCTGCAGAACCCTGAACTGGCGCTCCGGCTGGCTGTGAGGAATAACCTGGCCGGTGCCGAGGAGTTGTTCGTCAGGAAATTCAACATGCTGTTCACCAACGGACAGTACGGAGAGGCAGCTAAGGTAGCGGCTATGGCTCCGCGCGGGATCCTCCGTACGCCGCAGACGATCCAGCGGTTCCAGCAGGTGCCCACCCAGCCCGGCCAGACCTCCCCGCTGTTGCAGTACTTCGGCATCCTGTTGGACCAAGCACAGCTCAACAAGTTCGAATCGCTGGAGTTGTGCCGACCTGTACTTCTGCAAGGTCGCAAGCAACTATTGGAGAAATGGCTGAAGGAAGAGAAATTGGAATGTTCAGAGGAACTGGGAGACCTTGTGAAGCAGGTCGATCCCACTCTGGCACTATCAGTTTATTTAAGGGCGAATGTAGCTGCCAAAGTGATCCAATGTTTCGCCGAAACCGGCCAGTTCCAGAAGATCGTGTTATACGCTAAGAAGGTGGGCTATACGCCGGATTATATCTATCTCCTGAGATCTGTGATGCGTACGAATCCCGAGCAAGGCGCAGGTTTCGCGGGTATGCTGGTCGCCGAGGACCCGCCGCTGGCTGACATCAATCAGATCGTGGACGTGTTCATGGAACAGAACATGGTACAGCAGTGCACAGCCTTCTTACTCGATGCCTTGAAGAACAACCGTCCCGAGGAAGGAGCCCTACAGACCAGATTGTTAGAGATGAATCTGATGTCAGCGCCTCAAGTGGCAGACGCGATTCTGGGCAATGGTATGTTCACGCACTACGACCGCGCCCATGTCGCTCAGCTCTGCGAGAAGGCCGGCCTACTGCAACGTGCTCTAGAGCATTACACAGACTTGTACGACATTAAGAGAGCTGTGGTTCACACACACTTGCTGTCCGCCGATTGGTTGGTGAGTTATTTCGGCACCCTATCAGTCGAAGACTCCCTCGAGTGTCTTAAGGCGATGCTACAAGCGAACATTCGCCAAAACCTTCAGATCTGCGTACAGATCGCAACCAAATACCACGAACAACTAACAACCAAGGCTCTCATTGAATTATTCGAGGGTTTCAAGACTTATGAAGGTCTATTCTACTTCCTCGGCTCCATTGTGAACTTCAGTCAGGATTCAGAAGTACATTTCAAGTACATCCAGGCTGCATGCAAGACTGGTCAGATCAAAGAAGTGGAACGCATCTGTCGCGAGTCGAACTGCTACAACGCGGAGCGTGTGAAAAATTTCCTTAAGGAAGCCAAACTTCCCGATCAGTTGCCTCTAATCATTGTGTGCGATAGATTCGACTTCGTACACGACCTCGTCTTGTATTTGTATAGAAACAGCCTCCAAAAGTACATCGAGATTTACGTACAGAAGGTAAATCCGTCAAGGCTGCCTGTAGTTGTCGGTGGTCTGTTGGATGTAGACTGCGCTGAGGATATAATCAAAAACCTCATACTCGTAGTCCGAGGACAGTTCTCCACAGACGAGCTCGTAGCTGAAGTTGAGAAGAGAAACAGACTAAAGTTGCTCCTACCATGGTTGGAGACGCGGGTCCACGAGGGCTGCAACGAGCCAGCGACGCACAACGCTCTAGCCAAGATTTACATTGATTCTAACAATAATCCCGAGAGATTCTTGAAGGAGAACCAATGGTACGATTCCCGTGTTGTGGGTCGCTACTGTGAGAAGCGCGATCCCCACCTCGCTTGTGTGGCGTACGAGCGTGGGCAGTGTGACCGCGAGCTGATCGCCGTATGCAATGATAACTCGCTGTTCAAGACTCAAGCGCGGTACCTCGTGAGGAGACGGGACCAGGACCTCTGGCTGGAAGTACTGGCCGAGTCAAACCCTTACAAGAGGCAGCTTATAGATCAGGTTGTACAAACGGCTCTGTCGGAAACCCAAGACCCTGAGGACATTTCGGTGACGGTGAAGGCATTCATGACAGCTGATTTGCCGAATGAGCTGATCGAGCTGTTAGAGAAGATTGTCCTAGATAACTCTGTGTTCTCTGATCACAGGAACCTACAGAATCTGCTTATTTTGACAGCTATCAAGGCCGATCGCACCCGTGTTATGGAATACATCAATCGCCTGGACAACTACGACGCACCGGACATCGCTAACATAGCCATCAATAACGAGCTATATGAGGAAGCTTTTGCTATCTTCAAGAAGTTCGATGTTAATACATCGGCCATTCAAGTCCTGATAGACCAAGTGAAGGATCTACAACGCGCTTATGAATTCGCCGAGCGTTGCAACGAGCCGGGCGTTTGGTCACAACTGGCTAAGGCTCAGTTACAGCAGGGATTGGTGAAGGAAGCCATTGATTCTTACATAAAGGCAGACGATCCATCCGCCTATATGGACGTAGTTGATACAGCCACCAAACAACAGTCCTGGGAGGATCTCGTCAGATACCTACAGGCTAGTTCTGGTCTACTTATACGTTATATAAATGACTTAATAATGGCTCGCAAGAAGGCTCGTGAATCGTACATAGAATCCGAATTGATTTACGCTTACGCCCGCACTGGGAGGCTGGCTGATCTCGAAGAGTTCATCTCTGGTCCGAACCACGCCGACATACAGAAGATAGGGGACAGGTGTTTCGACGATAAGATGTACAACGCTGCTAAACTGCTCTACAATAACGTGAGCAACTTTGCTCGTTTGGCCATCACTCTGGTGCATCTCAAGGAATTCCAAGGCGCGGTGGACAGTGCCCGCAAGGCGAACTCCACTCGTACATGGAAGGAGGTTTGCTTCGCCTGTGTCGACGCCGGTGAATTCCGTCTCGCTCAGATGTGCGGACTACATATAGTTGTGCACGCTGACGAGTTGGAGGACCTCATTAATTACTACCAGGATCGTGGTCATTTCGACGAGCTGATCAGTCTGCTCGAGGCTGCTCTCGGTCTCGAACGTGCTCATATGGGAATGTTCACAGAACTGGCCATACTTTACTCCAAGTACAAACCAGCTAAGATGCGCGAACATTTGGAACTATTCTGGTCTCGCGTTAACATTCCGAAGGTCCTTCGCGCCGCGGAACAAGCTCATCTGTGGTCCGAACTAGTGTTCCTGTACGATAAATACGAGGAGTACGACAACGCTGCTCTCACCATGATGCAACACCCCACAGAGGCATGGAGGGAGGGCCACTTCAAGGATATCATCACTAAGGTGGCGAATATGGAGCTGTACTACAAGGCTATCCAGTTTTACTTGGACTACAAACCTCTTCTTCTGAACGATCTTCTGCTAGTGCTGGCTCCACGTATGGATCACACTCGTGCTGTGGGATTCTTCACCAAGGCGGGCCACCTACAGCTGGTTAAGGCCTACCTGAGGTCCGTACAGAGCCTCAACAATAAAGCTGTCAATGAAGCACTCAATTCCCTGCTCATTGATGAAGAGGATTATCAGGGCTTGAGGACATCGATTGACGCTTTCGATAACTTTGACACGATCGCACTGGCGCAGCAACTGGAGAAACACGAACTCACCGAGTTTAGAAGAATTGCTGCCTATTTGTACAAAGGCAACAATAGATGGAAACAGAGCGTCGAGCTTTGCAAGAAGGACGCTTTATACGCTGATGCTATGGAATACGCCGCTGAGTCCCGTCAGGCAGATGTCGCTGAGGAACTGCTAGACTGGTTCCTTGAAAGACGCAACTACGAGTGCTTCTCGGCTACTTTGTACCAGTGTTACGACCTCTTGAAACCCGATGTAGTTATTGAACTGGCGTGGAGACATAATATCATGGATTTCGCAATGCCGTATCTCATCCAAACTGTACGCGAACTGACAACTAAAGTTGAAAAGTTGGAGGAGGCTGACGCCAAACGTAGCACAGAGAGCGCTGAACAAGAAGCCAAACCAGCAATGATTATGGAACCACAGCTTATGCTTACTGCCGGACCTTCAATGGCTTATCCGGGTGTACCGGCCCAGTCACCGTACGCTTACGCGGCGCAGGCACCGTCCCCGGCGCCCTACCACGGCTACGGCATGTAG

Protein sequence:

>DPOGS207010-PA
MAQVLPIRFQEHLQLTNIGINPASISFNTLTMESDKFICVREKVGDTSEVVIIDMADPTNPIRRPISADSAIMNPASKVIALKGKAGVEAQKTLQIFNIEMKSKMKAHTMTEDVVFWKWISPNTLALVTKISVYHWSMEGDSTPVKMFDRHSSLAECQIINYRTDPKQQWLLLVGISAQQNRVVGAMQLYSVERKCSQPIEGHAASFATFKAEGNAELSTLFCFAVRTAQGGKLHIIEVGQTPAGNQQFPKKAVDVFFPAEAQNDFPVAMQVSPKYDVIYLITKYGYIHMYDIETGTCIYMNRISSDTIFVTAPHESTGGIIGVNRKGQVLSVTVEEESIVPYINTVLQNPELALRLAVRNNLAGAEELFVRKFNMLFTNGQYGEAAKVAAMAPRGILRTPQTIQRFQQVPTQPGQTSPLLQYFGILLDQAQLNKFESLELCRPVLLQGRKQLLEKWLKEEKLECSEELGDLVKQVDPTLALSVYLRANVAAKVIQCFAETGQFQKIVLYAKKVGYTPDYIYLLRSVMRTNPEQGAGFAGMLVAEDPPLADINQIVDVFMEQNMVQQCTAFLLDALKNNRPEEGALQTRLLEMNLMSAPQVADAILGNGMFTHYDRAHVAQLCEKAGLLQRALEHYTDLYDIKRAVVHTHLLSADWLVSYFGTLSVEDSLECLKAMLQANIRQNLQICVQIATKYHEQLTTKALIELFEGFKTYEGLFYFLGSIVNFSQDSEVHFKYIQAACKTGQIKEVERICRESNCYNAERVKNFLKEAKLPDQLPLIIVCDRFDFVHDLVLYLYRNSLQKYIEIYVQKVNPSRLPVVVGGLLDVDCAEDIIKNLILVVRGQFSTDELVAEVEKRNRLKLLLPWLETRVHEGCNEPATHNALAKIYIDSNNNPERFLKENQWYDSRVVGRYCEKRDPHLACVAYERGQCDRELIAVCNDNSLFKTQARYLVRRRDQDLWLEVLAESNPYKRQLIDQVVQTALSETQDPEDISVTVKAFMTADLPNELIELLEKIVLDNSVFSDHRNLQNLLILTAIKADRTRVMEYINRLDNYDAPDIANIAINNELYEEAFAIFKKFDVNTSAIQVLIDQVKDLQRAYEFAERCNEPGVWSQLAKAQLQQGLVKEAIDSYIKADDPSAYMDVVDTATKQQSWEDLVRYLQASSGLLIRYINDLIMARKKARESYIESELIYAYARTGRLADLEEFISGPNHADIQKIGDRCFDDKMYNAAKLLYNNVSNFARLAITLVHLKEFQGAVDSARKANSTRTWKEVCFACVDAGEFRLAQMCGLHIVVHADELEDLINYYQDRGHFDELISLLEAALGLERAHMGMFTELAILYSKYKPAKMREHLELFWSRVNIPKVLRAAEQAHLWSELVFLYDKYEEYDNAALTMMQHPTEAWREGHFKDIITKVANMELYYKAIQFYLDYKPLLLNDLLLVLAPRMDHTRAVGFFTKAGHLQLVKAYLRSVQSLNNKAVNEALNSLLIDEEDYQGLRTSIDAFDNFDTIALAQQLEKHELTEFRRIAAYLYKGNNRWKQSVELCKKDALYADAMEYAAESRQADVAEELLDWFLERRNYECFSATLYQCYDLLKPDVVIELAWRHNIMDFAMPYLIQTVRELTTKVEKLEEADAKRSTESAEQEAKPAMIMEPQLMLTAGPSMAYPGVPAQSPYAYAAQAPSPAPYHGYGM-