Monarch geneset OGS2.0

DPOGS211917
TranscriptDPOGS211917-TA747 bp
ProteinDPOGS211917-PA248 aa
Genomic positionDPSCF300011 + 120939-122638
RNAseq coverage1843x (Rank: top 7%)
Annotation
HeliconiusHMEL0177121e-7668.65% 
BombyxBGIBMGA001060-TA8e-8867.34% 
DrosophilaClc-PC1e-2638.17% 
EBI UniRef50UniRef50_B0XF422e-2940.22%Clathrin light chain n=10 Tax=Endopterygota RepID=B0XF42_CULQU
NCBI RefSeqXP_001944260.12e-4445.59%PREDICTED: similar to clathrin light chain [Acyrthosiphon pisum]
NCBI nr blastpgi|3287256162e-4346.74%PREDICTED: clathrin light chain-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287256168e-4545.38%PREDICTED: clathrin light chain-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00068861.6e-39intracellular protein transport
GO:00301321.6e-39clathrin coat of coated pit
GO:00301301.6e-39clathrin coat of trans-Golgi network vesicle
GO:00051981.6e-39structural molecule activity
GO:00161921.6e-39vesicle-mediated transport
KEGG pathwayrno:838008e-23 
 K04644 (CLTA, LCA)maps-> Huntington's disease
    Endocytosis
    Lysosome
    Bacterial invasion of epithelial cells
InterPro domain[2-249] IPR0009961.6e-39Clathrin light chain
Orthology groupMCL12627 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211917-TA
ATGGATGATTTTGGAGACAACTTTGTACAAACTGAAGTAGATCCAGCAGCAGAATTCTTAGCTCGGGAACAAAACCAGCTAGCAGGCCTCGAAGATGAACTGGAAACCGGTGCTCCGCCCCCACTTGCTGTCTCATCGGGCAGCAATGGACTTGACGATTTCGTTGAAATACCAAGTTCTGCAGTATTTGAAGCTAATGACTTGATGGATGAAGAGCCATCCGCACCTGCCGTGGCTCCTGTGACACCAGTGTTCCGTCAGGAAAGAGAAGAACCAGAAAAGATAAAATTGTGGAGGGAAGAACAGAAAAAAAGGTTGGAGGAAAAAGATGCCGAGGAAGAAGAAAAGAAGCAGGAAATGTTAAAAGTAGCTAAAAAAGAGCTGGAGGATTGGTATAAAACTCACGAAGAACAAATATCTAAGACCAAAGCAGCTAACAGGCATCACGTTGAGACTGTACTAATAAAAAATTTCCTTTTCAATGCTGGCAGGGAGTCCGCTAATTCAACTGATACATGTGTGACTGTGATATTAAACAACATTTGTCCAAACAGAAATGCCGAGAAGGCTTTGGCAAGGGGTTCGGAAGACGGACTCGAGGATTCCAATGAATGGGCACGCGTTTCCGAACTGTGTGACTTTGGACCGAGACGTGGCCGCGATGTAGCACGTCTACGGTCTATTGTACTGCAGTTGAAGCAGTCAGGAGTTCGTCCCAAATATCCACCACGCACCACCAAGGTGTAA

Protein sequence:

>DPOGS211917-PA
MDDFGDNFVQTEVDPAAEFLAREQNQLAGLEDELETGAPPPLAVSSGSNGLDDFVEIPSSAVFEANDLMDEEPSAPAVAPVTPVFRQEREEPEKIKLWREEQKKRLEEKDAEEEEKKQEMLKVAKKELEDWYKTHEEQISKTKAANRHHVETVLIKNFLFNAGRESANSTDTCVTVILNNICPNRNAEKALARGSEDGLEDSNEWARVSELCDFGPRRGRDVARLRSIVLQLKQSGVRPKYPPRTTKV-