Monarch geneset OGS2.0

DPOGS203907
TranscriptDPOGS203907-TA2595 bp
ProteinDPOGS203907-PA864 aa
Genomic positionDPSCF300005 - 975788-979158
RNAseq coverage711x (Rank: top 18%)
Annotation
HeliconiusHMEL0104140.072.80% 
BombyxBGIBMGA011180-TA0.073.38% 
DrosophilagammaCop-PB0.056.36% 
EBI UniRef50UniRef50_Q9Y6780.054.41%Coatomer subunit gamma n=161 Tax=Eukaryota RepID=COPG_HUMAN
NCBI RefSeqNP_001036846.10.082.93%nonclathrin coat protein gamma1-COP [Bombyx mori]
NCBI nr blastpgi|1129839360.082.93%nonclathrin coat protein gamma1-COP [Bombyx mori]
NCBI nr blastxgi|1129839360.082.93%nonclathrin coat protein gamma1-COP [Bombyx mori]
Group
Gene OntologyGO:00054881.4e-107binding
GO:00068861.2e-103intracellular protein transport
GO:00301171.2e-103membrane coat
GO:00161921.2e-103vesicle-mediated transport
GO:00057981.3e-72Golgi-associated vesicle
GO:00051981.3e-72structural molecule activity
GO:00301262.2e-51COPI vesicle coat
KEGG pathway 
InterPro domain[5-864] IPR0171060Coatomer, gamma subunit
[19-592] IPR0119891.4e-107Armadillo-like helical
[31-537] IPR0025531.2e-103Clathrin/coatomer adaptor, adaptin-like, N-terminal
[16-589] IPR0160248.5e-90Armadillo-type fold
[605-863] IPR0148631.3e-72Coatomer, gamma subunit , appendage
[595-756] IPR0130418.5e-54Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain
[602-752] IPR0130402.2e-51Coatomer, gamma subunit, appendage, Ig-like subdomain
[758-863] IPR0090281.1e-21Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain
[762-864] IPR0158732.8e-15Clathrin alpha-adaptin/coatomer adaptor, appendage, C-terminal subdomain
Orthology groupMCL11188 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203907-TA
ATGAGCTCTTTTCGTCGTGACAAGAAGGAGGAAGAAGATACGGGAGGAAATGTCTTCCAGAATCTTGATAAGACTGCTCTTTTGCAAGAGGCTCGTTATTTCAACTCCACTCCTGTAATTCCAAAAAAATGTGTCCAGATCCTTACGAAAATTCTATACCTACTCAACCAAGGAGAGAAACTAACTACTCAAGAAGCCACAGATATATTTTTTGCGACAACTAAATTGTTTCAATCTAAGGATGTTGTTTTACGTCGTCTTGTGTATTTGTGCATTAAAGAATTAAGCTCAATGGCACAAGATGTAATCATCGTAACGTCCTCTTTGACCAAAGATATGACTGGCAAGGAGGATCTTTATCGAGCTGCAGCAATTCGTGCACTCTGCAGTATAACCGACAGTACTATGCTTCAAGCTATAGAACGGTACATGAAGCAGGCAATTGTGGATAAAAATCCTGCCGTCAGTTCAGCTGCTCTTGTGTCAGCTCTTCATCTATCTGCAACTGTTCCTGAACTGGTTAGGAGATGGGTGAATGATGCGCAGGAGGCAATTATGTCTGATAATGCAATGGTGTCATATCACGCCTTGGCAGTAGTAGCAGGTGCCAGACGGAATGATCGTTTATCAACTGTGAAATTGGTAACAAAACTATCTCGTTCACCTTTACGATCTCCATTTGCTTTATGCCTGTTAGTTAGATATGCTGCCAAATTAGCAGAAGAGGATCAAACTGAAGCATCAGAACCATACTTGGAATTTATAGAATGTTGCTTGCGTCACAAATCTGAGATTGTTGTTTATGAAGCTGCTCATGCTATTGTTAATTTGCGAAAGTCTGCTAGAGATCTTGCCCAAGCTGTGAGCGTTTTACAAATTTTCTGTGGTTCATCCAAGGCAACCCTTCGCCTTGCTGGAGCTCGTACCTTAGCGAAGTTAACAACAAAACATCCAAATGCTGTTGCAGCTTGTGCTGTTGACTTGGAAAACTTGATTTCAGATCCAAATCGCTCTGTTGCTACTTTAGCTGTGACAACTTTGCTTGCCACCGGAGCAGAAAGCTCAATTGATCGCCTAATGAAGCAAATTTCTACTTTCATGTCTGAAATATCAGATGAATTTAAAATTGTAGTTGTTCGTGCCATAAGGCGTCTTTGTACCAAATATCCAAGAAAGCATCAATCGCTGGCCTCATTTTTAGCTGGGATGTTGCGTGATGAAGGTGGCGTACAGTACAAGACAGCAATTGCCGATGCGATTATTGCCTTGATAGAAGAAAATCCAGATGCAAAAGAGACTGGTCTGGCTCATCTTTGTGAATTTATAGAAGATTGCGAACATACATCACTGGCAGTTAGGATTTTGTATGTTTTGGGTCGCGAAGGCCCTAAAGCTCGTCAGCCTTCTCGCTACATTAGATATATCTATAATCGAGTTATTTTGGAATCTGGCCCAGTTCGAGCTGCTGCTGTATCTGCAGTTGCTCGCTTTGGCGCTACTTGCGAGGATCTACTTCCAAATATTAGTGTATTGTTAGCTCGTTGTCAGCTTGACGATGATGATGAAGTTCGCGACAGAGCTATATTTTTTAATGCTATCTTGAATTCCAATGACCCTCAACTAATTAGCGATTATATCACAAACGTACCGACTCCAAACCCAGTTCTCTTGGAAAAAGCCCTTCATGATCACCTTAAAATGAATCCTGATAAACCATTCAACATTCTGTCGGTTCCATCATCAGAACCCACCAAAGAACCTGAAGAAGCCCCTGTTCAAATTGAAACTCGTAAACAGCCAGTTGTTTCTCTTGAAGAATTGTATTCTGAACAATTGAAAGTGGTTCCCGGCATTGAAAAACTCGGTCCACCATTTAAGACATGCAAAGCTATTGACCTTACTGAGCCAGAAACCGAATATCGTGTACGCTGTGTTAAACACATCTTTGCACGACATTTAATTTTGCAGTTTGAATGTTTGAACACACTTAGCGATCAGCTTTTGGAAAAAGTCCGCGTCCGCCTGGAAACGGCTCCTGGGTATAAAATCCTGTGTGAAGTTCCATGCGAGCAACTACCCTATGATAAACAAGGTAGTGTGTTTTGTCTCTTAGAATTCCCTCATGGTCCGATAGAAACCTTGGGCACCTTTGGGGCTACCCTTGAATTTTCAGTTCGAGATTGCGATCCTACCACTGGACTGCCTGATGGCGGTGATGGATATTCCGATACGTATCCTTTAGAGGAATTAGATATTGGATGCGCTGAGCAGTTCCGCGCTCACGTGGCTACTGATGATTGGGAAGCTTCATGGGAAAGAACAGCGTCAGCCGCCGAAGCTTCCGATACCTTTGTACTATCACAGTCAGATATTAATGAAGCAGCATCGGCAGTGTGCACCCATTTAGGTTTACCTAAAGCTGCCATATCAGGTGACGCTGTAAAGGAAATTCGTGGTGGAGGTTTGTGGCGCGAAGGCACACCGATGTTGGTACGTGCACGTCTGGTGGCATCTCAAGGGTCTGTGACTATGAAACTCACGGCTCGCTCTCCTCGTGAAGATGTTGCTACTCTACTTTTAGCTGCAGTTGGCTAA

Protein sequence:

>DPOGS203907-PA
MSSFRRDKKEEEDTGGNVFQNLDKTALLQEARYFNSTPVIPKKCVQILTKILYLLNQGEKLTTQEATDIFFATTKLFQSKDVVLRRLVYLCIKELSSMAQDVIIVTSSLTKDMTGKEDLYRAAAIRALCSITDSTMLQAIERYMKQAIVDKNPAVSSAALVSALHLSATVPELVRRWVNDAQEAIMSDNAMVSYHALAVVAGARRNDRLSTVKLVTKLSRSPLRSPFALCLLVRYAAKLAEEDQTEASEPYLEFIECCLRHKSEIVVYEAAHAIVNLRKSARDLAQAVSVLQIFCGSSKATLRLAGARTLAKLTTKHPNAVAACAVDLENLISDPNRSVATLAVTTLLATGAESSIDRLMKQISTFMSEISDEFKIVVVRAIRRLCTKYPRKHQSLASFLAGMLRDEGGVQYKTAIADAIIALIEENPDAKETGLAHLCEFIEDCEHTSLAVRILYVLGREGPKARQPSRYIRYIYNRVILESGPVRAAAVSAVARFGATCEDLLPNISVLLARCQLDDDDEVRDRAIFFNAILNSNDPQLISDYITNVPTPNPVLLEKALHDHLKMNPDKPFNILSVPSSEPTKEPEEAPVQIETRKQPVVSLEELYSEQLKVVPGIEKLGPPFKTCKAIDLTEPETEYRVRCVKHIFARHLILQFECLNTLSDQLLEKVRVRLETAPGYKILCEVPCEQLPYDKQGSVFCLLEFPHGPIETLGTFGATLEFSVRDCDPTTGLPDGGDGYSDTYPLEELDIGCAEQFRAHVATDDWEASWERTASAAEASDTFVLSQSDINEAASAVCTHLGLPKAAISGDAVKEIRGGGLWREGTPMLVRARLVASQGSVTMKLTARSPREDVATLLLAAVG-