Monarch geneset OGS2.0

DPOGS200469
TranscriptDPOGS200469-TA2592 bp
ProteinDPOGS200469-PA863 aa
Genomic positionDPSCF300260 + 302718-305934
RNAseq coverage304x (Rank: top 37%)
Annotation
HeliconiusHMEL0104140.075.46% 
BombyxBGIBMGA011180-TA0.072.95% 
DrosophilagammaCop-PB0.054.12% 
EBI UniRef50UniRef50_Q9Y6780.052.00%Coatomer subunit gamma n=161 Tax=Eukaryota RepID=COPG_HUMAN
NCBI RefSeqNP_001036846.10.066.47%nonclathrin coat protein gamma1-COP [Bombyx mori]
NCBI nr blastpgi|76374100.073.09%nonclathrin coat protein gamma2-COP [Bombyx mori]
NCBI nr blastxgi|76374100.073.17%nonclathrin coat protein gamma2-COP [Bombyx mori]
Group
Gene OntologyGO:00054887.6e-107binding
GO:00068862.9e-102intracellular protein transport
GO:00301172.9e-102membrane coat
GO:00161922.9e-102vesicle-mediated transport
GO:00057981.1e-61Golgi-associated vesicle
GO:00051981.1e-61structural molecule activity
GO:00301265.6e-43COPI vesicle coat
KEGG pathway 
InterPro domain[5-863] IPR0171060Coatomer, gamma subunit
[19-584] IPR0119897.6e-107Armadillo-like helical
[25-537] IPR0025532.9e-102Clathrin/coatomer adaptor, adaptin-like, N-terminal
[1-579] IPR0160241.4e-88Armadillo-type fold
[603-862] IPR0148631.1e-61Coatomer, gamma subunit , appendage
[593-754] IPR0130412.5e-47Clathrin/coatomer adaptor, adaptin-like, appendage, Ig-like subdomain
[596-750] IPR0130405.6e-43Coatomer, gamma subunit, appendage, Ig-like subdomain
[756-862] IPR0090286e-16Clathrin/coatomer adaptor, adaptin-like, appendage, C-terminal subdomain
[753-863] IPR0158732.7e-10Clathrin alpha-adaptin/coatomer adaptor, appendage, C-terminal subdomain
Orthology groupMCL11188 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200469-TA
ATGAGCTCCTTTAAGCGTGATAAGAAGGAAGAAGAAGAGGGTGGGTGTTTTTCATTCCAGAACTTAGATAAAACCATCGTTTTGCAAGAATCGAGATATTTCAACGAGACGCCGGTGAACCCTCGGAAATGTACACAAATTCTAACGAAAATACTATTTCTGCTCAACCAAGGAGAGACATTCTCCACTCAGGAAGCTACAGAGGCTTTCTTCGCCATCACGAAGCTTTTCCAATCCAATGATGTTATTTTGCGGCGTATGGTGTATCTTTGCATCAAGGAATTGAGCAAACTGGCTCAGGATGTGATTATAGTTACATCTTCCTTAACCAAAGACATGACGGGCAAGGAGGATCTGTATAGGGCTGCGGCTATCCGAGCGCTGTGCAGTGTCACCGATAGCACCATGATTCAAGCTATTGAAAGATACATGAAGCAAGCTATCGTTGATAGAAACCCCGCGGTCAGTTCTGCGGCTCTGGTGTCGTCTTTGCATCTATCATCTACTTCACCGGATCTGGTGAAGCGATGGACGAGCGAGGCACAGGAAGCGTTGAACAGCGAAAAGTCACTCGTCTCGTACCATGCACTCGGAATACTTGTGAATATTCGTAGAACTGACAAGCTGTCAACCATGAAGCTAGTGACACGTCTCACCAAATCCTCAATAAAATCACCATACACTCTGTGCTTATTGATTCGTCTCGCTGCTCAACTAGTTGAAGATGATGCTTCGGAGACATCCCAAGCTTACATCGAGTTCATTGATGGCTGTCTCCGGCACAAATCAGAAATGGTGATTTATGAAGCTGCTCACGCGATTGTCAACTTGCGTAAAACAACACGAGATTTAGCTCCGGCTGTGTCAGTGCTACAGCTGTTCTGTGGTTCCTCCAAAGCGACCTTGAGACTCGCTGGAGCTAGAACCCTCGCCAAACTGACGACTAAACATCCAGCTGCAGTATCAGCCTGCACCATCGACCTGGAAAATCTGATCTCCGATCCAAACAGATCAGTTGCTACGCTGGCAGTCACGACTCTGCTGGCTACAGGCGCTGAGAGCTCCATCGATCGTCTCATGAAGCAGATCTCTAGCTTCGTTTCGGAGATATCGGACGAGTTTAAAATTATCGTAGTCAAGGCTATCAAACGCCTGTGCCTCAAGTTTCCTAGGAAGCACCAATCGTTGGCTACATTTTTGGCCGGCATGCTCCGCGATGAAGGTGGTTTAGATTATAAAGCGGCCATTGCTGACGCGATCATCGCTCTGGTGGAAGAAAACCCCGATGCCAAAGAGACAGGACTGGCTCATCTGTGCGAATTTATCGAAGATTGCGAGCACCAAGTCCTGTCGGTTCGGATTTTGCATTTATTGGGACGCGAAGGCCCTAAAACTCGCCATCCCACGAGATACATCAGATTCATCTACAATAGGGTCATCCTGGAGACGGGGCCGGTCAGGGCGGCTGCGGTCTCTGCTGTGGCGCAGTTCGGAGCCCACATTCCTGAACTCCTGCCAAATATAAGAGTTCTGTTAAGTCGCTGTGAAACCGACGAGGAAGATGAAGTGAGGGATCGCGCGATCTTCTTCAACGCGATCTTCAATTCGGGCAACGAGAAATTGATAAGAGATTATATCACCCACGTACCGAGAGTGAACCCGGTTCTGTTGGAAAAAGCCCTCCATGATCATGCCAAGAACAGGCCGAACGAACCCTTCGACATCCTGAGCGTCCCGGAGATGGAAAAACCGAAGAGAGAAGAAGTGGTAGAGATCGATGTGAAGCAACCGAAACAGATAACAATCGAGGAGATTTACAGCCAGCAGCTCGCGAAAATACCGGGCATCGAGAAATTGGGGACTATTTTCAAAACCAATAATCCGGTCGAACTCACGGAAGAGGACACAGAGTTCCAAGTCCGTCTGATTAAACACATCTACGTTCGTCACGTCGTCCTGCAGTTTGAATGCACGAGCACTATCAACTTCCATGTCTTCGAGAATGTGACCGTAAAATTGGATCTGCCCAACGAGTTCGAAGTGAAGAACATGGTGCCCATCAAGTCGTTGGCTTTCAACAGACCCGAGAGCATTTTTGTAATCGTGGAATTCCCCTGCTCGTTTCTGGACAGCATGAACCCCTTCGGTGCCATCCTGGAGTTTGTGACACGCGAATGTCACCCAATCACTTGTATGCCAAACCCCGGCCCAGGGTACATAGACACTTATCCTATCGAGGACTTCTACATCAGCTGCGCCGATCAAATACGCACGCGAGTCACCGGCGATGACTGGGAGCAGACTTGGGAGAGCGCTTTCAACGTCATCGAGATTTCAGATACTTTCTCTCTCCCTCAGCGAGACGCTGCGGCCGCTGCTAAGTCGGTTTGCGAATATCTCGGTCTACCGAAAGGTTCCATCACCGGGGACACGGTTAAGGAGATAAGGGGGGCCGGTATTTTTAGGGGCGGAGCGCCTTTCTTGGTCAGGGCTCGCATAGCGCCGACGAGCGCTGGAACTGCTACTATGCTGATCGCAGCACGATCTCCGAGAGAAGACGTGGCACAACTACTACTCAACGCTGTAGGTTAA

Protein sequence:

>DPOGS200469-PA
MSSFKRDKKEEEEGGCFSFQNLDKTIVLQESRYFNETPVNPRKCTQILTKILFLLNQGETFSTQEATEAFFAITKLFQSNDVILRRMVYLCIKELSKLAQDVIIVTSSLTKDMTGKEDLYRAAAIRALCSVTDSTMIQAIERYMKQAIVDRNPAVSSAALVSSLHLSSTSPDLVKRWTSEAQEALNSEKSLVSYHALGILVNIRRTDKLSTMKLVTRLTKSSIKSPYTLCLLIRLAAQLVEDDASETSQAYIEFIDGCLRHKSEMVIYEAAHAIVNLRKTTRDLAPAVSVLQLFCGSSKATLRLAGARTLAKLTTKHPAAVSACTIDLENLISDPNRSVATLAVTTLLATGAESSIDRLMKQISSFVSEISDEFKIIVVKAIKRLCLKFPRKHQSLATFLAGMLRDEGGLDYKAAIADAIIALVEENPDAKETGLAHLCEFIEDCEHQVLSVRILHLLGREGPKTRHPTRYIRFIYNRVILETGPVRAAAVSAVAQFGAHIPELLPNIRVLLSRCETDEEDEVRDRAIFFNAIFNSGNEKLIRDYITHVPRVNPVLLEKALHDHAKNRPNEPFDILSVPEMEKPKREEVVEIDVKQPKQITIEEIYSQQLAKIPGIEKLGTIFKTNNPVELTEEDTEFQVRLIKHIYVRHVVLQFECTSTINFHVFENVTVKLDLPNEFEVKNMVPIKSLAFNRPESIFVIVEFPCSFLDSMNPFGAILEFVTRECHPITCMPNPGPGYIDTYPIEDFYISCADQIRTRVTGDDWEQTWESAFNVIEISDTFSLPQRDAAAAAKSVCEYLGLPKGSITGDTVKEIRGAGIFRGGAPFLVRARIAPTSAGTATMLIAARSPREDVAQLLLNAVG-