Monarch geneset OGS2.0

DPOGS204923
TranscriptDPOGS204923-TA1794 bp
ProteinDPOGS204923-PA597 aa
Genomic positionDPSCF300766 + 252-7620
RNAseq coverage429x (Rank: top 28%)
Annotation
HeliconiusHMEL0131840.061.65% 
BombyxBGIBMGA010725-TA0.071.68% 
DrosophilaalphaCop-PB0.063.84% 
EBI UniRef50UniRef50_F4WHH30.066.03%Coatomer subunit alpha n=16 Tax=Coelomata RepID=F4WHH3_ACREC
NCBI RefSeqNP_001166192.10.080.00%coatomer protein complex subunit alpha [Bombyx mori]
NCBI nr blastpgi|2896292160.080.00%coatomer protein complex subunit alpha [Bombyx mori]
NCBI nr blastxgi|2896292160.080.00%coatomer protein complex subunit alpha [Bombyx mori]
Group
Gene OntologyGO:00068862.7e-56intracellular protein transport
GO:00301172.7e-56membrane coat
GO:00051982.7e-56structural molecule activity
GO:00161922.7e-56vesicle-mediated transport
GO:00055151.9e-37protein binding
KEGG pathwayame:5508060.0 
 K05236 (COPA)maps-> Neuroactive ligand-receptor interaction
InterPro domain[204-467] IPR0066922.7e-56Coatomer, WD associated region
[60-192] IPR0159431.9e-37WD40/YVTN repeat-like-containing domain
[2-285] IPR0110464.8e-30WD40 repeat-like-containing domain
[61-96] IPR0197811.8e-08WD40 repeat, subgroup
[58-97] IPR0016801.9e-06WD40 repeat
[12-26] IPR0204722.4e-06G-protein beta WD-40 repeat
Orthology groupMCL11194 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204923-TA
ATGTGTGCACAGTTTCATCCCACCGAAGATCTTCTAGTGTCAGCTTCGTTGGACCAATCTGTTAGAGTCTGGGACTTCTCTGGGCTGAGGAAGAAGAGTGTGGCACCGGGACCCACAGGGCTAGCCGATCATCTCAGGAATCCACAAGCTACGGACCTCTTTGGACAGGCGGATGCAGTTGTAAAGCATGTGTTGGAAGGGCATGATCGCGGAGTGAACTGGGCGTCTTTCCATCCAAACCTCCCGCTCATAGTGTCCGCAGCTGATGATAGACAAGTCAAACTGTGGCGGATGAACGATGCCAAGGCTTGGGAGGTAGACACGTGTCGCGGGCACTACAACAACGTGTCATGTGCGCTGTTCCACGCTAGACACGAGCTGATACTGTCGAATAGTGAGGATAAATCCATCAGGGTCTGGGATATGACGAAGCGTGTCTGTTTGCACACCTTCAGGAGAGAACACGAACGCTATTGGGTGCTATCATCACATCCGACCCTCAACCTATTTGCCGCTGGTCACGACGCTGGTATGATATTGTTCAAACTTCAACGAGAAAGACCAGCATATGCCATACATAATAATATGCTCTTCTATATTAAGGACAGACAGCTTCGTAAACTGGATATGTCAACTAATAGAGATGCCCCGGTTATGCAGATCAAGGGTGGCGGAAGACATCAACCTTACAGTATGTCGCTGAATCACGCTGAGTGGTGCGTGCTTGTGTCGTGGCGTGTTGGTGACACGCATACGTACGAACTATACAACGTTTCGAGAGACGGTGAGGCTGCGAGTACCGCTGAGCCAATGAGGGGACACGCTACTACGGCTGTGTGGGTCGCTAGGAATAGATTTGCTGTATTGGAGAAGAACAATCAGCTGATAATAAAGAATCTGAAGAACGAGGTGTCCAAGAAGATAGCGACTCCGACGTGTGAGGAGATCATGTACGCCGGTACTGGGATGCTGTTACTCAGGGAGGTCGATGCTGTGCAGCTCCTGGACGTGCAGCAGAAGAGGACCGTGGCCAGTGTGAAAGTATCCAAATGCCGTTACGCTATTTGGAACTCGGATATGTCGCTAGTGGCGCTCCTTGGGAAGCATACGGTGACGATATGTACCAAGAAGCTAGAACAGCTGTGCTCTATCACCGAAGGGGCGCGGGTCAAGTCGGGGGCGTTTGATGATTCAAATTCACACCCAGTCTTCATATACACGACGGCCAATCACATCAAATATTGCTGCAAAGAAGGAGATCACGGGATTATCCGTACGTTGGATGTGCCGGTGTATGCGGTGAAGGTGATAGCGAACGAAGCTGGGGCGAGAGTTGTATGTCTCGATAGAGAGGCCCGTCCCAAAGTACTCAACATTGACCCCACAGAATACAGATTATCAAAAAGATTCGGCTATCCAGAAGTAGCGCTTCACTTCGTGAAGGATGCCCGGACGAGGTTGGAATTGTCACTACAGTGTGGTAACATAGAAGTGGCTTTGGAAGCTGCTAAGAGTTTGGACGATCCTGATGCATGGGAGAAACTCGCTGATGCGGCGAGAAACGCCGGAAACCATCAGATTGTTGAGATGTGCTACCAGCGTACAAAGAACTTTGACAAGCTCTCGTTCCTGTACCTCATCACCGGCAACCTTGACAAGTTGAGGAAGATGATGAAAATAGCGGAGATAAGAAAAGATGTCTCCTCACAGTTCCAAGGAGCGCTGCTGTTAGGAGATGTGGCTGAGAGAATAAGGCTGTTGAAAAACGCTGGACAGGTCTATGGAACTAAATAA

Protein sequence:

>DPOGS204923-PA
MCAQFHPTEDLLVSASLDQSVRVWDFSGLRKKSVAPGPTGLADHLRNPQATDLFGQADAVVKHVLEGHDRGVNWASFHPNLPLIVSAADDRQVKLWRMNDAKAWEVDTCRGHYNNVSCALFHARHELILSNSEDKSIRVWDMTKRVCLHTFRREHERYWVLSSHPTLNLFAAGHDAGMILFKLQRERPAYAIHNNMLFYIKDRQLRKLDMSTNRDAPVMQIKGGGRHQPYSMSLNHAEWCVLVSWRVGDTHTYELYNVSRDGEAASTAEPMRGHATTAVWVARNRFAVLEKNNQLIIKNLKNEVSKKIATPTCEEIMYAGTGMLLLREVDAVQLLDVQQKRTVASVKVSKCRYAIWNSDMSLVALLGKHTVTICTKKLEQLCSITEGARVKSGAFDDSNSHPVFIYTTANHIKYCCKEGDHGIIRTLDVPVYAVKVIANEAGARVVCLDREARPKVLNIDPTEYRLSKRFGYPEVALHFVKDARTRLELSLQCGNIEVALEAAKSLDDPDAWEKLADAARNAGNHQIVEMCYQRTKNFDKLSFLYLITGNLDKLRKMMKIAEIRKDVSSQFQGALLLGDVAERIRLLKNAGQVYGTK-