Monarch geneset OGS2.0

DPOGS204802
TranscriptDPOGS204802-TA3525 bp
ProteinDPOGS204802-PA1174 aa
Genomic positionDPSCF300460 + 94516-108198
RNAseq coverage431x (Rank: top 28%)
Annotation
HeliconiusHMEL0131841e-16557.30% 
BombyxBGIBMGA010725-TA0.067.74% 
DrosophilaalphaCop-PB0.065.30% 
EBI UniRef50UniRef50_F4WHH30.055.54%Coatomer subunit alpha n=16 Tax=Coelomata RepID=F4WHH3_ACREC
NCBI RefSeqNP_001166192.10.072.02%coatomer protein complex subunit alpha [Bombyx mori]
NCBI nr blastpgi|2896292160.072.02%coatomer protein complex subunit alpha [Bombyx mori]
NCBI nr blastxgi|2896292160.071.96%coatomer protein complex subunit alpha [Bombyx mori]
Group
Gene OntologyGO:00068867.9e-148intracellular protein transport
GO:00301267.9e-148COPI vesicle coat
GO:00055157.9e-148protein binding
GO:00051987.9e-148structural molecule activity
GO:00161927.9e-148vesicle-mediated transport
GO:00301171.1e-61membrane coat
KEGG pathwaybta:1001260410.0 
 K05236 (COPA)maps-> Neuroactive ligand-receptor interaction
InterPro domain[1-1169] IPR0163910Coatomer, alpha subunit
[784-1172] IPR0107147.9e-148Coatomer, alpha subunit, C-terminal
[4-316] IPR0159433.5e-74WD40/YVTN repeat-like-containing domain
[329-612] IPR0066921.1e-61Coatomer, WD associated region
[9-305] IPR0110468.7e-58WD40 repeat-like-containing domain
[227-535] IPR0110482.8e-08Cytochrome cd1-nitrite reductase-like, C-terminal haem d1
[185-220] IPR0197814e-08WD40 repeat, subgroup
[182-221] IPR0016801.9e-06WD40 repeat
Orthology groupMCL11194 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204802-TA
ATGCTGACAAAATTCGAAACAAAGTCAGCTAGAGTGAAAGGGATATCGTTTCACGCGAAGCGACCGTGGGTTTTGACGAGTCTTCATAATGGTGTAATTCAACTTTGGGACTATCGTATGTGTACATTGCTGGAGAAATTTGATGAACATGATGGTCCCGTCCGCGGTATTTGCTTCCACATTCAACAGCCTTTGTTCGTTTCCGGCGGTGACGATTATAAAATAAAGGTTTGGAATTACAAGCAAAGAAGATGCCTTTTCACCCTACTCGGTCACTTGGACTACATTCGCACAACTTTCTTTCATCATGAGTACCCTTGGATCTTAAGTGCTTCAGATGATCAGACAATAAGAATATGGAACTGGCAATCCCGTCAATGCATCAGTGTCCTGACTGGGCACAACCACTATGTAATGTGTGCACAGTTTCATCCCACCGAAGATCTTCTAGTGTCAGCTTCCTGTGATCACGGGCAGGTGAGGAAAATGTTGCCCTTTTCATCTTTTTTTGTGAAATTAAAAAGTGTGAGCCGCCATTGTGCGGATGCAGTTGTAAAGCATGTGTTGGAAGGGCATGATCGCGGAGTGAACTGGGCGTCTTTCCATCCAAACCTCCCGCTCATAGTGTCCGCAGCTGATGATAGACAAGTCAAACTGTGGCGGATGAACGATGCCAAGGCTTGGGAGGTAGACACGTGTCGCGGGCACTACAACAACGTGTCATGTGCGCTGTTCCACGCTAGACACGAGCTGATACTGTCGAATAGTGAGGATAAATCCATCAGGGTCTGGGATATGACGAAGCGTGTCTGTTTGCACACCTTCAGGAGAGAACACGAACGCTATTGGGTGCTATCATCACATCCGACCCTCAACCTATTTGCCGCTGGTCACGACGCTGGTATGATATTGTTCAAACTTCAACGAGAAAGACCAGCATATGCCATACATAATAATATGCTCTTCTATATTAAGGACAGACAGCTTCGTAAACTGGATATGTCAACTAATAGAGATGCCCCGGTTATGCAGATCAAGGGTGGCGGAAGACATCAACCTTACAGTATGTCGCTGAATCACGCTGAGTGGTGCGTGCTTGTGTCGTGGCGTGTTGGTGACACGCATACGTACGAACTATACAACGTTTCGAGAGACGGTGAGGCTGCGAGTACCGCTGAGCCAATGAGGGGACACGCTACTACGGCTGTGTGGGTCGCTAGGAATAGATTTGCTGTATTGGAGAAGAACAATCAGCTGATAATAAAGAATCTGAAGAACGAGGTGTCCAAGAAGATAGCGACTCCGACGTGTGAGGAGATCATGTACGCCGGTACTGGGATGCTGTTACTCAGGGAGGTCGATGCTGTGCAGCTCCTGGACGTGCAGCAGAAGAGGACCGTGGCCAGTGTGAAAGTATCCAAATGCCGTTACGCTATTTGGAACTCGGATATGTCGCTAGTGGCGCTCCTTGGGAAGCATACGGTGACGATATGTACCAAGAAGCTAGAACAGCTGTGCTCTATCACCGAAGGGGCGCGGGTCAAGTCGGGGGCGTTTGATGATTCAAATTCACACCCAGTCTTCATATACACGACGGCCAATCACATCAAATATTGCTGCAAAGAAGGAGATCACGGGATTATCCGTACGTTGGATGTGCCGGTGTATGCGGTGAAGGTGATAGCGAACGAAGCTGGGGCGAGAGTTGTATGTCTCGATAGAGAGGCCCGTCCCAAAGTACTCAACATTGACCCCACAGAATACAGGTTCAAACTGGCTCTGGTGACCCGTCAGTACGACCAGGTTCTTCATATGCCGGTTCCCGTGCGTCAACGTAACGCGAGATCTGCCGAGAATATCGCCGCTGTGCGCGACAGTGTCCTCGAAAACCCGCGGCAGTCAATTCCGCGTCGCGCACAGGAACTCGGCCTTTCGCAGACGACAACTTGGCGAATTTTGCGTTGTGACTTGAGCCTGCACCCGTACAAGATCCAGCTGACCCAAGAGCTCAAGGTTAATGACCATAGACAGCGCCGTGTGTTCGCTGACTGGGCATTAGAGCAGTTGGAAGTTGACGCCGATTTTGGCAAAAAAATCATCTTCAGCGACGAGGCGCATTTTTGGATGAATGGCTATCTCTCCCTGGCATACCTGACAGCGATCAATCACAAGCAGACAAGTGAGGCTGAGCAATTGAAGGTGGCGTTGGAAGCGGCCAATCTCCCCGTACCCAACAAGAACCCTAAAGCTGTCGTTCTGCGACCACCCGTACCTGTACAGAAAGCTCAATCTAACTGGCCGCTTCTATCTATATCTAAGAGCTTCTTCGAAGTGGCGGGTGCTCGGTCGGAGGGTCCGAGGGTTGCGGGCCAGGAGGCTATACACGAGGACGAACCTCTGGAAGCAACAGGCGCCTGGGGCGATGATGATTTAGGCGATAACAAGGAGGTTGATGGTGATGTGGTCATTGACCAAGAAGACGCTTGTGAGGACGGAGGCTGGGATGTTGGTGACGAGGACTTGGACCTGCCCGAACTACCACCTGTTGCTGCCGAAGAATCATCTGAGTCCTCGTTCTTCGTCGCTCCGCCCCGCGGGACGCGAGCCCCCGCGGTCTTGAGAACAGCGCACGACCATGTCGCTACAGGAAACATAGAAGGAGCTATGAGGTTACTGAACGAGCAAGTCGGGATTGTGAACTTCGAGCCCTACCTAGAGACATTCCTCTCGATGTATTCCACTGCGAGGGTCATGTTCGCTGGTCTGCCACAGCTGCCGCCGCTGGTGACTCTCTTACACAGAAACTGGAAGGAGGCCAGCGGGAAAGACCTGCTGCCCGTTATCACCGCCAAGCTGACGGATCTGGTGAACCAGCTGCAGCAGTGCTACCAACTGACCACTAGCGGCAAGTTCCCGGAGGCGCTCGTCCGTCTGGAGAGGGTGGTGAGGCTGGTGCCGCTGCTGGTGGTCGACACCCGCCAGGAGCTCGTGGAGGCGCAGCAGCTGATGACCATCAGCAGGGAGTACCTGCTCGGACTCAGGATGGAAACGGCGAGGAAGGCCATGCCGAAGAACACGCTCGAGGAACAAATAAGGACGTGCGAGATGGCAGCTTACTTCACTCACTGCAAGCTGCAGCCGGTCCATCAGATCCTGACGCTGCGGACGGCGCTGAACATGTTCTTCAAGCTGAAGAACTACAAGACAGCTGCGTCCTTCGCGCGGCGGCTGCTGGAGCTGGGACCGAGACCGGAAGTGGCCCAACAGGCGAGGAAAATACTACAGGCCTGCGAAAAAACGCCCACCGACGAACACCAGCTGTCGTACGACGAGCACAATCCGTTCAATATATGCGGGATAAGCTATAAGCCGATATACAGAGGCAAGCCGGAGGAGAAGTGCTCGCTCTGCAGCGCCAGCTTCATGCCCGAGCATAAAGGGAAGCTGTGCCCCGTGTGCGGCGTCGCTGAAATAGGCAAAGACGTCCTCGGACTGAGAATCTGTGCTGTCCAGTTTCAGAGATAA

Protein sequence:

>DPOGS204802-PA
MLTKFETKSARVKGISFHAKRPWVLTSLHNGVIQLWDYRMCTLLEKFDEHDGPVRGICFHIQQPLFVSGGDDYKIKVWNYKQRRCLFTLLGHLDYIRTTFFHHEYPWILSASDDQTIRIWNWQSRQCISVLTGHNHYVMCAQFHPTEDLLVSASCDHGQVRKMLPFSSFFVKLKSVSRHCADAVVKHVLEGHDRGVNWASFHPNLPLIVSAADDRQVKLWRMNDAKAWEVDTCRGHYNNVSCALFHARHELILSNSEDKSIRVWDMTKRVCLHTFRREHERYWVLSSHPTLNLFAAGHDAGMILFKLQRERPAYAIHNNMLFYIKDRQLRKLDMSTNRDAPVMQIKGGGRHQPYSMSLNHAEWCVLVSWRVGDTHTYELYNVSRDGEAASTAEPMRGHATTAVWVARNRFAVLEKNNQLIIKNLKNEVSKKIATPTCEEIMYAGTGMLLLREVDAVQLLDVQQKRTVASVKVSKCRYAIWNSDMSLVALLGKHTVTICTKKLEQLCSITEGARVKSGAFDDSNSHPVFIYTTANHIKYCCKEGDHGIIRTLDVPVYAVKVIANEAGARVVCLDREARPKVLNIDPTEYRFKLALVTRQYDQVLHMPVPVRQRNARSAENIAAVRDSVLENPRQSIPRRAQELGLSQTTTWRILRCDLSLHPYKIQLTQELKVNDHRQRRVFADWALEQLEVDADFGKKIIFSDEAHFWMNGYLSLAYLTAINHKQTSEAEQLKVALEAANLPVPNKNPKAVVLRPPVPVQKAQSNWPLLSISKSFFEVAGARSEGPRVAGQEAIHEDEPLEATGAWGDDDLGDNKEVDGDVVIDQEDACEDGGWDVGDEDLDLPELPPVAAEESSESSFFVAPPRGTRAPAVLRTAHDHVATGNIEGAMRLLNEQVGIVNFEPYLETFLSMYSTARVMFAGLPQLPPLVTLLHRNWKEASGKDLLPVITAKLTDLVNQLQQCYQLTTSGKFPEALVRLERVVRLVPLLVVDTRQELVEAQQLMTISREYLLGLRMETARKAMPKNTLEEQIRTCEMAAYFTHCKLQPVHQILTLRTALNMFFKLKNYKTAASFARRLLELGPRPEVAQQARKILQACEKTPTDEHQLSYDEHNPFNICGISYKPIYRGKPEEKCSLCSASFMPEHKGKLCPVCGVAEIGKDVLGLRICAVQFQR-