Monarch geneset OGS2.0

DPOGS204779
TranscriptDPOGS204779-TA2700 bp
ProteinDPOGS204779-PA899 aa
Genomic positionDPSCF300231 + 579132-590477
RNAseq coverage1166x (Rank: top 11%)
Annotation
HeliconiusHMEL0176910.078.34% 
BombyxBGIBMGA013717-TA6e-17587.86% 
DrosophilabetaCop-PA0.060.30% 
EBI UniRef50UniRef50_B0W8M20.056.28%Coatomer subunit beta n=12 Tax=Opisthokonta RepID=B0W8M2_CULQU
NCBI RefSeqXP_001816488.10.058.46%PREDICTED: similar to coatomer subunit beta [Tribolium castaneum]
NCBI nr blastpgi|3838561000.060.95%PREDICTED: coatomer subunit beta-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3504133210.060.57%PREDICTED: coatomer subunit beta-like [Bombus impatiens]
Group
Gene OntologyGO:00057370cytoplasm
GO:00068865.6e-110intracellular protein transport
GO:00301265.6e-110COPI vesicle coat
GO:00051985.6e-110structural molecule activity
GO:00161925.6e-110vesicle-mediated transport
GO:00054884.5e-92binding
GO:00301171.5e-87membrane coat
KEGG pathwayvvi:1002417081e-13 
 K12392 (AP1B1)maps-> Lysosome
InterPro domain[1-891] IPR0164600Coatomer, beta subunit
[624-883] IPR0117105.6e-110Coatomer, beta subunit, C-terminal
[24-484] IPR0119894.5e-92Armadillo-like helical
[20-473] IPR0025531.5e-87Clathrin/coatomer adaptor, adaptin-like, N-terminal
[12-490] IPR0160241.9e-64Armadillo-type fold
Orthology groupMCL14209 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204779-TA
ATGGGGGGTGTTGAGCAACCATGCTACACTCTCATTAACTTTCCCACTGATACCGAGCCTTATAGTGAACAGCAGCTGAAGACAGACCTTGAAAAGGGTGATATAAAGAAGAAAATTGAGGCACTTAAGAAAACCATTGGTATCATCCTGTCCGGAGAAAAGATTCCAGGGTTGCTGATGATAATCATCAGGTTTGTGCTGCCGTTACAGGATCACACCATCAAGAAGCTCCTTCTCATCTTCTGGGAAATTGTTCCCAAGACCACACCGGATGGGAAGTTGTTGCAGGAAATGATACTAGTCTGTGATGCATACAGAAAGGATCTCCAACACCCCAATGAGTTCCTCCGAGGTTCTACGTTGAGGTTCCTCTGCAAGTTGAAGGAGCCAGAGCTTCTAGAGCCCCTGATGCCGGCCATCCGAGCCTGCCTGGAACACAGACATTCATATGTTCGGAGAAATGCTGTGCTCGCAATCTTCACCATCTACCGAAACTTCGAGTTCCTGATCCCGGACGCGCCGGAGGTGATCGGCTCGTTCCTGGAGTCGGAGCAGGACATGTCGTGTAAGAGGAACGCCTTCCTCATGCTGCTGCACGCGGACCAGGAGACGGCGCTGGCCTACCTCTCGCAGCGCCTCGACCAGGTGCACGGCTTCGGAGACATCCTGCAGCTCGTCATAGTGGAACTCATATACAAAGTGTGTCACGCCAACCCGGCAGAGCGGTCTCGTTTCATCCGCACGGTGTACGGGCTACTGAACGCGCCGAGCGCCGCTGTGCGCTACGAGGCGGCCGGCACGCTGGTCACGCTGTCCACGGCACCCGCCGCCATCAAGGCGGCGGCGGCGTGCTACATAGACCTGATAGTGAAGGAGAGCGACAACAACGTCAAGCTGATAGTGGTGGAGAAGCTGTCCGCGCTCAGGGACGTGAGCTGCGACGCCACCTCGCGCGCCCTGCCCGAGCTGGCCATGGACGTGCTCAGGGTGCTGGCCTCCTCCGACCTGGACGTCCGCAGGCACACGCTGCATCTCGCTCTGGAGCTGGCCACTCCTCGCCACGCGGACGAGCTGGTGGGCGCTCTGAGGAAGGAGGCGTCGCGGGCTCAGCTCGCGGATCACGACGACGCGGCCAAGTACCGGCAGCTGCTGGTGAGGGCGATGCATCGAGCGGCCATCAAGTTCCCGGAGGTGGCGTGCTCGGTGGCGCCCGGATTGTTGGAGCTACTAGGGGACGGCGGGGAGGTGGCGGCGCAGGATGTGCTCATGTTCACCAGGCACGCGCTGCACGCCTTCCATGACCTGAAACCTGGCATATACGCGAAACTACTGGAGAGTCTGGGCAGCATCCGGGTGGGGAAGATCGCGAGGGCCGCCCTGTGGCTGGTGGCCGAGTTCGCCGACAACGAGGACAACGTGAAGGCGGCCATCGACGTCATCGCGGCTGCCATGCCGACACACAAAGACAATGAGGAGGACGGCGACAAGGACGGCGCTGAGGCTCCGCCTAAGGAAAAGGAGAAGGAGGCGCCCACTCGACAGCTGGTCACCAGTGATGGAGCGTACGTCACGCAGTCCGCCTTCAACCAGCCCAAGACTCCTGTGACTGACAGCGGTCCGACGGCGGACGACCTGGAGCACGGCGTGCGCTGCGTGCGCGCCGGCGCAGAGCGACCCGACGTGCTCAGCGAGGCGCTCACGGCCGGATCCCGGAAAGCGCTCGCCTCGCTACTGACGCTACCACATCGCTCGGCACCGACCCTGCTCCCTGAGGGCTCCCCCGAGCGTCCCGAGGCGTCTTCTCGTCCGACGTCTGTTCCCATCGAGCGCGGGATATCGTTCACCGCCCTGGCGCCGCTCGCCGCCGCCGGCAACAGAGACGTGTTCGAGCTGGCCTTGGATAGAGCTTTACAAGGTCGCACCAAGCCGGCCAGTGACGACGGCGGTCGTTTGTCCAAGGTGACCCAGCTGACGGGCTTCTCGGACCCGGTGTACGCGGAGGCGGTCGTGTCCGTCAATCAGTACGACATAGTGCTGGACGTGTTCGTCGTCAACCAGACAGACGATACCCTCCAGAACTGTACGGTGGAGTTGGCGACGCTGGGCGAGCTGCGGCTGGTGGAGCGGCCGGCGGGCATCGTACTGGGGCCGCGGGATTACGCCTCCATCAGGGCGCACGTCAAGGTCGCCTCCACAGAGAACGGCATCATCTTCGGAAACATCGTGTACGAGGTGTCCGGCGCGTCCATGGACCGCGGCGTGGTCGTGCTCAACGACATCCACATAGACATCGTGGACTACATACAGCCCGCCGCCTGCAGCGACGCCGACTTCCGCACCATGTGGGCAGAGTTCGAGTGGGAGAACAAGGTGTCCGTGAACACGAACATCACGGAGCTGAACGAGTACCTGGAGCACCTGCTGGCGTCCACCAACATGAAGTGCCTCACTCCCGATAAGGCCCTGTCCGGTCAGTGCGGGTTCATGGCGGCCAACCTGTACGCGCGGTCCATATTCGGCGAGGACGCGCTCGCCAACCTCAGCATCGAGATCCCCATGAACAAACAGAACGCTCCCGTCGTGGGACACGTGCGGATACGGGCTAAGACACAGGGCATGGCTCTCAGTCTGGGCGACAAGATCAACATGATGCAGAAGACTCGCCCCAAGAAGCCCAAAGATCCCACGCCGGCTGCCTAA

Protein sequence:

>DPOGS204779-PA
MGGVEQPCYTLINFPTDTEPYSEQQLKTDLEKGDIKKKIEALKKTIGIILSGEKIPGLLMIIIRFVLPLQDHTIKKLLLIFWEIVPKTTPDGKLLQEMILVCDAYRKDLQHPNEFLRGSTLRFLCKLKEPELLEPLMPAIRACLEHRHSYVRRNAVLAIFTIYRNFEFLIPDAPEVIGSFLESEQDMSCKRNAFLMLLHADQETALAYLSQRLDQVHGFGDILQLVIVELIYKVCHANPAERSRFIRTVYGLLNAPSAAVRYEAAGTLVTLSTAPAAIKAAAACYIDLIVKESDNNVKLIVVEKLSALRDVSCDATSRALPELAMDVLRVLASSDLDVRRHTLHLALELATPRHADELVGALRKEASRAQLADHDDAAKYRQLLVRAMHRAAIKFPEVACSVAPGLLELLGDGGEVAAQDVLMFTRHALHAFHDLKPGIYAKLLESLGSIRVGKIARAALWLVAEFADNEDNVKAAIDVIAAAMPTHKDNEEDGDKDGAEAPPKEKEKEAPTRQLVTSDGAYVTQSAFNQPKTPVTDSGPTADDLEHGVRCVRAGAERPDVLSEALTAGSRKALASLLTLPHRSAPTLLPEGSPERPEASSRPTSVPIERGISFTALAPLAAAGNRDVFELALDRALQGRTKPASDDGGRLSKVTQLTGFSDPVYAEAVVSVNQYDIVLDVFVVNQTDDTLQNCTVELATLGELRLVERPAGIVLGPRDYASIRAHVKVASTENGIIFGNIVYEVSGASMDRGVVVLNDIHIDIVDYIQPAACSDADFRTMWAEFEWENKVSVNTNITELNEYLEHLLASTNMKCLTPDKALSGQCGFMAANLYARSIFGEDALANLSIEIPMNKQNAPVVGHVRIRAKTQGMALSLGDKINMMQKTRPKKPKDPTPAA-