Monarch geneset OGS2.0

DPOGS215391
TranscriptDPOGS215391-TA909 bp
ProteinDPOGS215391-PA302 aa
Genomic positionDPSCF300088 - 301801-304458
RNAseq coverage1073x (Rank: top 12%)
Annotation
HeliconiusHMEL0036721e-14785.43% 
BombyxBGIBMGA012437-TA2e-13281.46% 
DrosophilaepsilonCOP-PA1e-3934.65% 
EBI UniRef50UniRef50_D2Y4R64e-12880.13%Coatomer protein complex subunit epsilon n=2 Tax=Obtectomera RepID=D2Y4R6_BOMMO
NCBI RefSeqNP_001166195.18e-12980.13%coatomer protein complex subunit epsilon [Bombyx mori]
NCBI nr blastpgi|2896292221e-12780.13%coatomer protein complex subunit epsilon [Bombyx mori]
NCBI nr blastxgi|2896292225e-13680.13%coatomer protein complex subunit epsilon [Bombyx mori]
Group
Gene OntologyGO:00301264.5e-103COPI vesicle coat
GO:00068904.5e-103retrograde vesicle-mediated transport, Golgi to ER
GO:00051984.5e-103structural molecule activity
GO:00054883.7e-11binding
KEGG pathway 
InterPro domain[1-300] IPR0068224.5e-103Coatomer, epsilon subunit
[214-285] IPR0119903.7e-11Tetratricopeptide-like helical
Orthology groupMCL15208 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215391-TA
ATGGCACGTCAGCAACAAGATGTAGATGAATTGTTCGATGTTAAAAATGCATTCTACGTTGGGAATTATCAACAAGCAATAAACGAGGCCCAAAATGTTAAGCCCTCATCTCCACAAGTGGCCTTCCAGAGAGACACATTCCTCTACAGATCGTACATAGCTCAGAACAATTTCCGTATTGTTCTACAAGAATTGAAAAATGCTGATCCAATGCTTCAGCCGCTGCAAACGTTGGTTGAATATTTGTCCCCTGATTCCAATAAGCCTAATATTGTAGCTGACATTGATGCAAGGGTGCAAAAAGGAGTGGAACTAACAAATGAAGTGTTTCTGATAGTCGCCGCAACAATTTACTATCATGAGGATAATTATGAAGCTGCCCTCAAAATATTGCACGAGGCGGAGTCGTTAGAGCTACGTGCATTCAGTCTCCAGTGCCTATTGGCTATGAACCGACCCGACCTGGCCAGGAAGCAACTCAAGAAGTTACAGGATATAGAAGACGACAGTACACTGACACAGCTCGCTCAGGCCTGGTTGAATTTGTCAGAGGGTGGTCCAGGTGTACAGGATGCTCATTTCAGTATAATGGAACTGTCGGAGCGTCTCGGCGCGCTGGGGGCCGGGCCGGCAGCCGCAGGGGCCGCCGCGGCCGCCTCGAGAGGTATGTGGGATGAGGCGGAGCAGATGTTGACGGAAGCTCAGACCCGCCTGCCCCAGCAGCCCGAGTTGTTGTTGGGCCTGGGGGTCGCGGCCGCGCACGTCGGCAAACCTCCCGAGGTATCGGCTCGTTACTTCGCCCAGCTCTTGGACTCGCATCCTGACCATCCTTTCTCTAAGGAGTACAACGCTAAGACCAACGAGTTCAAACGACTGGCGGCGCAGTACCAACCCTCTGTTGCAAGCTAA

Protein sequence:

>DPOGS215391-PA
MARQQQDVDELFDVKNAFYVGNYQQAINEAQNVKPSSPQVAFQRDTFLYRSYIAQNNFRIVLQELKNADPMLQPLQTLVEYLSPDSNKPNIVADIDARVQKGVELTNEVFLIVAATIYYHEDNYEAALKILHEAESLELRAFSLQCLLAMNRPDLARKQLKKLQDIEDDSTLTQLAQAWLNLSEGGPGVQDAHFSIMELSERLGALGAGPAAAGAAAAASRGMWDEAEQMLTEAQTRLPQQPELLLGLGVAAAHVGKPPEVSARYFAQLLDSHPDHPFSKEYNAKTNEFKRLAAQYQPSVAS-