Monarch geneset OGS2.0

DPOGS215687
TranscriptDPOGS215687-TA663 bp
ProteinDPOGS215687-PA220 aa
Genomic positionDPSCF300041 - 639651-641718
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0040704e-10482.41% 
BombyxBGIBMGA003587-TA3e-7474.59% 
DrosophilaERp60-PA4e-7360.19% 
EBI UniRef50UniRef50_Q9TWZ11e-7059.72%Protein disulphide isomerase isoform/multifunctional endoplasmic reticulum luminal polypeptide (D-ERp60) n=49 Tax=Bilateria RepID=Q9TWZ1_DROME
NCBI RefSeqNP_001036997.18e-10485.38%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
NCBI nr blastpgi|622412901e-10285.38%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
NCBI nr blastxgi|622412902e-10484.79%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
Group
Gene OntologyGO:00168533.2e-35isomerase activity
GO:00454546.6e-28cell redox homeostasis
GO:00150354.9e-05protein disulfide oxidoreductase activity
GO:00090554.9e-05electron carrier activity
GO:00066624.9e-05glycerol ether metabolic process
KEGG pathwayaag:AaeL_AAEL0014324e-75 
 K08056 (PDIA3, GRP58)maps-> Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[101-201] IPR0057883.2e-35Disulphide isomerase
[63-204] IPR0123363.9e-34Thioredoxin-like fold
[96-200] IPR0137666.6e-28Thioredoxin domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215687-TA
ATGGAGATAGACTTCCGGGAGTCCAAAGTGGCCAAAGAGATGAAGGACGTTAACTTCGCGGTCAGTGACAAGGATGACTTCACCCACGAGCTGAACGACTTCGGCATCGACTTCGCCAAAGGCGACAAACCCGTGGTAGGAGGCAAGGATGCTGATGGGAACAAGTTCGTTATGTCTTCCGAATTCAGCATTGAAAACCTTTTGGCCTTCGCCAAGGATCTCTTGGATGGTAAATTGGAGCCGTTCATCAAGTCCGAGCCAGTCCCTGAGAATAACGATGGACCGGTCAAGGTAGCTGTTGGCAAGAACTTCAAGGAGTTAGTCACAGACAGCGGAAGGGACGCTCTAGTCGAATTCTACGCCCCCTGGTGCGGCCATTGCCAAAAACTGGCACCTGTTTGGGAAGAACTTGGTGAAAAGCTCAAAGACGAAGAGGTTGATATAGTGAAGATCGACGCTACAGCCAACGACTGGCCCAAGTCGCTGTACGACGTCTCCGGTTTCCCGACCATCTTCTGGAAACCGAAGGATAACAGCAAGAAGCCTGTCAGATATAACGGTGGGCGAGCCCTCGAGGACTTCGTGAAGTACGTGTCCGACAATGCTTCCAACGAGCTGAAAGGCTTCGACAGGAAAGGGAACGCCAAGAAAGACGAGTTGTAG

Protein sequence:

>DPOGS215687-PA
MEIDFRESKVAKEMKDVNFAVSDKDDFTHELNDFGIDFAKGDKPVVGGKDADGNKFVMSSEFSIENLLAFAKDLLDGKLEPFIKSEPVPENNDGPVKVAVGKNFKELVTDSGRDALVEFYAPWCGHCQKLAPVWEELGEKLKDEEVDIVKIDATANDWPKSLYDVSGFPTIFWKPKDNSKKPVRYNGGRALEDFVKYVSDNASNELKGFDRKGNAKKDEL-