Monarch geneset OGS2.0

DPOGS215686
TranscriptDPOGS215686-TA1464 bp
ProteinDPOGS215686-PA487 aa
Genomic positionDPSCF300041 - 648226-652384
RNAseq coverage2262x (Rank: top 5%)
Annotation
HeliconiusHMEL0040700.081.39% 
BombyxBGIBMGA003587-TA0.079.78% 
DrosophilaERp60-PA5e-17560.08% 
EBI UniRef50UniRef50_Q9TWZ13e-17259.88%Protein disulphide isomerase isoform/multifunctional endoplasmic reticulum luminal polypeptide (D-ERp60) n=49 Tax=Bilateria RepID=Q9TWZ1_DROME
NCBI RefSeqNP_001036997.10.084.19%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
NCBI nr blastpgi|622412900.084.39%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
NCBI nr blastxgi|622412900.084.15%protein disulfide-isomerase like protein ERp57 [Bombyx mori]
Group
Gene OntologyGO:00057832.4e-158endoplasmic reticulum
GO:00168532.4e-158isomerase activity
GO:00454545.2e-29cell redox homeostasis
GO:00150352.2e-06protein disulfide oxidoreductase activity
GO:00090552.2e-06electron carrier activity
GO:00066622.2e-06glycerol ether metabolic process
KEGG pathwayphu:Phum_PHUM3804800.0 
 K08056 (PDIA3, GRP58)maps-> Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[22-486] IPR0057922.4e-158Protein disulphide isomerase
[20-133] IPR0123361.9e-36Thioredoxin-like fold
[368-468] IPR0057881.5e-34Disulphide isomerase
[24-128] IPR0137665.2e-29Thioredoxin domain
[42-50] IPR0057462.2e-06Thioredoxin
Orthology groupMCL13282 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215686-TA
ATGTTGGGATCAATAAAAATCTTTTTATTATTAGGAGTTATTTATTTATGTAAAGCAAGCGAAGAAGATGTTTTAGAGCTTACAGATTCCGATTTTGAAAGTGCGATCGGTCAACATGAAACCGCTTTGGTCATGTTTTACGCACCTTGGTGTGGACATTGTAAAAGATTGAAACCGGAATATGCGGTTGCGGCCGGCATTCTCAAGGATGATGATCCGCCGGTAGCGTTGGCCAAAGTAGACTGCACGGAAGCTGGAAAGAGCACTTGTGAAAAGTTCTCCGTCTCAGGATATCCAACCCTTAAAATCTTCAGGAAAGGGGAGCTGTCACAGGAGTACAATGGTCCCAGGGAATCTAACGGGATTGTGAAGTACATGAGAGCTCAAGTCGGACCAAGTTCCAAGGAATTACTCAATGTTGAGAGCTTCGAGAACATGATATCCAAGGATGAGGTGGTTGTTATTGGTTTCTTTGAAAAGGAGGATGACCTCAAGGGAGAATTCTTGAAAACCGCTGACAAAATGAGAGAAGAAGTTTCCTTTGCTCACTCGTCTGCTAAGGATGTCCTCGAAAAATCTGGATACAAGAACAATGTAGTCCTGTACCGTCCCAAGCGTTTACAAAACAAGTTTGAAGACTCATTTGTTGTGTACAAGAGTGGCGTCTCGCTTAAAGGCTTCATCAAAGAAAACTACCACGGTCTGGTTGGTATCCGTCAGAAGGACAATATGAATGACTTCAGCAATCCCCTTGTGGTTGCCTACTATGATGTGGACTATGTCAAGAACCCCAAGGGTACCAACTACTGGAGGAACCGCGTGCTCAAAGTGGCCAAAGAGATGAAGGACGTTAACTTCGCGGTCAGTGACAAGGATGACTTCACCCACGAGCTGAACGACTTCGGCATCGACTTCGCCAAAGGCGACAAACCCGTGGTAGGAGGCAAGGATGCTGATGGGAACAAGTTCGTTATGTCTTCCGAATTCAGCATTGAAAACCTTTTGGCCTTCGCCAAGGATCTCTTGGATGGTAAATTGGAGCCGTTCATCAAGTCCGAGCCAGTCCCTGAGAATAACGATGGACCGGTCAAGGTAGCTGTTGGCAAGAACTTCAAGGAGTTAGTCACAGACAGCGGAAGGGACGCTCTAGTCGAATTCTACGCCCCCTGGTGCGGCCATTGCCAAAAACTGGCACCTGTTTGGGAAGAACTTGGTGAAAAGCTCAAAGACGAAGATGTTGATATAGTGAAGATCGACGCTACAGCCAACGACTGGCCCAAGTCGCTGTACGACGTCTCCGGTTTCCCGACCATCTTCTGGAAACCGAAGGATAACAGCAAGAAGCCTGTCAGATATAACGGTGGGCGAGCCCTCGAGGACTTCGTGAAGTACGTGTCCGACAATGCTTCCAACGAGCTGAAAGGCTTCGACAGGAAAGGGAACGCCAAGAAAGACGAGTTGTAG

Protein sequence:

>DPOGS215686-PA
MLGSIKIFLLLGVIYLCKASEEDVLELTDSDFESAIGQHETALVMFYAPWCGHCKRLKPEYAVAAGILKDDDPPVALAKVDCTEAGKSTCEKFSVSGYPTLKIFRKGELSQEYNGPRESNGIVKYMRAQVGPSSKELLNVESFENMISKDEVVVIGFFEKEDDLKGEFLKTADKMREEVSFAHSSAKDVLEKSGYKNNVVLYRPKRLQNKFEDSFVVYKSGVSLKGFIKENYHGLVGIRQKDNMNDFSNPLVVAYYDVDYVKNPKGTNYWRNRVLKVAKEMKDVNFAVSDKDDFTHELNDFGIDFAKGDKPVVGGKDADGNKFVMSSEFSIENLLAFAKDLLDGKLEPFIKSEPVPENNDGPVKVAVGKNFKELVTDSGRDALVEFYAPWCGHCQKLAPVWEELGEKLKDEDVDIVKIDATANDWPKSLYDVSGFPTIFWKPKDNSKKPVRYNGGRALEDFVKYVSDNASNELKGFDRKGNAKKDEL-