Monarch geneset OGS2.0

DPOGS213564
TranscriptDPOGS213564-TA1488 bp
ProteinDPOGS213564-PA495 aa
Genomic positionDPSCF300033 - 34186-40012
RNAseq coverage5080x (Rank: top 2%)
Annotation
HeliconiusHMEL0108640.082.83% 
BombyxBGIBMGA011844-TA0.078.18% 
DrosophilaPdi-PA2e-15555.84% 
EBI UniRef50UniRef50_P543994e-15355.84%Protein disulfide-isomerase n=65 Tax=Eukaryota RepID=PDI_DROME
NCBI RefSeqNP_001037171.10.078.18%protein disulfide isomerase [Bombyx mori]
NCBI nr blastpgi|3584431120.084.02%control protein HCTL033 [Heliconius erato]
NCBI nr blastxgi|3584431120.084.02%control protein HCTL033 [Heliconius erato]
Group
Gene OntologyGO:00057832.3e-150endoplasmic reticulum
GO:00168532.3e-150isomerase activity
GO:00454543.9e-32cell redox homeostasis
GO:00150354e-08protein disulfide oxidoreductase activity
GO:00090554e-08electron carrier activity
GO:00066624e-08glycerol ether metabolic process
KEGG pathwaycqu:CpipJ_CPIJ0052195e-162 
 K09580 (PDIA1, P4HB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[26-495] IPR0057922.3e-150Protein disulphide isomerase
[24-134] IPR0123361.6e-42Thioredoxin-like fold
[34-130] IPR0057881.1e-37Disulphide isomerase
[27-129] IPR0137663.9e-32Thioredoxin domain
[45-53] IPR0057464e-08Thioredoxin
Orthology groupMCL12361 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213564-TA
ATGCGTGCCGTTTTATTAACAGTAGCGATAGCCCTTCTGGGTTCCGCTTATGGAGACGAAATACCCACTGAAGACAATGTACTTGTTCTAAGCAAACCTCTTTTTGATTCTGTTATTTCAAGCAACGACTACGTTTTAGTGGAATTCTATGCGCCATGGTGCGGCCACTGCAAGTCTCTCGCGCCGGAATACGCAAAAGCCGCCACAAAACTTGCCGAAGAAGATTCTCCTATAAAGTTGGCTAAAGTAGATGCTACTCAAGAACAAGATTTGGCTGAGTACTACAAAGTTAAGGGATACCCAACCCTTATTTTCTTCAAGAAAGGCAGCAGTATTGACTACACAGGCGGACGACAGGCTGATGACATCATTGCATGGCTAAAGAAGAAGACTGGTCCTCCGGCAGTCGAAGTTGCCTCGGCAGAACAAGCTAAAGAACTCACTGTTGCCAACCTCGTTGTTGTATTTGGTTTCTTCCCAGACCAATCATCTGAACGGGCATTAGCTTTCCTTAACACCGCTGGAGTTGTCGACGACCAGATCTTTGCTATTGTATCTGATGAGAAAGTTATCGAAGAGATGGAAGCTAAAGCTGGCGACATTGTTTTATACAAGAAATTCGAAGATCCCCAAGTCAAGTATGATGCTGAAGAGTTGAATGAAGACCTCCTCAAGAACTGGGTGTTCATGCAGAGCATGCCCACAATCGTCGAATTCTCTCATGAAACAGCGTCCAAGATCTTCGGTGGTCAGATCAAATACCACCTCCTTCTATTCCTGTCCAAGAAAGACGGTCACTTCGAGAAATACATCGATGAGTTGAAACCTGTTGCCAAGAACTACCGGGACAAGATCATGACCGTCTCCATCGACACAGACGAAGATGACCATCAGAGAATCCTGGAGTTCTTTGGTATGAAGAAGGATGAGGTCCCATCCGTACGTCTCATAGCCCTGGAACAAGACATGGCCAAGTACAAGCCAGCGGCCGATGAACTTAATGCCAACACTGTTGAGGAATTCGTTCAGTCTTTCTTCGCCGGCACTCTGAAGCAGCATTTGTTGAGCGAGTCTCTCCCCGCGGACTGGGCCGACAAACCCGTGAAAGTGCTAGTCGCTTCCAACTTCGATGAAGTCGTCTTTGATAATGAAAAGACTGTGCTCGTGGAGTTCTACGCGCCGTGGTGCGGCCACTGCAAGCAACTGGTGCCTATCTACGACAAACTCGGTGAGCACTTCGAGAAGGACAGCGACATCGTGATCGCCAAAATTGACGCCACCGCCAACGAGCTGGAACACACCAAGATCACCTCCTTCCCGACCATCAAGCTCTACACCAAGGACAATCAGGTTCGTGAGTACAACGGTGAGCGTACTCTGAGCGCGCTCACAAAGTTCGTGGAAACCGGCGGGGAGGGCGCCGAGCCCGTGCCGGTGGACGAGGAGTCCGACAGCGACGACCACGAACAACCCCGAGACGAGCTATAA

Protein sequence:

>DPOGS213564-PA
MRAVLLTVAIALLGSAYGDEIPTEDNVLVLSKPLFDSVISSNDYVLVEFYAPWCGHCKSLAPEYAKAATKLAEEDSPIKLAKVDATQEQDLAEYYKVKGYPTLIFFKKGSSIDYTGGRQADDIIAWLKKKTGPPAVEVASAEQAKELTVANLVVVFGFFPDQSSERALAFLNTAGVVDDQIFAIVSDEKVIEEMEAKAGDIVLYKKFEDPQVKYDAEELNEDLLKNWVFMQSMPTIVEFSHETASKIFGGQIKYHLLLFLSKKDGHFEKYIDELKPVAKNYRDKIMTVSIDTDEDDHQRILEFFGMKKDEVPSVRLIALEQDMAKYKPAADELNANTVEEFVQSFFAGTLKQHLLSESLPADWADKPVKVLVASNFDEVVFDNEKTVLVEFYAPWCGHCKQLVPIYDKLGEHFEKDSDIVIAKIDATANELEHTKITSFPTIKLYTKDNQVREYNGERTLSALTKFVETGGEGAEPVPVDEESDSDDHEQPRDEL-