Monarch geneset OGS2.0

DPOGS210734
TranscriptDPOGS210734-TA1374 bp
ProteinDPOGS210734-PA457 aa
Genomic positionDPSCF300013 + 195810-197793
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0070793e-15865.39% 
BombyxBGIBMGA006265-TA4e-13256.07% 
Drosophilaprtp-PB6e-6034.03% 
EBI UniRef50UniRef50_B7PHA06e-6532.91%Protein disulfide isomerase, putative n=1 Tax=Ixodes scapularis RepID=B7PHA0_IXOSC
NCBI RefSeqXP_971669.23e-7136.29%PREDICTED: similar to protein disulfide isomerase [Tribolium castaneum]
NCBI nr blastpgi|2700026481e-7036.22%hypothetical protein TcasGA2_TC004980 [Tribolium castaneum]
NCBI nr blastxgi|2700026483e-7036.14%hypothetical protein TcasGA2_TC004980 [Tribolium castaneum]
Group
Gene OntologyGO:00454548.4e-20cell redox homeostasis
KEGG pathwaytca:6603388e-71 
 K13984 (TXNDC5, ERP46)maps-> Protein processing in endoplasmic reticulum
InterPro domain[22-136] IPR0123361e-26Thioredoxin-like fold
[275-376] IPR0137668.4e-20Thioredoxin domain
Orthology groupMCL12752 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210734-TA
ATGAATATGTTCAAGTTTATGTTTACTTTTTGTCTGTTATTCCCTCAATATGTGACTCCTGATGATAAAAATGTATTTGAATACGATCCAGATGAATTTTATAAACAAATCGTAAACAAGGATGGAAATTTTATAATGTTCTATACGCCATGGTGTCGGCATTGCAAAGAATTTCATCCAATTTGGTCAGAATTAGGGAATCTTATAAATTCCAGAAAGTATGACATAGCAATTGCTCAAGTAGATTGCATGAAACATTCAAAACTATGCAAGGAAAATGATATTACAGGATATCCTACATTATTGTATTACCACAAGAATTCATTTACACCCGTAGAATACAAAAGCACAAGAGACCTACCTTCTCTCACTCTTTTCGTAAGTGCAGTGTACACTAAAAATAAAAAACCAAAGCCAAAAGAGAGGCCATTGCCTAATGTGGAGATATATAGCGGTATGGCTTCTTTGGATGATTACAATATAGAGAAATTTGTGTCAAAGGGACAGCACTTTGTATTGTTCTATGTTCCGTGGTGTACAGCATCCCAGAAATTGGCAATCGTCTGGGCGGAGATGGCAAAGATATATGAAAATAATGAATACCTACAGATTGGAAGGATTAATTGTTACCACAATGAAATAACCTGTCAGAACTTTGACATAAAACAGTATCCGCTTCTACTGTGGATTGTTAACGGAAGGATTATGGGACAAACTGATATGAAAACATTACCTCAACTCCAAGAGTATGTAAAAAAGATGTTGCTGACTGAAAATCATGATCCAGAGAAATTTGTCAAGAAAAAGAAAGCTTTACCAGTGGCCAGGATATCGGAGGAAACATTTGAAACCTTTTTGGAAAACGAATTGGTTTATGTTAACTATTTTGCTCCATGGTGTGCCCATTGCATGCAATTAAGCCCCCTGTGGCTGAAGCTGGGTGAACGGTTTCAAAATGAAAGCAGAGTTATCATAGCTGATATAGACTGTGCTCAGTCCAAAACAATCTGTGAAGTTGAAAAGATAAATGGCTTGCCAACACTGATTCTATACAAGAACAAAAACATAGTAAATGTAGAACATGGCAGCAAACCTCTGGAAAGTCTGATAACTCTGGTCAATGAACACCTGCATGATAACAAAACGTTAGAGGAAAATGAAAATAAGGAAAATGAAAATAAGGGAGATGAAAATAAGGGAAATGAAAATAAGGAAAATGAAAATAAGGGAGATGAAAATAAGGGAAATGAAAATAAGGAAACTGAAAATAAGGGAAATGAAAATAAGGAAACTGAAAATAAGGAAAATGAAAATTTGGACAATGAAAATAATGAGTCAACCGAAAAACCTTTGCCAAATAAAGATGAGCTATAA

Protein sequence:

>DPOGS210734-PA
MNMFKFMFTFCLLFPQYVTPDDKNVFEYDPDEFYKQIVNKDGNFIMFYTPWCRHCKEFHPIWSELGNLINSRKYDIAIAQVDCMKHSKLCKENDITGYPTLLYYHKNSFTPVEYKSTRDLPSLTLFVSAVYTKNKKPKPKERPLPNVEIYSGMASLDDYNIEKFVSKGQHFVLFYVPWCTASQKLAIVWAEMAKIYENNEYLQIGRINCYHNEITCQNFDIKQYPLLLWIVNGRIMGQTDMKTLPQLQEYVKKMLLTENHDPEKFVKKKKALPVARISEETFETFLENELVYVNYFAPWCAHCMQLSPLWLKLGERFQNESRVIIADIDCAQSKTICEVEKINGLPTLILYKNKNIVNVEHGSKPLESLITLVNEHLHDNKTLEENENKENENKGDENKGNENKENENKGDENKGNENKETENKGNENKETENKENENLDNENNESTEKPLPNKDEL-