Monarch geneset OGS2.0

DPOGS201415
TranscriptDPOGS201415-TA1263 bp
ProteinDPOGS201415-PA420 aa
Genomic positionDPSCF300006 - 1692796-1698047
RNAseq coverage1252x (Rank: top 10%)
Annotation
HeliconiusHMEL0090602e-8379.70% 
BombyxBGIBMGA002569-TA0.079.47% 
DrosophilaCaBP1-PA7e-14761.43% 
EBI UniRef50UniRef50_Q150842e-14558.37%Protein disulfide-isomerase A6 n=154 Tax=Eukaryota RepID=PDIA6_HUMAN
NCBI RefSeqXP_972053.22e-16566.59%PREDICTED: similar to protein disulfide-isomerase A6 [Tribolium castaneum]
NCBI nr blastpgi|3286708810.073.65%protein disulfide isomerase [Helicoverpa armigera]
NCBI nr blastxgi|3286708810.073.65%protein disulfide isomerase [Helicoverpa armigera]
Group
Gene OntologyGO:00168531.4e-36isomerase activity
GO:00454541.3e-32cell redox homeostasis
GO:00150353.9e-06protein disulfide oxidoreductase activity
GO:00090553.9e-06electron carrier activity
GO:00066623.9e-06glycerol ether metabolic process
KEGG pathwaytca:6607545e-165 
 K09584 (PDIA6, TXNDC7)maps-> Protein processing in endoplasmic reticulum
InterPro domain[140-254] IPR0123366.6e-39Thioredoxin-like fold
[151-252] IPR0057881.4e-36Disulphide isomerase
[148-249] IPR0137661.3e-32Thioredoxin domain
[45-53] IPR0057463.9e-06Thioredoxin
Orthology groupMCL14360 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201415-TA
ATGTTGCATTTACGCTTATTAGGTGTTATTTTATTTTTTACGGGAACATATGCATTATATGACGCACATTCAGATGTCGTGGAATTAACTCCCAATAATTTTGAAAGACTTGTTACAAAATCTGATGAAGTGTGGATCGTGGAGTTCTTTGCCCCTTGGTGCGGTCACTGTAAGAACCTAGTTCCCGAATATTCCAAAGCTGCACGTGCTTTAAAGGGTATAGTGAAAGTGGGAGCTTTAGATGCAGATAGCTATAAAGAGTTTGCTCAGAAATATGGTGTCACTGGTTTTCCTACTATCAAAGTGTTCACCGGCTCTAAGCACACTCCATATCAGGGTCAGAGAACTGCTGAAGCCTTTGTTGATGCCGCTCTGAAGGCGGCCAAGGACAAGGCATATGACAGCCTCGGCAAGAAAGCGAAGAGTTCTGATAAGTCTGACGTCATAACCTTGACGGATGAAAACTTCAATAAGTTGGTTTTGGAGAGCGATGACATGTGGCTCGTTGAGTTCTTCGCCCCTTGGTGTGGACACTGCAAGAATCTCGAACCTCACTGGGCTAAGGCTGCCACTGAACTAAAGGGCAAGATCAAACTGGGTGCAGTGGACGCGACAGTACACCAGGTGTTGGCCTCCCGGTACCAAGTACAAGGATACCCTACAATCAAATACTTCCCGTCCGGCAAGAAGGATAATGCTGAAGAATACAATGGAGGCAGAACTTCCAGTGACATTGTGTCATGGGCGCTTGAGAAACTCGCTGAGAATATTGCACCACCAGAAGTTGTACAGGTCATTGACCCAGCCACCATGAGTGAATGTAGCGAGAAGCCTCTATGTGTGGTGTCAGTACTACCTCATATATTGGACTGTGACGCCGCCTGCAGGAACTCATACATCGATATCCTCAGAAGACTCGGAGAGAAATACAAGAACAAAATGTGGGGATGGGTGTGGACTGAAGCGGGAGCTCAGTCGTCACTGGAAGATGCTCTTGAAATAGGAGGCTTTGGCTACCCCGCCATGGCTGTAGTTAACGCTAAGAAACTGAAGTTCAGTACACTGCGAGGGTCCTTCTCTGAGACAGGAATCAACGAGTTCCTTAGGGACCTATCGTTTGGCCGAGGTCAAACCGCTCCAGTCCGAGGTGCAGAAATGCCAAAAATTGTTACACAGGATCCTTGGGATGGTAAAGACGGTGAATTACCCCCGGAAGAGGACATCGACCTCTCAGATATAGATCTCGAGAAAGATGAATTGTAA

Protein sequence:

>DPOGS201415-PA
MLHLRLLGVILFFTGTYALYDAHSDVVELTPNNFERLVTKSDEVWIVEFFAPWCGHCKNLVPEYSKAARALKGIVKVGALDADSYKEFAQKYGVTGFPTIKVFTGSKHTPYQGQRTAEAFVDAALKAAKDKAYDSLGKKAKSSDKSDVITLTDENFNKLVLESDDMWLVEFFAPWCGHCKNLEPHWAKAATELKGKIKLGAVDATVHQVLASRYQVQGYPTIKYFPSGKKDNAEEYNGGRTSSDIVSWALEKLAENIAPPEVVQVIDPATMSECSEKPLCVVSVLPHILDCDAACRNSYIDILRRLGEKYKNKMWGWVWTEAGAQSSLEDALEIGGFGYPAMAVVNAKKLKFSTLRGSFSETGINEFLRDLSFGRGQTAPVRGAEMPKIVTQDPWDGKDGELPPEEDIDLSDIDLEKDEL-