Monarch geneset OGS2.0

DPOGS204328
TranscriptDPOGS204328-TA1653 bp
ProteinDPOGS204328-PA550 aa
Genomic positionDPSCF300142 - 108497-111537
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0023190.076.99% 
BombyxBGIBMGA007226-TA8e-12243.96% 
DrosophilaCG13671-PA2e-1024.07% 
EBI UniRef50UniRef50_E1ZW117e-7633.57%Thioredoxin domain-containing protein 11 n=6 Tax=Formicidae RepID=E1ZW11_CAMFO
NCBI RefSeqXP_001602801.17e-7031.59%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071906882e-7533.57%Thioredoxin domain-containing protein 11 [Camponotus floridanus]
NCBI nr blastxgi|3071906881e-7633.09%Thioredoxin domain-containing protein 11 [Camponotus floridanus]
Group
Gene OntologyGO:00454546.7e-05cell redox homeostasis
KEGG pathwayphu:Phum_PHUM3804808e-11 
 K08056 (PDIA3, GRP58)maps-> Antigen processing and presentation
    Protein processing in endoplasmic reticulum
InterPro domain[36-147] IPR0123361.5e-12Thioredoxin-like fold
Orthology groupMCL15849 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204328-TA
ATGTTGGTTAAAGAGGCGGTATTTTGTATTGCATTAGCACTTACTACATATGGAGCCTTACACAATACGCCTTCTAAAACGTCCAAAGTTCCTCAGGCAGTAAGATTTTTTAATCACGACTCTATAGTATCTGATTGGTACAGGGGTCAGTTAAGCAATGCCCTATCTCTTATAAATTCTGAAGATATTTCATTTGTTATGTACTATGCACCCTGGGATGCTGAGTCACAATATGTGAGAGGAGAGTTTGAAAAGGCTGCTAATATTTTAAGTGATAGAGTTCATTTTTCTGCTATTAACTGTTGGAATCCAGGCAGTGAATGCAGACTGCAACATAATAAAATACCATCATGGCCAATATTGATGGCTTACACTGTCACCTCTAGAGGTGTTTTATATAAAGGACCTAGGAATGCAGAGAGTATGGTAAATTTTTTGGAATTAATCATGAGACCTTTGCAAAGAGTATCAAACACTGAAGATTTGGTTAATTTATTGTCTAAATGTGATGCTGTTGCTGTAGGGTTTACGCCACTGACTGAGACTTCTAGGTATTACAATGTGTGGTACAGTGTGGCATTAAAGTCGCGAGAATTTGACACTATTGGTGAAATATGTTTTGCGACAGTTACATCCAACGAATTAGCAATGGACCTCGGAGTTGAAAGTGTACCTAATGCCAGACTCATGTTATGGAATGATACTAAGGAGTATAGACCAGAAGATGGCAATCAGTCATGGAATGAGACGTATTTGATGCACTGGGTTCTAGAGAATTTTTCTCAACCTGTTGCCAGAATCATTCCATTGTGGAAGAAGTCTTTCAATTTTGAAAGATATGCTGATGGTAACCCAATGTTAATATTATTCACACCGTTAAATCCATTGTATGAACAGCTGCCATTGTATTCATTACTTCGCGAAGTTGCTATGGAGTATAACAATTGTAAAGACAAAGAAAGTCATCAATGGACATCGGAACTAATAAAGTTGCAACAGGTACAAAGACTGTACCAGTTATATCAGCAGAAGAACTTTTTTTTCTGTAGGGAATACAAATTTAAAAAACCTGTCAAAAAAATGTCTCCGATACACAAAAAGGAGGTAATATCACAAAATAACAAATACCCTTGGAGCAATGTGACCCAAAAGAATCAAAAGAATGGAATATTTAATTATCTTCTCAAGCAAGGACTGGCTTTATCCAAATTAATTGAGACATCTAATGAAAATTCAGCATTGTGGTCAACATTAGGTTTTCTGGAGCAGTGTGAATCAAGCGCTATCAAATCATTGCCAGCTGAGAAAAGCTTTTATGAATACTTTGAAAAATGTCAAACTCTCGAGGAGCAGTTAAGCGAAATTGAGACTGACAATCAAGAAACAGAAACAACAATGCAACCGTCAGAAGATGATCCCTACTCATCAGAAAATCTTGTACAGGACAATATGAAGCATTTCTGTAATATATTGCAGTTTGCTAATGATGTTAGTCCACTAATAATGCCAAGTAAATCCAACGGTAAAATAACACATTTGCATGGACTGGGATGTAAAACTAACTTCACTATGTACATGATAGCCGTTGACAGCATTCGGAATTATCACTTTGCTGAGGCTCTTGGTATTGATATTAAAATAAAAAGGACATGA

Protein sequence:

>DPOGS204328-PA
MLVKEAVFCIALALTTYGALHNTPSKTSKVPQAVRFFNHDSIVSDWYRGQLSNALSLINSEDISFVMYYAPWDAESQYVRGEFEKAANILSDRVHFSAINCWNPGSECRLQHNKIPSWPILMAYTVTSRGVLYKGPRNAESMVNFLELIMRPLQRVSNTEDLVNLLSKCDAVAVGFTPLTETSRYYNVWYSVALKSREFDTIGEICFATVTSNELAMDLGVESVPNARLMLWNDTKEYRPEDGNQSWNETYLMHWVLENFSQPVARIIPLWKKSFNFERYADGNPMLILFTPLNPLYEQLPLYSLLREVAMEYNNCKDKESHQWTSELIKLQQVQRLYQLYQQKNFFFCREYKFKKPVKKMSPIHKKEVISQNNKYPWSNVTQKNQKNGIFNYLLKQGLALSKLIETSNENSALWSTLGFLEQCESSAIKSLPAEKSFYEYFEKCQTLEEQLSEIETDNQETETTMQPSEDDPYSSENLVQDNMKHFCNILQFANDVSPLIMPSKSNGKITHLHGLGCKTNFTMYMIAVDSIRNYHFAEALGIDIKIKRT-