Monarch geneset OGS2.0

DPOGS213021
TranscriptDPOGS213021-TA1176 bp
ProteinDPOGS213021-PA391 aa
Genomic positionDPSCF300024 + 267249-270149
RNAseq coverage520x (Rank: top 24%)
Annotation
HeliconiusHMEL0122930.077.83% 
BombyxBGIBMGA006941-TA1e-17786.09% 
DrosophilaCG9911-PA4e-15568.46% 
EBI UniRef50UniRef50_E2AFC41e-15366.41%Thioredoxin domain-containing protein 4 n=29 Tax=cellular organisms RepID=E2AFC4_CAMFO
NCBI RefSeqXP_971182.18e-16769.49%PREDICTED: similar to CG9911 CG9911-PA [Tribolium castaneum]
NCBI nr blastpgi|910925202e-16569.49%PREDICTED: similar to CG9911 CG9911-PA [Tribolium castaneum]
NCBI nr blastxgi|910925202e-16370.03%PREDICTED: similar to CG9911 CG9911-PA [Tribolium castaneum]
Group
Gene OntologyGO:00454541.3e-18cell redox homeostasis
KEGG pathwayuma:UM02443.11e-26 
 K09580 (PDIA1, P4HB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[7-131] IPR0123361.6e-31Thioredoxin-like fold
[21-127] IPR0137661.3e-18Thioredoxin domain
Orthology groupMCL13269 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213021-TA
ATGACGATGTGTCTAGTTATCTGCCATAGTTTTTACAATCCTATCGATAGCGGCGCTGTACAAATTACACAGAGCAATTTAGATATGGTACTTGCTTCCAATGAGATTGTTTTTATAAATTTTTATGCTGAATGGTGTAAGTTCAGTAATATATTGATGCCCATCTTTGATGATGCGGCTGTTGAAGTAGCTAAAGCAGGATATGACCCTGGAAAGGTTGTTATGGGTAAAGTAGATTGTGACCAAGAAGGTGCTATTGCTACAAGGTTTCATATTACCAAATATCCTACTTTGAAACTATTCCGGAATGGGTTTCCAGCTAAGAAAGAGTATAGAGGTCAAAGATCAGTTGAGGCCTTTGCTGAGTTTATTAAGAAGCAGTTGACGGACCCAATTGTTCAATTTGGTTCCTTGAAAGAGTTGCATGATTTGAGTGAAGATAAAAGACATATAATTGGATATATGGACAGACGGGATCAGCCGGAGTATGAAGTATTAAGAAAAGTAGCAGCCAGTTTGAAGGATGAGTGTTTATTCCATGCAGGATTTGGAGATGCTTCCCAGCAGATGCATCCTCCGGGTCAGCCGATCATTGTATTCAGGACTGATAAGCGAACCTCCATTGAACCCGATGAAACCTATCACGGCTCTATGCTTAACTTTGATGAGTTATACACTTGGGTACAACAGAAGTGTATACCAATTGTACGTGAAATAACATTTGAAAATGCTGAAGAACTAACAGAAGAAGGCCTGCCGTTCCTCATCTTGTTCCATCACCCCTCTGACACTGAGAGTGTTAAGAAATACAAAGAAATTATTATGAACGAATTGGAATCCGAAAAACAAAATATTAACTTCTTGACTGCTGATGGCGTACGTTTTGAACATCCTCTTCATCACCTCGGGAAGTCTGTGAGCGACCTGCCTTTGATTGCTATCGACTCATTCAGGCACATGTACCTATTTCCCAAATACAGTGATATGGAAATACCCGGAAAACTCAAACAGTTCTTACAAGATTTGTATTCAGGAAAATTACACAGGGAATTCCACTATGGTACCGAAGCTCCTGCTAGTGACAATGATATTAAAGTAACCACACCTCCCGAGTCCACATTTAAAAAACTAGCGCCATCTAAGAACAGATACACTCTACTAAGAGATGAGTTATAA

Protein sequence:

>DPOGS213021-PA
MTMCLVICHSFYNPIDSGAVQITQSNLDMVLASNEIVFINFYAEWCKFSNILMPIFDDAAVEVAKAGYDPGKVVMGKVDCDQEGAIATRFHITKYPTLKLFRNGFPAKKEYRGQRSVEAFAEFIKKQLTDPIVQFGSLKELHDLSEDKRHIIGYMDRRDQPEYEVLRKVAASLKDECLFHAGFGDASQQMHPPGQPIIVFRTDKRTSIEPDETYHGSMLNFDELYTWVQQKCIPIVREITFENAEELTEEGLPFLILFHHPSDTESVKKYKEIIMNELESEKQNINFLTADGVRFEHPLHHLGKSVSDLPLIAIDSFRHMYLFPKYSDMEIPGKLKQFLQDLYSGKLHREFHYGTEAPASDNDIKVTTPPESTFKKLAPSKNRYTLLRDEL-