Monarch geneset OGS2.0

DPOGS213520
TranscriptDPOGS213520-TA1278 bp
ProteinDPOGS213520-PA425 aa
Genomic positionDPSCF300033 - 833283-835328
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0136840.080.00% 
BombyxBGIBMGA011672-TA2e-17973.61% 
DrosophilaCG5027-PA1e-11648.15% 
EBI UniRef50UniRef50_E2BCN62e-13051.88%Protein disulfide-isomerase TXNDC10 n=9 Tax=Endopterygota RepID=E2BCN6_HARSA
NCBI RefSeqXP_001870826.13e-13856.69%disulfide-isomerase TXNDC10 [Culex quinquefasciatus]
NCBI nr blastpgi|3123839081e-13756.93%hypothetical protein AND_02801 [Anopheles darlingi]
NCBI nr blastxgi|3123839081e-13856.56%hypothetical protein AND_02801 [Anopheles darlingi]
Group
Gene OntologyGO:00454548.4e-23cell redox homeostasis
GO:00150355.5e-05protein disulfide oxidoreductase activity
GO:00090555.5e-05electron carrier activity
GO:00066625.5e-05glycerol ether metabolic process
KEGG pathway 
InterPro domain[20-125] IPR0123361.8e-27Thioredoxin-like fold
[34-119] IPR0137668.4e-23Thioredoxin domain
Orthology groupMCL14050 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213520-TA
ATGAACTTCTTATTTGGATTTATCTGTATATTATTTTTGGGTTTTAATAGTGTTTCATCTTCAAGAGTGTTAGAGTTAAGTGATAAGTTTATTGACATCAGCAAAGATGGAGTTTGGTTTGTTAAGTTTTATGCTCCTTGGTGTGCTCACTGCCGAAGAATTGAACCAATATGGGCTCATGTAGCACAGGCTTTATATAATAGTCCTATAAAAGTTGCTAAAGTTGATTGTACTCGGTTTAGTGCTGTGGCTACTCACTTTAAAGTGAGAGCCTTTCCAACCATCATGTTTATAAAAGGACCATTCTGGCATGAATATTCTGGAGAAAGAGAAAAAGAAGACATGGTAAATTATGCCATGAGGATGGCTCAGCCAGCTGTTCAAGTTGTATCTCATACTGAAAGCTTTGCATATTTGAAAGATTCACACAATGTTTTCTTTGGATATATTGGAAAACAGCAAGGACCTCTATGGGAAATGTTTACGACCAATGCTGAGAAGTATCAAGCTCATAGTTGGTTTTATGCCATGTCTCATGAGGTAGTCAAAAATGACTTGAAGCCACCAAATGATACTGCTGTCTTTGTCCACAAAGACAATGAGATAATATATTTTACAGCCACACCAGAAATAATCAAAGACAGGGAGACATTAAATGTTACTCTCAAGAGATGGGTAAATTCAGAGAGATTTGGGTTTTTCCCAAAGATATCAAGATCTAACATAAATGATTTAATGGATACTAAAAAATATATTGTGATAGCTGTTGTATCAGAAAATAAATTAAATGAAATCACACAAACTGAACGAGACTTCAAGGATATGGTTGAGAGCATTATAAGATCAAAGAAACATGAGTTGCATCATCACTTCCAGTTTGGTTGGATGGGAAATCCGGAATTAGCCAATTCGATAGCGATGTCAGAGTTGGCGGTTCCATATTTGATAGTACTCAACTCTACCACCAACCACCACCACATTCCTGACGATGACCCAGTTCTGATGACTCCCGAAGCTGTTATTTTGTTTCTTGAACAGATACATGAACAAACTGCACCTACTTATGGCGGAAATGGTTGGATGGTGAGATTCTATAGAGGATTCTTTGAAGCCAGAACAACATTGATAAACATGTGGAGGGGGAACCCAGTCCTCACTGCTCTCGTCTTTGGACTACCGCTTGGATTTCTTTCCCTCATTTGTTATTCTATATGCTGTGCTGACATTTTGGATGCTGATGATGAGGAAATTACTGATACCCATGAGAAGAAAGATTAA

Protein sequence:

>DPOGS213520-PA
MNFLFGFICILFLGFNSVSSSRVLELSDKFIDISKDGVWFVKFYAPWCAHCRRIEPIWAHVAQALYNSPIKVAKVDCTRFSAVATHFKVRAFPTIMFIKGPFWHEYSGEREKEDMVNYAMRMAQPAVQVVSHTESFAYLKDSHNVFFGYIGKQQGPLWEMFTTNAEKYQAHSWFYAMSHEVVKNDLKPPNDTAVFVHKDNEIIYFTATPEIIKDRETLNVTLKRWVNSERFGFFPKISRSNINDLMDTKKYIVIAVVSENKLNEITQTERDFKDMVESIIRSKKHELHHHFQFGWMGNPELANSIAMSELAVPYLIVLNSTTNHHHIPDDDPVLMTPEAVILFLEQIHEQTAPTYGGNGWMVRFYRGFFEARTTLINMWRGNPVLTALVFGLPLGFLSLICYSICCADILDADDEEITDTHEKKD-