Monarch geneset OGS2.0

DPOGS213506
TranscriptDPOGS213506-TA1002 bp
ProteinDPOGS213506-PA333 aa
Genomic positionDPSCF300033 - 1082141-1083491
RNAseq coverage151x (Rank: top 53%)
Annotation
HeliconiusHMEL0035555e-4478.57% 
BombyxBGIBMGA011786-TA1e-12874.14% 
DrosophilaCG4623-PA1e-5637.67% 
EBI UniRef50UniRef50_E7E2751e-14273.19%Ganglioside-induced differentiation-associated-protein n=1 Tax=Bombyx mori RepID=E7E275_BOMMO
NCBI RefSeqNP_001186866.12e-14373.19%ganglioside-induced differentiation-associated-protein [Bombyx mori]
NCBI nr blastpgi|3156332094e-14273.19%ganglioside-induced differentiation-associated-protein [Bombyx mori]
NCBI nr blastxgi|3156332091e-14073.19%ganglioside-induced differentiation-associated-protein [Bombyx mori]
Group
Gene OntologyGO:00055152.6e-09protein binding
KEGG pathwayecb:1000521902e-07 
 K01800 (E5.2.1.2, maiA)maps-> Styrene degradation
    Tyrosine metabolism
InterPro domain[29-108] IPR0123366.3e-20Thioredoxin-like fold
[179-294] IPR0109873.1e-14Glutathione S-transferase, C-terminal-like
[36-101] IPR0040452.6e-09Glutathione S-transferase, N-terminal
[207-284] IPR0040462.3e-07Glutathione S-transferase, C-terminal
Orthology groupMCL16248 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213506-TA
ATGCATTATGTACAAAAATATTTGGATAAATTGCAGGCACCAAAGTTAAATAACACATCTTTAAGTAATGGATACAAAACAAATATATTTTTATATTGCAATTATTACAGCTTTTACTCTCAAAAGGTTTTGATGGCACTATACGAAAAAAATATAGACTTCGAACCAATAGTTATAGATATAACTAAAGGTGAACAGTATTCTCAATGGTTTCTGGAACTTAATCCTCGGGGAGAAATCCCAGTTCTTAAAGTAAACAAATCCATTATTCCGGATTCCACCAGAATTTTAGATTATTTGGAGATGTACCTGGATCAAGAGAACCCACCATTACTGGAGGTTTCTCAAGATCCGAAAGTAATGATGAACATTGTTAAGTTTCGGGAACTAATTGAAGCCCTGCCTGCTGGTGTAATTACTGTGGGATCATTCTTCCATCCACATCTTTCTGGACGGCCCAAATTACCATTCATTTTGCCAGTTAGAGAAGTGCTCAAAAGTGGTGATTTAAGTAATTCTAAAAATCTAAGAAGGTTAGCTGAAGAAAATCCAAAGGCCAAGAGTGTTCTTCTATACAAAGCAGAGATACAGGATCGAAAACAAGAAATACTTACTAATGAAGAAGAATATCTTAAAATCATTAATATAGTTGATGATGTACTGTCACAAGTTGAAGAGCAGTTGAAAAAACAAAATGATGATAGTTGGCTTTGCTGTGATAAATTTAGTATTGCTGATATTAATTTGGCTGTGCTTTTACAACGTTTGTGGGAGTTGGGGCTGGATGAGCGCTTTTGGGCATTCGGCAAACGCCCCTACATTGAGAATTACTTCGTCCGTGTCAAACAAAGAGATTCTTTCCAAAAGACCATTCCTGGCCTACCGGTCCATGTTAAAATGATTTTAACATCACAACCGCCTATATATGTTGCTTCGGCGGGAATTGTGTCCATTTCTCTTGTGATAACATTGGCATATCTTTTCAAAAAATTAATATGGTAA

Protein sequence:

>DPOGS213506-PA
MHYVQKYLDKLQAPKLNNTSLSNGYKTNIFLYCNYYSFYSQKVLMALYEKNIDFEPIVIDITKGEQYSQWFLELNPRGEIPVLKVNKSIIPDSTRILDYLEMYLDQENPPLLEVSQDPKVMMNIVKFRELIEALPAGVITVGSFFHPHLSGRPKLPFILPVREVLKSGDLSNSKNLRRLAEENPKAKSVLLYKAEIQDRKQEILTNEEEYLKIINIVDDVLSQVEEQLKKQNDDSWLCCDKFSIADINLAVLLQRLWELGLDERFWAFGKRPYIENYFVRVKQRDSFQKTIPGLPVHVKMILTSQPPIYVASAGIVSISLVITLAYLFKKLIW-