Monarch geneset OGS2.0

DPOGS209647
TranscriptDPOGS209647-TA381 bp
ProteinDPOGS209647-PA126 aa
Genomic positionDPSCF300015 + 1162411-1164871
RNAseq coverage1116x (Rank: top 11%)
Annotation
HeliconiusHMEL0170572e-5776.98% 
BombyxBGIBMGA006709-TA6e-5777.78% 
Drosophilacl-PA1e-3348.41% 
EBI UniRef50UniRef50_Q9VMQ92e-3148.41%Clot n=15 Tax=Schizophora RepID=Q9VMQ9_DROME
NCBI RefSeqXP_002089215.12e-3247.62%cl [Drosophila yakuba]
NCBI nr blastpgi|3838612963e-3148.41%PREDICTED: thioredoxin domain-containing protein 17-like [Megachile rotundata]
NCBI nr blastxgi|3838612966e-3249.17%PREDICTED: thioredoxin domain-containing protein 17-like [Megachile rotundata]
Group
KEGG pathwaysmm:Smp_128310.15e-11 
 K00771 (XYLT)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
    Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[7-124] IPR0103572.8e-43Protein of unknown function DUF953, thioredoxin-like
[2-124] IPR0123364.1e-31Thioredoxin-like fold
Orthology groupMCL10901 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209647-TA
ATGGTCAACTACGTGGATATAAAGGGATTTCAAGATTTTACTAAATATACAGAGCAAATAAACTCAAATGATCCACCTGTATTATTCTTCTTTAGTGGATCAAAACTTCCCAATGGCAATAGTTGGTGTCCCGACTGTGTTGAAGCGGAACCTGTGGTCAAAGCATTTCTAAGCGAATTGAAGAAAGATATAACCTTTGTTTATGTCGATGTTGGCGACAGAGATTATTGGAAAGACCGAGCCTGCCCGTTCCGTACCGACAGTCGCACTAAGCTGATGGTAATACCGACGATAATAAAATGGAAAGGAGTTCAGAGGCTAGAGGGCAGTCAGTGCAACAACAGGGAGTTGCTGCAAATGTTGATGGAAGACGAGGATTGA

Protein sequence:

>DPOGS209647-PA
MVNYVDIKGFQDFTKYTEQINSNDPPVLFFFSGSKLPNGNSWCPDCVEAEPVVKAFLSELKKDITFVYVDVGDRDYWKDRACPFRTDSRTKLMVIPTIIKWKGVQRLEGSQCNNRELLQMLMEDED-