Monarch geneset OGS2.0

DPOGS206769
TranscriptDPOGS206769-TA996 bp
ProteinDPOGS206769-PA331 aa
Genomic positionDPSCF300001 - 6220759-6223471
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0061948e-17686.40% 
BombyxBGIBMGA010842-TA2e-17083.69% 
DrosophilaCG16733-PA2e-8747.26% 
EBI UniRef50UniRef50_D6X2G03e-9050.00%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X2G0_TRICA
NCBI RefSeqXP_002430060.13e-11459.38%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420192165e-11359.38%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420192165e-11359.38%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00081462.1e-57sulfotransferase activity
KEGG pathwaydpo:Dpse_GA141144e-86 
 K01016 (SULT1E1, STE)maps-> Steroid hormone biosynthesis
    Sulfur metabolism
InterPro domain[59-326] IPR0008632.1e-57Sulfotransferase domain
Orthology groupMCL10183 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206769-TA
ATGAGGACCAAACCTAATTTGACCTTTTCCAAACTGGACAAGGAAACAGGAGATGTACTGGATAGAATGTTCGAGAAAGAAGATTGCATGGTCGAGATTAATCCTGGCCGCGTTATTCTACCAGCGGATTACATGACGATAGGTCAGGATATATTGGATATGGATGTTTTGGAAAGCGATGTTTGGATGCTTTCCTATCCAAGAACTGGTTCAACATGGGCCCAGGAAATGGTGTGGTTGATTGGACACGACTTGGACTACGAAGGCGCAATGTCTTTACAACAGATTCGTTGTCCGTTAGTGGAATTATCTTGTATCATGGTTGATGGGCACGCCCAATGGCATGACGAATCTGTTGGGGGAACCTCGGTCGACCTGGTGAAGTATCGGGTGCCTCACCCTCGCTACATTCGCAGCCATTTACCCTGGGACCTGTTGCCTGTGGATATACTTAACGCTGATGGCACTGTTAAGCCCAAGGTCATTTATACTTCTCGGAACCCGAAGGACATGGTGGTATCATACTACCACTACTGTTCGCTGGTTCACGGGATGAAGGGAAGCTTTGAGGAGTTCTGCGACCTCTTCATGAGAGATCGAGCGCCGTTTGGACCCGTTTGGAATCATATACTGGGTTTCTGGAACAGACGTGAGGATCCCAACATACTCTTTATAAAGTTTGAAGAAATGAAACGAGACTTGCCAACAGTTGTCAGGAAGACGGCGAAGTTTCTAGATAAAACGTTGAGCGACGAGGAAGTATTCAAATTATGTGATTATCTGTCATTTGCGAACATGAAATCGAACCGCGCTGTTAACTTGGAGGCCATCTTGGAGAAATCGTACGGAAAACACTTTCTGGAGCAAACGTCGCTGAGGTTCATCAGGAAGGGGGAGATTGGAGATTGGAAGAATTTCATGTCCGACGAGCTATCGAGAAGATTCGACGATTGGGCGGAACAGAACCTCAAAGGCACTGAACTGAGTTTTGAATAA

Protein sequence:

>DPOGS206769-PA
MRTKPNLTFSKLDKETGDVLDRMFEKEDCMVEINPGRVILPADYMTIGQDILDMDVLESDVWMLSYPRTGSTWAQEMVWLIGHDLDYEGAMSLQQIRCPLVELSCIMVDGHAQWHDESVGGTSVDLVKYRVPHPRYIRSHLPWDLLPVDILNADGTVKPKVIYTSRNPKDMVVSYYHYCSLVHGMKGSFEEFCDLFMRDRAPFGPVWNHILGFWNRREDPNILFIKFEEMKRDLPTVVRKTAKFLDKTLSDEEVFKLCDYLSFANMKSNRAVNLEAILEKSYGKHFLEQTSLRFIRKGEIGDWKNFMSDELSRRFDDWAEQNLKGTELSFE-