Monarch geneset OGS2.0

DPOGS201554
TranscriptDPOGS201554-TA984 bp
ProteinDPOGS201554-PA327 aa
Genomic positionDPSCF300201 - 121323-123919
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0109205e-16783.23% 
BombyxBGIBMGA006029-TA3e-11981.36% 
DrosophilaHs2st-PA5e-11558.10% 
EBI UniRef50UniRef50_E2C5N23e-11357.06%Heparan sulfate 2-O-sulfotransferase 1 n=2 Tax=Formicidae RepID=E2C5N2_HARSA
NCBI RefSeqXP_002057669.19e-11960.84%GJ17976 [Drosophila virilis]
NCBI nr blastpgi|1953981132e-11760.84%GJ17976 [Drosophila virilis]
NCBI nr blastxgi|1953981136e-11761.03%GJ17976 [Drosophila virilis]
Group
Gene OntologyGO:00081461.5e-146sulfotransferase activity
GO:00160211.5e-146integral to membrane
KEGG pathwaydvi:Dvir_GJ179763e-118 
 K02513 (HS2ST1)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[6-328] IPR0077341.5e-146Heparan sulphate 2-O-sulfotransferase
[49-303] IPR0053311.4e-29Sulfotransferase
Orthology groupMCL13604 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201554-TA
ATGTGTGTTGTTGTTGTTGTCATTCTATATTTCCAGTCGGAAATAACGCGACTGGAGGACGTGTATCGCAAACTGGAGTTCAGAGTGTTGCAGTCGCATTCCGAGGCTCGTCAATACACGACGAGAGACCCGCCGAGAGATGACGACAATCTAGTAGTTATTTATAATAGAGTTCCTAAGACTGGCTCCACGAGTTTCGTCGGCGTAGCTTATGATTTATGTAAGAAAAATCATTTCAAAGTGCTACACATCAATATAACGGCCAACATGCACGTCATGTCGCTCAGTAATCAGTACAGATTTGCTCAGAATGTTACCAAGTGGCAGGAAGTGAAGCCGGCCCTCTACCATGGTCATATGGCATTCCTTAACTTTGAAAGGTTGGGAACAAACGCGAGGCCTCTCTTCATCAATTTAATCAGGAAGCCTCTAGACAGGCTCGTGTCTTATTATTATTTTCTACGACACGGTGATAACTTCAGGCCTCATCTAGTGAGGAAGAAACATGGCGATAAAATGACATTTGACGAGTGCGTGGAGAAGGGTCAGGCGGACTGTGACCCCAGCAACATGTGGCTCCAGGTGCCGTTCTTCTGTGGACATGCTGCCGAGTGCTGGCGTCCCGGTAGTCCGTGGGCGCTGCAGCAGGCGAAGCACAACCTGGTGCATCACTATCTAGTGGTGGGCGTGACGGAGGAGATGCTGGCCTTCATCAGCGTGCTGGAGGCGACCCTGCCGCGCCTGTTCCGCGGCGCCACCGACCACTACCGCAGCAGCAACAGGTCGCACCTCCGGCAGACCAGCGCCAAGATAGAGCCCTCGCAGCGGACGGTCGACAGGATACAGCAGTCGGTCATATGGAAGATGGAGAACGAGCTGTACGAATTCGCGTCCGAACACTTCAAGTTCGTCAAGAAGAAAGTATTGAAGGAGGCCAACAGCGCGCCGCAGGTGTTCTTCTACGAGAAGATACGACCGAAATGA

Protein sequence:

>DPOGS201554-PA
MCVVVVVILYFQSEITRLEDVYRKLEFRVLQSHSEARQYTTRDPPRDDDNLVVIYNRVPKTGSTSFVGVAYDLCKKNHFKVLHINITANMHVMSLSNQYRFAQNVTKWQEVKPALYHGHMAFLNFERLGTNARPLFINLIRKPLDRLVSYYYFLRHGDNFRPHLVRKKHGDKMTFDECVEKGQADCDPSNMWLQVPFFCGHAAECWRPGSPWALQQAKHNLVHHYLVVGVTEEMLAFISVLEATLPRLFRGATDHYRSSNRSHLRQTSAKIEPSQRTVDRIQQSVIWKMENELYEFASEHFKFVKKKVLKEANSAPQVFFYEKIRPK-