Monarch geneset OGS2.0

DPOGS210791
TranscriptDPOGS210791-TA804 bp
ProteinDPOGS210791-PA267 aa
Genomic positionDPSCF300027 - 1309875-1312003
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0024401e-13486.03% 
BombyxBGIBMGA007104-TA2e-10576.98% 
DrosophilaHs3st-B-PA9e-11270.57% 
EBI UniRef50UniRef50_Q7PS292e-11070.94%AGAP000422-PA n=7 Tax=Endopterygota RepID=Q7PS29_ANOGA
NCBI RefSeqXP_002022384.11e-11169.70%GL13007 [Drosophila persimilis]
NCBI nr blastpgi|1951630872e-11069.70%GL13007 [Drosophila persimilis]
NCBI nr blastxgi|3479637865e-10970.94%AGAP000422-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00081461.3e-37sulfotransferase activity
KEGG pathwaycqu:CpipJ_CPIJ0084911e-110 
 K07809 (HS3ST3)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[15-252] IPR0008631.3e-37Sulfotransferase domain
Orthology groupMCL10513 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210791-TA
ATGACATTAAGGAAACCGAACCTGGTTCCAACGAAGAGACTGCCGGATGCCTTGATTATAGGTGTTAAGAAATGTGGAACGAGAGCGCTTTTGGAATTTTTAAGATTGCATCCAGACGTAAGGGCTGCCGGATCCGAGGTTCATTTTTTCGACAAATTTTATCATAAGGGATTCGAATGGTACAGGAACAGAATGCCACCAACTCTCGAGGGTCAGATTACTATGGAGAAGACACCTTCTTATTGGGTAACACGATCGGCCCCGAAGCGTGTCTTCGCTATGAATCCAGCCGTCAAACTATTGGCCGTAGTCAGAGATCCTGTCACTAGAGCTATTAGTGACTACACCCAGTCCGCAAGTAAGAGACCGAGCCTGCCTCGTTTTGAGGAGTTGGCGCTGATGGATAGCCCGTGGGGTTCCGTAGTGGACACCTGGCCGCCCGTCCGACTGGGAATATACGCCAGACCTCTGAGACGTTGGCTGAGAAGGTTCCCAAGGTCCAGGATACTTATCATCAGCGGAGAGAGACTCGTTGTGGACCCCGCCGCTGAAATGACTAGGGTTCAGGAATTCCTGAACCTCAAGCCAGTGATAACGGAAAAACATTTCTACTTTAATTCCACCAAAGGGTTCCCATGCCTGCTCAAATCTGAGAGCCGGTCAACTCCGCACTGCCTTGGCAAAACCAAAGGTAGAAATCATCCCTACATAGATCCAGTAGCCTTAGAGAGACTGAGAGAATTCTATAGACCTCATAATGAGAGATTTTATGAACTATCTGGTATAAATTTTGGTTGGCAGTAA

Protein sequence:

>DPOGS210791-PA
MTLRKPNLVPTKRLPDALIIGVKKCGTRALLEFLRLHPDVRAAGSEVHFFDKFYHKGFEWYRNRMPPTLEGQITMEKTPSYWVTRSAPKRVFAMNPAVKLLAVVRDPVTRAISDYTQSASKRPSLPRFEELALMDSPWGSVVDTWPPVRLGIYARPLRRWLRRFPRSRILIISGERLVVDPAAEMTRVQEFLNLKPVITEKHFYFNSTKGFPCLLKSESRSTPHCLGKTKGRNHPYIDPVALERLREFYRPHNERFYELSGINFGWQ-