Monarch geneset OGS2.0

DPOGS210294
TranscriptDPOGS210294-TA1023 bp
ProteinDPOGS210294-PA340 aa
Genomic positionDPSCF300551 - 14991-18819
RNAseq coverage1842x (Rank: top 7%)
Annotation
HeliconiusHMEL0055484e-16181.82% 
BombyxBGIBMGA004220-TA2e-16077.71% 
DrosophilaCG6903-PA2e-4736.10% 
EBI UniRef50UniRef50_D6WH022e-7343.90%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WH02_TRICA
NCBI RefSeqXP_974454.13e-7443.90%PREDICTED: similar to heparan-alpha-glucosaminide N-acetyltransferase [Tribolium castaneum]
NCBI nr blastpgi|910789766e-7343.90%PREDICTED: similar to heparan-alpha-glucosaminide N-acetyltransferase [Tribolium castaneum]
NCBI nr blastxgi|910789761e-7643.90%PREDICTED: similar to heparan-alpha-glucosaminide N-acetyltransferase [Tribolium castaneum]
Group
KEGG pathwaytca:6633069e-74 
 K10532 (HGSNAT)maps-> Lysosome
    Glycosaminoglycan degradation
Orthology groupMCL17342 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210294-TA
ATGGGGGAGGCCATGGTGTTGTCCCTCAACGCCAGGCTGAGGACCTCGCTGCCGAGGGTCAACGCGCTTGGACAGGTCGCAAGGAGGTCTCTTCTTCTGTCCCTGATCGGCATCTGCCTAGGATCAGTGAACACCAACTGGTCCTACGTTCGGTTCCCGGGCGTGCTACAGCGACTCGCTGCTATGTACTTGATTGTTGGGTCTTTGGAGTGCGCTTTTATGAGGACCAGCCAGAATATCATACCCGGTCGGTCGCTGTTCCGTGACATCGCTGCCGGTTGGCAACAGTGGTTGGCCACCGTGCTGATGGTAGCTATACAGCTGTGTATAACATTGACGGTCGCTGCCCCCGGGTGTCCCGTGGGGTACTCGGGTCCCGGGGGGCTGCATAGGACAGCGACCGGGGACTTCTCCCTCCAGAACTGCACAGGTGGCATAGCTGGTTATATTGACAGGCTCATCCTCGGCCCGAATCATTTGTACCAGCACGGGACCTTCAAGAGCATCTACCGCACCCAGCTACCCCACGACCCTGAGGGTATCCTGGGCATACTGTCCGGGGTTCTGGTCGTCCAAGCCGGGGCCCACGCCGCCAGGATCATGCTCGTGTATAACCACGCCAGAGCCCGCATCATGCGTTGGGTTTTCTGGTCTGTGATGTTCGGCGTGGTGGGCGGTCTGCTCTGTAAGTTTTCGGATGGCGGGTACATCCCCGTCAACAAGAACCTGTGGTCTGTGTCGTATTGCCTGGTGACGTCATCGATGGCCTTCTTCATCCAGGCGATCTTGTACTTCGTGGTTGATCTGAAGAACAAGTGGGGTGGACGACCCCTGTACTATGCGGGTCAGAACGCGCTCTTCCTCTACGTCGGGAGTGAGCTGCTTAAAAAGCACTTCCCCCTCCACTGGCACCTCCCCTCCCCTATCCACGCTCAGCTGCTCACCGCGAACGCAGCCGCGGCTCTCATGTGGCTGGCGGTCGCTGTGGCCCTGCACAGGAAACGCGTCTTTATTACTCTGTAA

Protein sequence:

>DPOGS210294-PA
MGEAMVLSLNARLRTSLPRVNALGQVARRSLLLSLIGICLGSVNTNWSYVRFPGVLQRLAAMYLIVGSLECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAPGCPVGYSGPGGLHRTATGDFSLQNCTGGIAGYIDRLILGPNHLYQHGTFKSIYRTQLPHDPEGILGILSGVLVVQAGAHAARIMLVYNHARARIMRWVFWSVMFGVVGGLLCKFSDGGYIPVNKNLWSVSYCLVTSSMAFFIQAILYFVVDLKNKWGGRPLYYAGQNALFLYVGSELLKKHFPLHWHLPSPIHAQLLTANAAAALMWLAVAVALHRKRVFITL-