Monarch geneset OGS2.0

DPOGS205168
TranscriptDPOGS205168-TA1137 bp
ProteinDPOGS205168-PA378 aa
Genomic positionDPSCF300197 - 195608-201232
RNAseq coverage509x (Rank: top 25%)
Annotation
HeliconiusHMEL0127994e-4873.73% 
BombyxBGIBMGA001270-TA3e-7785.00% 
DrosophilaCG6903-PA3e-3240.22% 
EBI UniRef50UniRef50_C3YL829e-4131.06%Putative uncharacterized protein (Fragment) n=1 Tax=Branchiostoma floridae RepID=C3YL82_BRAFL
NCBI RefSeqXP_002100049.18e-5736.62%GE16376 [Drosophila yakuba]
NCBI nr blastpgi|3504236011e-5835.40%PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Bombus impatiens]
NCBI nr blastxgi|3504236012e-5934.65%PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like [Bombus impatiens]
Group
KEGG pathwaydya:Dyak_GE163762e-56 
 K10532 (HGSNAT)maps-> Lysosome
    Glycosaminoglycan degradation
Orthology groupMCL16628 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205168-TA
ATGGCTGTGCCTTGTGAATGTCAGCTTATGGAATCTCTGCTCCTACTATTAGGAGTTCTGATCGGGGCGTCGATTCTCTATGGTGTGTTGAGGCTCATCATGTCGAGGGTGAGGAGATCGGCCAAGTTGAGGTATGGAGATAAGGAACTTGCCCTACAGCAACGTCTACGAGCTCTGGACACGTTTCGAGGGATAGCTATCGTTTTCATGATATTTGTGAACGATGGTGCGGGAGGGTACTGGTGGTTGGAACACGCCACCTGGAACGGTCTCTCAGCTGGAGACCTGGTGTTCCCCGCCTTCCTCTGGATCATGGGAGTCTGCATCCCATTATCAATAAAGAGCGCCTTCGCTAAGGGTATACCCAGGTGGAAAATCGTCCTACACATTTTTAAGGTAAATCAAATACTATATTTTATCACAATACGAAGAGAAAGAAATAAATATAAAATTAATTCGTATAATGAATTTCGAGTGATTTATCTTAGAAGAGTTTTTAATTCGTGTCCAATATCTCTACATCTTATTTCGCAAGCTCATACAGTTAAGCATTACCTCCGTATAACGACTACACCGAGAAGTAATAGATTTTTTAGCTTCCAAGTCAGTCATTTTGTGCATTCTCCAGATTATATTTCCTATCGGTATGTCGCAGGATGCCTCACCTCCGCGGTTCAAGCCCTGGTCGGCATCCAAGCCGGCGCCACTGTCCTTCTCCAACGTTCGCACAAAGCCCGGGTGTCTCGTTGGCTGGCCTGGGCTCTAGTGCTAGCTCTAGCCGGGGCCCTACTCGCTGGATTCTCGAGGGAACACGGAGTGCTACCCATCAATAAGAACTTGTGGTCCATGTCGTTCGTGCTGGTGACGTCAGCGGTGTCCCTCGCGATACTCAGCATCTGTTACACATTCACGGACGCCTGGCGGCTGTGGGGCGGAGGACCCTTCAGAGCTCCAGGTCTTAACGCCATCGCCCTGTACATAGGCCACTCGATCTGCGCGCACATATTCCCATTCCATTGGAAAATCCCAACCATGAGAACGCACGCCGTCTATCTCGTGGAAGCCGTATGGGGAACCGCGTTGTGGGTCATCATCGCTCACGTCATGGCCAGAAAGAAAGTATTCATCACCCTCTGA

Protein sequence:

>DPOGS205168-PA
MAVPCECQLMESLLLLLGVLIGASILYGVLRLIMSRVRRSAKLRYGDKELALQQRLRALDTFRGIAIVFMIFVNDGAGGYWWLEHATWNGLSAGDLVFPAFLWIMGVCIPLSIKSAFAKGIPRWKIVLHIFKVNQILYFITIRRERNKYKINSYNEFRVIYLRRVFNSCPISLHLISQAHTVKHYLRITTTPRSNRFFSFQVSHFVHSPDYISYRYVAGCLTSAVQALVGIQAGATVLLQRSHKARVSRWLAWALVLALAGALLAGFSREHGVLPINKNLWSMSFVLVTSAVSLAILSICYTFTDAWRLWGGGPFRAPGLNAIALYIGHSICAHIFPFHWKIPTMRTHAVYLVEAVWGTALWVIIAHVMARKKVFITL-