Monarch geneset OGS2.0

DPOGS204377
TranscriptDPOGS204377-TA1086 bp
ProteinDPOGS204377-PA361 aa
Genomic positionDPSCF300002 - 1762992-1776189
RNAseq coverage832x (Rank: top 15%)
Annotation
HeliconiusHMEL0057173e-17480.33% 
BombyxBGIBMGA007703-TA6e-2075.47% 
DrosophilaCG9171-PE1e-13259.44% 
EBI UniRef50UniRef50_Q7JRE11e-13059.44%CG9171, isoform C n=20 Tax=Endopterygota RepID=Q7JRE1_DROME
NCBI RefSeqXP_001845873.13e-13461.77%N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase [Culex quinquefasciatus]
NCBI nr blastpgi|1700360396e-13361.77%N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase [Culex quinquefasciatus]
NCBI nr blastxgi|1700360394e-13161.77%N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase [Culex quinquefasciatus]
Group
KEGG pathwaydme:Dmel_CG154833e-96 
 K00741 (B3GNT1, B3GNT2)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
    Glycosaminoglycan biosynthesis - keratan sulfate
Orthology groupMCL15077 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204377-TA
ATGTTGCCGCAGACATTGCAACGCGGAGATTATTGGGTTTTGAAGAATTTCGTTCGTGCTGATCACGGTTTCATCATGTGCCATGAATCTATAACCTACACAACACACGCTGGTTTTGAGTTTCTAGATAACGTCCAGCCCCTAGTTGAAAGATGGATGGCCCCAGTGAGTCTCGCGATTCACGCGCCGGGTACCGACATGGCCCCCACGGTGAATAGCATCAGATACCTGAGGGATTGTCTCGGGAACGAGCTCATCAAACAGTTCGTCACCTTCCACGTGTTCTTTTCACATAAAAACGTTCCCACTAGTATACCAAATCCGGAAACGTTTCTCCAGATCCCATACAATTGCTCAGCGAAGCCGCCATATAACGTGAACGCTAGCTCAACGTATATGAAAGCCAAGAATCTTTTGTATCCTGTGAACGTAGCTCGCAACATCGCGAGGGAGGCTGCTGTCACACATTATATATTGCCATCAGATATAGAGCTGTATCCTAGTCCTAATTTAGTACCTAGGTTCCTAAATATGATAGCTAGAAACGCTAAACCGCTAAGCACCTCCACTAAACCTAGAGTCTTTCCGATAAGCATATTTGAAGTCGGTGAAAAGGTTCAAGTGCCATCGACAAAGACAGAGCTTCGGGCTATGTTAGCTAACAAGACGGCTATCCCGTTCCATAAATTCGTCTGCCCCAATTGTCACAACATACCCGAGGGTCAGAAATGGATGAACACACCAGAAACTAATCGTATGGACGTTTTTCATGTTGGCAAGCGACGTGGTAAATTTGTGCATTGGGAGCCTATTTTCATCGGAACCCACCAAGACCCTTATTACGATGAGCGACTCAGCTGGGAGGGGAAGAAGGATAAAATGACACAAGGATATATCCTCTGCGTCAAAGACTATGACTTCATGATATTGAATAACGCATTCCTCATCCACAAACCGGGTATCAAACATTATGTAAAGAATCCCAAAAGAGACGCCATCGCTGGCCGACAGTCGTTGCTTATCAAAAGTATTATCATGCCGCAACTAAAAGCTCTCTTTGGGGCGAGGAGCGGCTGTGCTCTGTGA

Protein sequence:

>DPOGS204377-PA
MLPQTLQRGDYWVLKNFVRADHGFIMCHESITYTTHAGFEFLDNVQPLVERWMAPVSLAIHAPGTDMAPTVNSIRYLRDCLGNELIKQFVTFHVFFSHKNVPTSIPNPETFLQIPYNCSAKPPYNVNASSTYMKAKNLLYPVNVARNIAREAAVTHYILPSDIELYPSPNLVPRFLNMIARNAKPLSTSTKPRVFPISIFEVGEKVQVPSTKTELRAMLANKTAIPFHKFVCPNCHNIPEGQKWMNTPETNRMDVFHVGKRRGKFVHWEPIFIGTHQDPYYDERLSWEGKKDKMTQGYILCVKDYDFMILNNAFLIHKPGIKHYVKNPKRDAIAGRQSLLIKSIIMPQLKALFGARSGCAL-