Monarch geneset OGS2.0

DPOGS204781
TranscriptDPOGS204781-TA1380 bp
ProteinDPOGS204781-PA459 aa
Genomic positionDPSCF300231 + 609301-612678
RNAseq coverage27x (Rank: top 76%)
Annotation
HeliconiusHMEL0176950.084.00% 
BombyxBGIBMGA013719-TA4e-17676.00% 
DrosophilaCG3253-PA1e-7447.70% 
EBI UniRef50UniRef50_B0XF902e-8649.57%N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase n=3 Tax=Neoptera RepID=B0XF90_CULQU
NCBI RefSeqXP_002426793.11e-8754.61%N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420121292e-8654.61%N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1571139082e-8450.44%n-acetyllactosaminide beta-1,3-n-acetylglucosaminyltransferase [Aedes aegypti]
Group
KEGG pathwaydme:Dmel_CG32531e-72 
 K00741 (B3GNT1, B3GNT2)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
    Glycosaminoglycan biosynthesis - keratan sulfate
Orthology groupMCL14210 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204781-TA
ATGCGGCTGTCTGAATTACAAAGACCATTGACCAAAACCAGGAGAGACAGAATTTGGCGTTGGCGTTGTCAATGGAGTATCGTAACACTAGTGGCTGTGACGCTAGTTGTTTACAACGCTGTCGCTAACCTGTGGTTGCTGAGCCCTGTGCCATGTTCGCCGCGCAGCACGCCGCAACCTGAGCTGCCGACCTGCGAGCCCTGCATCGACACAGCTCCTCTCTCGATAGAAGAGGACCCCATCTCTAACCTGGACCTGCGTCTCGGGAGATGGGACGGCTCCAGGTCTTACCGGATGTTCGATTACGCGGCTGTCGGGGATATGTACGCGGACGCTTCCTCCAACCGCAGAGTGTGTCTCGCGACTCAGAGCTCGATCGAACGTTTGCACGAGTTGCTGGGTATAGCAGCTCACTGGACCGGACCCATCTCAGTCGCAGTGTTCGTGGCCGGCGACGAGTTAAGGTTGTTGCGGGCCTTCGAAATGTGGCTGTTCAGATGTCAGCCGGATGTGTACGCGCGGTTGGCCCTTCACGTCGCGATGCCGGCAGAGAGACCGGGCGTGCAGTCGAATGTGCCGAATTGGGCGAGGAACTGCAACGTCGCTCCTTTAGCAACCGAAGAAAGGAAGTCGGAAACAGTGGCGTGGAGAGCGCGCCACCCTTATCCTCAGAATCATTTACGTAACCTAGCCAGAAAAAACTGCCATACTCCCTACGTTTTCCTGGTCGACGTAGACATAGTTCCTTCTAGAGGAATGGCGGAGGCGCTGGACAGTTTTCTGTCGAGTGTTCCAAAGTGTCAAATGTGCGCATACGTGGTGCCAACCTATGAACTAGATAAGAGAGTTGCCAACTTTCCAACGAATAAATCAGAACTGTTAAGACTTTCAAGAAATAAATTAGCGATACCGTTTCATAGGAAGGTGTTCATATATAATCAGTACGCTTCAAACTTTTCCAGGTGGGAGTCTTCAGGCGGGAATGAAAGCCTCTCAACTCATATCAGTCACACGGTGACTAACTTCGAGTTGCTTTACGAGCCGTTCTACGTGGCCCCGGATACCGTGCCCGCTCATGATGAGAGGTTCCTAGGGTACGGCTTCACCAGGAACACTCAGCCTGGCACCTATAACAAGCTGCTCAACGATCGTACCATGCCAAAGATCGCTCGACTCAGCTCATCCCCATCTCTCTCAGACTCTCTGCTAGAGACCGAGAAGAAGCTCACTTCTCACTGCTTCTTTGCTGTTCCTTTTTTCATCTATCAGGAGCGGGGTCTGGACCGGTCGATAGTACAACCTGCCGGTATAATTCTGGAAGTCATTAGAACACTTAACCCGTCCAATCACTACAAGGTGACGACTGCGGAGGGTTACTAA

Protein sequence:

>DPOGS204781-PA
MRLSELQRPLTKTRRDRIWRWRCQWSIVTLVAVTLVVYNAVANLWLLSPVPCSPRSTPQPELPTCEPCIDTAPLSIEEDPISNLDLRLGRWDGSRSYRMFDYAAVGDMYADASSNRRVCLATQSSIERLHELLGIAAHWTGPISVAVFVAGDELRLLRAFEMWLFRCQPDVYARLALHVAMPAERPGVQSNVPNWARNCNVAPLATEERKSETVAWRARHPYPQNHLRNLARKNCHTPYVFLVDVDIVPSRGMAEALDSFLSSVPKCQMCAYVVPTYELDKRVANFPTNKSELLRLSRNKLAIPFHRKVFIYNQYASNFSRWESSGGNESLSTHISHTVTNFELLYEPFYVAPDTVPAHDERFLGYGFTRNTQPGTYNKLLNDRTMPKIARLSSSPSLSDSLLETEKKLTSHCFFAVPFFIYQERGLDRSIVQPAGIILEVIRTLNPSNHYKVTTAEGY-