Monarch geneset OGS2.0

DPOGS202272
TranscriptDPOGS202272-TA1116 bp
ProteinDPOGS202272-PA371 aa
Genomic positionDPSCF300032 - 446265-449529
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0076402e-15469.09% 
BombyxBGIBMGA004930-TA7e-13468.14% 
DrosophilaCG9996-PB3e-8446.54% 
EBI UniRef50UniRef50_E2A3836e-8747.78%Glycosyltransferase 8 domain-containing protein 3 n=8 Tax=Neoptera RepID=E2A383_CAMFO
NCBI RefSeqXP_969798.13e-9845.53%PREDICTED: similar to GLT8D3 protein [Tribolium castaneum]
NCBI nr blastpgi|910900755e-9745.53%PREDICTED: similar to GLT8D3 protein [Tribolium castaneum]
NCBI nr blastxgi|910900755e-9645.53%PREDICTED: similar to GLT8D3 protein [Tribolium castaneum]
Group
Gene OntologyGO:00167574.9e-15transferase activity, transferring glycosyl groups
KEGG pathway 
InterPro domain[71-278] IPR0024954.9e-15Glycosyl transferase, family 8
Orthology groupMCL10908 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202272-TA
ATGAGACGACTTTCCCTATTTAAACTTTGTACTTTGTTGTTTTTATTGTTTACGTTGTTTTATTTTTATTCCTCAAGTAATATTACAAATGGTAGCAAATTAAACTCTACACAGCATAAGAATGTAAATAACGATCTAAAGTCTAGTGTTGTCAAGAATAAGACAAATATTGATAGAATAGTGATATCTTTTGTCGTTTGTGACTCAAGATTCAATGAGTCCTTAAATGTTATTAAGTCAGTTTTAATTTTCACGAAAACACCAACACATTTCGTTATATTCTCTGATGACAAATTGGAACACAAATTTAATGAGACACTGACCAAATGGAAGGCTATCTTAAGGGATCAATTTGACTTTGAACTACAGAAAATAAAATTTCCCAAAGCGCACGAGGAGGATTGGATGAATCTATTCAGCAAATGTGCTGCCCAGAGATTATTTATACCTAACTTAATAACTCATATAGACTCCATGATGTATGTGGATACAGACACACTTTTCCTGGGTCCAGCCGATCGGCTTTGGGAGGTGTTCTACAAAATGAACAGAACCCAAATATCAGCCATGGCACTTGAAGACGACAACCCCAATATATCCTGGTATCCTCGGTTCGCTAAGCATCCGTTTTATGGAAAATATGGACTCAATTCAGGCGTTATGCTCATGAACCTGACGAGGATGAGGGATTTTGGTTGGGTAGATTATGTCACCCCCATCATGCTGAAATGGAAATTGTATATACCTTGGGGTGATCAGGATATCATCAACATAATTTTCCACTACCACGAGAATGCTGTATACGTGATGACTTGTAATTACAACTACAGGTCGGACCAGTGTGTGTATGGAGATGCCTGTGAGCCGGCGTCCCATGGCGTGCTTGTGGTGCACGGGAGTCGTGGCGTATTCCATAATAACAAACAGCCAGCCTTTCAGGCTGTATACAGAGCTATCAATGAATATGAGATTGGTTCGGATCGAATGACAGTATTGACGAATTTGGATAAATATCTGAATGAGGCACCGCCTTCAAACTGCGGCAACATGAAGGATTCGTTACTGAAGTTGCCCATAGACACCCTCACTAATAAATATGTCAAACCGGACAGATAA

Protein sequence:

>DPOGS202272-PA
MRRLSLFKLCTLLFLLFTLFYFYSSSNITNGSKLNSTQHKNVNNDLKSSVVKNKTNIDRIVISFVVCDSRFNESLNVIKSVLIFTKTPTHFVIFSDDKLEHKFNETLTKWKAILRDQFDFELQKIKFPKAHEEDWMNLFSKCAAQRLFIPNLITHIDSMMYVDTDTLFLGPADRLWEVFYKMNRTQISAMALEDDNPNISWYPRFAKHPFYGKYGLNSGVMLMNLTRMRDFGWVDYVTPIMLKWKLYIPWGDQDIINIIFHYHENAVYVMTCNYNYRSDQCVYGDACEPASHGVLVVHGSRGVFHNNKQPAFQAVYRAINEYEIGSDRMTVLTNLDKYLNEAPPSNCGNMKDSLLKLPIDTLTNKYVKPDR-