Monarch geneset OGS2.0

DPOGS204937
TranscriptDPOGS204937-TA2094 bp
ProteinDPOGS204937-PA697 aa
Genomic positionDPSCF300160 - 222691-228710
RNAseq coverage494x (Rank: top 25%)
Annotation
HeliconiusHMEL0036410.087.85% 
BombyxBGIBMGA005279-TA0.067.21% 
DrosophilaCG30463-PA0.062.97% 
EBI UniRef50UniRef50_B3NP820.058.95%GG22251 n=31 Tax=Neoptera RepID=B3NP82_DROER
NCBI RefSeqXP_002425051.10.060.83%UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420085190.060.83%UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3287130870.060.29%PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 1 [Acyrthosiphon pisum]
Group
KEGG pathwayphu:Phum_PHUM1702200.0 
 K00710 (GALNT)maps-> O-Glycan biosynthesis
InterPro domain[556-683] IPR0089973.7e-32Ricin B-related lectin
[183-332] IPR0011732.5e-28Glycosyl transferase, family 2
[566-680] IPR0007727e-28Ricin B lectin
Orthology groupMCL16449 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204937-TA
ATGAAATACTACAGCATGTTCTTTCTTAGAAGAAGAACGTGGATATTAAAAGCTGTTCTAATTTTGACTGCTCTCTGGTTTATGATTGCTTTATTGACTACTAATGGTGGTACGCGAAATGCTATCGGAGCACCCAATATTGAATATGAAGACGCAGAAGCTAAACCTCTTAAAATGGCAAAGCCCACGTCAACGAAAAAGAAAAGTGATCCATATAGCAACAACTTGTCTGACGATATTATGCATGATAATATTCCTTTATCCACCACAGCTCGAGTCAATTGGATGGGTCGGATCGCGGGGAACGAAGGGTTAGGAGTAATTGCTGCTCCTGGTTCGGACAACGCACCGGGCGAACTGGGAAAACCGGTCGTTCTGCCGTCTAATATGAGTGAGGATGCAAAACTAGCGGTGTCCGAGGGATGGAAAAAAAACGCCTTCAATCAGTACGCCAGCGACTTAATATCGATTAGACGGACACTACCCGACCCTAGGGACGAGTGGTGTAAGCAACCTGGTCGCTACTTAGAAGATCTTCCGCAAACATCTGTTGTTATATGTTTTCACAATGAGGCGTGGTCAGTGCTTCTCAGAACTGTTCATTCAGTTATAGACAGATCTCCTGCACATTTAATAAAAGAGATTATTCTCGTCGACGACTTTTCTGATATGCCACACTTGATGCAACAACTCGATGATTATATGTCCTCGTTGCCGAAGGTTAGGATAGTGAGAGCAACTCAACGCGAGGGTCTCATTAGAGCGAGGCTGCTAGGTGCTAAGTACGTCACAGCGCCAGTACTAACATATCTCGATAGCCATTGCGAATGTACTGAGGGTTGGCTAGAACCACTTTTAGATAGAATTGCTCGCAACAAAACAAATGTAGTGTGTCCTGTTATCGACGTAATTGACGACAATACACTCGAGTATCACTACCGAGATTCAACTTCAGTAAACGTTGGTGGTTTTGACTGGAATCTACAATTCAATTGGCACCCGGTACCCGCAAGGGAGCGAGCTCGACACAAACATACCGCTGAACCAGTATGGTCTCCAACTATGGCTGGTGGTCTCTTTGCTATAGATAAAGAGTTTTTTGAAAGACTTGGAACTTACGACAGTGGATTTGACATATGGGGTGGAGAGAATTTGGAACTGTCATTCAAAACGTGGATGTGCGGTGGCACTCTCGAAATAGTTCCTTGCTCACATGTCGGTCATATATTTAGAAAACGATCGCCATACAAGTGGAGGACCGGAGTTAATGTTCTCAAGAAAAATTCCGTCCGATTAGCTGAAGTGTGGTTGGACGATTATTCAAAATATTATTATCAGCGGGTTGGCAACGACAAGGGTGACTACGGTGATATTAGTGGTAGGAAGGAATTAAGAGAAAAACTTAAGTGTAAATCATTCGATTGGTACTTAAAGAACATTTACCCAGAACTGTTCATACCGGGAGAATCCGTAGCCCACGGAGAGATTCGAAATATCGGCTTCGAAAGGACATGTCTGGACTCTCCGACGCGGAAGTCCGATCATCATAAGCCAGTAGGACTATACCCGTGTCATCGGCAAGGAGGAAATCAGATTCGCAACGTGCTATATGGACGTTGTATTGTTGGTGATTCTGAAGAGGCGTCGAATGAAGTTGTTATAATGATACACAGGTGTCATGGCGCTGGGGGCAGTCAGATCGCGAATCCCTCTTCGGATATGTGCGTGGACTCGGCTGCTGGACCAGAAGACATGAAGAAGCCAGTCAACCCTTGGCCTTGTCATGGAGAATATGGCAATCAGTACTGGATGTATTCGAAGAATGGCGAGATCCGTCGCGATGAGACTTGCCTCGACTATTCGGGTCACGATGTTGTTTTGTACCCCTGTCATGGGGCCAAGGGTAATCAATTATGGCTGTATGACCCCACTACGAAGCTAATAAAACATGGCTCAAGTGAAAAATGTATGGCGATATCGCGGAAGAAGGACAAGATTGTAATGGAAACGTGCAACGAAAGGGAGAATAGGCAACAGTGGAATATGGAAAACTTTAATGCTGACAGACTCAGTCCCGAACTGACGGCTGAGTAG

Protein sequence:

>DPOGS204937-PA
MKYYSMFFLRRRTWILKAVLILTALWFMIALLTTNGGTRNAIGAPNIEYEDAEAKPLKMAKPTSTKKKSDPYSNNLSDDIMHDNIPLSTTARVNWMGRIAGNEGLGVIAAPGSDNAPGELGKPVVLPSNMSEDAKLAVSEGWKKNAFNQYASDLISIRRTLPDPRDEWCKQPGRYLEDLPQTSVVICFHNEAWSVLLRTVHSVIDRSPAHLIKEIILVDDFSDMPHLMQQLDDYMSSLPKVRIVRATQREGLIRARLLGAKYVTAPVLTYLDSHCECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNTLEYHYRDSTSVNVGGFDWNLQFNWHPVPARERARHKHTAEPVWSPTMAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLKKNSVRLAEVWLDDYSKYYYQRVGNDKGDYGDISGRKELREKLKCKSFDWYLKNIYPELFIPGESVAHGEIRNIGFERTCLDSPTRKSDHHKPVGLYPCHRQGGNQIRNVLYGRCIVGDSEEASNEVVIMIHRCHGAGGSQIANPSSDMCVDSAAGPEDMKKPVNPWPCHGEYGNQYWMYSKNGEIRRDETCLDYSGHDVVLYPCHGAKGNQLWLYDPTTKLIKHGSSEKCMAISRKKDKIVMETCNERENRQQWNMENFNADRLSPELTAE-