Monarch geneset OGS2.0

DPOGS204942
TranscriptDPOGS204942-TA1368 bp
ProteinDPOGS204942-PA455 aa
Genomic positionDPSCF300160 + 116876-119908
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0036407e-15359.62% 
BombyxBGIBMGA005280-TA6e-14059.63% 
DrosophilaCG30463-PA1e-13153.85% 
EBI UniRef50UniRef50_B3NP825e-13053.59%GG22251 n=31 Tax=Neoptera RepID=B3NP82_DROER
NCBI RefSeqXP_002425051.12e-13957.66%UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420085193e-13857.66%UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420085192e-13657.66%UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative [Pediculus humanus corporis]
Group
KEGG pathwayphu:Phum_PHUM1702204e-139 
 K00710 (GALNT)maps-> O-Glycan biosynthesis
InterPro domain[100-281] IPR0011731.6e-30Glycosyl transferase, family 2
Orthology groupMCL25122 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204942-TA
ATGGCGCCATTTCAGCTAATGGGTTGGGAATCCGTTGGTCCTAGCAGTCCCCATTTAACTTACAGAGATCGTATTACTGCTATAAAGAATAAGGAGTTTCCACTTGGTAAGTTTGGAAAACCAGTGTTTATGGCTAATTCCATCAAAGGTACTTTGAGATTGATCATAAAAAAAGGCTGGGAAGAAAATGCTTTCAATCAATTTGTTAGTGATTTGATACCTATAGACCGTCCTTTGCTGGATTTGCGGGATAAATGGTGTCTGGAAAGATATTCATCAAAATTATTACCACAAGCGTCGGTGGTAATATGTTTTTTCAATGAGGCTTGGTCTACCCTTCTCAGAACGCTTCATTCAGTTTTAAACAGATCGCCCCCGCATCTATTGAGAGAGGTCCTATTGATTGATGATTTTTCTGACATGGATCATATTAAAGTTCGTTTAGAAAATTATACTCGAAAGTTCCCTAATGTTATTTTAATACGAACCTCACAAAGAGAGGGTTTAATAAGAGCACGAATCGTTGGTGCTAAAAAGGCATCTGCCCCTGTTTTAGTTTTTCTCGACAGCCATTGCGAATGTACTGAAGGTTGGCTGGAACCTTTATTAGAACGCTTGGTTGAAAATCCAAAAATAGTTGCTAGTCCAGTTATTGATCATATCGACCCAAACACATTTGAATATATTTCTCAAAACCCAAAAGACATTTATATTGGAGGATTTAACTGGAATTTAAAATTTATATGGAGATCAATAGAGTATAAAAGAGAAAATTTTCTGTTACCAATCAAAACACCTACTATAGCAGGGGGCTTATTTGCCATAGATAAGGAGTTTTTTTACAGTATAGGATATTATGATGAGGGTTTTGATGTATGGGGAGGGGAGAACTTAGAATTGTCATTTAAGGTGTGGATGTGTGGGGGCTCTTTAGAAATCGTTCCTTGTTCTCATGTGGGACATATTTTTAGGGAAAATTTTCCTTATTACACTTCGGGAGAAACATTTAAACGGAATGCTGCGCGATTGGCTGAGGTTTGGCTTGATGATTATGCAAAAATTTTCTACGAGAGAATCGGTAATGCGGATGTTAGTTTGGGCGACGTTACCGCACAAAAAGAATTAAGAAAGAAGTTAAAATGCAAATCATTTAATTGGTATCTGCGTAATGTTTATCCTGAAAAGAAAATACCAAAAAGTAATGTAGCAAGTGGACAGATTTACAATATTGGGAAAAGAGCTGTTTGTCTTGATGCTTCTGTTACACCTCCACAAGTCACGGGTTTCATTCACATTATGCCCTGTCATGGTCAGGGGGGAAATCAGGTCACTAAATTCATTATATTTTATTATAAGCCAGAAATTTAA

Protein sequence:

>DPOGS204942-PA
MAPFQLMGWESVGPSSPHLTYRDRITAIKNKEFPLGKFGKPVFMANSIKGTLRLIIKKGWEENAFNQFVSDLIPIDRPLLDLRDKWCLERYSSKLLPQASVVICFFNEAWSTLLRTLHSVLNRSPPHLLREVLLIDDFSDMDHIKVRLENYTRKFPNVILIRTSQREGLIRARIVGAKKASAPVLVFLDSHCECTEGWLEPLLERLVENPKIVASPVIDHIDPNTFEYISQNPKDIYIGGFNWNLKFIWRSIEYKRENFLLPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELSFKVWMCGGSLEIVPCSHVGHIFRENFPYYTSGETFKRNAARLAEVWLDDYAKIFYERIGNADVSLGDVTAQKELRKKLKCKSFNWYLRNVYPEKKIPKSNVASGQIYNIGKRAVCLDASVTPPQVTGFIHIMPCHGQGGNQVTKFIIFYYKPEI-