Monarch geneset OGS2.0

DPOGS214688
TranscriptDPOGS214688-TA1527 bp
ProteinDPOGS214688-PA508 aa
Genomic positionDPSCF300022 - 1939543-1943788
RNAseq coverage232x (Rank: top 44%)
Annotation
HeliconiusHMEL0177230.079.55% 
BombyxBGIBMGA001064-TA0.076.02% 
Drosophilapgant6-PB1e-14151.77% 
EBI UniRef50UniRef50_D1MLM61e-15856.63%N-acetyl galactosaminyl transferase-like protein n=1 Tax=Mayetiola destructor RepID=D1MLM6_MAYDE
NCBI RefSeqNP_001161257.10.065.28%polypeptide GalNAc transferase 6-like [Tribolium castaneum]
NCBI nr blastpgi|2700061700.065.28%hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
NCBI nr blastxgi|2683701550.065.55%polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Group
KEGG pathwaytca:6637980.0 
 K00710 (GALNT)maps-> O-Glycan biosynthesis
InterPro domain[49-232] IPR0011738.3e-29Glycosyl transferase, family 2
[352-488] IPR0089971.8e-25Ricin B-related lectin
[362-482] IPR0007721e-18Ricin B lectin
Orthology groupMCL10442 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214688-TA
ATGTTTTTCCAGCCCAGAGCTACTGGCAAAGAACAATTCAAAATGAAAATAAATATTTTATACCATCTAGTTATAAGATCTCTACCAGACATCCGTCATCCTGGTTGTCAAAATAGGCTGTACATTGAATCCTTGCCGACCGTTAGTGTAGTAGTCCCATTTCATAATGAGCATTGGAGTACATTGTTAAGAACGGCATACAGTGTCCTTAACAGATCACCAACTTTTCTTATAAAAGAAGTGTTTTTGGTGGATGACGCCAGCACTAAAGACTTTCTTAAGGAACAGCTTGATGATTATGTATCAAAACATATGCCTAAGGTAAAAATAATCCGACTCAAATCCAGAAGTGGTTTGATAGCTGCTCGATTAGCCGGTGCGGAGAAAGCTACAGCTGATGTCCTGGTTTTCCTTGACTCACACACAGAAGCCAATGTCAACTGGCTACCCCCACTCCTAGAGCCCATAGCGTTGAATTACAAGACAGTGGTGTGTCCATTCATTGATGTTGTTGCGTATGATACGTTTGCGTATCGGGCTCAAGATGAGGGGGCTCGTGGCGCGTTTGACTGGGAACTGTTCTACAAGCGACTGCCGGTGTTACCAGCTGATGAGGCGAATATGCCAGAGCCATTTCCGAGTCCAGTAATGGCGGGTGGTCTGTTCGCGATATCACGCGTATTTTTCTGGGAACTTGGCGGATATGATCCCGGTCTTGATATATGGGGTGGGGAGCAATATGAGCTCAGCTTTAAGTTGTGGCAGTGTGGTGGAAAAATGTTGGATGCGCCATGTTCTCGTGTTGGACATATTTACAGGAAATTCGCACCCTTCCCCAATCCCGGCCACGGAGATTTCGTTGGGAAGAATTACAGACGAGTCGCGGAAGTGTGGATGGACGAATACGCTCAATACTTGTATAAAAGGCGTCCACACTATTTGAAAATAGACACCGGCGATATATCCAAGCAGAAGGCTTTGAGGGAGAAACTTCAGTGCAAACCGTTCAAATGGTTCATGACTCAGATAGCTTTTGACCTGACGGCGAAGTATCCGCCGGTCGAACCAAAACCTTTCGCAGAGGGACGTATAAGGCCGGCTACATATCCTCATTTATGCGTGGATGCTCATCATGGCAACCAAATGGACAAGTTACATTTGAAGTCCTGTACAGCATCTACATCTGCCGAACAAAACTTTATGCTGTCATGGCATAAGGACATTAAGTCAAAGACTCGGAATATGTGCTGGGACCTGCCGGATTCTTCTCCAAGGAGTCCTATACTCTTGTACAGTTGTCACCTGGGGGGAGGAAACCAGCTCTGGAGATATCATCCCGAGTCCAGGCGTCTCAAACACGGTACGAACGACAATTGTTTAGATTTTGAAATATCAACGAGATCTGTTTTCATAAAGCAGTGTTCAGACTCAGAAACCCAGGAGTGGATCATAGATAAAGTAGATAACGCCATGTTGGCGACGTGGGATACCATCGCCAAAAGAGTTACTGGTCCCGTTGAGGAGTAA

Protein sequence:

>DPOGS214688-PA
MFFQPRATGKEQFKMKINILYHLVIRSLPDIRHPGCQNRLYIESLPTVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDFLKEQLDDYVSKHMPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPLLEPIALNYKTVVCPFIDVVAYDTFAYRAQDEGARGAFDWELFYKRLPVLPADEANMPEPFPSPVMAGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPCSRVGHIYRKFAPFPNPGHGDFVGKNYRRVAEVWMDEYAQYLYKRRPHYLKIDTGDISKQKALREKLQCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGRIRPATYPHLCVDAHHGNQMDKLHLKSCTASTSAEQNFMLSWHKDIKSKTRNMCWDLPDSSPRSPILLYSCHLGGGNQLWRYHPESRRLKHGTNDNCLDFEISTRSVFIKQCSDSETQEWIIDKVDNAMLATWDTIAKRVTGPVEE-