DPGLEAN07902 in OGS1.0

New model in OGS2.0DPOGS214688 
Genomic Positionscaffold15:- 431359-439522
See gene structure
CDS Length1608
Paired RNAseq reads  523
Single RNAseq reads  1290
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001064 (0.0)
Best Drosophila hit  polypeptide GalNAc transferase 6, isoform B (4e-140)
Best Human hitpolypeptide N-acetylgalactosaminyltransferase-like 6 (3e-139)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC008338 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  polypeptide GalNAc transferase 6-like [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families

  
IPR008997 Ricin B-related lectin
IPR000772 Ricin B lectin
IPR001173 Glycosyl transferase, family 2
Orthology groupMCL10575

Nucleotide sequence:

ATGGGAAAGAAATTTTCTAAAAAGGCATTAAGAGAAAGAGGTATTGGTGAACATGGTTTG
CCAGCACATTTGCCAATCAAAGATTCAGAAATAGAGAAGGATTTGTATGCAGTCAATGGT
TTTAATGGGGCCTTAAGTGACAAAATACCATTAAACAGATCTCTACCAGACATCCGTCAT
CCTGGTTGTCAAAATAGGCTGTACATTGAATCCTTGCCGACCGTTAGTGTAGTAGTCCCA
TTTCATAATGAGCATTGGAGTACATTGTTAAGAACGGCATACAGTGTCCTTAACAGATCA
CCAACTTTTCTTATAAAAGAAGTGTTTTTGGTGGATGACGCCAGCACTAAAGACTTTCTT
AAGGAACAGCTTGATGATTATGTATCAAAACATATGCCTAAGGTAAAAATAATCCGACTC
AAATCCAGAAGTGGTTTGATAGCTGCTCGATTAGCCGGTGCGGAGAAAGCTACAGCTGAT
GTCCTGGTTTTCCTTGACTCACACACAGAAGCCAATGTCAACTGGCTACCCCCACTCCTA
GAGCCCATAGCGTTGAATTACAAGACAGTGGTGTGTCCATTCATTGATGTTGTTGCGTAT
GATACGTTTGCGTATCGGGCTCAAGATGAGGGGGCTCGTGGCGCGTTTGACTGGGAACTG
TTCTACAAGCGACTGCCGGTGTTACCAGCTGATGAGGCGAATATGCCAGAGCCATTTCCG
AGTCCAGTAATGGCGGGTGGTCTGTTCGCGATATCACGCGTATTTTTCTGGGAACTTGGC
GGATATGATCCCGGTCTTGATATATGGGGTGGGGAGCAATATGAGCTCAGCTTTAAGTTG
TGGCAGTGTGGTGGAAAAATGTTGGATGCGCCATGTTCTCGTGTTGGACATATTTACAGG
AAATTCGCACCCTTCCCCAATCCCGGCCACGGAGATTTCGTTGGGAAGAATTACAGACGA
GTCGCGGAAGTGTGGATGGACGAATACGCTCAATACTTGTATAAAAGGCGTCCACACTAT
TTGAAAATAGACACCGGCGATATATCCAAGCAGAAGGCTTTGAGGGAGAAACTTCAGTGC
AAACCGTTCAAATGGTTCATGACTCAGATAGCTTTTGACCTGACGGCGAAGTATCCGCCG
GTCGAACCAAAACCTTTCGCAGAGGGACGTATAAGGCCGGCTACATATCCTCATTTATGC
GTGGATGCTCATCATGGCAACCAAATGGACAAGTTACATTTGAAGTCCTGTACAGCATCT
ACATCTGCCGAACAAAACTTTATGCTGTCATGGCATAAGGACATTAAGTCAAAGACTCGG
AATATGTGCTGGGACCTGCCGGATTCTTCTCCAAGGAGTCCTATACTCTTGTACAGTTGT
CACCTGGGGGGAGGAAACCAGCTCTGGAGATATCATCCCGAGTCCAGGCGTCTCAAACAC
GGTACGAACGACAATTGTTTAGATTTTGAAATATCAACGAGATCTGTTTTCATAAAGCAG
TGTTCAGACTCAGAAACCCAGGAGTGGATCATAGATAAAGTAGATAACGCCATGTTGGCG
ACGTGGGATACCATCGCCAAAAGAGTTACTGGTCCCGTTGAGGAGTAA

Protein sequence:

MGKKFSKKALRERGIGEHGLPAHLPIKDSEIEKDLYAVNGFNGALSDKIPLNRSLPDIRH
PGCQNRLYIESLPTVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDFL
KEQLDDYVSKHMPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPLL
EPIALNYKTVVCPFIDVVAYDTFAYRAQDEGARGAFDWELFYKRLPVLPADEANMPEPFP
SPVMAGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPCSRVGHIYR
KFAPFPNPGHGDFVGKNYRRVAEVWMDEYAQYLYKRRPHYLKIDTGDISKQKALREKLQC
KPFKWFMTQIAFDLTAKYPPVEPKPFAEGRIRPATYPHLCVDAHHGNQMDKLHLKSCTAS
TSAEQNFMLSWHKDIKSKTRNMCWDLPDSSPRSPILLYSCHLGGGNQLWRYHPESRRLKH
GTNDNCLDFEISTRSVFIKQCSDSETQEWIIDKVDNAMLATWDTIAKRVTGPVEE