DPGLEAN16143 in OGS1.0

New model in OGS2.0DPOGS214723 
Genomic Positionscaffold8526:- 5015-12945
See gene structure
CDS Length1764
Paired RNAseq reads  3313
Single RNAseq reads  8407
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005068 (4e-131)
Best Drosophila hit  CD98 heavy chain (4e-38)
Best Human hitneutral and basic amino acid transport protein rBAT (1e-17)
Best NR hit (blastp)  PREDICTED: similar to CD98hc amino acid transporter protein [Tribolium castaneum] (1e-66)
Best NR hit (blastx)  PREDICTED: similar to CD98hc amino acid transporter protein [Nasonia vitripennis] (1e-61)
GeneOntology terms


  
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
GO:0003824 catalytic activity
GO:0060356 leucine import
InterPro families



  
IPR017853 Glycoside hydrolase, superfamily
IPR006047 Glycosyl hydrolase, family 13, catalytic domain
IPR006589 Glycosyl hydrolase, family 13, subfamily, catalytic domain
IPR015902 Alpha amylase
IPR013781 Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL16359

Nucleotide sequence:

ATGGGTTTGATTATGGCTATTCGCTCCTCAGTGGACCGCCGCGCCGCGCCGCTACTGCCG
CGATCGAGTCTGCACAAGTCCAGGGACAAGGTCTCCGCAGACGAGGCCGAAGAGAGACTA
CTACAGAAGGAGGAGGAGGCGAAGATAACGACCAGGGTCGACATGGCGGACGCCAAGTTT
GTGGTCGAGGATCACAGGAACGGGGACGCCAAGATTGAGCTGGACGCGAACAAGAGGTTC
ACGGGCCTGACCAAGGAGGAGCTGATGAAGTACGCGGACGACCCGTTCTGGGTCCGCCTC
CGCTGGTTCATGTTCGTGTTGTTCTGGTCTCTGTGGCTGTGTATGCTGGCCGGGGCCATA
GCCATCATTGTGAGAGCTCCCAGGTGTGTCGCGCCGGAGCCTAAGACCAGGTACGAGACA
GGTCCTCTAGTTGATCTGGACCTCGCTGACTACACCACGGCGGAGTCTCATCTAGACACT
CTCCAGCAATACCAGGTGTCTGGGCTGTTCGCCTCCGCCTGTCAGTCTACCTACGTGGTG
CTCGAAGACAGCTCCTGTCTAGACAAGTTCAAACAGTTTGCTGATAAAGCCAAGAACTAT
GGAATCAAGGTCATAGTAGACCTGACAGCCAACTTCGTGTCCACCAGTCATCCTTGGTTC
CAGCAGAGTGAGAACCGTTCAGAGCAGTTCTCGGAGTACTTCATCTGGGTGAAGAGTGAT
GAACATGATCCCGAACTCAACACCACCATACCCAAACCACCTAATGATTGGGTGTCCACA
GTGAACACTGGTGCATGGTCTTGGAGTGAGAGAAGGAAGGAGTTCTATCTTCACCAGTAT
GGCGAGGGACTCGCTGACCTCAACTTCCACAACCCTAATGTAGTCAAACAGTTCGATGAG
GTCATCAGACTGTGGATGAAGGCCGGAGCCGGTGGCATCAGGTTGCACAACGTCCGTCAG
CTGTTAGTGAGCAGTCCTCCTCTGTCTGAGCTGCCTCACACGGGCGCCGGGAGCACGCCG
GGGGCGGACCACTCGCAGTACCCCTTCTGGAGACACTCTCGGACCTCGGACCAGCCACAG
CTGGATTCGCTGCTGGCTCACTGGTCATATATCGTGGAGCAAGCTTCCTCTGAGCCGACG
GTGTTCACGTTAGCGGAACCTTCCCGGCCGGAGCTGTTCATGCTGCAAAGGAACACGAGT
TGTCTCCGGCCCGCCAGCGGAGCACCCGTCGACCTGGCGCGGCCCGGGGCGGCCAAGCTC
CTCGCTGAGCGACTGTCACGCTGGCCCGCCATACAGTTGACTGATGATAAGCCGGACGAG
GAGACGGCCGTGTTTTCCATGCTGCTGCCGGCCGCACCTGTCATGGTCTTGGAACAACTG
GCTGGGGATGACAATGATACTACCCCCAGCGAGAGTTTGAAGCACGCGATATCACTGCGT
ACCGACGCCAGTGTGCAGCACGGAGCGTTGGTTGTGACTGACGCACCCGTTCACAACTCC
AGCGACATGATGCTGGCCGTCGCCAGATGGAAGGCGGACCACTCCGGCTACGTGTCGGTG
TATAACCCCGGCGCCTCTGGTCTCGTGTCTCTGTCTTCAGTCCGCTCTCTGCCGTCTTCC
CTCGCGGTGCATCACGTGTCGAGGAACACCAAGCTCGCCTCCAATTACACCAGTAACCAG
GCCGTGGAGACGGCGAGCGTGTTCGTCCCGGGCAAGTCGGCGGTGATCTTCTCGTACGTG
CCGAAAGATGGCGCTGAAAACTGA

Protein sequence:

MGLIMAIRSSVDRRAAPLLPRSSLHKSRDKVSADEAEERLLQKEEEAKITTRVDMADAKF
VVEDHRNGDAKIELDANKRFTGLTKEELMKYADDPFWVRLRWFMFVLFWSLWLCMLAGAI
AIIVRAPRCVAPEPKTRYETGPLVDLDLADYTTAESHLDTLQQYQVSGLFASACQSTYVV
LEDSSCLDKFKQFADKAKNYGIKVIVDLTANFVSTSHPWFQQSENRSEQFSEYFIWVKSD
EHDPELNTTIPKPPNDWVSTVNTGAWSWSERRKEFYLHQYGEGLADLNFHNPNVVKQFDE
VIRLWMKAGAGGIRLHNVRQLLVSSPPLSELPHTGAGSTPGADHSQYPFWRHSRTSDQPQ
LDSLLAHWSYIVEQASSEPTVFTLAEPSRPELFMLQRNTSCLRPASGAPVDLARPGAAKL
LAERLSRWPAIQLTDDKPDEETAVFSMLLPAAPVMVLEQLAGDDNDTTPSESLKHAISLR
TDASVQHGALVVTDAPVHNSSDMMLAVARWKADHSGYVSVYNPGASGLVSLSSVRSLPSS
LAVHHVSRNTKLASNYTSNQAVETASVFVPGKSAVIFSYVPKDGAEN