New model in OGS2.0 | DPOGS214723  |
---|---|
Genomic Position | scaffold8526:- 5015-12945 |
See gene structure | |
CDS Length | 1764 |
Paired RNAseq reads   | 3313 |
Single RNAseq reads   | 8407 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005068 (4e-131) |
Best Drosophila hit   | CD98 heavy chain (4e-38) |
Best Human hit | neutral and basic amino acid transport protein rBAT (1e-17) |
Best NR hit (blastp)   | PREDICTED: similar to CD98hc amino acid transporter protein [Tribolium castaneum] (1e-66) |
Best NR hit (blastx)   | PREDICTED: similar to CD98hc amino acid transporter protein [Nasonia vitripennis] (1e-61) |
GeneOntology terms    | GO:0043169 cation binding GO:0005975 carbohydrate metabolic process GO:0003824 catalytic activity GO:0060356 leucine import |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR006047 Glycosyl hydrolase, family 13, catalytic domain IPR006589 Glycosyl hydrolase, family 13, subfamily, catalytic domain IPR015902 Alpha amylase IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | MCL16359 |
Nucleotide sequence:
ATGGGTTTGATTATGGCTATTCGCTCCTCAGTGGACCGCCGCGCCGCGCCGCTACTGCCG
CGATCGAGTCTGCACAAGTCCAGGGACAAGGTCTCCGCAGACGAGGCCGAAGAGAGACTA
CTACAGAAGGAGGAGGAGGCGAAGATAACGACCAGGGTCGACATGGCGGACGCCAAGTTT
GTGGTCGAGGATCACAGGAACGGGGACGCCAAGATTGAGCTGGACGCGAACAAGAGGTTC
ACGGGCCTGACCAAGGAGGAGCTGATGAAGTACGCGGACGACCCGTTCTGGGTCCGCCTC
CGCTGGTTCATGTTCGTGTTGTTCTGGTCTCTGTGGCTGTGTATGCTGGCCGGGGCCATA
GCCATCATTGTGAGAGCTCCCAGGTGTGTCGCGCCGGAGCCTAAGACCAGGTACGAGACA
GGTCCTCTAGTTGATCTGGACCTCGCTGACTACACCACGGCGGAGTCTCATCTAGACACT
CTCCAGCAATACCAGGTGTCTGGGCTGTTCGCCTCCGCCTGTCAGTCTACCTACGTGGTG
CTCGAAGACAGCTCCTGTCTAGACAAGTTCAAACAGTTTGCTGATAAAGCCAAGAACTAT
GGAATCAAGGTCATAGTAGACCTGACAGCCAACTTCGTGTCCACCAGTCATCCTTGGTTC
CAGCAGAGTGAGAACCGTTCAGAGCAGTTCTCGGAGTACTTCATCTGGGTGAAGAGTGAT
GAACATGATCCCGAACTCAACACCACCATACCCAAACCACCTAATGATTGGGTGTCCACA
GTGAACACTGGTGCATGGTCTTGGAGTGAGAGAAGGAAGGAGTTCTATCTTCACCAGTAT
GGCGAGGGACTCGCTGACCTCAACTTCCACAACCCTAATGTAGTCAAACAGTTCGATGAG
GTCATCAGACTGTGGATGAAGGCCGGAGCCGGTGGCATCAGGTTGCACAACGTCCGTCAG
CTGTTAGTGAGCAGTCCTCCTCTGTCTGAGCTGCCTCACACGGGCGCCGGGAGCACGCCG
GGGGCGGACCACTCGCAGTACCCCTTCTGGAGACACTCTCGGACCTCGGACCAGCCACAG
CTGGATTCGCTGCTGGCTCACTGGTCATATATCGTGGAGCAAGCTTCCTCTGAGCCGACG
GTGTTCACGTTAGCGGAACCTTCCCGGCCGGAGCTGTTCATGCTGCAAAGGAACACGAGT
TGTCTCCGGCCCGCCAGCGGAGCACCCGTCGACCTGGCGCGGCCCGGGGCGGCCAAGCTC
CTCGCTGAGCGACTGTCACGCTGGCCCGCCATACAGTTGACTGATGATAAGCCGGACGAG
GAGACGGCCGTGTTTTCCATGCTGCTGCCGGCCGCACCTGTCATGGTCTTGGAACAACTG
GCTGGGGATGACAATGATACTACCCCCAGCGAGAGTTTGAAGCACGCGATATCACTGCGT
ACCGACGCCAGTGTGCAGCACGGAGCGTTGGTTGTGACTGACGCACCCGTTCACAACTCC
AGCGACATGATGCTGGCCGTCGCCAGATGGAAGGCGGACCACTCCGGCTACGTGTCGGTG
TATAACCCCGGCGCCTCTGGTCTCGTGTCTCTGTCTTCAGTCCGCTCTCTGCCGTCTTCC
CTCGCGGTGCATCACGTGTCGAGGAACACCAAGCTCGCCTCCAATTACACCAGTAACCAG
GCCGTGGAGACGGCGAGCGTGTTCGTCCCGGGCAAGTCGGCGGTGATCTTCTCGTACGTG
CCGAAAGATGGCGCTGAAAACTGA
Protein sequence:
MGLIMAIRSSVDRRAAPLLPRSSLHKSRDKVSADEAEERLLQKEEEAKITTRVDMADAKF
VVEDHRNGDAKIELDANKRFTGLTKEELMKYADDPFWVRLRWFMFVLFWSLWLCMLAGAI
AIIVRAPRCVAPEPKTRYETGPLVDLDLADYTTAESHLDTLQQYQVSGLFASACQSTYVV
LEDSSCLDKFKQFADKAKNYGIKVIVDLTANFVSTSHPWFQQSENRSEQFSEYFIWVKSD
EHDPELNTTIPKPPNDWVSTVNTGAWSWSERRKEFYLHQYGEGLADLNFHNPNVVKQFDE
VIRLWMKAGAGGIRLHNVRQLLVSSPPLSELPHTGAGSTPGADHSQYPFWRHSRTSDQPQ
LDSLLAHWSYIVEQASSEPTVFTLAEPSRPELFMLQRNTSCLRPASGAPVDLARPGAAKL
LAERLSRWPAIQLTDDKPDEETAVFSMLLPAAPVMVLEQLAGDDNDTTPSESLKHAISLR
TDASVQHGALVVTDAPVHNSSDMMLAVARWKADHSGYVSVYNPGASGLVSLSSVRSLPSS
LAVHHVSRNTKLASNYTSNQAVETASVFVPGKSAVIFSYVPKDGAEN