New model in OGS2.0 | DPOGS214383  |
---|---|
Genomic Position | scaffold979:+ 41937-47161 |
See gene structure | |
CDS Length | 1362 |
Paired RNAseq reads   | 660 |
Single RNAseq reads   | 2648 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004010 (1e-172) |
Best Drosophila hit   | CG2774 (1e-103) |
Best Human hit | sorting nexin-2 (4e-88) |
Best NR hit (blastp)   | PREDICTED: similar to sorting nexin isoform 1 [Tribolium castaneum] (2e-142) |
Best NR hit (blastx)   | PREDICTED: similar to sorting nexin isoform 1 [Tribolium castaneum] (7e-131) |
GeneOntology terms    | GO:0006886 intracellular protein transport GO:0035091 phosphoinositide binding GO:0005515 protein binding GO:0007154 cell communication |
InterPro families    | IPR001683 Phox homologous domain IPR015404 Vps5 C-terminal |
Orthology group | MCL14074 |
Nucleotide sequence:
ATGTCGGCGGAGACCGATGCACCGTTCAATAATGTGGAAATAAGTAACGAGAACCGGGAA
GAGGAAGATCTATTCGCCTCGGCGGTACAGGAGGTCAGTCTAGATCCTGAAATTAATGGT
ACCCAAGATGGATTAGAAAAGTCAACAATAGATGACATTTCGGTTTCATCACCCGCTACT
ATTGGCAGTTCTATTATGGAGGAAATAGCAACAGAACGTGCTAACAATATTATAATAACA
ATAACCGAGCCTCAAAAGATTGGTGAAGGGATGAGCTCATATGTAGCCTACCGTGTCATC
ACCAAAACAAACATGCCAATCTTTAGCAAATTAGATTTTGCGGTTCTAAGGCGATTCTCC
GATTTTCTAGGACTTCATGAGAAATTGACCGAGAAATACTTGCGCTCTGGTAGAATTATA
CCTCCAGCACCAGAAAAAAGTATCATGGGAACAACAAAGTTGAAGATGTCATCGACTCCG
TCTACAGAGAGTGCTAATGGCTCACCGTCGGTTCAATCACAGTTTGTGGAACGAAGACGA
GCTGCCCTGGAGAGGTTCCTGAACAGAGTAGCCCAACATCCTGTACTGTGTATTGATCCC
GATTTCAGAGAGTTTTTGGAGTCTGACACTGAACTACCAAAGGCCACGAGTACCTCGGCG
CTTAGTGGAGCTGGTATGCTGCGACTCTTCAATAAAGTTGGAGAAACAGTCAACAAGATC
ACATACAGGATGGACGAGTCCGATCCTTGGTTCGAAGAGCGCGTGGCTCGTATAGAGTCT
CTGGAAAGCGGTCTACGGCGTCTGTGTGGGGCCTGTGAGGCGCTCGCTACTGAGAGACGT
GAACTGGCGGGGCGAGCTCATGAGGCGGCTCGGGCCATCGCCGGATATATATACATATAT
TTTTTTAATATTAAAATAAACTTTGAAATTGAAGAGAATGAACAAGCCAACACAGACTTC
TATGTTCTGACCGAACACATTAAAGATTATCTCGGATTAATTGGTGCTATCAAAGACGTG
TTCCATGAAAGAGTTAAGGTATTCCAACACTGGCAACACTCACAAATGCAGCTAACGAAG
CGGAGGGAAAACAAAGCGAAAGCGGAACTGGCCAACCGTCCGGAGAAAATCGAACAGGCC
GCTAATGAAATTATTGAGTGGGAGTCGAAAGTGGAACGCGGCCAGCAGGAGTTTGATACA
ATGTCGAGGGTCATCAAGAAGGAACTGGAACGCTTTGAAGAGATCCGCCTCGACCAGCTC
AGAGACACGCTGCTGCGGTATCTTGATGAGCATATGAAACACCAGGCACAGGCTATTCGG
TACTGGGACGCTTTCCTTCCTGAGGCCCGCGCCATCAAATGA
Protein sequence:
MSAETDAPFNNVEISNENREEEDLFASAVQEVSLDPEINGTQDGLEKSTIDDISVSSPAT
IGSSIMEEIATERANNIIITITEPQKIGEGMSSYVAYRVITKTNMPIFSKLDFAVLRRFS
DFLGLHEKLTEKYLRSGRIIPPAPEKSIMGTTKLKMSSTPSTESANGSPSVQSQFVERRR
AALERFLNRVAQHPVLCIDPDFREFLESDTELPKATSTSALSGAGMLRLFNKVGETVNKI
TYRMDESDPWFEERVARIESLESGLRRLCGACEALATERRELAGRAHEAARAIAGYIYIY
FFNIKINFEIEENEQANTDFYVLTEHIKDYLGLIGAIKDVFHERVKVFQHWQHSQMQLTK
RRENKAKAELANRPEKIEQAANEIIEWESKVERGQQEFDTMSRVIKKELERFEEIRLDQL
RDTLLRYLDEHMKHQAQAIRYWDAFLPEARAIK