New model in OGS2.0 | DPOGS213307  |
---|---|
Genomic Position | scaffold2149:+ 3558-14107 |
See gene structure | |
CDS Length | 1344 |
Paired RNAseq reads   | 115 |
Single RNAseq reads   | 301 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005611 (7e-101) |
Best Drosophila hit   | CG42402 (2e-61) |
Best Human hit | hypothetical protein LOC59271 precursor (4e-25) |
Best NR hit (blastp)   | AGAP003572-PA [Anopheles gambiae str. PEST] (9e-82) |
Best NR hit (blastx)   | AGAP003572-PA [Anopheles gambiae str. PEST] (2e-81) |
GeneOntology terms    | GO:0016020 membrane GO:0016021 integral to membrane GO:0005529 sugar binding GO:0005886 plasma membrane GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families   | IPR000922 D-galactoside/L-rhamnose binding SUEL lectin domain |
Orthology group | MCL11375 |
Nucleotide sequence:
ATGGTTACTCTTATTTGTCCTCGAGGAACAACAATTAGCATACAAGTAGCCCAGTATGGA
GCATCGACTTCACAAAGCTCTTGTTCATCAGAATTAGCGGAATATCAACCTGTAGCTGTT
GAAGTAGTAGGAGACACATCTTGCACGTGGCCGAGTGCATTGCAGACCGTCGTTGAAGCT
TGTCAGAAGAAGCGACAGTGTAAATTTCATACGAGCCCCAAGGCCTTCGGTGTTGACCCC
TGTCCAGGTTCAAGAAGATTCGTGGAGGTAGCCTACAAATGTCGACCATATGAATTCAGA
AGTAAAGTGGGCTGTGAAAACGACGTGCTTCATTTGAGCTGTAACCCACACTCAAGGGTT
GCGATATATTCGGCACAGTATGGGCGGACGGAGTACGACTCAATACAATGCCCACAGCCG
CGGGGAATGAAAGAGGAAACATGCTTAGAGCCTTATGCTACTGAAACATCGATGAGGGAA
TGCCATGGAAAACGTCGATGTGTTCTTTCAGCAGACAACAAGATGTTTGGAAGGCCATGT
CGAACGGGAAGCAGAACATACCTGAAAGTTGTTTATACTTGTGTGCCCCGAACCGTTTTA
AAAGAGAGGTATGAAAGTGCTCCTGAAGAGGATGAAGTCGCCCACGATGTATCAGATCTG
GAACACGATGATGTCGATGAGTCTAGCGATCGCTGGTGGGGAGAGTCAGTACCCCCAGCA
CCAGCAGTAGCGGCTGTCCCACCTCAACGTCCCACAGCGCATACTAACATTAGCAGAGAC
GGACCCACAACCACTCAGCCAAAACAGCATACATCAAATGATGAACAATTTGATATGATG
TACGTATACGTGATTGCTGCTGCCACAGGAATATGTCTTATGTGTCTTATAATCGGAGTA
ATACGATGTGTGAAGCTCAGAAACAACACAGATCAAGCCAAAGGACCGGATGTCTCCGCC
TCTACTGACATCCCTAACGGCTTTAATGACAGTATATCGGAGGTCGATAATGACATAAAC
ATCACAAGCCTCTCGGGCCCAGTAGACACTGTAGACTCTAGTCTGAAGCAAGACATGCAA
ATTACGAACATGGCCAACATGAGTCCAAAAATCAATCGATACGTTGGCAGGCCAGTGCCT
AATACATATCCCCACGTGAGTACTAATATGTATGGACAGGTTGCGGAATATCCAGTTGAA
ATGCCTCTTCGAACAATGCCTCATGGAACTTTAGGACGTAGTATGGCGGTAAAAACTTTG
CCTAGAATCCAATTGCAAACAGAAACGGACCCTAACACTAGGAGTTTATATCGTTATTCA
AATGCACAATACTATTTTGGGTGA
Protein sequence:
MVTLICPRGTTISIQVAQYGASTSQSSCSSELAEYQPVAVEVVGDTSCTWPSALQTVVEA
CQKKRQCKFHTSPKAFGVDPCPGSRRFVEVAYKCRPYEFRSKVGCENDVLHLSCNPHSRV
AIYSAQYGRTEYDSIQCPQPRGMKEETCLEPYATETSMRECHGKRRCVLSADNKMFGRPC
RTGSRTYLKVVYTCVPRTVLKERYESAPEEDEVAHDVSDLEHDDVDESSDRWWGESVPPA
PAVAAVPPQRPTAHTNISRDGPTTTQPKQHTSNDEQFDMMYVYVIAAATGICLMCLIIGV
IRCVKLRNNTDQAKGPDVSASTDIPNGFNDSISEVDNDINITSLSGPVDTVDSSLKQDMQ
ITNMANMSPKINRYVGRPVPNTYPHVSTNMYGQVAEYPVEMPLRTMPHGTLGRSMAVKTL
PRIQLQTETDPNTRSLYRYSNAQYYFG