New model in OGS2.0 | DPOGS201465  |
---|---|
Genomic Position | scaffold124:+ 106210-108635 |
See gene structure | |
CDS Length | 2235 |
Paired RNAseq reads   | 161 |
Single RNAseq reads   | 520 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002617 (0.0) |
Best Drosophila hit   | CG5819, isoform B (3e-65) |
Best Human hit | leucine-rich repeat-containing G-protein coupled receptor 4 precursor (5e-20) |
Best NR hit (blastp)   | AGAP006643-PA [Anopheles gambiae str. PEST] (2e-104) |
Best NR hit (blastx)   | leucine-rich transmembrane protein [Aedes aegypti] (8e-107) |
GeneOntology terms   | GO:0005515 protein binding |
InterPro families    | IPR001611 Leucine-rich repeat IPR003591 Leucine-rich repeat, typical subtype |
Orthology group | MCL16291 |
Nucleotide sequence:
ATGCTTCTATTACGAACATCGGGAGTGCTTCTGTTGTTGTTGTTGTTGACAAATGCGGAC
TCGGACTTAGATGACAACTACGATATTTGCTTTCTGTGCTCATGTAACGCAGAACAAAAC
AGCGTTGATTGTTCCCATAGAGGGCTGACTATTCTTCCTGATGGGATATCCGTAAAAGTT
TCGAAACTTAACATTTCCAACAACGACATCTCAATTTTTCCAACTAACCTCAGCAGACTA
TATAATTTAGTATCATTGGACATAAGTGGGAATCAAATAAATCAGTTGCCTGAAAATGCT
CTTCAAAATCTTACAGCATTAGAAATTTTGAATTTGTCAAACAATGGCTTTGAAACATGG
TTAAGTTTAAATCCCAACGAAGTTTTAAAACCAGCTGTAAAATTAAAGATTTTAGATTTA
TCCAGCAATAATTTGAAAACACTGGCCAATTTGGCTAATCAAGAACTATTCATTAGCCCA
ACATTGGAAACATTAATTTTGAATGACTGTCAAATAGATGCTGTTTACGGAAGGTCACCT
CTAAGTGGACTTATTAATATAAGAGTATTAAAACTTGATAACAATCCTTTGTTAAGAATA
CAAAACCTTATATCACCGACCTTAAGGAGTCTATACTTGAGTAATTGCCAACTTAGCTAC
ATAAATAATCATGAACTATCTTATTTACCTTCGCTACTGTATTTGAAAATGTCGCACAAT
TATCGTTTAGAGTTATCTGCAACGTCCAACTCGTTATTCTCCGAGTCCTTAAAGTTTTTG
GACATATCATATTGCAATGTGTTCAAACCAAATTTAGCAGGATTTCCAAACCTAAGAAAA
GCTATAATAAATCATAATATGATAAGATATCTAGGAAGCAACGAGTTTATAAATAATACA
AAACTTGAATATTTAGATCTTTCCTACAATAACATAGGGTCTATAAAAGGCGATACTTTC
CGTGGACTTAATATGCTAAGGTTTTTAGATTTATCTTGGAATGAATTGGCTCAAATACCT
GAAAATAGTCTCTTAGAAATGCCATCAGTGACCCAAGTGAAACTAGCAAGGAATTATTTA
AATCGAGTTGGTCATTTAAAATCTACATCAATAACAATCCTCGATATGAGCTCATGTGAA
ATCAGTGTGATTGGAAAAGATTCATTCGAAGGCCTACCGTCGCTCATTGAAATGGACTTA
TCCCATAATCTTATATCCCACATACCTGACAGCATATCATCGAACACGCTGAAGTATTTG
AATTTAAACTTTAACAGAATTAGCTCTCTAAGTAATTTCACATTTTTTATGATGCCTCAT
CTGACCGGCCTCAGTGTGATAGGAAATCGTTTTACTAATATTTGGAGCAGATCGTATTTT
GAATTTAATGTCAATTTGGATAGACTCGATTTGAGCGATAACATGTGGAGATGTGATTGT
CGTGATGACAACATGTATGATTTTTACGAGTTTGTTACTTTAGAACCAAACAAAAAAGAG
GAATCTTATAATCTTATATGTAACAGCCCTATCAATGTTATAGGCCAGACTTGGTTAGAA
GCATGTTATTTTGCTTGGTATCCCGATGCGAAGAAAGGAAACGCTGATAACTTAATATAT
TTTATTATTGTTATGGTAGTTGGTTTATCATTATGTCTTATTATTGTAACTGCTATAAGA
AGATCTCTCAAACATCGCTTGTTAGTTATGCAAACAGAACGAGAAAGACAAGTTGAAGAA
GCCAGGGATAGATTGAGACAATTAAGAATGCGAGCTGAACAAGAAGCGTTATGTAATACT
CCAGATCCAAGGGATTTGATACAACCACCGTCGTATGATGAAGCTCTCTCAATGCCAAAA
CTAAACGTATCCTGTCATTCGTTGAACGAAACCGGAACTGGCAAAAGTAAACGAAGAAGA
GGCAGGAGGAAAACTAAGTCAAGCGGAGATTTACTCGAAGAGACAGAAAGGAATGGAGAT
GTTCGGGTAGTTGGTGACATAGAATTGACTGAAACACCTAATATCGATGATAGACGGCGT
CGGCGGCGTAACAGAAGATACGGCAGTCACGATATAGCAGAACTCGATGATAGTCCTGGT
GCGAGAAGGAGACGAATGTCCGCTGTCGAACCACATGAAAATAGCGTCATGCTGCAAGTT
GAAGCAGAATTAGAGACACCCAATAATCGACGGGTAGGCTTAAACGACAGTCACCCACGA
GAAAGTGACTTTTAA
Protein sequence:
MLLLRTSGVLLLLLLLTNADSDLDDNYDICFLCSCNAEQNSVDCSHRGLTILPDGISVKV
SKLNISNNDISIFPTNLSRLYNLVSLDISGNQINQLPENALQNLTALEILNLSNNGFETW
LSLNPNEVLKPAVKLKILDLSSNNLKTLANLANQELFISPTLETLILNDCQIDAVYGRSP
LSGLINIRVLKLDNNPLLRIQNLISPTLRSLYLSNCQLSYINNHELSYLPSLLYLKMSHN
YRLELSATSNSLFSESLKFLDISYCNVFKPNLAGFPNLRKAIINHNMIRYLGSNEFINNT
KLEYLDLSYNNIGSIKGDTFRGLNMLRFLDLSWNELAQIPENSLLEMPSVTQVKLARNYL
NRVGHLKSTSITILDMSSCEISVIGKDSFEGLPSLIEMDLSHNLISHIPDSISSNTLKYL
NLNFNRISSLSNFTFFMMPHLTGLSVIGNRFTNIWSRSYFEFNVNLDRLDLSDNMWRCDC
RDDNMYDFYEFVTLEPNKKEESYNLICNSPINVIGQTWLEACYFAWYPDAKKGNADNLIY
FIIVMVVGLSLCLIIVTAIRRSLKHRLLVMQTERERQVEEARDRLRQLRMRAEQEALCNT
PDPRDLIQPPSYDEALSMPKLNVSCHSLNETGTGKSKRRRGRRKTKSSGDLLEETERNGD
VRVVGDIELTETPNIDDRRRRRRNRRYGSHDIAELDDSPGARRRRMSAVEPHENSVMLQV
EAELETPNNRRVGLNDSHPRESDF