New model in OGS2.0 | DPOGS206004  |
---|---|
Genomic Position | scaffold735:+ 33888-40817 |
See gene structure | |
CDS Length | 3345 |
Paired RNAseq reads   | 488 |
Single RNAseq reads   | 1767 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012662 (2e-50) |
Best Drosophila hit   | CG3520 (1e-37) |
Best Human hit | hypothetical protein LOC54914 (1e-18) |
Best NR hit (blastp)   | PREDICTED: similar to CG3520-PA [Apis mellifera] (1e-74) |
Best NR hit (blastx)   | PREDICTED: similar to CG3520-PA [Apis mellifera] (9e-58) |
GeneOntology terms    | GO:0016020 membrane GO:0005488 binding GO:0016021 integral to membrane |
InterPro families    | IPR022542 Domain of unknown function DUF3730 IPR016024 Armadillo-type fold |
Orthology group | MCL16194 |
Nucleotide sequence:
ATGGATGAAATAGAATATAAGTTAAACACCAACAACAGCGTCCTTATAGTCAACGCTATA
GATAAGTTGATATTGACAATTAAATCTAAGTTTAAACCTGGTGAACGTCAGAAATTCGTG
TTAGAAAATGAAGAGTTGAAGTTTTTGAGAGAGAAATGTTCTTCAAAAGACAACATGGTG
AGCCTGACCGCTTGCCAAGGGTTACTGGCATTGGTGGAGTTAGGAGTATTGGAGATTGCA
CACACAATGTCGACCGTAGTAACGCTTATCCCGAGCACACACAACTACTCAGCAATAATA
TCAACAATGGCCGGGCTGCTCATATTGGATCTGAAGTCTCGGCTCATTCCCGGCCAGCCA
TACAAATGCCAGTTTTCAATGCGTTCCCCTCAACACCCCTTCATCACCATACTTCAGAAG
AACAAAGATATAGAAGACAATATACGTGATGCGCCGTCCTGTGAGCGATCGTTCAGTGTA
TATCGCTGTGTGAGTGAGGGGCTGCGGTCCCGGCCGGAGCTGGGAGCCGCTCTGTTGGCC
GTCATTGCGCGCACCAGCGGTGAGCTGATGATACACGGTATGGACCCTCGACCTTGTTAT
TCTCTCGTGGAGCGTGTGCTGAGCGAGGGCGGGCGGTCGGCGGTCGCGGGGCTTGTGATC
ATGATGCTGGGCGGAAATCTTGTTCACACATCCGCCCTGTATCTACATGAGTTGTTCCAT
TTATGCTTGAACATAATCACCAAATACGAGTTCTCGACTCTGTGTCTGAACTCATTCGTA
GCTCTGTCACTGCAGTGGCTCCACTTACCGTCGTATTTGACCAACAACGCTCTCAAAGTG
TCATCGAAAATATTGGAAATCCATCAAAACATGAATCGTCCGGACTCGGGGCTGTTCATG
GCGAATATGAAAAACAACGCTGTCTTCCAAGAGATGGTCCACGTGGACCGGAAGCTGTAC
ATACATTACAAACTGCTGGACACCTGGGAACGACTGAGAGACGAGCCTGACAGGCTGAGC
AAGTGGTTCGACGGTCTGATGCGGTGCGATGACCGCCTCAAGGTCGAGCTGATGCCCTTC
ATCGCCGGCCTGGCTCTGGACGGGATGGGAGACGAGCGCCTGGTGCTGGCGGCGCTCCAG
GGCCTCATACAACTCGTGGGCTTCAAGAAGGAAGTGTCCGTCACACTACTGCCCATACTG
TTGTACAAAATAGCCAACGACCCCCGACCCAAGGTCAAGCTGGAGTGCCTGAGAGGTCTA
CCGCTAATGGCTACCACCAAGGCGAGTCCCTTCAGAGAGAACGTCCCAGCGCTGGTGTCG
ATCTTCAACAAGCTGAAGAACAGGAAAGGAGTCCCGACCTCGCAGCTCATCATGATGTAC
ACGTCGGACCGGCTCTGGGTGCGCTGCTTCCCTTACTTACAAGAGTTGTTGTCGGACTCG
TCGCTCCACCCGCACGACCTCAAGTGGGAGGTGGACATCGCCAAGGCGCTGGCCGTGCGG
CGGATCTGTGAGATCCGGCCGTCCAGTCACGGTCTGGAGTTAGTTCCCGTGGTGTCGTTG
CTCCTGAACCGCTGGGAGCGTTCCAGCGCGGGGCCGGTGTCGCTCGCGTTGGAGGCGCTC
CGGCACCTGTGGCAAGGGGCGGCGGTGGCGCCCCCCGGCACGTGGCGCGCGCTCCAGCCG
CGACTGGCCAAGGATAACAGGATACAAGTTCAGATCAGCCTGTGTAATCTGCTGGCGGAG
GCGCCCGGCCTGCGCGTGTCGTCCGCGGAGTACGGCGAGCTGCTGCAGCAGACGGCCGCT
CGTCTTTGGCTCTTCATATCCGACTCGGACCAGCCGGCCGTGATCGGGGCGGCGTGTCGC
GCGCTCGCCGGCTATAAGATAGAAGATTACACGCTCAAGGACATCCCCGAGGTTTATCGG
CGCACGGTGAAGCTGCCTCCGTCGTATTGCAAGACGCCTTCAGACGCCGCGAGGAAGCCC
GAAGACGTCCTGGACTATGTGCCATGTGAAGTCTGGCCGGAGGTGTTTAAATACACGAAC
CAATCGGCTCTAGACTGCGTACAACACCTGGTGTCCAAGTTGATAGGGCGGGAGATTCGA
GGGTACCGCAGCGGGGTCTACCACGAGCGGGAGGGGGGCAAGGAGGGGGCGGGCCTCAGT
GTCAGCAGCGTATTGAGAGGGGTCGTGGAGGGACTCAGGAAACAGATGGTGAGCCCGACG
TACGATTACTCGGACGCCGTCCTCCTGGCGATGTTGGAGACCTTGTCCTCGGAGTTCCCT
AAGCCCCTGCCGCCCTTCGACCTCACATTCCTTCACGAGGGTCTGCACCGCGGGGCTCCG
ATGCGGGCTCGCGTCGTCAAATTGGCCGCCCGTCAAGCCAGTACCGCAGTGTCAGCCAAA
AGACTGATTGAAAACTTCCTATCTGCAATCGACCCTGGGAATTGTGAGGAATCAGACATT
TTATTGTTCTTCGAATATCTTCCTATCTTGTGTCGCACGATGCCCCCGAACCATCTCCGG
CCTCCGCTCGAGAGATGTCTGAGTGACTCTTTCTCGAGGGTCAGGGTCAAAGGTCAGGAG
GAGACGTTCATAAAGCAGTTGAACTACATCAAGGAGTGCCTCGACTGCGACAAGATCCAC
GATGCCAACAGGACCCTGCTGTCACAGCTGGTCGAGAGCTACTTCACTGTTATAGATGAG
GACCACGTGGCGTGGTCCGCGTACCTGGCGGCGTGCTCGTCGCTGGTGGTGAGCTCGGTG
GAGCGCATGTCGTCTCCGAGCTCGTGGTGGGAGGTGTCGGGCGCGCTGCTGAGGAAGGCC
AGCGTGCTCCGCGCGAGGTTGGCCGCCAACAGACTCGCCTGGATCAACGAGATCGTGGAC
ACCGCCGCGGGCCACGTCACTGAGCAGGAGTTCACGTTGCGATGTTTCCTACCCGCGCTA
CAGGCTACGGACGTCGACGCGACTAACACCCGCGAGTGGTTCCTGCAATTGATGGCTCGC
ACTCAGGTCGCCTTCAAAGAGACGGAGGAGGAGTCGGCCAGGTTATACCTGTGCGACGTG
TTCTTCCTGAGCGTGGTAGTGTTCAGTGGTCTATGGACCCTGGAGGCGGACGGCGAGGCG
CTGGTCGCCGACAGGGACGCCAGGCTGGGGCTCGCTCCCGCCGCCCTCGGCCTGCTCGTG
GACAGGGACGGCTGGACGGACTACACCGCACAGTTGTTGGAGTGGTTGTGCCACACGCGC
TCCGTGACCCGCCACGCCGGCGTGTCCCGCTGCTGCAGGCGCTCCGTGCTCGCCCTGAGG
CACACGAGAGCCTTCCACGAGCACGCGGTCTTGATGAAGCTGTGA
Protein sequence:
MDEIEYKLNTNNSVLIVNAIDKLILTIKSKFKPGERQKFVLENEELKFLREKCSSKDNMV
SLTACQGLLALVELGVLEIAHTMSTVVTLIPSTHNYSAIISTMAGLLILDLKSRLIPGQP
YKCQFSMRSPQHPFITILQKNKDIEDNIRDAPSCERSFSVYRCVSEGLRSRPELGAALLA
VIARTSGELMIHGMDPRPCYSLVERVLSEGGRSAVAGLVIMMLGGNLVHTSALYLHELFH
LCLNIITKYEFSTLCLNSFVALSLQWLHLPSYLTNNALKVSSKILEIHQNMNRPDSGLFM
ANMKNNAVFQEMVHVDRKLYIHYKLLDTWERLRDEPDRLSKWFDGLMRCDDRLKVELMPF
IAGLALDGMGDERLVLAALQGLIQLVGFKKEVSVTLLPILLYKIANDPRPKVKLECLRGL
PLMATTKASPFRENVPALVSIFNKLKNRKGVPTSQLIMMYTSDRLWVRCFPYLQELLSDS
SLHPHDLKWEVDIAKALAVRRICEIRPSSHGLELVPVVSLLLNRWERSSAGPVSLALEAL
RHLWQGAAVAPPGTWRALQPRLAKDNRIQVQISLCNLLAEAPGLRVSSAEYGELLQQTAA
RLWLFISDSDQPAVIGAACRALAGYKIEDYTLKDIPEVYRRTVKLPPSYCKTPSDAARKP
EDVLDYVPCEVWPEVFKYTNQSALDCVQHLVSKLIGREIRGYRSGVYHEREGGKEGAGLS
VSSVLRGVVEGLRKQMVSPTYDYSDAVLLAMLETLSSEFPKPLPPFDLTFLHEGLHRGAP
MRARVVKLAARQASTAVSAKRLIENFLSAIDPGNCEESDILLFFEYLPILCRTMPPNHLR
PPLERCLSDSFSRVRVKGQEETFIKQLNYIKECLDCDKIHDANRTLLSQLVESYFTVIDE
DHVAWSAYLAACSSLVVSSVERMSSPSSWWEVSGALLRKASVLRARLAANRLAWINEIVD
TAAGHVTEQEFTLRCFLPALQATDVDATNTREWFLQLMARTQVAFKETEEESARLYLCDV
FFLSVVVFSGLWTLEADGEALVADRDARLGLAPAALGLLVDRDGWTDYTAQLLEWLCHTR
SVTRHAGVSRCCRRSVLALRHTRAFHEHAVLMKL