New model in OGS2.0 | DPOGS212634  |
---|---|
Genomic Position | scaffold3890:- 9564-17171 |
See gene structure | |
CDS Length | 3726 |
Paired RNAseq reads   | 1975 |
Single RNAseq reads   | 4676 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005199 (6e-06) |
Best Drosophila hit   | CG3921 (0.0) |
Best Human hit | ND |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC007861 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC007861 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005044 scavenger receptor activity GO:0016020 membrane GO:0005488 binding |
InterPro families    | IPR001190 Speract/scavenger receptor IPR016187 C-type lectin fold IPR017448 Speract/scavenger receptor-related IPR011050 Pectin lyase fold/virulence factor IPR006626 Parallel beta-helix repeat IPR016186 C-type lectin-like IPR008646 Herpesvirus UL45-like |
Orthology group | MCL13112 |
Nucleotide sequence:
ATGCCTGAAGTTACACTTACAATAAGTCCAGGCGTGGTGTTAGAATTCGCCCCTAATGTT
GGAATTTTAGTATTGGGAACACTTCGAGCCATCGGCCACAGTCAATTACCGATCCTTATG
AAGCCGATGTCGTCCTCGTCTAAAATAGAAACGAATAGAATCGAAAGAGACGTGAGCCAA
TATTATACAAGCCACCATAAAAATAAGAGACAATTGGAAATGTTAACTACCACAGAATCA
ATTAGATTGTGCACCGGACGTAATTGTTCCATGAGTGATGACAATCATAAAACAAACAAC
GAGGGATTCTTAGAATATTATAATAGGACAACATTACAGTGGGTTCCGATGTGTGATAAC
AGATTCACGGAGAGAAACGCCCAAGTGGTTTGCCGAGAACTGGGCTTCGATCCAATAAAC
ATATTCTTCGCTCACGACTATCGAGTAGAGTATCACAGTAATTCCCTTTCACGAATATGG
TCATGGCCGGAACCGTTACAATGTGTGGGAACGGAAGATCGCTACGAAGATTGTCCAATT
CGTCTCAACGGCCAATTGTATGGCCACAGACATGAATGCAAATGGAACTCGCAATTCGTT
TTCATCCATTGCGGCTCCAGAAATTTAAATGAAAACCTAGAATATTGGGGTGGCATAAGA
TTTACCAATCCAGAATTCGAATATTCCCTGTATGAACATAGGATACATGATCATCACACT
CACGAGACATTGAAGAAGGTCGAAAGCGTTTTGCAACACGTTCAAATCATTGGAGCGGGA
ATATTACATAATGAAAAATCACCGGCACTACAAAGCATTGTTAAAAATCCAGTCATTGAA
AACGTTAATATAAGCAAATGTGCACATCATGGGATTAATGTAATATCGCCAACTGATACA
ATCAATATGATGTTTAATTCTATCAGCGACACCTTAGGAGAAGGCATCAGCGCTATATCA
TTGAGTGGTGAAGGTCGAGACTCGGAGGAATCTAGTTTCACGCCTTTGAAAGATTTAAAT
CTACCCTACCACCTGTTTTCTCTTATAGATATATGTGACAGTACGAAGGTGGTCACCGTC
GAAGAGAGGGTGCTCTTGTACTATAAATATGATAATAATCCAGTAAATTGTGTGAAGATT
TTCAAAAGTATGTATCGAGTAAAACCGTTTGGATTCAGACTGCTCCAATTTAACTTATTT
AACCACACCCTGAACTACGGGAAACGAGATTCCGTGACACTTTACGATGGAGACATATAC
AACATAACGGCTCCCCAAATAGCATATCTTGAAAATGGATCACCAGATGAAAAAAAACTG
TTCAAGACGGAAGGTTCCAGCTTGAGTGTTAAGTTGTTTGCTAACGGCGCTTCGTCTGTC
CATGGATTTATTGCAGAAATCGTAACTCTGCCTATTTCGGCTATTGGATTAAGTCGCGAC
GTCCAACATAACATATCGAACAGTGAAATCATTAACAATCGTGATGGGGCTATTACTTAC
CAATCTGTGGGAGAAGTCAATCCTCTGGTGGCTATAACCAGGAATGACATTACCCAAAAT
TGTCTCAAGCTTTACGGAAACTTCACAACGTGTCAGGCCGCCGTGACGGTTGATGTTCAG
AATACGCAGACTTTAGTATTTAGGAATAATCTAATTCGGAACAATGTCGGTGGCCTCCTG
GTGAGAGCTGATTCCCGGGGTTCAGCGACCTCCCTGCGCGGGTGGATACATAATAATTTA
TTTTTGGCTAACCACGACAGACCAGCGCTACAGGTTGAAGGTCGCCAGTCATCCCCGTAC
CAGGAGGTGACGATCTACCGCAACTACTTCACCCGCAACCAGGCCTTGTACCGAGACGTC
ATCGTTCTTCATCAGGTCGTCTCCAACTTCAGCCACAACTACGTCCACCACAACACTGGC
CTCAGGATTCTCGACGTGTCCGGCTTCCACAAAGTCCGCCTGCCCATCTACCAGACCACG
TCACACAACGGATTCTACAAGAACTACGCGCTGGACCGCGAGGGCCGCGCCACCGTGGCG
GCGGGGACCGCTGGCCAGCACTACGTCGACAACGTGTTCTTCAACCCCGACAACGACTAC
GAGATGATCACCGTCAACAGATCCATTTCTCTGGATGTATGGCGTACTCGAGTGGACGCT
AAGCACAACTATTGGAGTTACAACGAGAGTCTGGCGGTGGCGGGGCGGGTGAGGGACCGC
GCTGACGACCCCAGGCTGCTGGAGGTGGACTACAGACCTTACTACATGAACAACGCGTCC
GTCCTGGGCGGAGGGAAGTGTCCTCCGGGTTGGGTCTCGTTGTCCGGGACCTGTTACATG
TACATCGGCGCGCCCATGACATACCACGAGGCGCGGGCCTTCTGTCTGTCCGACAACGCG
TCCATGCCGTACGTGAACGGTAACTACGACGACCTGTACGAGTTCATCCACAGTCAGAAC
CAGTGGGTGCGGTACGGCGACCGAGTGTGGGTCAACCACATCGACTACGTCACGCAGTGC
ACCTCCTTCGCCTACTCGGACATTGAGATCACTGACTGTGGCCAGAAGAATGCGTTCATA
TGCGAAATTGACCCTAAAATCTCAATCGATCCTCTGTCATGGCGCGGGGACGCCCTGGTA
GTGTCTTTCGTGTGTGTATTGGGAGCGGCGGTACTGCTCGTGGCACTCGCCCTCGTAGCT
TGGTACTACAAATCCAAACATCGACACGTTCAGCGCCTCGAGAGACGGAACTCGATCCGA
CAGTCCCTGCACTCAGTGCGGTCTATCGGCAGCATTAACAGAGGCTTCCCTGACACGACC
TACAGGAGGAAAATGGTCCAAATGAGCGCTCGCTCCACGGACACACTGACGAAAGGCTCG
GATTACAAGAAGATGCTCGCCTCGACCACGTCCATGGAGTCCATGGAGAAAAGTCAGTTT
AACTCGTCACTGGAGGACACGCAGAGCTTCGACATATACGAAGCGCACAACCCCAACAAC
GTCATACAGCTCAAACACAGCACCTTCAACAGGAAGCCCGCTTCACCCGAGTACAGTGTG
CCGCAGAACAGGCCCTACAGCCTGGCCTACAAGAACGAAGGGTACAAGGAGAACTCCGCC
AGCGGAGCGCCATCCATGAACACCGTGGCCACTGAGGAGCTGCCGATAATACACCACCCG
GGAGGACTCACTTCACCCGAGGATGACACTCTGTCGCCCACCAGCCCCTCGCAGTACTTC
AACTCGGATACATTACCCTTGACGGGTGACAAGTCCGACGACCCTATATTCATGAAGAGG
GAACTTGAAAGGGAAGGGAAGATCTACGGTCCGTATGGGGCTCGGGACCATGGAGGTCAA
CCGAAATTATCATTTTTGATGGAATTGAGATCCAAGCTGCCAGAACAACCTCAAGCTGGC
GCTGTACCCGCTACGACCTTTGGACATAGACGAGATGAGCCTCAATATTATGATGATAGA
CTGCCCAGCCCGCAACCACCCGGCTATCAAACAGATTATCCAACTAACTATGACTATGAA
CCGAGTTATCCAGAAAGCTATCAAGCCAGTGCCACTTCATCACCAGACCTTCATCCATCG
GATCTATACACCAGATCGAGATCGGAAGCACTTTTAGAAACCAACTTCGATTTCGAAGAC
TCACAAAATTCTCAACTCTCCGAAGCGAATAGAGCACACAGCCAGCCTTTAGAAACTGCC
ATGTAA
Protein sequence:
MPEVTLTISPGVVLEFAPNVGILVLGTLRAIGHSQLPILMKPMSSSSKIETNRIERDVSQ
YYTSHHKNKRQLEMLTTTESIRLCTGRNCSMSDDNHKTNNEGFLEYYNRTTLQWVPMCDN
RFTERNAQVVCRELGFDPINIFFAHDYRVEYHSNSLSRIWSWPEPLQCVGTEDRYEDCPI
RLNGQLYGHRHECKWNSQFVFIHCGSRNLNENLEYWGGIRFTNPEFEYSLYEHRIHDHHT
HETLKKVESVLQHVQIIGAGILHNEKSPALQSIVKNPVIENVNISKCAHHGINVISPTDT
INMMFNSISDTLGEGISAISLSGEGRDSEESSFTPLKDLNLPYHLFSLIDICDSTKVVTV
EERVLLYYKYDNNPVNCVKIFKSMYRVKPFGFRLLQFNLFNHTLNYGKRDSVTLYDGDIY
NITAPQIAYLENGSPDEKKLFKTEGSSLSVKLFANGASSVHGFIAEIVTLPISAIGLSRD
VQHNISNSEIINNRDGAITYQSVGEVNPLVAITRNDITQNCLKLYGNFTTCQAAVTVDVQ
NTQTLVFRNNLIRNNVGGLLVRADSRGSATSLRGWIHNNLFLANHDRPALQVEGRQSSPY
QEVTIYRNYFTRNQALYRDVIVLHQVVSNFSHNYVHHNTGLRILDVSGFHKVRLPIYQTT
SHNGFYKNYALDREGRATVAAGTAGQHYVDNVFFNPDNDYEMITVNRSISLDVWRTRVDA
KHNYWSYNESLAVAGRVRDRADDPRLLEVDYRPYYMNNASVLGGGKCPPGWVSLSGTCYM
YIGAPMTYHEARAFCLSDNASMPYVNGNYDDLYEFIHSQNQWVRYGDRVWVNHIDYVTQC
TSFAYSDIEITDCGQKNAFICEIDPKISIDPLSWRGDALVVSFVCVLGAAVLLVALALVA
WYYKSKHRHVQRLERRNSIRQSLHSVRSIGSINRGFPDTTYRRKMVQMSARSTDTLTKGS
DYKKMLASTTSMESMEKSQFNSSLEDTQSFDIYEAHNPNNVIQLKHSTFNRKPASPEYSV
PQNRPYSLAYKNEGYKENSASGAPSMNTVATEELPIIHHPGGLTSPEDDTLSPTSPSQYF
NSDTLPLTGDKSDDPIFMKRELEREGKIYGPYGARDHGGQPKLSFLMELRSKLPEQPQAG
AVPATTFGHRRDEPQYYDDRLPSPQPPGYQTDYPTNYDYEPSYPESYQASATSSPDLHPS
DLYTRSRSEALLETNFDFEDSQNSQLSEANRAHSQPLETAM