New model in OGS2.0 | DPOGS205913 |
---|---|
Genomic Position | scaffold4652:- 5213-19591 |
See gene structure | |
CDS Length | 1515 |
Paired RNAseq reads | 1172 |
Single RNAseq reads | 3174 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007012 (0.0) |
Best Drosophila hit | CG31145, isoform C (1e-160) |
Best Human hit | dentin matrix protein 4 (1e-111) |
Best NR hit (blastp) | FAM20C [Culex quinquefasciatus] (5e-179) |
Best NR hit (blastx) | PREDICTED: similar to AGAP002913-PA [Tribolium castaneum] (1e-170) |
GeneOntology terms | GO:0005576 extracellular region |
InterPro families | IPR009581 Protein of unknown function DUF1193 |
Orthology group | MCL14200 |
Nucleotide sequence:
ATGAATATGAAGCTCAAGGAGCGGCTCGCGCTCGCTCTTAGCGCATTTCTTGTACTGTTC
ACACTGATGCTGATTGTTGACATACAAATGGACTACGGCATCTCCGGACACAGGGTACCA
TTGCATGGACGTGTCAAAATAGGCGATGACACGGATAAAGGAAGATCGGCGTATATAGAG
TTTAGAAAAAGATTTCTACAGAAAAGCAATGGTAGCAATGGTTCACGAGAGTACGAGCAG
TCGGCTCCGAGTGAGACGAGCGGTGGCGACGCTGAACCTAGGACGCCGGCGACACCTGCC
GATAGGTTCGAGGACCTGCAGAGGATACTGGTCTTCCAGTTGCAGGGGAAGTCTGATGAC
AACCCTGTGGTGGTGCCACCCCATCGAGATCTGAACGTCTTAAGACCCGAGAACCCTACC
ATCGGTGAAATGGAGGACTTGGAACCAAGTGTAAATGCCTCAAATCTGGAGAAATTTCAA
TTGAAAATAGCGCAACACGAGCTCTATGAAGACGGAGAGCCGCTGGTCAGCGCCATTCTT
CGCGACATGACCTTTGAACCCATCCTACATGTTGAACAAAAGGAAGGAGGAACGCAACTG
AAGCTCATCATCGACTATCCAAATGGCGTGCAGGCTTTATTCAAACCGATGAGGTTCGCC
CGGGATGTACAAACTCTACCTAATCACTTCTACTTCTCAGACTATGAGCGGCATAACGCT
GAAATTGCTGCTTTTCATCTTGACAGGATACTCGGTTTCCGTCGAGCGATGCCGGTGGTG
GGTCGAGTTGTGAATATGACCACTGAGATCTATGACGTCACTGAGGGAGACATCTTGAAG
ACGTTCTTCGTATCTCCAGCGAACAACTTCTGTTTCCACGGCAAGTGTTCGTACTATTGC
GACACAGGACACGCGATATGCGGCAATCCGGACATGTTAGAAGGCAGCTTTGCGGCTTTC
CTACCAACCTCGGATCTAGCGGAGCGTAAGGTGTGGAGACATCCCTGGCGAAGATCTTAT
CACAAACGAAGGAAAGCTCAGTGGGAGCTGCAGTCCGACTACTGTGATACGGTCCGCAGT
ACTCCGCCCTACGACTCCGGCCGTCGTCTGTTAGACCTCATAGACATGTCGATATTCGAC
TTCCTCACTGGGAACATGGACAGACATCACTATGAGACATTCAAAATGTTCGGCAACGAG
ACTTTTACGTTGCACCTGGACCAGGGACGAGCTTTCGGTAAGGCGTTCCACGACGAGCTC
AGCATACTCGCGCCACTGCTACAGTGCTGCACCGTTAGACACACTACGCTTGCAGTCCTG
CTTAAATTCCATAACGGCGTGCCATTATCGAAAGTGCTCCGAGATTCTATGAAAGCTGAC
CCCGTGAATCCCGTGCTTTGGGAGCCTCACCTGGCCGCGTTAGACAGGCGTATAGTTACA
GTACTGGACGCGATCAGGAAATGCATAGATAAATTAGAAAATCCTCTACCGAATGAACTG
AACTCTGTCGTGTGA
Protein sequence:
MNMKLKERLALALSAFLVLFTLMLIVDIQMDYGISGHRVPLHGRVKIGDDTDKGRSAYIE
FRKRFLQKSNGSNGSREYEQSAPSETSGGDAEPRTPATPADRFEDLQRILVFQLQGKSDD
NPVVVPPHRDLNVLRPENPTIGEMEDLEPSVNASNLEKFQLKIAQHELYEDGEPLVSAIL
RDMTFEPILHVEQKEGGTQLKLIIDYPNGVQALFKPMRFARDVQTLPNHFYFSDYERHNA
EIAAFHLDRILGFRRAMPVVGRVVNMTTEIYDVTEGDILKTFFVSPANNFCFHGKCSYYC
DTGHAICGNPDMLEGSFAAFLPTSDLAERKVWRHPWRRSYHKRRKAQWELQSDYCDTVRS
TPPYDSGRRLLDLIDMSIFDFLTGNMDRHHYETFKMFGNETFTLHLDQGRAFGKAFHDEL
SILAPLLQCCTVRHTTLAVLLKFHNGVPLSKVLRDSMKADPVNPVLWEPHLAALDRRIVT
VLDAIRKCIDKLENPLPNELNSVV