New model in OGS2.0 | DPOGS213048  |
---|---|
Genomic Position | scaffold941:- 23912-47957 |
See gene structure | |
CDS Length | 1416 |
Paired RNAseq reads   | 42 |
Single RNAseq reads   | 160 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008935 (4e-06) |
Best Drosophila hit   | ND |
Best Human hit | fibronectin type-III domain-containing protein C4orf31 precursor (2e-10) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Taeniopygia guttata] (4e-15) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Taeniopygia guttata] (1e-12) |
GeneOntology terms    | GO:0010811 positive regulation of cell-substrate adhesion GO:0030198 extracellular matrix organization GO:0008201 heparin binding GO:0019800 peptide cross-linking via chondroitin 4-sulfate glycosaminoglycan GO:0031012 extracellular matrix |
InterPro families   | IPR019326 Protein of unknown function DUF2369 |
Orthology group | MCL16168 |
Nucleotide sequence:
ATGACCACGGACGACATGATATATGGGTGTCACCTACATCGCGTCTTCCAAAGTCCGGAC
ATCGTCACGACAAGGCTAGCAGCGCTGATCATACTTCACGCAGCTAGAGCTGGATTCGCA
GCTCCAAGTGTTCTCGTCACCAGACCAGTCAGCAGGCTTTCCGAAGAATGGCTGCCAATG
GATAACCAGACCTTGTACACATTGGAAGAAGGAGAAAGTCGAAATCCAATAGACCCTCAA
TCTACAACCTATTGCGTGGTCGCATCTCGGAGAAAGAACTACACTTCACTTTGCGCAGCT
CAATACGACCTCCGGAATACAAAAACAGAAAAAGATACATCCTCTGATCAGTCGCTGGAA
AACCATAAACCTAACCGTTCACAAAACAGCGGTTCGGAAGAGAATAAAATTGATATAGAT
AAAGACAGCATAAGTATTTTTGATGGAAATTATAGGACTTTGTATAGAAGAAAAAAGTTT
GGTCGTAGCACAAGAGTATCAAACGAAGACCCACTGGTTGTTTGTATAGGAGACAGAACA
CATCACTTTATTGAAAATTTAGATCCTGGCACAACCTATTTCGTCAGTATTTTCGGCATT
GCAAGAGATAGACAAATTGGGTCCTTATTAGCATCTGGAAGCGTTCGACCCAGGACGTCC
ACTGCTAAAAGACTAAGGGAAAATGTTCCTTACAAGGCTGACATAAAGGGAAAAAATGTT
TATTACTTGAAGACAACGACCAGTAGTACTTCAACAAACGCTGGTCTTTGGATTGCGACG
TCCACTTGCGGAGGTTCAGTTGACATCGAAGTTTATGTAAAAGGAAAGCGATTGTACGTG
GCCAAAAATATAGAAAATCATTCAAAGTTTTTCGTGCCATCACCAATACTTTCATCGACA
CAAGAAACAAGTGACGAGGGTTCTGTGCAGTTCGATTCGAGCTCAGAGGAATCAAAAATA
AGATACATTGTACGAGTTGTGCCTAACAAATGGGCTATTGATGGTGCCGTAACAATAGAG
TTACTGGCCTCAACATCAAGATGGGGCATGGATACGCCAGAACTTACCGACGGCGGAGTG
ATAAGGGAACTCCGGCCGAGGAGGTCTTGTAAATCCGTAGACATTGCCTTTTTACCAGCC
ACCCATAACGCGACTGATGTGATTAGATATTGCATTTCAACAAAAGAAATAGTCGACAAG
GACATAACAATCTGTGCGTTAACAAAAAAATCATCAACAAAAACACAATGCATAAGTCAC
ATGCAACGTCCTCAGTCAAGAGTAATAGTCCAGAAAGTGAGTGGCTTAAAACCGGGAAGA
AAATACGGAATTCAAGTAACTGCCGCCTCCAAAGGGATCTCAGTGCCATACAATGTCCTG
TATGTGGAAACTAATGCAACTTGCAAAGAAGAATAG
Protein sequence:
MTTDDMIYGCHLHRVFQSPDIVTTRLAALIILHAARAGFAAPSVLVTRPVSRLSEEWLPM
DNQTLYTLEEGESRNPIDPQSTTYCVVASRRKNYTSLCAAQYDLRNTKTEKDTSSDQSLE
NHKPNRSQNSGSEENKIDIDKDSISIFDGNYRTLYRRKKFGRSTRVSNEDPLVVCIGDRT
HHFIENLDPGTTYFVSIFGIARDRQIGSLLASGSVRPRTSTAKRLRENVPYKADIKGKNV
YYLKTTTSSTSTNAGLWIATSTCGGSVDIEVYVKGKRLYVAKNIENHSKFFVPSPILSST
QETSDEGSVQFDSSSEESKIRYIVRVVPNKWAIDGAVTIELLASTSRWGMDTPELTDGGV
IRELRPRRSCKSVDIAFLPATHNATDVIRYCISTKEIVDKDITICALTKKSSTKTQCISH
MQRPQSRVIVQKVSGLKPGRKYGIQVTAASKGISVPYNVLYVETNATCKEE