e.g.
搜索 人民
./search '人民'
Coreseek Fulltext 3.2 [ Sphinx 0.9.9-release (r2117)]
Copyright (c) 2007-2011,
Beijing Choice Software Technologies Inc (http://www.coreseek.com)
using config file '/usr/local/coreseek/etc/csft.conf'...
WARNING: stopwords: failed to get file size for 'G:\data\stopwords.txt'
WARNING: index 'test1': failed to stat G:\data\wordforms.txt: No such file or directory
index 'test1': query '人民 ': returned 2 matches of 2 total in 0.003 sec
displaying matches:
1. document=1, weight=1, group_id=1, date_added=Sat Jul 22 13:02:58 2017
id=1
group_id=1
group_id2=5
date_added=2017-07-22 13:02:58
title=test 广东人民英雄
content=this is my test document number one. also checking search within phrases.
2. document=3, weight=1, group_id=2, date_added=Sat Jul 22 13:02:58 2017
id=3
group_id=2
group_id2=7
date_added=2017-07-22 13:02:58
title=another doc 中华人民共和国
content=this is another group
搜索 人 则无结果
[root@iZuf647cvofre9edkjhztdZ bin]# ./search '人'
Coreseek Fulltext 3.2 [ Sphinx 0.9.9-release (r2117)]
Copyright (c) 2007-2011,
Beijing Choice Software Technologies Inc (http://www.coreseek.com)
using config file '/usr/local/coreseek/etc/csft.conf'...
WARNING: stopwords: failed to get file size for 'G:\data\stopwords.txt'
WARNING: index 'test1': failed to stat G:\data\wordforms.txt: No such file or directory
index 'test1': query '人 ': returned 0 matches of 0 total in 0.002 sec
words:
1. '人': 0 documents, 0 hits
WARNING: stopwords: failed to get file size for 'G:\data\stopwords.txt'
WARNING: index 'test1stemmed': failed to stat G:\data\wordforms.txt: No such file or directory
index 'test1stemmed': query '人 ': returned 0 matches of 0 total in 0.000 sec
words:
1. '人': 0 documents, 0 hits
我知道可能跟分词有关系,本人sphinx小白,加上现在coreseek官网不维护了,看不到文档。希望能大神简单解释一下,并给你学习资料感谢。
Coreseek 都没有人维护了,推荐使用原版Sphinx,原版Sphinx已经能很好的支持Unicode字符集索引,对于中文的支持也很好。