本文共 1923 字,大约阅读时间需要 6 分钟。
coreseek 号称是sphinx中文版,特色是libmmseg 中文分词支持,目前稳定版是3.2.14,对应的sphinx版本是 0.9.9,(sphinx最新版为2.1.7)。
由于官方没有给CentOS 6打包,网上也几乎全是make/make install之类的教程,于是花了几个小时,将coreseek和mmseg分别打包成srpm,方便新手。
默认配置文件改为/etc/sphinx/csft.conf,在CentOS 6 x64 平台上测试打包。
编译后的rpm二进制软件包内容如下
coreseek
1 2 3 4 5 6 7 8 9 10 11 12 13 | # rpm -qpl coreseek-3.2.14-2.el6.x86_64.rpm /etc/logrotate .d /coreseek /etc/rc .d /init .d /searchd /etc/sphinx /etc/sphinx/csft .conf /usr/bin/indexer /usr/bin/indextool /usr/bin/search /usr/bin/searchd /usr/bin/spelldump /var/lib/sphinx /var/log/sphinx /var/run/sphinx |
mmseg
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 | # rpm -qpl mmseg-3.2.14-1.el6.x86_64.rpm /etc/mmseg /etc/mmseg/mmseg .ini /etc/mmseg/uni .lib /etc/mmseg/unigram .txt /usr/bin/mmseg /usr/include/mmseg /usr/include/mmseg/ICorpusReader .h /usr/include/mmseg/SegmentPkg .h /usr/include/mmseg/Segmenter .h /usr/include/mmseg/SegmenterManager .h /usr/include/mmseg/Singleton .h /usr/include/mmseg/StringTokenizer .h /usr/include/mmseg/SynonymsDict .h /usr/include/mmseg/ThesaurusDict .h /usr/include/mmseg/UnigramCorpusReader .h /usr/include/mmseg/UnigramDict .h /usr/include/mmseg/UnigramRecord .h /usr/include/mmseg/Utf8_16 .h /usr/include/mmseg/bsd_getopt .h /usr/include/mmseg/bsd_getopt_win .h /usr/include/mmseg/csr .h /usr/include/mmseg/csr_assert .h /usr/include/mmseg/csr_mmap .h /usr/include/mmseg/csr_pool .h /usr/include/mmseg/csr_typedefs .h /usr/include/mmseg/csr_utils .h /usr/include/mmseg/darts .h /usr/include/mmseg/dictionary .h /usr/include/mmseg/freelist .h /usr/include/mmseg/iniparser .h /usr/include/mmseg/mmthunk .h /usr/include/mmseg/os .h /usr/include/mmseg/scoped_ptr .h /usr/include/mmseg/tolowercase .h /usr/lib64/libmmseg .a /usr/lib64/libmmseg .la |
附件超过2M,请移步百度网盘下载。
http://pan.baidu.com/s/1c0ovYzm