首页 > 树人文学文章详情

elasticsearch进行分词测试

elasticsearch进行分词测试

elasticsearch进⾏分词测试1，打开kibana:

GET /scddb/_analyze

{

"text": "蓝瘦⾹菇",

"analyzer": "ik_max_word" //ik_smart

}

测试分词效果如下，不是很理想:

{

"tokens" : [

{

"token" : "蓝",

"start_offset" : 0,蓝瘦香菇是什么意思

"end_offset" : 1,

"type" : "CN_CHAR",

"position" : 0

},

{

"token" : "瘦",

"start_offset" : 1,

"end_offset" : 2,

"type" : "CN_CHAR",

"position" : 1

},

{

"token" : "⾹菇",

"start_offset" : 2,

"end_offset" : 4,

"type" : "CN_WORD",

"position" : 2

}

]

}

添加⾃定义词库：

参考这⾥添加⾃定义IK词库：

重启：service elasticsearch restart

再测试：

{

"tokens" : [

{

"token" : "蓝瘦⾹菇",

"start_offset" : 0,

"end_offset" : 4,

"type" : "CN_WORD",

"position" : 0

}

]

}

本文发布于:2024-11-05 02:09:55，感谢您对本站的认可！

本文链接:https://www.nsfzsr.com/read/512595.html

版权声明:本站内容均来自互联网，仅供演示用，请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系，我们将在24小时内删除。

添加测试分词词库定义香菇

上一篇：今日emo文案
下一篇：来啊,互相伤害啊

发布评论取消回复

评论列表（有 0 条评论）

热门文章