欢迎来到代码驿站!

JAVA代码

当前位置:首页 > 软件编程 > JAVA代码

安装elasticsearch-analysis-ik中文分词器的步骤讲解

时间:2020-10-15 23:16:14|栏目:JAVA代码|点击:

1 安装elasticsearch-analysis-ik中文分词器

Ik介绍:ik是一款中文的分词插件,支持自定义词库。

1.1 下载ik分词器

下载地址:https://github.com/medcl/elasticsearch-analysis-ik/releases

下载指定版本的分词器(zip版本)

1.2 解压ik分词器

[es@bigdata1 plugins]$ cd /home/es/
[es@bigdata1 ~]$ ls
elasticsearch-6.2.2         jdk1.8.0_161        kibana-6.2.2-linux-x86_64.tar.gz
elasticsearch-6.2.2.tar.gz      jdk-8u161-linux-x64.tar.gz logstash-6.2.2.tar.gz
elasticsearch-analysis-ik-6.2.2.zip kibana-6.2.2-linux-x86_64
[es@bigdata1 ~]$ unzip elasticsearch-analysis-ik-6.2.2.zip 
Archive: elasticsearch-analysis-ik-6.2.2.zip
  creating: elasticsearch/
 inflating: elasticsearch/elasticsearch-analysis-ik-6.2.2.jar 
 inflating: elasticsearch/httpclient-4.5.2.jar 
 inflating: elasticsearch/httpcore-4.4.4.jar 
 inflating: elasticsearch/commons-logging-1.2.jar 
 inflating: elasticsearch/commons-codec-1.9.jar 
 inflating: elasticsearch/plugin-descriptor.properties 
  creating: elasticsearch/config/
 inflating: elasticsearch/config/extra_single_word_full.dic 
 inflating: elasticsearch/config/quantifier.dic 
 inflating: elasticsearch/config/IKAnalyzer.cfg.xml 
 inflating: elasticsearch/config/main.dic 
 inflating: elasticsearch/config/extra_single_word_low_freq.dic 
 inflating: elasticsearch/config/extra_stopword.dic 
 inflating: elasticsearch/config/preposition.dic 
 inflating: elasticsearch/config/extra_main.dic 
 inflating: elasticsearch/config/extra_single_word.dic 
 inflating: elasticsearch/config/suffix.dic 
 inflating: elasticsearch/config/surname.dic 
 inflating: elasticsearch/config/stopword.dic 
[es@bigdata1 ~]$ ls
elasticsearch        elasticsearch-analysis-ik-6.2.2.zip kibana-6.2.2-linux-x86_64
elasticsearch-6.2.2     jdk1.8.0_161             kibana-6.2.2-linux-x86_64.tar.gz
elasticsearch-6.2.2.tar.gz jdk-8u161-linux-x64.tar.gz      logstash-6.2.2.tar.gz
[es@bigdata1 ~]$ mv elasticsearch/* ./elasticsearch-6.2.2/plugins/ik/
[es@bigdata1 ~]$ vim ./elasticsearch-6.2.2/config/elasticsearch.yml

1.3 重新启动Elasticsearch服务

ps -ef | grep elasticsearch
然后kill掉相关的进程。
然后按照18.8的方式启动elasticsearch进程

1.4 然后执行以下命令

GET _analyze 
{ 
 "analyzer":"ik_smart", 
 "text":"2018年5月全球编程语言排行榜" 
}

运行结果是:

{
  "tokens": [
    {
      "token": "2018年",
      "start_offset": 0,
      "end_offset": 5,
      "type": "TYPE_CQUAN",
      "position": 0
    },
    {
      "token": "5月",
      "start_offset": 5,
      "end_offset": 7,
      "type": "TYPE_CQUAN",
      "position": 1
    },
    {
      "token": "全球",
      "start_offset": 7,
      "end_offset": 9,
      "type": "CN_WORD",
      "position": 2
    },
    {
      "token": "编程",
      "start_offset": 9,
      "end_offset": 11,
      "type": "CN_WORD",
      "position": 3
    },
    {
      "token": "语言",
      "start_offset": 11,
      "end_offset": 13,
      "type": "CN_WORD",
      "position": 4
    },
    {
      "token": "排行榜",
      "start_offset": 13,
      "end_offset": 16,
      "type": "CN_WORD",
      "position": 5
    }
  ]
}

Kibana的效果图

总结

上一篇:如何搭建一个完整的Java开发环境

栏    目:JAVA代码

下一篇:JNI实现最简单的JAVA调用C/C++代码

本文标题:安装elasticsearch-analysis-ik中文分词器的步骤讲解

本文地址:http://www.codeinn.net/misctech/11735.html

推荐教程

广告投放 | 联系我们 | 版权申明

重要申明:本站所有的文章、图片、评论等,均由网友发表或上传并维护或收集自网络,属个人行为,与本站立场无关。

如果侵犯了您的权利,请与我们联系,我们将在24小时内进行处理、任何非本站因素导致的法律后果,本站均不负任何责任。

联系QQ:914707363 | 邮箱:codeinn#126.com(#换成@)

Copyright © 2020 代码驿站 版权所有