androidapachebackendcentoscentos8.4commandcomposercrontabcurldesigndevelopdockerelasticfinancingfirewallfrontendgatewaygitgolanghybridiisiptablesjavajenkinskuberneteslaravelluamacmemcachemigrationmongodbmysqlnginxnodejsofficeopenrestyopensslperformancephpproductionredisregexprestfulsecurityshadowsocksshellsnippetsshstartupteamtraintravelunitvmwarevsftpwindowsworkwwwyaf

ElasticSearch

基本知识

Relational DB -> Databases -> Tables -> Rows -> Columns
Elasticsearch -> Indices   -> Types  -> Documents -> Fields

Elasticsearch集群可以包含多个索引(indices)(数据库),每一个索引可以包含多个类型(types)(表),每一个类型包含多个文档(documents)(行),然后每个文档包含多个字段(Fields)(列)

数据层

服务层

使用样例:

1. 先创建名叫 index 的索引

curl -XPUT http://localhost:9200/index

2. 创建名为 fulltext 的类型,并设定其使用 IK 分词器

这一步很重要,必须在往索引中添加数据前完成

curl -XPOST http://localhost:9200/index/fulltext/_mapping -d'
{
    "properties": {
        "content": {
            "type": "text",
            "analyzer": "ik_max_word",
            "search_analyzer": "ik_max_word"
        }
    }
}'

3. 往索引库中添加数据

curl -XPOST http://localhost:9200/index/fulltext/1 -d'{"content":"美国留给伊拉克的是个烂摊子吗"}'
curl -XPOST http://localhost:9200/index/fulltext/2 -d'{"content":"公安部: 各地校车将享最高路权"}'
curl -XPOST http://localhost:9200/index/fulltext/3 -d'{"content":"中韩渔警冲突调查: 韩警平均每天扣1艘中国渔船"}'
curl -XPOST http://localhost:9200/index/fulltext/4 -d'{"content":"中国驻洛杉矶领事馆遭亚裔男子枪击 嫌犯已自首"}'

IK 分词器

  1. ik_max_word 会将文本做最细粒度的拆分,比如会将“中华人民共和国国歌”拆分为“中华人民共和国,中华人民,中华,华人,人民共和国,人民,人,民,共和国,共和,和,国国,国歌”,会穷尽各种可能的组合;

  2. ik_smart 会做最粗粒度的拆分,比如会将“中华人民共和国国歌”拆分为“中华人民共和国,国歌”。

索引 API

创建索引

PUT http://127.0.0.1:9200/geeduo/

创建类型

POST http://127.0.0.1:9200/geeduo/goods/
{}

创建映射

POST http://127.0.0.1:9200/geeduo/goods/_mapping
{
        "properties": {
            "goods_id": {
                "type": "long"
            },
            "goods_name": {
                "type": "text",
                "analyzer": "ik_max_word",
                "search_analyzer": "ik_max_word"
            },
            "goods_tags": {
                "type": "text",
                "analyzer": "ik_max_word",
                "search_analyzer": "ik_max_word"
            },
            "goods_cate": {
                "type": "text",
                "analyzer": "ik_max_word",
                "search_analyzer": "ik_max_word"
            }
        }

}

删除索引

DELETE http://127.0.0.1:9200/geeduo/

查看映射

GET http://127.0.0.1:9200/geeduo/goods/_mapping

搜索 API

搜索测试

POST http://127.0.0.1:9200/geeduo/goods/_search
{
    "query": {
        "query_string": {
            "query": "中华人民共和国"
        }
    }
}

搜索高亮

POST http://127.0.0.1:9200/geeduo/goods/_search
{
    "query": {
        "match": {
            "goods_name": "我的世界玩具"
        }
    },
    "highlight": {
        "pre_tags": "<strong>",
        "post_tags": "</strong>",
        "fields": {
            "goods_name": {}
        }
}

混合搜索

POST http://127.0.0.1:9200/geeduo/goods/_search
{
    "query": {
        "match_phrase": {
            "goods_name": {
                "query": "玩具我的世界",
                "slop":  30
            }
        }
    },
    "highlight": {
        "pre_tags": "<strong>",
        "post_tags": "</strong>",
        "fields": {
            "goods_name": {}
        }
    }
}

分词 API

默认分词

GET http://localhost:9200/index/_analyze?text=中华人民共和国

IK 分词

GET http://localhost:9200/index/_analyze?analyzer=ik_max_word&text=中华人民共和国

优化事项

我们可以在索引时使用 ik_max_word,在搜索时用 ik_smart

SQL 支持

Match 语法

MATCH( field_exp, constant_exp [, options])

Match 简单查询

POST /_xpack/sql?format=txt
{
    "query": "SELECT * FROM twitter WHERE user_name = 'newbie'"
}

Match 带评分查询

POST /_xpack/sql?format=txt
{
    "query": "SELECT *, SCORE() FROM twitter WHERE match( user_name, 'maxbuff' )"
}

POST /_xpack/sql?format=txt
{
    "query": "SELECT item_title, SCORE() AS score FROM item_list2 WHERE match( item_title, '洗面奶' ) ORDER BY score DESC"
}

Match 指定分析器

POST /_xpack/sql?format=txt
{
    "query": "SELECT item_title, SCORE() AS score FROM item_list2 WHERE match( item_title, '洗面奶', 'analyzer=ik_smart' ) ORDER BY score DESC"
}

QUERY 语法

QUERY( constant_exp [, options]) 

QUERY 简单查询

POST /_xpack/sql?format=txt
{
    "query": "SELECT *, SCORE() FROM twitter WHERE QUERY('user_name:maxbuff')"
}

QUERY 复杂条件

POST /_xpack/sql?format=txt
{
    "query": "SELECT author, name, page_count, SCORE() FROM library WHERE QUERY('_exists_:"author" AND page_count:>200 AND (name:/star.*/ OR name:duna~)')"
}

QUERY 带可选项

POST /_xpack/sql?format=txt
{
    "query": "SELECT author, name, SCORE() FROM library WHERE QUERY('dune god', 'default_operator=and;default_field=name')"
}

相关链接