Skip to content

Japanese Kanji cannot search #4561

@exkoi

Description

@exkoi

Describe the bug
Japanese Kanji cannot search

To Reproduce
Create Index:

curl -X POST 'http://localhost:7700/indexes' \
  -H 'Content-Type: application/json' \
  --data-binary '{
    "uid": "hospitals",
    "primaryKey": "id"
  }'

Setting index:

curl -X PATCH 'http://localhost:7700/indexes/hospitals/settings' \
  -H 'Content-Type: application/json' \
  --data-binary '{
    "searchableAttributes": [
      "name",
      "kana"
    ],
    "displayedAttributes": [
      "name",
      "kana"
    ]
  }'

Insert documents:

curl -X POST 'http://localhost:7700/indexes/hospitals/documents' \
  -H 'Content-Type: application/json' \
  --data-binary '[
      { "id": 1, "name": "日健クリニック", "kana": "ニッケンクリニック" },
      { "id": 2, "name": "日見中央病院", "kana": "ヒミチュウオウビョウイン" }
  ]'

Expected behavior
When search for the word "健", I want "日健クリニック" to be hit, but documents containing "見" are hit.
In Japanese, "健" and "見" both have the same pronunciation as "ken", but they have different meanings.

Screenshots
meilisearch

Meilisearch version:
v1.7.2

Additional context
Docker Image: getmeili/meilisearch:v1.7

Metadata

Metadata

Assignees

No one assigned

    Labels

    duplicateThis issue or pull request already exists

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions