Feature request: support high-speed import by merging multiple `.dump` files #845

adysec · 2025-06-23T09:06:28Z

adysec
Jun 23, 2025

I need to import a very large number of documents, so I spun up several self-hosted Meilisearch instances (each one ingests different indexes) to improve write throughput.
Now I’m trying to merge all those indexes into a single self-hosted Meilisearch instance.

Importing via the HTTP API is far too slow; there’s no performance gain compared with writing the data directly.
After reading the docs and running experiments, I found that only the /dumps endpoint exports fast enough—but Meilisearch doesn’t let me import multiple .dump files at once.

What I tried

1.Unpack the .dump files (they’re just tar.gz archives), e.g.
dumps/aaa.dump & dumps/bbb.dump
To
dumps/aaa/index/aaa & dumps/bbb/index/bbb

2.Merge the contents of their index/ directories, e.g.
dumps/aaa/index/aaa and dumps/aaa/index/bbb

3.Inside dumps/aaa, run

tar -czf ../test.dump .

to create a merged test.dump.

4.Import with

./meilisearch --import-dump test.dump

Result: both aaa and bbb indexes appear in the target instance, but the searchableAttributes and displayedAttributes settings of bbb are lost and must be re-set manually. After resetting them I haven’t noticed any problems, but I don’t know whether hidden integrity issues remain.
Why this matters

This “unpack-merge-re-pack” workflow is extremely fast—orders of magnitude faster than the HTTP API—yet clearly unintended and a bit error-prone

Feature request

Make this high-speed dump-merge workflow an officially supported method so users can efficiently
- merge data from multiple instances, or
- import very large datasets.
Ideally, add a second mode to meilisearch-importer:
- the current HTTP API mode, and
- a new dump-file import mode that can merge multiple .dump files, validate the format, and preserve index settings.

adysec · 2025-06-25T04:08:31Z

adysec
Jun 25, 2025
Author

I built a small Rust tool meilisearch-dumper that does exactly this - generates dump files from JSON while keeping all the index settings intact.

It's basically a safer version of your manual merge workflow. Instead of unpacking and repacking dumps manually, you just feed it your JSON files and it creates a proper dump with all settings preserved.

Usage is pretty simple:

./meilisearch-dumper --index aaa --files aaa.json --index bbb --files bbb.json

This resolves the default 100MB HTTP request size limitation in Meilisearch (although it can be modified, it still incurs performance overhead during the ingestion process) and improves data import efficiency. However, it comes with the limitation of requiring a brand-new Meilisearch instance.

0 replies

macraig · 2025-07-03T15:10:44Z

macraig
Jul 3, 2025
Maintainer

Thanks for sharing your use case @adysec. Unfortunately, it's a bit niche and not a current priority for us. We'll keep the issue open so others can upvote and share their own use cases too.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meilisearch

Feature request: support high-speed import by merging multiple `.dump` files #845

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Meilisearch

Feature request: support high-speed import by merging multiple .dump files #845

Uh oh!

adysec Jun 23, 2025

What I tried

Feature request

Replies: 2 comments

Uh oh!

adysec Jun 25, 2025 Author

Uh oh!

macraig Jul 3, 2025 Maintainer

Feature request: support high-speed import by merging multiple `.dump` files #845

adysec
Jun 23, 2025

adysec
Jun 25, 2025
Author

macraig
Jul 3, 2025
Maintainer