Manticore Search: Fast, efficient, drop-in replacement for Elasticsearch

https://github.com/manticoresoftware/manticoresearch

87•klaussilveira•8h ago

Comments

sandstrom•7h ago

For anyone who's interested, two other popular contenders for replacing Elasticsearch are Typesense (https://typesense.org/) and Meilisearch (https://www.meilisearch.com/).

(both are also trying to replace Algolia, because both have cloud offerings)

robertlagrant•7h ago

Don't forget OpenSearch[0].

[0] https://opensearch.org

smarx007•6h ago

Isn't that just an old fork of Elasticsearch?

mdaniel•6h ago

It is a fork, but not old; they have ongoing commits: https://github.com/opensearch-project/OpenSearch/commits/mai...

Plus, given that AWS is currently hosting Open Search, they are not incentivized to sit on their laurels when it comes to modern features or stability

Keyframe•5h ago

Went from ES and sharding hell to less of a sharding hell with OS on AWS. I've been looking for a replacement ever since first friday evening sharding party with infrastructure team.

sontek•5h ago

I don't believe meilisearch or typesense are API compatible with Elasticsearch. I think the best part of this new tool is its a drop-in replacement.

Edit: Nevermind, in another part of this thread the maintainer said:

    We do get compared to Elasticsearch a lot. While we support some of its APIs, Manticore isn't a drop-in replacement

Which conflicts with the README: "Drop-in replacement for E in the ELK stack"

snikolaev•5h ago

It's not a drop-in replacement in general, but it can be seen as a drop-in replacement for Elasticsearch in the ELK stack because: - You can send data to Manticore using Logstash (L) - You can visualize the data using Kibana (K)

Sorry for the confusion :)

merb•2h ago

That is the most bullshit thing I’ve read in a while. Send data to manticore via logstash does not make you an elasticsearch replacement. And a lot of elasticsearch use cases are not using kibana.

(Logstash can basically ingest and output to everything…)

entropyie•5h ago

Honourable mention to ZincSearch, if you are looking for a lightweight single binary (golang) alternative: https://github.com/zincsearch/zincsearch

I have no affiliation.

mdaniel•7h ago

One should not use "drop-in" when they have their own query language and seemingly input shape for the /search endpoint (which is also different from Elastic, of course) https://manual.manticoresearch.com/Searching/Full_text_match...

It sounds like they're really targeting the logging search store part of ELK, which can be a perfectly fine objective, but no need to mislead audiences since they will find out and then you've made an enemy

tbarbugli•7h ago

I agree, only reason I read the project readme was to see the drop-in explainer.

Very misleading title

andygeorge•5h ago

cc klaussilveira

klaussilveira•5h ago

Well, it says right there in the GitHub About section: "Drop-in replacement for E in the ELK stack".

snikolaev•7h ago

The Manticore Search github repository calls it a "drop-in replacement for E in the ELK stack," not just a replacement for Elasticsearch. On https://manticoresearch.com/, it's described as an "Elasticsearch alternative," so the confusion is probably just here on HN :)

snikolaev•6h ago

Hi everyone — I'm one of the maintainers of Manticore Search. Huge thanks to @klaussilveira for submitting this — we really appreciate the interest and the thoughtful discussion here.

A few points that came up in the thread and are worth clarifying:

- We do get compared to Elasticsearch a lot. While we support some of its APIs, Manticore isn't a drop-in replacement. We've focused on performance, simplicity, and keeping things open-source without vendor lock-in. Our own SQL-based query language and REST endpoints are part of that philosophy. - @mdaniel was right to question the "drop-in" wording — that's not our goal. - As @sandstrom pointed out, tools like Typesense and Meilisearch are part of this evolving search space. We see Manticore fitting in where users want powerful full-text and vector search capabilities with lower resource overhead and SQL capabilities (we support JSON too though)

We'd love to hear from you: - What are your main use cases for search or log indexing? - Which Elasticsearch features (if any) are absolutely essential for you? - Are there performance comparisons or scaling challenges you'd like to see addressed?

Happy to answer any questions or dive deeper.

aaroninsf•6h ago

Since you asked! I don't see mention of Lucene on the repo landing page,

could you ELI5 the query language and TD-IDF?

(Being lazy, I am happy to look into this myself lol.)

snikolaev•6h ago

We made a blogpost "TF-IDF in a nutshell" - https://manticoresearch.com/blog/tf-idf-in-a-nutshell/

Manticore Search's query language is more expressive than Lucene's. While Lucene supports basic boolean logic, field search, phrase queries, wildcards, and proximity, Manticore adds many powerful operators. These include quorum matching (/N), strict word order (<<), NEAR/NOTNEAR proximity, MAYBE (soft OR), exact form (=word), position anchors (^word, word$), full regular expressions, and more. Manticore uses SQL-style syntax with a MATCH() clause for full-text search, making it easier to combine text search with filters and ranking.

Semaphor•6h ago

> What are your main use cases for search

Camera and camera lens names. I tried mellisearch (1-2 years ago), and while I loved the simplicity (I barely understand what I threw together with many, many lines of C# code for elastic search; this is partially on ES, but clearly on me as well), it was not good at getting results.

Names like "Lumix DC-S1IIE", "DSC-RX100 VIIA", or "FE 50-150 mm F2 GM (SEL50150GM)" do not quite work with default tokenizers and analyzers. Of course that is for product names, for full text queries still need to use normal language rules… except for product names showing up in the text, so now I need multiple indexes for every field, and searching different sub-fields sometimes with different query analyzers.

It was a lot of trial and error getting ES to both find what was searched for, but also be typo tolerant. It’s very easy getting far too many results, and bad scoring for fuzzy queries.

So a bit of a special case, but something the customization capabilities of ES support pretty well.

Luckily, our dataset is rather small, maybe 100k documents, so scalability is not a problem.

snikolaev•5h ago

Thanks for sharing! What sets Manticore apart from Meilisearch and Elasticsearch is that it lets you configure tokenization at a low level by:

- choosing which characters should be treated as token characters, and using the rest as token separators

- defining "blend chars" — for example, the hyphen (-) could make sense as both a separator and a non-separator in your case

- or optionally adding it to the ignore_chars list

- there's also regexp_filter to process tokens when indexing and searching

That said, setting things like this up perfectly is always tricky with any search engine, because the words and punctuation in real data often don't follow regular patterns. It's especially difficult when you want to find "abc def" by "ab cd ef" which may be a common situation in your case.

sontek•5h ago

The main tag line says "Drop-in replacement for E in the ELK stack" -- but here you say you aren't a drop-in replacement.

Does this mean you've at least implemented every API that Kibana requires?

snikolaev•5h ago

Not every, but Kibana can be used with Manticore with some limitations - https://github.com/manticoresoftware/kibana-demo

Keyframe•5h ago

So, "Drop-in replacement for E in the ELK stack, but not really, maybe, limitations apply"? Why even use that copy then?

esafak•3h ago

To entice people searching for ElasticSearch alternatives.

victorbjorklund•1h ago

To be honest plenty do that. Companies claim their object storage is S3 compatiable even if they havent implemented every single functionality that S3 has.

nine_k•1h ago

Let's call it an almost equivalent replacement, like "e" for "E".

sandstrom•4h ago

"What are your main use cases for search or log indexing?"

To me, storing and searching logs is quite different from most other search use-cases, and it's not obvious that they should be handled by the same piece of software.

For example, tokenization, stemming, language support many other things and are basically useless in log search. Also, log search is often storing a lot of data, and rarely retrieving it (different usage pattern from many search use-cases which tend to be less write-heavy and more about reads).

I know ElasticSearch has had success in both, but if I were Manticore/Typesense/Meilisearch I'd probably just skip logs altogether.

Loki, QuickWit and other such tools are likely better suited for logs.

- https://github.com/quickwit-oss/quickwit - https://github.com/grafana/loki

lokl•3h ago

I've been using Sphinx for 20 years for full-text search with a custom stemmer. I considered switching to Manticore, but didn't see a huge need to do so, because Sphinx still works well for me and requires zero maintenance. Any big new features that might entice me to switch? (I only have a few GB of indexes, covering a few million documents.)

another_twist•6h ago

Curious about the architecture here. Where does the 20x speedup come from ?

Recently had a look at Tantivy as well, although compared to raw lucene, their perf is actually inferior. Wonder if there are specific benchmarks here which measure performace and if they compared tail latencies as opposed to averages.

snikolaev•6h ago

The speedup comes from a number of architectural and low-level performance optimizations in Manticore Search.

Manticore has a modern multithreading architecture with efficient query parallelization that fully utilizes all CPU cores. It supports real-time indexing - documents are searchable immediately after insertion, with no need to wait for flushes or refreshes.

It uses row-wise storage optimized for small to large datasets, and for even larger datasets that don’t fit into memory, there's support for columnar storage through the Manticore Columnar Library.

Secondary indexes are built automatically using the PGM-index (Piecewise Geometric Model index), which enables efficient filtering and sorting by mapping keys to their memory locations. The cost-based query optimizer uses statistics about the data to choose the most efficient execution plan for each query.

Manticore is SQL-first: SQL is its native syntax, and it speaks the MySQL protocol, so it works out of the box with MySQL clients.

It's written in C++, starts quickly, uses minimal RAM, and avoids garbage collection — which helps keep latencies low and stable even under load.

As for benchmarks, there's a growing collection of them at https://db-benchmarks.com, where Manticore is compared to Elasticsearch, MySQL, PostgreSQL, Meilisearch, Typesense, and others. The results are open and reproducible.

9dev•5h ago

If I had to guess, I would say it’s the 20x smaller feature set compared to Elasticsearch.

We built a custom search engine on top of Elasticsearch. Our query builder regularly constructs optimised queries that would be impossible to implement in any of the touted alternatives or replacements, which almost always focus on simple full text search, because that’s everything the developers ever used ES for. There’s a mindboggingly huge number of additional features that you need for serious search engines though, and any contender will have to support at least a subset of these to deserve that title in the first place.

I’m keeping an eye on the space, but so far, I’m less than impressed with everything I’ve seen.

another_twist•5h ago

What are the missing features though ? Autoshard, something related to ranking ? Also curious, why not go with algolia which as I understand kinda built for product facing search use cases ?

snikolaev•5h ago

Autosharding, authentication, dynamic mapping.

snikolaev•5h ago

It's somewhat smaller, but I believe not 20 times smaller. Among the major features, probably only authentication and auto-sharding are missing. Both are already in progress. On the other hand, the main feature missing in Elasticsearch is proper SQL support, which many Manticore users really appreciate.

9dev•2h ago

What’s the story on nested documents, complex Boolean queries, custom script scoring, pipelined aggregations, vectors and so on?

wavemode•3h ago

> Manticore Search was forked from Sphinx 2.3.2 in 2017.

What was the reason for the fork, and in what ways does Manticore Search differ from Sphinx today?

aleksi•2h ago

See https://manticoresearch.com/comparison/vs-sphinx/. Sphinx is no longer open-source.

cess11•1h ago

I like Manticore. It's easy to setup, lean on resources and quite fast. I use it when I want to quickly pour a lot of semi-structured text into a database for exploratory browsing and prototype web applications.

The auto-bolding of query terms in responses is quite convenient and has allowed me to skip annoying little regexes many times. Maybe other engines have it too and I never noticed?

CARA – High precision robot dog using rope

The Promised LAN

Major rule about cooking meat turns out to be wrong

Neil Armstrong's customs form for moon rocks (2016)

Parsing Protobuf like never before

A diverse cast of rocky worlds around a small star revealed by astronomers

Building better AI tools

What to expect from Debian/Trixie

Show HN: TheProtector – Linux Bash script for the paranoid admin on a budget

FastVLM: Efficient Vision Encoding for Vision Language Models

Interactive Programming in C (2014)

Checklists are hard, but still a good thing

How to increase your surface area for luck

Cops say criminals use a Google Pixel with GrapheneOS – I say that's freedom

I'm Unsatisfied with Easing Functions

Optery (YC W22) Is Hiring in Engineering, Legal, Sales, Marketing (U.S., Latam)

Show HN: The missing link of a bookstore's tech stack

You can now disable all AI features in Zed

The Big OOPs: Anatomy of a Thirty-Five Year Mistake

Kimi-K2 Tech Report [pdf]

AccuWeather to discontinue free access to Core Weather API

Vector Tiles are deployed on OpenStreetMap.org

US AI Action Plan

AI groups spend to replace low-cost 'data labellers' with high-paid experts

Why Elixir? Common misconceptions

How YouTube won the battle for TV viewers

Manticore Search: Fast, efficient, drop-in replacement for Elasticsearch

SIMD Perlin Noise: Beating the Compiler with SSE (2014)

AI overviews cause massive drop in search clicks

Reverse engineering GitHub Actions cache to make it fast

CARA – High precision robot dog using rope

The Promised LAN

Major rule about cooking meat turns out to be wrong

Neil Armstrong's customs form for moon rocks (2016)

Parsing Protobuf like never before

A diverse cast of rocky worlds around a small star revealed by astronomers

Building better AI tools

What to expect from Debian/Trixie

Show HN: TheProtector – Linux Bash script for the paranoid admin on a budget

FastVLM: Efficient Vision Encoding for Vision Language Models

Interactive Programming in C (2014)

Checklists are hard, but still a good thing

How to increase your surface area for luck

Cops say criminals use a Google Pixel with GrapheneOS – I say that's freedom

I'm Unsatisfied with Easing Functions

Optery (YC W22) Is Hiring in Engineering, Legal, Sales, Marketing (U.S., Latam)

Show HN: The missing link of a bookstore's tech stack

You can now disable all AI features in Zed

The Big OOPs: Anatomy of a Thirty-Five Year Mistake

Kimi-K2 Tech Report [pdf]

AccuWeather to discontinue free access to Core Weather API

Vector Tiles are deployed on OpenStreetMap.org

US AI Action Plan

AI groups spend to replace low-cost 'data labellers' with high-paid experts

Why Elixir? Common misconceptions

How YouTube won the battle for TV viewers

Manticore Search: Fast, efficient, drop-in replacement for Elasticsearch

SIMD Perlin Noise: Beating the Compiler with SSE (2014)

AI overviews cause massive drop in search clicks

Reverse engineering GitHub Actions cache to make it fast

Manticore Search: Fast, efficient, drop-in replacement for Elasticsearch

Comments