Datadog acquires Quickwit

2025-01-0917:49296115quickwit.io

The unexpected journey of building a multi-petabyte scale search engine

We are thrilled to announce that Quickwit is joining Datadog! We will be heads down building a new product with Datadog, so to ensure our open-source community can continue on, we will soon release a new version of Quickwit under the Apache License 2.0. Stay tuned!

But first, I want to share our story—a journey that spans four years, three continents, and countless plates of gyoza.

It all began with a decade-long friendship between three engineers—Paul, Adrien, and me—who first met in Paris in 2010. Back then, we'd sit in our favorite gyoza restaurant—the gyoza bar, it still exists. We would typically go there after attending an event called Start in Paris, where rookie founders came to pitch their startup ideas. Think Dragon’s Den with a mustache. There, we would dream about building something revolutionary together. But dreams remained dreams until 2020 when Paul called us to talk about his pet project, tantivy. The pet had sharp teeth - it was already a popular alternative to Lucene.

We knew we had to build something together within minutes in that call. The problem emerged naturally: Elasticsearch wasn't scaling effectively and had become too costly and complex to manage. Our mission crystallized: creating a search engine that would be at least 10x more cost-efficient, scale to multiple petabytes while letting operators sleep peacefully at night, and be significantly easier to manage—the dream of thousands of engineers operating Elasticsearch clusters.

With Adrien in San Francisco, Paul in Tokyo, and me in Paris, we created a "follow-the-sun" development cycle. Our choice of Rust proved transformative—our first demo on the Common Crawl dataset hit the front page of HN with virtually no bugs (except for that one pesky Python error: "'NoneType' object has no attribute"). We launched our first version on July 13, 2021. Choosing open source wasn't just strategic—it was natural for us as engineers. Our front page feature on HN brought our first interesting conversations, but we quickly realized our little experiment needed significant work. Each chat with potential users revealed new missing features we needed to implement. We were struggling to find the shortest path to market, making countless decisions along the way. Even in hindsight, it's hard to tell which choices were good or bad—each day brought its mix of successes and failures, definitely more of the latter than the former.

Everything changed when we met the engineering teams at Mezmo and Binance. Thanks to Open-Source. They understood our vision early and trusted us completely. Our partnerships transformed Quickwit from an ambitious idea into a battle-tested product operating at a mind-boggling scale:

  • Binance built a 100PB log service with Quickwit, indexing at 1.6PB/day.
  • Mezmo recently put in production Quickwit to serve thousands of customers and petabytes of logs, drastically reducing infrastructure cost and complexity while delivering the same user experience.

These partnerships helped us build one of the most cost-efficient, multi-petabyte-scale search engines from scratch. Along the way, we also contributed to several libraries in the Rust ecosystem:

  • tantivy: The foundation of Quickwit, a full-text search engine library.
  • chitchat: Cluster membership protocol with failure detection inspired by Cassandra and DynamoDB.
  • Bitpacking: SIMD algorithms for integer compression via bitpacking.
  • Whichlang: A blazingly fast and lightweight language detection library for Rust.
  • Mrecordlog: An efficient write-ahead-log (WAL) designed for multitenancy
  • And more!

This summer, the wind started to turn. We witnessed stronger open-source traction, our revenue increased dramatically, and VCs became more insistent. It was time for us to open a new chapter for the company and raise a series A round. However, building across Tokyo, Paris, and New York (Yes, Adrien moved from SF to NYC) stretched us thin, and scaling up would only intensify our challenges. We eventually decided to look for a new home for Quickwit. As we began evaluating acquisition opportunities, Datadog emerged as the clear standout, their proven ability to deliver exceptional user experiences, combined with Quickwit’s petabyte-scale search engine, presented a strong opportunity to create powerful solutions deployed in customer environments.

Initially skeptical about joining a large corporation, our reservations dissolved through interactions with the Datadog team. As Yanbing Li, Datadog's new CPO, candidly observed, "When you start discussing with those guys, you are on a slippery slope". Conversations with Olivier, Alexis, Michael, David, Yanbing, Laurent, Jeromy and others unveiled a refreshingly authentic culture—one characterized by intelligence, humility, and genuine integrity.

Today, we are excited to join Datadog and build something extraordinary together. We will be focused on building a new product with Datadog, and to ensure our open-source community can continue, we will soon release a major update of both Quickwit with a relicense to Apache License 2.0 and tantivy. This new Quickwit version will include several notable features our community has been asking for: distributed ingest, cardinality aggregations, performance and memory improvements, and more. Stay tuned for our upcoming releases!

We are extraordinarily thankful to everyone who invested their time and energy in this journey: employees, investors, contributors, friends, and allies. Special thanks to Pascal, Trinity, Harrison, Loic, Idriss, Shikhar, Vladimir, jY, Raphael, Borat, Evance, Michael, Eugene, Roch, Philippe, Michael, Matt, Sam, and many others who helped shape the project and spread the word.

To our amazing customers—Mezmo, Formal, Radiant Security, MatterLabs, Fly.io, and others—thank you for your trust and support.

To our community, thank you for being part of this adventure.

And if you are ever in Paris, let us know! We'd love to share some gyoza—conveniently close to the Datadog office.

With gratitude and excitement for our next chapter,

François

The Quickwit founders love gyozas.

Read the original article

Comments

  • By francoismassot 2025-01-1021:483 reply

    Co-founder of Quickwit here. Seeing our acquisition by Datadog on the HN front page feels like a truly full-circle moment.

    HN has been interwoven with Quickwit's journey from the very beginning. Looking back, it's striking to see how our progress is literally chronicled in our HN front-page posts:

    - Searching the web for under $1000/month [0]

    - A Rust optimization story [1]

    - Decentralized cluster membership in Rust [2]

    - Filtering a vector with SIMD instructions (AVX-2 and AVX-512) [3]

    - Efficient indexing with Quickwit Rust actor framework [4]

    - A compressed indexable bitset [5]

    - Show HN: Quickwit – OSS Alternative to Elasticsearch, Splunk, Datadog [6]

    - Quickwit 0.8: Indexing and Search at Petabyte Scale [7]

    - Tantivy – full-text search engine library inspired by Apache Lucene [8]

    - Binance built a 100PB log service with Quickwit [9]

    - Datadog acquires Quickwit [10]

    Each of these front-page appearances was a milestone for us. We put our hearts into writing those engineering articles, hoping to contribute something valuable to our community.

    I'm convinced HN played a key role in Quickwit's success by providing visibility, positive feedback, critical comments, and leads that contacted us directly after a front-page post. This community's authenticity and passion for technology are unparalleled. And we're incredibly grateful for this.

    Thank you all :)

    [0] https://news.ycombinator.com/item?id=27074481

    [1] https://news.ycombinator.com/item?id=28955461

    [2] https://news.ycombinator.com/item?id=31190586

    [3] https://news.ycombinator.com/item?id=32674040

    [4] https://news.ycombinator.com/item?id=35785421

    [5] https://news.ycombinator.com/item?id=36519467

    [6] https://news.ycombinator.com/item?id=38902042

    [7] https://news.ycombinator.com/item?id=39756367

    [8] https://news.ycombinator.com/item?id=40492834

    [9] https://news.ycombinator.com/item?id=40935701

    [10] https://news.ycombinator.com/item?id=42648043

    • By nextaccountic 2025-01-1022:062 reply

      I think you forgot to add the links

      Anyway tantivy is great! I love pg_search https://www.paradedb.com/blog/introducing_search (which appears to be built by another company, but on top of tantivy, which is a great feature of open source)

      Now, I am worried about development being stalled after this acquisition. How does further developing tantivy in the open helps Datadog's bottom line?

      • By dangoodmanUT 2025-01-111:041 reply

        I love quickwit, unfortunately datadog has a history of murdering open source (e.g. vector.io halting development and never fixing gross bugs)

        • By the_duke 2025-01-113:493 reply

          Yeah, a Vector dev that is now at Datadog told me that Vector is essentially deprecated.

          • By bfung 2025-01-117:19

            (Disclaimer: datadog employee)

            I joined Datadog after the Vector acquisition and now currently am the manager for the Community Open Source Engineering team that works on Vector open source.

            It’s def. not deprecated, but it did take awhile to sort out. It’s not easy figuring out business vs giving away software for free.

            Anyways, there’s quite a few issues and GitHub discussions everyday, in addition to Discord chats.

          • By epinephrinios 2025-01-122:26

            On the contrary, It's quite active lately - https://vector.dev/releases/

          • By quicksilver03 2025-01-1114:491 reply

            I'm using Vector for my own infrastructure and at work, at the time it seemed the best option to ship logs to various destinations. Are there any alternatives?

            • By pranay01 2025-01-1214:26

              If you want to check out OpenTelemetry, Otel-collector does the same job - though it's tightly coupled to opentelemetry

      • By philippemnoel 2025-01-1113:04

        pg_search dev here -- Thanks for mentioning us.

        Re: Tantivy. I'm hopeful the community Paul and the Quickwit team have built on top of Tantivy will continue to flourish. I'm sure Datadog will build product(s) with Quickwit, which is built on Tantivy and will contribute to it. Many other companies like ours (ParadeDB) and other databases also integrate it. I can't speak for others, but we'll contribute whenever possible. We're currently working on supporting nested documents in Tantivy, for example, and hoping to upstream this work.

        While it's reasonable to be concerned, I'd say this is a win for Quickwit, Tantivy and, of course, the well-deserving team behind them.

    • By adeptima 2025-01-1023:42

      Congratulations! The fact you and your team managed to built Tantivy is a huge contribution to open source.

      As someone who never managed to built a fond relationship with Apache Lucene based products (Solf, Elastic). I was extremely happy to see Tantivy in open source.

      BM25 scoring, proper asian language support, speed, memory foot prints, etc - amazing job! Thank you so much!

      https://github.com/quickwit-oss/tantivy

      IMHO Datadog made a smart move!

      If Tantivy itself just stays permanently under Apache2 licence and find a sustainable path to co exist with the rest of open source community - it's all good guys. You are more than deserve a commercial success.

    • By blinded 2025-01-110:31

      Congrats!!

  • By aseipp 2025-01-1020:301 reply

    Well, it looks like Quickwit was going to add an Enterprise license as of earlier this year (PR #5529), which I had been keeping eyes on, but this announcement says they're instead going to relicense as Apache 2.0 so the "community can continue on":

    > We will be focused on building a new product with Datadog, and to ensure our open-source community can continue, we will soon release a major update of both Quickwit with a relicense to Apache License 2.0 and tantivy.

    So, it looks like we'll get a more liberally licensed Quickwit, but reading between the lines suggests development of it is might otherwise be winding down? It has been pretty nice and stable in my experience, so I can't really complain much. But I was really looking forward to what else it could bring.

    Congrats to the team, in any case!

    • By mindcrash 2025-01-1021:453 reply

      "So, it looks like we'll get a more liberally licensed Quickwit, but reading between the lines suggests development of it is might otherwise be winding down?"

      They will stop fulltime day-to-day effort in it themselves, probably because they have been relocated to writing a similar service but closed and integrated in DD, but it seems they want to opensource the current product with a OSI compliant license in the hopes that the community picks up the tab.

      I think that's a nice trade. Could have been much worse.

      By the way, also note that DD is not a total stranger in the OSS space. They actually opensourced their observability pipeline tooling for general use as Vector, which is a rock solid product. - https://vector.dev/

      • By Dylan1312 2025-01-1021:591 reply

        Vector was already OSS when they acquired the company that created it, timber.

        https://www.datadoghq.com/blog/datadog-acquires-timber-techn...

        • By mindcrash 2025-01-111:26

          And yet they did dedicate some resources on it, until now. Which basically is my point :)

      • By aseipp 2025-01-1117:57

        Yes, I've been using Vector since very early on, long before Datadog acquired it, and Datadog have continued ongoing maintenance and feature additions at a slow-but-steady pace, which I think is good. Like Quickwit, Vector is very stable and already quite complete. So I'm not too unhappy.

        But Vector is something that complements Datadog's offering very well, so I think that makes sense for them to be good stewards of it. Quickwit is something that somewhat actively competes against them, which is a big difference. I suspect that unlike Vector, Quickwit is probably going to stop seeing any development in pretty short order, unless the devs now can consciously go out of their way to dedicate extra hours to it.

        To be clear, I think that the relicense is great, and I think it's very possible that Quickwit will be picked up/forked by someone and maintenance will continue, because it's very good, and I'd really love to see someone do metrics for it as well. So, I'm not all gloomy or anything like that.

      • By everfrustrated 2025-01-1021:56

        They bought Vector - it was always opensource

  • By Hixon10 2025-01-106:412 reply

    It's a bit sad that many modern databases were recently acquired. They had the potential to bring a lot of innovations.

    1. https://www.warpstream.com/

    2. https://www.orioledb.com/

    3. https://quickwit.io/

    • By oliverrice 2025-01-1017:581 reply

      (disclaimer: supabase employee)

      OrioleDB continues to be a fully open source and liberally licensed. We're working with the OrioleDB team to provide an initial distribution channel so they can focus on the storage engine vs hosting + providing lots of user feedback/bug reports. Our shared goal is to advance OrioleDB until it becomes the go-to storage engine for Postgres, both on Supabase and everywhere else.

      Happy to hear any concerns you have

      • By chrisweekly 2025-01-1020:561 reply

        Please forgive and help remedy my ignorance: it's a coherent goal to want OrioleDB to be the go-to storage engine for Postgres, on Supabase?

        • By oliverrice 2025-01-1022:232 reply

          I don't want to hijack Datadogs+Quickwit's post comment section with unrelated promotional-looking info. Quick summary below but if you have any other questions pls tag olirice in a Supabase GH discussion.

          The OrioleDB storage engine for postgres is a drop-in replacement for the default heap method. Its takes advantage of modern hardware (e.g. SSDs) and cloud infrastructure. The most basic benefit is that throughput at scale is > 5x higher than heap [1], but it also is architected for a bunch of other cool stuff [2]. copy-on-write unblocks branching. row-level-WAL enables an S3 backend and scale-to-zero compute. The combination of those two makes it a suitable target for multi-master.

          So yes, given that it could greatly improve performance on the platform, it is a goal to release in Supabase's primary image once everything is buttoned up. Note that an OrioleDB release doesn't take away any of your existing options. Its implemented as an extension so users would be able to optionally create all heap tables, all orioledb tables, or a mix of both.

          [1] https://www.orioledb.com/blog/orioledb-beta7-benchmarks

          [2] https://www.orioledb.com/docs

          • By satvikpendem 2025-01-1022:291 reply

            Makes sense, perhaps the previous commenter thought OrioleDB was itself a database rather than an implementation detail alternative to current databases. That's what I thought before I went to their site.

          • By chrisweekly 2025-01-1119:38

            Thanks! TIL

    • By philippemnoel 2025-01-1113:091 reply

      Acquisitions don't necessarily mean the end of innovation. Sometimes, it allows them to take innovations they've worked hard on for years and expand the reach to a significantly larger audience :)

      I have met the founders of all 3 of these companies and can assure you they all care tremendously about bringing their work to the world.

      ParadeDB is independent and without plans to sell anytime soon, though :)

      • By hobs 2025-01-1113:31

        Not hating on Quickwit, but almost never does an acquisition in the modern era mean continual innovation, most companies are now suborned to a greater purpose, and its almost never going to drive them to build the best thing they already have ended their lifecycle - nobody is going to buy them from DD and their quality/dev process will dominate, and that is of a decent size corporation.

        It also looks like most of DD's observability acquisitions are either integrated directly (seemingly with a full rewrite) or look a lot like aquihires for senior folks, so I wouldn't hold my breath here.

HackerNews