frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Project Hail Mary – Stellar Navigation Chart

https://valhovey.github.io/gaia-mary/
767•speleo•13h ago•170 comments

Blog ran on Ubuntu 16.04 for 10 years. I migrated it to FreeBSD

https://crocidb.com/post/this-blog-ran-on-ubuntu-16-04-for-10-years-i-migrated-it-to-freebsd/
225•speckx•11h ago•121 comments

The surprising story behind the first British person in space

https://www.bbc.com/culture/article/20260518-helen-sharman-the-story-behind-the-first-british-per...
9•xoxxala•1d ago•0 comments

Samsung chip workers will get an average $340k bonus as AI profits soar

https://qz.com/samsung-chip-workers-bonus-ai-profits-052126
195•carabiner•4h ago•132 comments

Was my $48K GPU server worth it?

https://rosmine.ai/2026/05/13/was-my-48k-gpu-worth-it/
370•apwheele•3d ago•264 comments

Using Kagi Search with Low Vision

https://veroniiiica.com/using-kagi-search-with-low-vision/
174•speckx•10h ago•48 comments

Uv is fantastic, but its package management UX is a mess

https://www.loopwerk.io/articles/2026/uv-ux-mess/
148•nchagnet•8h ago•86 comments

Slumber a TUI HTTP Client

https://slumber.lucaspickering.me
7•jicea•1h ago•0 comments

Indexing a year of video locally on a 2021 MacBook with Gemma4-31B (50GB swap)

https://blog.simbastack.com/indexed-a-year-of-video-locally/
343•asenna•15h ago•101 comments

Show HN: Freenet, a peer-to-peer platform for decentralized apps

https://freenet.org/
255•sanity•15h ago•138 comments

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

https://arxiv.org/abs/2605.19269
6•matt_d•1h ago•0 comments

Mycorrhizal Fungi, Nature's Key to Plant Survival and Success

https://pacifichorticulture.org/articles/mycorrhizal-fungi-natures-key-to-plant-survival-and-succ...
71•mooreds•1d ago•11 comments

Tristan Davey's Punch Card Archive

https://punchcards.tristandavey.com/
19•ohjeez•2d ago•3 comments

Show HN: KVBoost – chunk-level KV cache reuse for HuggingFace, 5–48x faster TTFT

https://pythongiant.github.io/KVBoost/
6•pythongiant•1h ago•0 comments

Python 3.15: features that didn't make the headlines

https://blog.changs.co.uk/python-315-features-that-didnt-make-the-headlines.html
355•rbanffy•18h ago•171 comments

Cleve Moler (Matlab, MathWorks) passed away on May 20, 2026

https://www.mathworks.com/company/aboutus/founders/clevemoler.html
17•mychele•3h ago•1 comments

Lost Images from the 1945 Trinity Nuclear Test Restored

https://spectrum.ieee.org/trinity-nuclear-test
323•pseudolus•18h ago•98 comments

Flipper One – we need your help

https://blog.flipper.net/flipper-one-we-need-your-help/
1110•sandebert•18h ago•433 comments

Launch HN: Runtime (YC P26) – Sandboxed coding agents for everyone on a team

https://www.runtm.com/
80•gustrigos•13h ago•22 comments

Spotify will start reserving concert tickets for fans

https://www.hollywoodreporter.com/music/music-industry-news/spotify-will-start-reserving-concert-...
130•elffjs•13h ago•259 comments

Waymo pauses Atlanta service as its robotaxis keep driving into floods

https://techcrunch.com/2026/05/21/waymo-pauses-atlanta-service-as-its-robotaxis-keep-driving-into...
288•mattas•13h ago•363 comments

Deciphering the Hashihara Castle Town Map

https://www.obayashi.co.jp/en/kikan_obayashi/detail/kikan_64_project.html
36•1970-01-01•2d ago•0 comments

Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O

https://arxiv.org/abs/2605.12460
86•atomicthumbs•10h ago•9 comments

Seattle Shield, an intelligence-sharing network operated by the Seattle police

https://prismreports.org/2026/05/20/seattle-shield-private-companies-surveillance/
445•root-parent•11h ago•176 comments

Google's Antigravity bait and switch

https://www.0xsid.com/blog/antigravity-bait-n-switch
621•ssiddharth•16h ago•284 comments

Throwing AI-generated walls of text into conversations

https://noslopgrenade.com/
556•napolux•20h ago•328 comments

News outlets are limiting the Internet Archive’s access to their journalism

https://www.niemanlab.org/2026/05/more-than-340-local-news-outlets-are-limiting-the-internet-arch...
253•jaredwiener•12h ago•88 comments

We're testing new ad formats in Search and expanding our Direct Offers pilot

https://blog.google/products/ads-commerce/google-marketing-live-search-ads/
586•sofumel•20h ago•525 comments

Mounting git commits as folders with NFS (2023)

https://jvns.ca/blog/2023/12/04/mounting-git-commits-as-folders-with-nfs/
100•pvtmert•2d ago•46 comments

ParadeDB (YC S23) Is Hiring Distributed Systems/Platform Engineers

1•philippemnoel•12h ago
Open in hackernews

Beyond Elk: Lightweight and Scalable Cloud-Native Log Monitoring

https://greptime.com/blogs/2025-04-24-elasticsearch-greptimedb-comparison-performance
25•xzhuang1984•1y ago

Comments

firesteelrain•1y ago
Any reason to use this like in Azure over their cloud native options such as with AKS that has fluentd built into the ama-pod? It already sends logs to Azure Monitor/LogA. Azure Managed Grafana can take in Kusto queries. AMA can monitor VMs. Further you can use DCE/DCRs for custom logs. Azure provides Azure native ElasticSearch too. It seems to own this market.

You can predictably control costs and predict costs with these models.

killme2008•1y ago
Agree. Leveraging capabilities provided by cloud vendors is always a good idea. However, as the scale grows, cost inevitably becomes an issue. Third-party solutions often offer cost advantages because they support multi-cloud deployments and are optimized for specific scenarios.
chreniuc•1y ago
How does it compare to openobserve?
atombender•1y ago
How does Greptime handle dynamic schemas where you don't know most of the shape of the data upfront?

Where I work, we have maybe a hundred different sources of structured logs: Our own applications, Kubernetes, databases, CI/CD software, lots of system processes. There's no common schema other than the basics (timestamp, message, source, Kubernetes metadata). Apps produce all sorts JSON fields, and we have thousands and thousands of fields across all these apps.

It'd be okay to define a small core subset, but we'd need a sensible "catch all" rule for the rest. All fields need to be searchable, but it's of course OK if performance is a little worse for non-core fields, as long as you can go into the schema and explicitly add it in order to speed things up.

Also, how does Greptime scale with that many fields? Does it do fine with thousands of columns?

I imagine it would be a good idea to have one table per source. Is it easy/performant to search multiple tables (union ordered by time) in a single query?

killme2008•1y ago
Thanks for your question. GreptimeDB, like MongoDB, is schemaless. When ingesting data via OTEL or its gRPC SDKs, it automatically creates tables by inferring the schema and dynamically adds new columns as needed.

Secondly, I prefer wide tables to consolidate all sources for easy management and scalability. With GreptimeDB's columnar storage based on Parquet, unused columns don't incur storage costs.

atombender•1y ago
Thanks, that seems promising. So much of the documentation is schema-oriented, I didn't see that it supported dynamic schemas.

I find it interesting that Greptime is completely time-oriented. I don't think you can create tables without a time PK? The last time I needed log storage, I ended up picking ClickHouse, because it has no such restrictions on primary keys. We use non-time-based tables all the time, as well as dictionaries. So it seems Greptime is a lot less flexible?

killme2008•1y ago
Yes, GreptimeDB requires a time index column for optimized storage and querying. It's not a constraint of a primary key, but just an independent table constraint.

Could you elaborate on why you find this inconvenient? I assumed logs, for example, would naturally include a timestamp.

atombender•1y ago
It's less convenient because it makes the database less general-purpose. The moment you need to go beyond time-based data, you have to reach for other tools.

ClickHouse is such a wonderful database precisely it's so incredibly flexible. While most data I interact with is time-based, I also store lots of non-time-based data there to complement the time-based tables. The rich feature set of table engines, materialized views, and dictionaries means you have a lot of different tools to pick from to design your solution. For example, to optimize ETL lookup, I use a lot of dictionaries, which are not time-based.

As an example, let's say I'm ingesting logs into Greptime and some log lines have a customer_id. I would like the final table, or least a view, to be cross-referenced with the customer so that it can include the customer's name. I suppose one would have to continually ingest customer data into a Greptime table with today's date, and then join on today's date?

killme2008•1y ago
Fair point. Joining time-series data with business data is often necessary. While GreptimeDB currently supports external tables for Parquet and CSV files, we plan to expand this support to include datasources like MySQL and PG in the future.
client4•1y ago
For logs I'd be more likely to choose https://www.gravwell.io as it's log agnostic and I've seen it crush 40Tb/s a day, whereas it looks like greptime is purpose-tuned for metrics and telemetry data.
dijit•1y ago
is gravwell open source?

(it seems greptime is.)

reconnecting•1y ago
I'm always skeptical toward software companies with an outdated year in the footer.
killme2008•1y ago
Thanks for pointing it out! The footer has been updated.
reconnecting•1y ago
Thank you for your prompt attention to this matter. Until next year, then.
killme2008•1y ago
We'll find a way to fix it forever :D
emmanueloga_•1y ago
a "no brown M&Ms" razor!
reconnecting•1y ago
From a website perspective, finding the current year can be challenging, but there's always a way to hack around it. For example, by parsing another website to get the year.
ByteBard1979•1y ago
What scenario would I use best?
qmarchi•1y ago
Am I the only one that got, "This article smells like it was written by an AI told to 'compare these two products'"?

Something around the sentence structure just is offputting.

killme2008•1y ago
The author is not a native speaker; I promised it's not an AI article but with some minor reviews from AI :)
up2isomorphism•1y ago
This space is so crowded, I think any new startup is very unlikely to survive, unless it solves its own business case first.
killme2008•1y ago
Yes, so many startups are trying to solve the log issue in the current stack.

In my personal observation, the vast majority of startups are still focused on the product layer and use ClickHouse directly for storage. However, ClickHouse’s tightly coupled storage and compute architecture makes it difficult to scale, and this becomes a real problem as workloads grow. GreptimeDB, on the other hand, is more focused on being an all-in-one observability database. Our log UI, however, still has quite a gap compared to products like Kibana.

This space is very crowded. I think it’s unlikely that any new startup will succeed here unless it can first solve its own business use case exceptionally well.

Would love to hear your thoughts.

atombender•1y ago
Reading the web site, I just noticed the open-source version does not have "Log query endpoints".

Does that mean you have to use SQL (or the visual SQL builder) to query logs, and you don't get access to a log query language the way Kibana gives you KQL and Lucene syntax?

If so, I think it's a little disingenuous to write an article comparing the ELK stack, which is open source and comes with a perfectly usable query UI, to Greptime's equivalent, which is not.

killme2008•1y ago
In fact, we have an open-source query language, but it's still in experimental, so we don't present it on the website. The description of the enterprise feature is not precise. Sorry for the inconvenience.

GreptimeDB also open-sources the log view UI if you read the article.

I agree with you that ETL is so powerful, and GreptimeDB is so young, we still have lots of work to do. Thank you.

atombender•1y ago
Thanks, sounds interesting. It's actually not at all clear from the article that the UI, as presented, is open source. I'm looking for an ELK replacement (in an enterprise setting), so it sounds like Greptime is something I might be able to use.
killme2008•1y ago
Thanks for your feedback. We fixed the descriptions of log query endpoints. Hope it's more clear. Glad you're considering giving it a try and looking forward to your feedback.