frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

OpenTelemetry protocol with Apache Arrow

https://opentelemetry.io/blog/2025/otel-arrow-phase-2/
55•tanelpoder•6h ago

Comments

andygrove•5h ago
I've just started exploring adding OpenTelemetry support to the Comet subproject of DataFusion. I'm excited to see the integration with Apache Arrow (Rust) and potentially DataFusion in the future.
SomaticPirate•5h ago
Wow, anyone able to provide a ELI5? OTel sounds amazing but this is flying over my head
theLiminator•4h ago
Not sure, but seems like it will be producing apache arrow data and carrying it across the data stack end to end from OTEL. This would be great for creating data without a bunch of duplication/redundant processing steps and exporting it in a form that's ready to query.
piterrro•3h ago
Unless I dont understand that fully (which could be the case).

This idea could fly if downstream readers will be able to read it. Json is great because anything can read it, process, transform and serialize without having to know the intrisics of the protocol.

Whats the point of using binary, columnar format for data in transit?

arccy•3h ago
better compression https://opentelemetry.io/blog/2023/otel-arrow/

You don't do high performance without knowing the data schema.

odie5533•3h ago
Is Arrow better than Parquet or Protobuf?
theLiminator•3h ago
Arrow is an in-memory columnar format, kinda orthogonal to parquet (which is an at-rest format). Protobuf is a better comparison, but it's more message oriented and not suited for analytics.
arccy•2h ago
the blog post comparison is against OTLP which is protobuf
phillipcarter•2h ago
Warning: this is an oversimplification.

Performance optimization and being able to "plug in" to the data ecosystem that Apache Arrow exists in.

OpenTelemetry is pretty great for a lot of uses, but the protocol over the wire is too chunky for some applications where. From last year's post on the topic[0]:

> In a side-by-side comparison between OpenTelemetry Protocol (“OTLP”) and OpenTelemetry Protocol with Apache Arrow for similarly configured traces pipelines, we observe 30% improvement in compression. Although this study specifically focused on traces data, we have observed results for logs and metrics signals in production settings too, where OTel-Arrow users can expect 50% to 70% improvement relative to OTLP for similar pipeline configurations.

For your average set of apps and services running in a k8s cluster somewhere in the cloud, this is just a nice-to-have, but size on wire is a problem for a lot of systems out there today, and they are precluded from adopting OpenTelemetry until that's solved.

[0]: https://opentelemetry.io/blog/2024/otel-arrow-production/

KAdot•3h ago
> We are interested in making OTAP pipelines safely embeddable, through strict controls on memory and through support for thread-per-core runtimes.

I'm curious about the thread-per-core runtimes, are there even any mature thread-per-core runtimes in Rust around?

jauntywundrkind•3h ago
glommio is pretty well respected. https://www.datadoghq.com/blog/engineering/introducing-glomm... https://github.com/DataDog/glommio

ByteDance also has their very fast monio. https://github.com/bytedance/monoio

Both integrate io-uring support for very fast io.

julian-datable•3h ago
Integrations with OTLP are critical to driving adoption and probably one of the biggest pain points we've encountered when adopting it ourselves (and encouraging others to the same).

Adopting OTLP without third-party support is pretty time consuming, especially is your tech stack is large and/or varied.

Re runtimes: curious about this too. Feels like the right direction if you’re optimizing a telemetry pipeline.

akdor1154•3h ago
Damn that's some scope creep if I ever saw it: 'try sending Arrow frames end to end' => 'rewrite the otel pipeline in rust'. Seems like the goals of the contributors don't exactly align with the goals of the project.

Kind of a bummer - one thing i was hoping to come out of this was better Arrow ecosystem support for golang.

Type-constrained code generation with language models

https://arxiv.org/abs/2504.09246
65•tough•2h ago•28 comments

Your fingers wrinkle the same way every time you're in the water too long

https://www.binghamton.edu/news/story/5547/do-your-fingers-wrinkle-the-same-way-every-time-youre-in-the-water-too-long-new-research-says-yes
22•gnabgib•1h ago•2 comments

Flattening Rust's Learning Curve

https://corrode.dev/blog/flattening-rusts-learning-curve/
47•birdculture•2h ago•22 comments

Branch Privilege Injection: Exploiting branch predictor race conditions

https://comsec.ethz.ch/research/microarch/branch-privilege-injection/
319•alberto-m•8h ago•127 comments

Starcloud

https://www.ycombinator.com/companies/starcloud
126•wiley1454•4h ago•245 comments

Map of Palaeohispanic Coins and Inscriptions

http://hesperia.ucm.es/consulta_hesperia/mapas.php
8•brendanashworth•35m ago•0 comments

Build real-time knowledge graph for documents with LLM

https://cocoindex.io/blogs/knowledge-graph-for-docs/
64•badmonster•4h ago•11 comments

Failed Soviet Venus lander Kosmos 482 crashes to Earth after 53 years in orbit

https://www.space.com/space-exploration/launches-spacecraft/failed-soviet-venus-lander-kosmos-482-crashes-to-earth-after-53-years-in-orbit
98•taubek•3d ago•62 comments

Google is building its own DeX: First look at Android's Desktop Mode

https://www.androidauthority.com/android-desktop-mode-leak-3550321/
194•logic_node•10h ago•160 comments

Show HN: HelixDB – Open-source vector-graph database for AI applications (Rust)

https://github.com/HelixDB/helix-db/
114•GeorgeCurtis•7h ago•50 comments

PDF to Text, a challenging problem

https://www.marginalia.nu/log/a_119_pdf/
222•ingve•9h ago•124 comments

Multiple security issues in GNU Screen

https://www.openwall.com/lists/oss-security/2025/05/12/1
331•st_goliath•13h ago•202 comments

Launch HN: Miyagi (YC W25) turns YouTube videos into online, interactive courses

157•bestwillcui•11h ago•87 comments

A tool to verify estimates, II: a flexible proof assistant

https://terrytao.wordpress.com/2025/05/09/a-tool-to-verify-estimates-ii-a-flexible-proof-assistant/
12•jjgreen•3d ago•0 comments

When graphic design saves lives

https://news.harvard.edu/gazette/story/2025/05/when-graphic-design-saves-lives/
8•gnabgib•3d ago•0 comments

Garbage collection of object storage at scale

https://www.warpstream.com/blog/taking-out-the-trash-garbage-collection-of-object-storage-at-massive-scale
35•ko_pivot•3d ago•6 comments

It Awaits Your Experiments

https://www.rifters.com/crawl/?p=11511
125•pavel_lishin•9h ago•34 comments

How (memory) safe is Zig? (2021)

https://www.scattered-thoughts.net/writing/how-safe-is-zig/
18•vortex_ape•2h ago•16 comments

Cardiac: A CARDboard Illustrative Aid to Computation [pdf]

https://www.cs.drexel.edu/~bls96/museum/CARDIAC_manual.pdf
15•throwaway71271•2h ago•5 comments

Coffee for people who don't like coffee

https://ostwilkens.se/blog/coffee
20•ostwilkens•3d ago•56 comments

Less meat is nearly always better than sustainable meat

https://ourworldindata.org/less-meat-or-sustainable-meat
8•sohkamyung•52m ago•1 comments

Y Combinator says Google is a monopolist, no comment about its OpenAI ties

https://techcrunch.com/2025/05/13/y-combinator-says-google-is-a-monopolist-that-has-stunted-the-startup-ecosystem/
108•mastazi•2h ago•27 comments

OpenTelemetry protocol with Apache Arrow

https://opentelemetry.io/blog/2025/otel-arrow-phase-2/
55•tanelpoder•6h ago•13 comments

The world could run on older hardware if software optimization was a priority

https://twitter.com/ID_AA_Carmack/status/1922100771392520710
562•turrini•14h ago•541 comments

I learned Snobol and then wrote a toy Forth

https://ratfactor.com/snobol/
115•ingve•2d ago•30 comments

Turritopsis dohrnii: Immortal jellyfish

https://www.nhm.ac.uk/discover/immortal-jellyfish-secret-to-cheating-death.html
30•vinnyglennon•4d ago•6 comments

Using obscure graph theory to solve programming languages problems

https://reasonablypolymorphic.com/blog/solving-lcsa/
27•matt_d•4h ago•3 comments

Membrane: Media Framework for Elixir

https://membrane.stream/
112•lawik•3d ago•35 comments

Insurers launch cover for losses caused by AI chatbot errors

https://www.ft.com/content/1d35759f-f2a9-46c4-904b-4a78ccc027df
107•jmacd•2d ago•41 comments

In a high-stress work environment, prioritize relationships

https://wqtz.bearblog.dev/high-stress-job-relationships/
285•wqtz•11h ago•178 comments