frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

OpenTelemetry protocol with Apache Arrow

https://opentelemetry.io/blog/2025/otel-arrow-phase-2/
108•tanelpoder•1mo ago

Comments

andygrove•1mo ago
I've just started exploring adding OpenTelemetry support to the Comet subproject of DataFusion. I'm excited to see the integration with Apache Arrow (Rust) and potentially DataFusion in the future.
SomaticPirate•1mo ago
Wow, anyone able to provide a ELI5? OTel sounds amazing but this is flying over my head
theLiminator•1mo ago
Not sure, but seems like it will be producing apache arrow data and carrying it across the data stack end to end from OTEL. This would be great for creating data without a bunch of duplication/redundant processing steps and exporting it in a form that's ready to query.
piterrro•1mo ago
Unless I dont understand that fully (which could be the case).

This idea could fly if downstream readers will be able to read it. Json is great because anything can read it, process, transform and serialize without having to know the intrisics of the protocol.

Whats the point of using binary, columnar format for data in transit?

arccy•1mo ago
better compression https://opentelemetry.io/blog/2023/otel-arrow/

You don't do high performance without knowing the data schema.

odie5533•1mo ago
Is Arrow better than Parquet or Protobuf?
theLiminator•1mo ago
Arrow is an in-memory columnar format, kinda orthogonal to parquet (which is an at-rest format). Protobuf is a better comparison, but it's more message oriented and not suited for analytics.
arccy•1mo ago
the blog post comparison is against OTLP which is protobuf
datadrivenangel•1mo ago
Not having to write to disk is great, and zero-copy in memory access is instant...
phillipcarter•1mo ago
Warning: this is an oversimplification.

Performance optimization and being able to "plug in" to the data ecosystem that Apache Arrow exists in.

OpenTelemetry is pretty great for a lot of uses, but the protocol over the wire is too chunky for some applications where. From last year's post on the topic[0]:

> In a side-by-side comparison between OpenTelemetry Protocol (“OTLP”) and OpenTelemetry Protocol with Apache Arrow for similarly configured traces pipelines, we observe 30% improvement in compression. Although this study specifically focused on traces data, we have observed results for logs and metrics signals in production settings too, where OTel-Arrow users can expect 50% to 70% improvement relative to OTLP for similar pipeline configurations.

For your average set of apps and services running in a k8s cluster somewhere in the cloud, this is just a nice-to-have, but size on wire is a problem for a lot of systems out there today, and they are precluded from adopting OpenTelemetry until that's solved.

[0]: https://opentelemetry.io/blog/2024/otel-arrow-production/

potamic•1mo ago
This diagram really depicts things nicely

https://opentelemetry.io/blog/2023/otel-arrow/row-vs-columna...

ahoka•1mo ago
A bit hand wavy.
KAdot•1mo ago
> We are interested in making OTAP pipelines safely embeddable, through strict controls on memory and through support for thread-per-core runtimes.

I'm curious about the thread-per-core runtimes, are there even any mature thread-per-core runtimes in Rust around?

jauntywundrkind•1mo ago
glommio is pretty well respected. https://www.datadoghq.com/blog/engineering/introducing-glomm... https://github.com/DataDog/glommio

ByteDance also has their very fast monio. https://github.com/bytedance/monoio

Both integrate io-uring support for very fast io.

julian-datable•1mo ago
Integrations with OTLP are critical to driving adoption and probably one of the biggest pain points we've encountered when adopting it ourselves (and encouraging others to the same).

Adopting OTLP without third-party support is pretty time consuming, especially is your tech stack is large and/or varied.

Re runtimes: curious about this too. Feels like the right direction if you’re optimizing a telemetry pipeline.

akdor1154•1mo ago
Damn that's some scope creep if I ever saw it: 'try sending Arrow frames end to end' => 'rewrite the otel pipeline in rust'. Seems like the goals of the contributors don't exactly align with the goals of the project.

Kind of a bummer - one thing i was hoping to come out of this was better Arrow ecosystem support for golang.

gitroom•1mo ago
Man Ive dipped my toes into this too, and yeah, the way everyone wants different things always shakes things up fast. Kinda love seeing where it all ends up tbh.
mike_heffner•1mo ago
Thanks for sharing this — it’s a really promising direction. The advantages of Arrow for OTLP, especially when used end-to-end, are compelling given the protocol overhead of OTLP.

We’ve been thinking along similar lines with the use of Rust, particularly for OpenTelemetry collection in environments where high performance and low resource overhead are critical, such as edge and serverless. With that in mind, we’ve open-sourced a lightweight OpenTelemetry collector written in Rust to address these use cases. We’ve also developed a native Lambda extension around it, and have seen encouraging interest from folks aiming to improve cold start times.

The project is still fairly early, but we’re optimistic that Rust can open up new opportunities for efficient observability pipelines. Vendors like Datadog are also moving in this direction with their Lambda extension and appear to be adopting Rust more broadly for data-plane components.

If this resonates, feel free to take a look here: https://github.com/streamfold/rotel. We’d love to hear your thoughts on how this could be useful.

RapidRAW: A non-destructive and GPU-accelerated RAW image editor

https://github.com/CyberTimon/RapidRAW
85•l8rlump•3h ago•16 comments

Where can I see Hokusai's Great Wave today?

https://greatwavetoday.com/
35•colinprince•2h ago•14 comments

Bootstrapping a side project into a profitable seven-figure business

https://projectionlab.com/blog/we-reached-1m-arr-with-zero-funding
413•jonkuipers•1d ago•88 comments

Breaking Git with a carriage return and cloning RCE

https://dgl.cx/2025/07/git-clone-submodule-cve-2025-48384
296•dgl•12h ago•101 comments

Phrase origin: Why do we "call" functions?

https://quuxplusone.github.io/blog/2025/04/04/etymology-of-call/
55•todsacerdoti•2h ago•32 comments

Frame of preference A history of Mac settings, 1984–2004

https://aresluna.org/frame-of-preference/
67•K7PJP•5h ago•13 comments

Supabase MCP can leak your entire SQL database

https://www.generalanalysis.com/blog/supabase-mcp-blog
660•rexpository•12h ago•331 comments

Smollm3: Smol, multilingual, long-context reasoner LLM

https://huggingface.co/blog/smollm3
268•kashifr•13h ago•50 comments

I'm Building LLM for Satellite Data EarthGPT.app

https://www.earthgpt.app/
6•sabman•1d ago•1 comments

Bulgaria to join euro area on 1 January 2026

https://www.ecb.europa.eu//press/pr/date/2025/html/ecb.pr250708~b9676a9fa8.en.html
176•toomuchtodo•5h ago•97 comments

Radium Music Editor

http://users.notam02.no/~kjetism/radium/
181•ofalkaed•12h ago•35 comments

Surfing on a Matchbox (1999)

http://news.bbc.co.uk/2/hi/science/nature/276762.stm
13•TMWNN•2d ago•3 comments

Xenharmlib: A music theory library that supports non-western harmonic systems

https://xenharmlib.readthedocs.io/en/latest/
58•retooth•7h ago•5 comments

Brut: A New Web Framework for Ruby

https://naildrivin5.com/blog/2025/07/08/brut-a-new-web-framework-for-ruby.html
154•onnnon•12h ago•52 comments

Swahili on the Road

https://www.historytoday.com/archive/behind-times/swahili-road
16•Thevet•5h ago•2 comments

Libpostal: C library for parsing/normalizing street addresses around the world

https://github.com/openvenues/libpostal
22•nateb2022•4h ago•5 comments

Dynamical origin of Theia, the last giant impactor on Earth

https://arxiv.org/abs/2507.01826
80•bikenaga•11h ago•27 comments

US court strikes down 'click-to-cancel' rule designed to make unsubscribing easy

https://www.theguardian.com/us-news/2025/jul/08/court-click-to-cancel-ruling
210•andsoitis•4h ago•98 comments

Show HN: OffChess – Offline chess puzzles app

https://offchess.com
309•avadhesh18•21h ago•138 comments

Taking over 60k spyware user accounts with SQL injection

https://ericdaigle.ca/posts/taking-over-60k-spyware-user-accounts/
184•mtlynch•5d ago•58 comments

Rules of good writing (2007)

https://dilbertblog.typepad.com/the_dilbert_blog/2007/06/the_day_you_bec.html
78•santiviquez•1d ago•58 comments

Plants monitor the integrity of their barrier by sensing gas diffusion

https://www.nature.com/articles/s41586-025-09223-4
66•Bluestein•3d ago•31 comments

New Horizons images enable first test of interstellar navigation

https://www.newscientist.com/article/2486823-new-horizons-images-enable-first-test-of-interstellar-navigation/
31•jnord•2d ago•2 comments

Can an email go 500 miles in 2025?

https://flak.tedunangst.com/post/can-an-email-go-500-miles-in-2025
285•zdw•4d ago•105 comments

GlobalFoundries to Acquire MIPS

https://mips.com/press-releases/gf-mips/
196•mshockwave•13h ago•112 comments

At the frontier between two lives–the evolutionary origins of pregnancy

https://phys.org/news/2025-07-frontier-evolutionary-pregnancy.html
10•wglb•2d ago•1 comments

Choosing a Database Schema for Polymorphic Data (2024)

https://www.dolthub.com/blog/2024-06-25-polymorphic-associations/
22•gm678•5h ago•5 comments

Ceramic: A cross-platform and open-source 2D framework in Haxe

https://ceramic-engine.com/
73•-yukari•3d ago•10 comments

Show HN: A rain Pomodoro with brown noise, ASMR, and Middle Eastern music

https://forgetoolz.com/rain-pomodoro
74•ShadowUnknown•12h ago•34 comments

Blind to Disruption – The CEOs Who Missed the Future

https://steveblank.com/2025/07/08/blind-to-disruption-the-ceos-who-missed-the-future/
108•ArmageddonIt•16h ago•127 comments