frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenTelemetry protocol with Apache Arrow

https://opentelemetry.io/blog/2025/otel-arrow-phase-2/
108•tanelpoder•5mo ago

Comments

andygrove•5mo ago
I've just started exploring adding OpenTelemetry support to the Comet subproject of DataFusion. I'm excited to see the integration with Apache Arrow (Rust) and potentially DataFusion in the future.
SomaticPirate•5mo ago
Wow, anyone able to provide a ELI5? OTel sounds amazing but this is flying over my head
theLiminator•5mo ago
Not sure, but seems like it will be producing apache arrow data and carrying it across the data stack end to end from OTEL. This would be great for creating data without a bunch of duplication/redundant processing steps and exporting it in a form that's ready to query.
piterrro•5mo ago
Unless I dont understand that fully (which could be the case).

This idea could fly if downstream readers will be able to read it. Json is great because anything can read it, process, transform and serialize without having to know the intrisics of the protocol.

Whats the point of using binary, columnar format for data in transit?

arccy•5mo ago
better compression https://opentelemetry.io/blog/2023/otel-arrow/

You don't do high performance without knowing the data schema.

odie5533•5mo ago
Is Arrow better than Parquet or Protobuf?
theLiminator•5mo ago
Arrow is an in-memory columnar format, kinda orthogonal to parquet (which is an at-rest format). Protobuf is a better comparison, but it's more message oriented and not suited for analytics.
arccy•5mo ago
the blog post comparison is against OTLP which is protobuf
datadrivenangel•5mo ago
Not having to write to disk is great, and zero-copy in memory access is instant...
phillipcarter•5mo ago
Warning: this is an oversimplification.

Performance optimization and being able to "plug in" to the data ecosystem that Apache Arrow exists in.

OpenTelemetry is pretty great for a lot of uses, but the protocol over the wire is too chunky for some applications where. From last year's post on the topic[0]:

> In a side-by-side comparison between OpenTelemetry Protocol (“OTLP”) and OpenTelemetry Protocol with Apache Arrow for similarly configured traces pipelines, we observe 30% improvement in compression. Although this study specifically focused on traces data, we have observed results for logs and metrics signals in production settings too, where OTel-Arrow users can expect 50% to 70% improvement relative to OTLP for similar pipeline configurations.

For your average set of apps and services running in a k8s cluster somewhere in the cloud, this is just a nice-to-have, but size on wire is a problem for a lot of systems out there today, and they are precluded from adopting OpenTelemetry until that's solved.

[0]: https://opentelemetry.io/blog/2024/otel-arrow-production/

potamic•5mo ago
This diagram really depicts things nicely

https://opentelemetry.io/blog/2023/otel-arrow/row-vs-columna...

ahoka•5mo ago
A bit hand wavy.
KAdot•5mo ago
> We are interested in making OTAP pipelines safely embeddable, through strict controls on memory and through support for thread-per-core runtimes.

I'm curious about the thread-per-core runtimes, are there even any mature thread-per-core runtimes in Rust around?

jauntywundrkind•5mo ago
glommio is pretty well respected. https://www.datadoghq.com/blog/engineering/introducing-glomm... https://github.com/DataDog/glommio

ByteDance also has their very fast monio. https://github.com/bytedance/monoio

Both integrate io-uring support for very fast io.

julian-datable•5mo ago
Integrations with OTLP are critical to driving adoption and probably one of the biggest pain points we've encountered when adopting it ourselves (and encouraging others to the same).

Adopting OTLP without third-party support is pretty time consuming, especially is your tech stack is large and/or varied.

Re runtimes: curious about this too. Feels like the right direction if you’re optimizing a telemetry pipeline.

akdor1154•5mo ago
Damn that's some scope creep if I ever saw it: 'try sending Arrow frames end to end' => 'rewrite the otel pipeline in rust'. Seems like the goals of the contributors don't exactly align with the goals of the project.

Kind of a bummer - one thing i was hoping to come out of this was better Arrow ecosystem support for golang.

gitroom•5mo ago
Man Ive dipped my toes into this too, and yeah, the way everyone wants different things always shakes things up fast. Kinda love seeing where it all ends up tbh.
mike_heffner•5mo ago
Thanks for sharing this — it’s a really promising direction. The advantages of Arrow for OTLP, especially when used end-to-end, are compelling given the protocol overhead of OTLP.

We’ve been thinking along similar lines with the use of Rust, particularly for OpenTelemetry collection in environments where high performance and low resource overhead are critical, such as edge and serverless. With that in mind, we’ve open-sourced a lightweight OpenTelemetry collector written in Rust to address these use cases. We’ve also developed a native Lambda extension around it, and have seen encouraging interest from folks aiming to improve cold start times.

The project is still fairly early, but we’re optimistic that Rust can open up new opportunities for efficient observability pipelines. Vendors like Datadog are also moving in this direction with their Lambda extension and appear to be adopting Rust more broadly for data-plane components.

If this resonates, feel free to take a look here: https://github.com/streamfold/rotel. We’d love to hear your thoughts on how this could be useful.

Tor browser removing various Firefox AI features

https://blog.torproject.org/new-alpha-release-tor-browser-150a4/
19•HelloUsername•30m ago•1 comments

Hyperflask – Full stack Flask and Htmx framework

https://hyperflask.dev/
61•emixam•2h ago•9 comments

Why I Chose Elixir Phoenix over Rails, Laravel, and Next.js

https://akarshc.com/post/phoenix-for-my-project.html
49•akarshc•1h ago•22 comments

A stateful browser agent using self-healing DOM maps

https://100x.bot/a/a-stateful-browser-agent-using-self-healing-dom-maps
48•shardullavekar•2h ago•33 comments

Launch HN: Inkeep (YC W23) – Open Source Agent Builder

https://github.com/inkeep/agents
24•engomez•2h ago•18 comments

Upcoming Rust language features for kernel development

https://lwn.net/Articles/1039073/
207•pykello•8h ago•105 comments

VOC injection into a house reveals large surface reservoir sizes

https://www.pnas.org/doi/10.1073/pnas.2503399122
28•PaulHoule•4d ago•13 comments

Liquibase continues to advertise itself as "open source" despite license switch

https://github.com/liquibase/liquibase/issues/7374
268•LaSombra•7h ago•211 comments

Like MS Excel, Pivot tables never die

https://www.rilldata.com/blog/why-pivot-tables-never-die
17•articsputnik•1h ago•32 comments

LINQ and Learning to Be Declarative

https://www.nickstambaugh.dev/posts/LINQ-and-being-declarative
16•sieep•1w ago•15 comments

New coding models and integrations

https://ollama.com/blog/coding-models
158•meetpateltech•9h ago•51 comments

Jiga (YC W21) Is Hiring Full Stacks

https://www.workatastartup.com/jobs/44310
1•grmmph•3h ago

JustSketchMe – Digital Posing Tool

https://justsketch.me
136•surprisetalk•6d ago•25 comments

Claude Haiku 4.5

https://www.anthropic.com/news/claude-haiku-4-5
685•adocomplete•22h ago•271 comments

Lace: A New Kind of Cellular Automata Where Links Matter

https://www.novaspivack.com/science/introducing-lace-a-new-kind-of-cellular-automata
11•airesearcher•1h ago•5 comments

DoorDash and Waymo launch autonomous delivery service in Phoenix

https://about.doordash.com/en-us/news/waymo
37•ChrisArchitect•59m ago•53 comments

Flies keep landing on North Sea oil rigs

https://theconversation.com/thousands-of-flies-keep-landing-on-north-sea-oil-rigs-then-taking-off...
159•speckx•6d ago•72 comments

TurboTax’s 20-year fight to stop Americans from filing taxes for free (2019)

https://www.propublica.org/article/inside-turbotax-20-year-fight-to-stop-americans-from-filing-th...
483•lelandfe•9h ago•263 comments

Credential Stuffing

https://ciamweekly.substack.com/p/credential-stuffing
26•mooreds•2d ago•18 comments

Silver Snoopy Award

https://www.nasa.gov/space-flight-awareness/silver-snoopy-award/
77•LorenDB•4d ago•18 comments

The people rescuing forgotten knowledge trapped on old floppy disks

https://www.bbc.com/future/article/20251009-rescuing-knowledge-trapped-on-old-floppy-disks
52•jnord•5d ago•19 comments

Steve Jobs and Cray-1 to be featured on 2026 American Innovations $1 coin

https://www.usmint.gov/news/press-releases/united-states-mint-releases-2026-american-innovation-o...
208•maguay•8h ago•199 comments

Free applicatives, the handle pattern, and remote systems

https://exploring-better-ways.bellroy.com/free-applicatives-the-handle-pattern-and-remote-systems...
77•_jackdk_•11h ago•22 comments

Zed is now available on Windows

https://zed.dev/blog/zed-for-windows-is-here
495•meetpateltech•22h ago•314 comments

Apple M5 chip

https://www.apple.com/newsroom/2025/10/apple-unleashes-m5-the-next-big-leap-in-ai-performance-for...
1178•mihau•1d ago•1267 comments

Build a Superscalar 8-Bit CPU (YouTube Playlist) [video]

https://www.youtube.com/watch?v=bwjMLyBU4RU&list=PLyR4neQXqQo5nPdEiMbaEJxWiy_UuyNN4&index=1
110•lrsjng•5d ago•14 comments

Are hard drives getting better?

https://www.backblaze.com/blog/are-hard-drives-getting-better-lets-revisit-the-bathtub-curve/
246•HieronymusBosch•21h ago•134 comments

A Gemma model helped discover a new potential cancer therapy pathway

https://blog.google/technology/ai/google-gemma-ai-cancer-therapy-discovery/
186•alexcos•19h ago•42 comments

Sharp Bilinear Filters: Big Clean Pixels for Pixel Art

https://bumbershootsoft.wordpress.com/2025/10/11/sharp-bilinear-filters-big-clean-pixels-for-pixe...
17•todsacerdoti•4d ago•4 comments

The Hidden Math of Ocean Waves Crashes Into View

https://www.quantamagazine.org/the-hidden-math-of-ocean-waves-crashes-into-view-20251015/
39•pykello•8h ago•1 comments