frontpage.

I built Orchestera as a PaaS that allows you to orchestrate Apache Spark clusters in your own AWS account, with no additional markup on compute via EC2 instances.

I built this because I was tired of the compute markup that products like AWS EMR and Databricks charge for the convenience of using Apache Spark via their platforms. One can argue that Databricks is a superior product with a lot of additional value in their offering but I don't see that with AWS EMR Apache Spark at all (given my personal experience working with it).

My motivation to build this was to be able to create your own Apache Spark cluster without needing any understanding of the underlying data infrastructure engineering and quickly get to the point of writing Spark pipelines, whether as Python applications or Jupyter notebooks, all with no markup on compute because I don't think that is a justified narrative.

It took me almost an year to build it with a day job and of course I used AI for frontend design and video narrations, the infrastructue engineering that goes behind it comes with quite a bit of experience in the industry. The backend that orchestrates the cluster is written with the following:

- Django and DRF for API

- Temporal for async workers

- Pulumi that is run via Temporal workers to orchestrate the cluster

- Karpenter for node auto-scaling based on Spark executor workloads and requests

- Librechat for Spark History server and MCP based debugging for Spark pipeline run analysis

There are currently no caps on the CPU limits so you can try this out today in your own personal AWS accounts for free.

Also looking for feedback on HN.

AI and the Joy of Programming

The Case for Wasteful Agents

An AI Called Winter: Neurosymbolic Computation or Illusion?

Open models in perpetual catch-up

Physical Mail Scam Checker

From coding agents to system operators

OpenClaw Got Banned. Here Is Why That Should Worry You

Show HN: Searchable compression for JSON/NDJSON (skip ~99% pages; sub-ms lookups

Study Suggests Women Have Autism Just as Often, but Are Diagnosed Later in Life

Show HN: Parallax – See how exposed you are to AI disruption and make a plan

Show HN: Prodlint – A linter that catches what AI coding tools miss

Farewell Rust

Show HN: Prompt inject AI agents to avoid slop

An update on upki: TLS certificate revocation checking with CRLite in Rust

Show HN: I built my own custom memory allocator

What Is a Centipawn Advantage?

Surprising Effectiveness of Masking Updates in Adaptive Optimizers

A CLI for managing FIDO2 security keys

Show HN: ClawdBot for iOS – uses iOS shortcuts for skills on device

Researchers identify genetic blueprint of mania in bipolar disorder

Show HN: Argeo – AI Visibility and Generative Engine Optimization (Geo) Advisory

Ask HN: Are hackathons still worth doing?

Verizon acknowledges "pain" of new unlock policy, suggests change is coming

Spring Festival Gala Robots Went Viral. But the Takes Are Wrong

Upright: An Open Source Synthetic Monitoring System

A milestone achievement in our journey to carbon negative

Sex toys maker Tenga says hacker stole customer information

Ask HN: What kind of coaching made a real impact in your life?

Archaeologists find possible first direct evidence of Hannibal's war elephants

Show HN: AgentLint – Real-time guardrails for AI coding agents

Show HN: Orchestera – Managed Apache Spark on Kubernetes in Your Own AWS Account

Comments