I built Crawlee Cloud, an open-source, self-hosted platform that lets you run Crawlee and Apify Actors on your own infrastructure.
The problem: The Apify ecosystem (Crawlee, SDK, Actors) is fantastic for web scraping, but it's tied to their cloud. If you want to keep your data on-prem, run on your own servers, or save on costs at scale, you're stuck.
The solution: Crawlee Cloud implements Apify's REST API so your existing Actors work without code changes. Just point APIFY_API_BASE_URL to your own server.
What's included:
SDK compatible: Datasets, Key-Value Stores, Request Queues all work
Docker-based: Each Actor runs in an isolated container
Dashboard: Monitor runs, explore datasets, manage Actors
CLI: Push, run, and manage Actors from your terminal
Stack: Node.js, Fastify, PostgreSQL, Redis, S3/MinIO, Next.jsGitHub: https://github.com/crawlee-cloud/crawlee-cloud
Happy to answer questions!