frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: FLE v0.3 – Claude Code Plays Factorio

https://jackhopkins.github.io/factorio-learning-environment/versions/0.3.0.html
19•noddybear•1h ago
We're excited to release v0.3.0 of the Factorio Learning Environment (FLE), an open-source environment for evaluating AI agents on long-horizon planning, spatial reasoning, and automation tasks.

== What is FLE? ==

FLE uses the game Factorio to test whether AI can handle complex, open-ended engineering challenges. Agents write Python code to build automated factories, progressing from simple resource extraction (~30 units/min) to sophisticated production chains (millions of units/sec).

== What's new in 0.3.0 ==

- Headless scaling: No longer needs the game client, enabling massive parallelization!

- OpenAI Gym compatibility: Standard interface for RL research

- Claude Code integration: We're livestreaming Claude playing Factorio [on Twitch](http://twitch.tv/playsfactorio)

- Better tooling and SDK: 1-line CLI commands to run evaluations (with W&B logging)

== Key findings ==

We evaluated frontier models (Claude Opus 4.1, GPT-5, Gemini 2.5 Pro, Grok 4) on 24 production automation tasks of increasing complexity.

Even the best models struggle:

- Most models still rely on semi-manual strategies rather than true automation

- Agents rarely define helper functions or abstractions, limiting their ability to scale

- Error recovery remains difficult – agents often get stuck in repetitive failure loops

The performance gap between models on FLE correlates more closely with real-world task benchmarks (like GDPVal) than with traditional coding/reasoning evals.

== Why this matters ==

Unlike benchmarks based on exams that saturate quickly, Factorio's exponential complexity scaling means there's effectively no performance ceiling. The skills needed - system debugging, constraint satisfaction, logistics optimization - transfer directly to real challenges.

== Try it yourself ==

>>> uv add factorio-learning-environment

>>> uv add "factorio-learning-environment[eval]"

>>> fle cluster start

>>> fle eval --config configs/gym_run_config.json

We're looking for researchers, engineers, and modders interested in pushing the boundaries of agent capabilities. Join our Discord if you want to contribute. We look forward to meeting you and seeing what you can build!

-- FLE Team

ROS – Robot Operating System

https://www.ros.org/
1•welovebunnies•49s ago•0 comments

The phenomenon of eternal network applications

https://medium.com/@master.oleg255/the-phenomenon-of-eternal-network-applications-6a6f400966b6
1•master255•1m ago•0 comments

Baudot Code

https://www.dcode.fr/baudot-code
1•andsoitis•2m ago•0 comments

Testing time (and other asynchronicities) in Golang

https://go.dev/blog/testing-time
1•fanf2•4m ago•0 comments

Interstellar Object 3I/Atlas Passed Mars Last Night

https://earthsky.org/space/new-interstellar-object-candidate-heading-toward-the-sun-a11pl3z/
2•jandrewrogers•5m ago•0 comments

Fast SSIMULACRA2 Implementation in Zig

https://github.com/gianni-rosato/fssimu2
1•computerbuster•10m ago•0 comments

Sweden's National Bank Introduces Mandate for Offline Card Payments

https://www.riksbank.se/en-gb/press-and-published/notices-and-press-releases/press-releases/2025/...
2•sebiw•10m ago•0 comments

A FOSS project to create an Artificial Mind

2•apiemotion•11m ago•0 comments

Show HN: Docc – AI-generated code walkthroughs with narration

https://github.com/RuliLG/docc
1•sandandcode•11m ago•0 comments

Pretraining Under Infinite Compute

https://arxiv.org/abs/2509.14786
1•jedharris•13m ago•1 comments

QuakeEd 2.0 Level Editor (NeXTSTEP)

https://archive.org/details/QuakeEd
2•klaussilveira•16m ago•0 comments

Seattle residents report encampments in record numbers

https://www.seattletimes.com/seattle-news/homeless/seattle-residents-report-encampments-in-record...
4•petethomas•16m ago•0 comments

AWS API MCP Server v1.0.0 release

https://aws.amazon.com/about-aws/whats-new/2025/10/aws-api-mcp-server-v1-0-0-release/
1•rmason•16m ago•0 comments

Associated terminal, qwebengine in C++ Qt% same app

https://github.com/zebulon75018/termweb
1•zebulon75018•18m ago•1 comments

Huawei's AI accelerator roadmap, claims it to makes the mightiest clusters

https://www.theregister.com/2025/09/18/huawei_ascend_roadmap/
1•PaulHoule•18m ago•0 comments

State of Global Water Resources 2024

https://wmo.int/publication-series/state-of-global-water-resources-2024
1•measurablefunc•19m ago•0 comments

Unity Platform Protection: Developer Remediation Guide

https://unity.com/security/sept-2025-01/remediation
1•personjerry•21m ago•0 comments

Reddit's Former CEO Wants You to Buy a Subscription for Trees

https://www.bloomberg.com/news/articles/2025-10-03/reddit-s-former-ceo-wants-you-to-buy-a-subscri...
2•edran•21m ago•0 comments

Statue of Trump and Epstein Holding Hands Returns to National Mall

https://www.nytimes.com/2025/10/03/us/donald-trump-jeffrey-epstein-statue-washington.html
4•me-vs-cat•23m ago•0 comments

Show HN: Beacon (open source) – Built after AWS billed me 700% more for RDS

https://beaconinfra.dev
1•matebajusz•24m ago•0 comments

Hamas says it agrees to free all hostages, enter Gaza deal talks

https://www.jpost.com/israel-news/article-869351
7•SilverElfin•25m ago•0 comments

I made an open-source version of Imagine with Claude

https://github.com/joshbickett/generative-computer
2•bickett•30m ago•1 comments

The Invisible Curriculum of Research

http://muratbuffalo.blogspot.com/2025/10/the-invisible-curriculum-of-research.html
4•eatonphil•32m ago•0 comments

Clock Made of Clocks

https://codepen.io/EntropyReversed/pen/QwybYEJ?editors=0100
3•liquid99•32m ago•0 comments

Claremont McKenna College – Robert Day Sciences Center

https://big.dk/projects/claremont-mckenna-college-14115
1•gnabgib•33m ago•0 comments

One Year of PostgreSQL Hacking Workshops

http://rhaas.blogspot.com/2025/07/one-year-of-hacking-workshops.html
1•eatonphil•34m ago•0 comments

AI is reshaping childhood in China

https://restofworld.org/2025/ai-china-childhood/
5•devonnull•34m ago•1 comments

Captcha Challenge: write code that allows a robot to solve this puzzle

https://captcha.garambrogne.net/
1•kadrek•35m ago•1 comments

Bean CMS: A micro-CMS built with redbean

https://github.com/kevinfiol/beancms
1•indigodaddy•35m ago•0 comments

Toyoake, Japan rule limits digital device use to 2 hours/day outside work/school

https://www.nytimes.com/2025/10/01/world/asia/japan-smartphones-ban-toyoake.html
5•bookofjoe•36m ago•2 comments