frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Refact.ai Agent is #1 on SWE-bench Lite With a 59.7% Score

https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-source-refact-ai/
3•kate_at_refact•1y ago

Comments

kate_at_refact•1y ago
Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score. Our approach: fully autonomous Agent, no manual intervention needed.

How we did this:

• Prompt strategy: https://github.com/smallcloudai/refact/blob/swe-boosted-prom... • Claude 3.7 Sonnet as orchestrator • deep_analysis() tool (powered by o4-mini) for reasoning • Tool suite for repository exploration, code modification, and testing. Used dynamically based on task needs • One correct solution through iteration!

Autonomy = our core strength.

Refact.ai Agent completes the entire dev workflow independently: plans, executes, tests, self-corrects, and delivers a production-ready result. For each task, it made one multi-step run to generate a single correct solution, creating custom strategies rather than following rigid scripts.

You can read tech details on our SWE-bench approach: https://refact.ai/blog/2025/sota-on-swe-bench-lite-open-sour...

Your questions are welcome! Also, welcome to try Refact.ai Agent in VS Code and Jet Brains: https://linktr.ee/refactai

Texas Tech cautions broadcasting research restrictions to prospective students

https://www.texastribune.org/2026/05/06/texas-tech-university-graduate-research-limit-warnings/
1•hn_acker•2m ago•0 comments

If December Was Too Late to Fix Unconstitutional Gerrymandering Why Is May Okay?

https://www.techdirt.com/2026/05/05/someone-ask-alito-if-december-was-too-late-to-fix-unconstitut...
1•hn_acker•5m ago•1 comments

The people preserving the scientific practice of bird banding

https://thenarwhal.ca/bird-banding-ontario/
2•bookofjoe•10m ago•0 comments

Android Bench – Model Evals

https://developer.android.com/bench
1•vthallam•12m ago•0 comments

Spent $130K+ AI token "cloned" Screen Studio: AGI for software feel so close

https://realmikechong.substack.com/p/spent-130k-cloned-screen-studio-and
2•imWildCat•13m ago•0 comments

Apple Could Be Working on 'Spatial iPhone' with Holographic Display

https://www.macrumors.com/2026/05/07/apple-working-on-spatial-iphone/
3•mgh2•14m ago•0 comments

The unlikely story of an email time machine

https://www.scientificamerican.com/article/forbes-email-time-capsule-communicating-future/
2•baud147258•15m ago•0 comments

Ramp in talks to hit $40B+ valuation, 6 months after reaching $32B

https://techcrunch.com/2026/05/07/ramp-in-talks-to-hit-40b-valuation-6-months-after-reaching-32b/
1•SilverElfin•16m ago•0 comments

Retrotechnology Media

http://www.typewritten.org/Media/
1•davikr•17m ago•0 comments

AI Agents are sending flowers to people

https://twitter.com/postalform/status/2052517791899570318
1•zavtra•18m ago•1 comments

The effect of personalized values screens on portfolio returns

https://stevenmackey.substack.com/p/how-much-return-are-you-giving-up-478
1•chibg10•29m ago•0 comments

Designing Analog Chips [pdf]

http://www.designinganalogchips.com/_count/designinganalogchips.pdf
3•whatisabcdefgh•32m ago•0 comments

Find out why Elon gave over his keys to Anthropic He is right can't win this

https://deepseekresearch.com/models.html
1•oroboroslabs•37m ago•1 comments

A new experiment deepens the mystery over gravitational constant, Big G

https://www.cnn.com/2026/05/07/science/gravitational-constant-measure-gravity-big-g
1•rramadass•41m ago•1 comments

Gambling ads on social media reach more than twice as many men as women: study

https://www.cam.ac.uk/research/news/gambling-ads-on-social-media-reach-more-than-twice-as-many-me...
2•hhs•42m ago•0 comments

Where the Curves Cross

https://whattotelltherobot.com/p/where-the-curves-cross
2•stefie10•43m ago•0 comments

AI Bots Auditioning for Wall Street Are Mostly Losing

https://www.fa-mag.com/news/ai-bots-auditioning-for-wall-street-trading-are-mostly-losing-86902.html
1•izyda•47m ago•1 comments

Researchers discover advanced language processing in the unconscious human brain

https://www.bcm.edu/news/researchers-discover-advanced-language-processing-in-the-unconscious-hum...
4•hhs•53m ago•0 comments

Show HN: Blamo A vibecoded app for vibecoding vibe games

https://www.blamo.ai/
1•semateos•53m ago•1 comments

Show HN: Notion-to-site – sync any Notion database to local Markdown/MDX/JSON

https://github.com/rashidazarang/notion-to-site
2•rashidae•55m ago•0 comments

ICE Plans to Develop Own Smart Glasses

https://www.404media.co/ice-plans-to-develop-own-smart-glasses-to-supplement-its-facial-recogniti...
7•cdrnsf•57m ago•1 comments

Maybe you shouldn't install new software for a bit

https://xeiaso.net/blog/2026/abstain-from-install/
28•psxuaw•57m ago•4 comments

The Mounting Toll of Multi-Year Funding on American Biomedical Research [pdf]

https://actfornih.org/wp-content/uploads/2026/05/A4N_Updated-MYF-One-Pager_May-2026_FINAL.pdf
2•petethomas•58m ago•0 comments

[dupe] Cloudflare is laying off 1,100 employees

https://www.businessinsider.com/cloudflare-announces-1100-layoffs-amid-ai-focus-shift-2026-5
13•cdrnsf•1h ago•3 comments

Show HN: Kill-The-Backlog, self-hosted background agents

https://github.com/jvaill/Kill-The-Backlog
3•jvaill•1h ago•0 comments

As Russia Expands Internet Blackouts, Kremlin Tells Citizens to Use the Radio

https://united24media.com/latest-news/as-russia-expands-internet-blackouts-kremlin-tells-citizens...
4•hkmaxpro•1h ago•0 comments

Nonprofit hospitals spend billions on consultants with no clear effect

https://www.uchicagomedicine.org/forefront/research-and-discoveries-articles/nonprofit-hospitals-...
4•hhs•1h ago•0 comments

Mirror Neuron

https://en.wikipedia.org/wiki/Mirror_neuron
2•kristianpaul•1h ago•0 comments

Agentic Artificial Intelligence in Finance

https://arxiv.org/abs/2604.21672
2•nhatcher•1h ago•0 comments

CSP2XSS: Vulnerability in Next.js App Router

https://aisafe.io/blog/csp2xss-nextjs-vulnerability
2•adragos_•1h ago•0 comments