frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

My GLM-5.1 coding agent scored 94.3% on LiveCodeBench Lite (348/369)

3•univence•1h ago
I’ve been building Univence, a custom autonomous coding agent platform powered by GLM-5.1.

We are building this to be a true Replit/Vercel competitor, but with zero vendor lock-in. You can build and develop entirely on our platform alongside our SOTA agent, but you own the code and can deploy it seamlessly to any 3rd-party host like DigitalOcean, Netlify, AWS, or your own VPS.

To prove the core agent's capability, we just ran it against the LiveCodeBench Lite dataset (Python split). Here is the breakdown over a blind 369-problem run:

    Total: 348/369 passed (94.3%)

    Easy: 138/141 passed (97.9%)

    Medium: 152/156 passed (97.4%)

    Hard: 58/72 passed (80.6%)
(Note: We achieved that 80% on Hard by engineering the agent's constraints to strictly prioritize optimal time complexities like O(n log n) over brute-force O(n^2), avoiding the Time Limit Exceeded errors that usually trip up standard wrappers).

But we aren't just building this for the tech. My co-founder is a Palestinian refugee currently living in the Gaza Strip, and we are launching this to drive immediate humanitarian impact. 100% of the profits from 11 months of the year from this platform will be donated directly to support Palestinian refugees.

The agent is this good already, but I have a roadmap of architectural ideas to make it even better. Right now, I'm looking for fast angel funding, compute sponsorships, or strategic partners to help us scale this ASAP.

    Try it out: https://univence.com

    Raw JSONL trajectory logs: https://github.com/UnivenceAI/Univence-benchmarks/tree/main/Z%20AI/GLM-5.1

    Follow our progress & proof of donations: https://x.com/UnivenceAI
I would love any feedback on the platform or the agent architecture. If you are an investor or want to support the mission, my DMs are open on X, or you can reach us at univenceai@gmail.com.

SonicMoE: A HW-Efficient and SW-Extensible Blueprint for Fine-Grained MoEs

https://dao-lab.ai/blog/2026/sonicmoe-blackwell/
1•matt_d•1m ago•0 comments

Tesla admits HW3 owners need upgrades for true 'Full Self-Driving'

https://techcrunch.com/2026/04/22/elon-musk-admits-millions-of-tesla-owners-need-upgrades-for-tru...
1•mfiguiere•1m ago•0 comments

The price of software is going to zero

https://blog.sledgeworx.dev/software-going-to-zero/
1•Sevii•2m ago•0 comments

SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving

https://arxiv.org/abs/2604.19157
1•matt_d•2m ago•0 comments

A Data-Driven Machine Learning Framework for Optimising Programmable Terahertz

https://www.researchgate.net/publication/404050094_A_Data-Driven_Machine_Learning_Framework_for_O...
1•f0r3st•3m ago•0 comments

Show HN: Forge-Core released on GitHub, Parse JSON in your data warehouse

https://github.com/foxtrotcommunications/foxtrotcommunications-forge-core
1•brady_bastian•7m ago•0 comments

DIRT: Database-Integrated Random Testing

https://arxiv.org/abs/2604.16373
1•matt_d•8m ago•0 comments

Lute: A Standalone Runtime for Luau

https://lute.luau.org/
2•vrn-sn•10m ago•1 comments

Instagram is testing premium features

https://www.cbc.ca/news/business/instagram-plus-rollout-9.7172486
1•fbelzile•11m ago•0 comments

Design.md: The New Open Contract Between Designers and AI

https://kyanfeat.substack.com/p/designmd-the-new-open-contract-between
1•kyanfeat•12m ago•0 comments

Willow

https://github.com/Ghosthx-Code/willow
1•Ghosthx-Code•14m ago•0 comments

Americans Turning to AI to Supplement Healthcare Visits

https://news.gallup.com/poll/707789/americans-turning-supplement-healthcare-visits.aspx
1•hn_acker•14m ago•0 comments

Qwen3.5-Omni Technical Report

https://arxiv.org/abs/2604.15804
1•gmays•14m ago•0 comments

Monitoring Data Quality in Probability-Based Internet Panels

https://news.gallup.com/opinion/methodology/708383/monitoring-data-quality-probability-based-inte...
1•hn_acker•14m ago•0 comments

Four Stable Kernels for Wednesday

https://lwn.net/Articles/1068981/
1•kazu11max17•17m ago•0 comments

When Your Digital Life Vanishes

https://www.newyorker.com/magazine/2026/04/27/when-your-digital-life-vanishes
3•benbreen•20m ago•0 comments

The IOC's decision to protect the female category is a victory for fairness

https://www.theguardian.com/commentisfree/2026/apr/21/ioc-decision-female-category-olympics-trans...
1•vlebb•23m ago•0 comments

In two years nobody will care if actors are AI or not–director Mathieu Kassovitz

https://www.theguardian.com/film/2026/apr/22/actors-ai-la-haine-director-mathieu-kassovitz
2•bookofjoe•24m ago•1 comments

gRPC benchmark results 2026-04-23

https://github.com/LesnyRumcajs/grpc_bench/discussions/559
1•materialferret•25m ago•0 comments

GitHub Driven RSS Feeds: Paul Graham, Anthropic, and More

https://github.com/Olshansk/rss-feeds
2•Olshansky•26m ago•1 comments

Bring your own Agent to MS Teams

https://microsoft.github.io/teams-sdk/blog/bring-your-agent-to-teams/
4•umangsehgal93•26m ago•0 comments

You want your Moon landings in HD? So does NASA

https://arstechnica.com/space/2026/04/you-want-your-moon-landings-in-hdtv-so-does-nasa-heres-how-...
2•jnord•27m ago•0 comments

Exercise and Weekly Sirolimus (Rapamycin) in Older Adults (Trial)

https://onlinelibrary.wiley.com/doi/10.1002/jcsm.70274
1•evo_9•29m ago•0 comments

HEPA air purifiers may boost brain power in adults over 40

https://medicalxpress.com/news/2026-04-hepa-air-purifiers-boost-brain.html
2•OutOfHere•30m ago•0 comments

Discover the Robot Athlete That Competes with Professional Table Tennis Players

https://ai.sony/blog/inside-project-ace-discover-the-robot-athlete-that-competes-with-professiona...
2•dbcooper•32m ago•0 comments

Andreessen, Thrive Poised for Windfall from SpaceX's Cursor Bid

https://www.bloomberg.com/news/articles/2026-04-22/andreessen-thrive-poised-for-windfall-from-spa...
1•petethomas•33m ago•0 comments

A new programming model for durable execution

https://vercel.com/blog/a-new-programming-model-for-durable-execution
2•gmays•33m ago•0 comments

Show HN: Archon-memory-core – agent memory that resolves contradictions

https://divergencerouter.com/amc/
1•Divergence42•35m ago•0 comments

Scaling Test-Time Compute for Agentic Coding

https://arxiv.org/abs/2604.16529
1•matt_d•36m ago•0 comments

Opus 4.7 is having a rough day. double check its work

https://imgur.com/a/eg5zL1u
1•prallo•37m ago•0 comments