frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open-source system for guided interviews and document assembly

https://docassemble.org/
1•amai•1m ago•0 comments

December 9, 1968, Douglas Engelbart delivered "The Mother of All Demos," [video]

https://www.youtube.com/watch?v=yJDv-zdhzMY
1•jdcampolargo•1m ago•0 comments

CT Teardown: AirPods Pro (3rd Generation)

https://www.lumafield.com/first-article/posts/ct-teardown-airpods-pro-3rd-generation
2•teardowntown•2m ago•0 comments

AI and the Future of Education

https://farza.substack.com/p/eating-a-pesto-pasta-salad-in-dubai
1•quintulch•5m ago•0 comments

The rise of the electrostate: China is leading on climate action

https://www.cbc.ca/news/science/china-energy-solar-electric-vehicle-climate-9.7005003
1•Teever•8m ago•0 comments

A New Billing Architecture to Transform EV Charging Economics

https://cleantechnica.com/2025/12/08/a-new-billing-architecture-to-transform-ev-charging-economics/
1•thelastgallon•8m ago•0 comments

Show HN: Advent of Back Ends

https://adventofbackends.vercel.app/
1•rohitghumare•9m ago•0 comments

Anki: Old version beginning to fail with newer Python releases (2020)

https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=958853
1•bariumbitmap•9m ago•0 comments

Bee2Bee – Turn Google Colab and local machines into a unified P2P AI cluster

https://github.com/Chatit-cloud/BEE2BEE
2•mohammede•10m ago•0 comments

2003: BowieNet 3 Launch and the Peak of Flash Web Design

https://cybercultural.com/p/bowienet-v3-flash-2003/
1•speckx•11m ago•0 comments

Show HN: AvocadoDB – Deterministic RAG (same query, same context, every time)

1•eprasad7•13m ago•0 comments

Blurble – An anonymous confessions feed to combat the loneliness epidemic

https://blurble.manus.space/
1•scoshap•13m ago•1 comments

Mega-structures beneath Egypt's Giza pyramids are confirmed by scientists

https://www.dailymail.co.uk/sciencetech/article-15364653/Hidden-shafts-beneath-Egypts-Giza-pyrami...
4•Bender•14m ago•1 comments

Company Uses Lab-Grown Human Neurons for Energy-Efficient Computing

https://itsfoss.com/news/finalspark/
1•diyftw•15m ago•0 comments

Show HN: Numle = Numbers and Wordle but Easier

https://martintale.com/numle/
1•MartinTale•16m ago•0 comments

'Big Short' Investor Michael Burry Says OpenAI Is Headed for 'Netscape Fate'

https://www.businessinsider.com/big-short-michael-burry-stock-marekt-bubble-openai-nvidia-2025-12
3•throwoutway•16m ago•0 comments

What if aphantasia was a shallow retrieval problem

https://old.reddit.com/r/Aphantasia/comments/1phtoky/what_if_aphantasia_is_not_a_retrieval_proble...
1•HR01•16m ago•0 comments

The Active Reliability Layer for AI Agents

https://github.com/imtt-dev/steer
1•mooreds•17m ago•0 comments

Is OpenAI Today's Netscape? Or Is It AOL?

https://battellemedia.com/archives/2025/12/is-openai-todays-netscape-or-is-it-aol
3•ohjeez•18m ago•1 comments

The stack circuitry of the Intel 8087 floating point chip, reverse-engineered

https://www.righto.com/2025/12/8087-stack-circuitry.html
2•elpocko•18m ago•1 comments

Creator IShowSpeed sued for allegedly punching, choking viral humanoid Rizzbot

https://techcrunch.com/2025/12/06/creator-ishowspeed-sued-for-allegedly-punching-choking-viral-hu...
2•ourmandave•19m ago•0 comments

Silver-Colored Genesis G90s Recalled for Paint That Can Trick It into Braking

https://www.caranddriver.com/news/a69663714/genesis-g90-savile-silver-paint-accidental-braking-re...
1•jbredeche•20m ago•0 comments

Please Don't

https://hn.algolia.com/?dateRange=all&page=0&prefix=true&query=Please%20don%27t%20by%3Adang%20by%...
5•Rendello•20m ago•0 comments

Hiring UK-Based Remote DevOps / MLOps. Cloud and Platform Engineers

https://careers.satalia.com/jobs
1•achilleasatha•22m ago•1 comments

Australia's social media ban for children takes effect in world first

https://www.reuters.com/legal/litigation/australia-social-media-ban-takes-effect-world-first-2025...
1•chirau•23m ago•0 comments

China's first reusable rocket explodes, but its onboard Ethernet network flew

https://www.theregister.com/2025/12/08/asia_tech_news_roundup/
2•Bender•24m ago•0 comments

Authorities intercept drone carrying crab legs, Old Bay, weed for prison inmates

https://local12.com/news/nation-world/authorities-intercept-drone-carrying-steak-crab-legs-weed-f...
2•randycupertino•25m ago•0 comments

Ask HN: Do you enjoy generating your code?

1•decentrabbit•25m ago•1 comments

500M, but Not a Single One More

https://www.effectivealtruism.org/articles/500-million-but-not-a-single-one-more
1•gbear605•25m ago•0 comments

Databricks Introduces OfficeQA Benchmark for Agents

https://www.databricks.com/blog/introducing-officeqa-benchmark-end-to-end-grounded-reasoning
1•ekelsen•27m ago•0 comments
Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•7mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•7mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•7mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•7mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.