frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Andrej Karpathy – It will take a decade to work through the issues with agents

https://www.dwarkesh.com/p/andrej-karpathy
647•ctoth•12h ago•638 comments

New Work by Gary Larson

https://www.thefarside.com/new-stuff
193•jkestner•8h ago•36 comments

The Unix Executable as a Smalltalk Method [pdf]

https://programmingmadecomplicated.wordpress.com/wp-content/uploads/2025/10/onward25-jakubovic.pdf
61•pcfwik•5h ago•6 comments

StageConnect: Behringer protocol is open source

https://github.com/OpenMixerProject/StageConnect
3•jdboyd•29m ago•1 comments

The pivot

https://www.antipope.org/charlie/blog-static/2025/10/the-pivot-1.html
261•AndrewDucker•10h ago•118 comments

PlayStation 3 Architecture (2021)

https://www.copetti.org/writings/consoles/playstation-3
121•adamwk•4d ago•23 comments

Exploring PostgreSQL 18's new UUIDv7 support

https://aiven.io/blog/exploring-postgresql-18-new-uuidv7-support
202•s4i•2d ago•148 comments

Live Stream from the Namib Desert

https://bookofjoe2.blogspot.com/2025/10/live-stream-from-namib-desert.html
447•surprisetalk•17h ago•84 comments

Claude Skills are awesome, maybe a bigger deal than MCP

https://simonwillison.net/2025/Oct/16/claude-skills/
488•weinzierl•12h ago•270 comments

WebMCP

https://github.com/jasonjmcghee/WebMCP
66•sanj•8h ago•18 comments

Show HN: ServiceRadar – open-source Network Observability Platform

https://github.com/carverauto/serviceradar
19•carverauto•4h ago•1 comments

If the Gumshoe Fits: The Thomas Pynchon Experience

https://www.bookforum.com/print/3202/if-the-gumshoe-fits-62416
18•prismatic•1w ago•0 comments

Tahoe's Elephant

https://eclecticlight.co/2025/10/12/last-week-on-my-mac-tahoes-elephant/
28•GavinAnderegg•5d ago•15 comments

Claude Code vs. Codex: I built a sentiment dashboard from Reddit comments

https://www.aiengineering.report/p/claude-code-vs-codex-sentiment-analysis-reddit
85•waprin•1d ago•37 comments

EVs are depreciating faster than gas-powered cars

https://restofworld.org/2025/ev-depreciation-blusmart-collapse/
312•belter•19h ago•726 comments

Spray Cooling – Recreating Supercomputer Cooling on a Desktop CPU [video]

https://www.youtube.com/watch?v=yEBSuk20gvc
6•zdw•5d ago•3 comments

The Majority AI View

https://www.anildash.com//2025/10/17/the-majority-ai-view/
16•Bogdanp•1h ago•4 comments

The Rapper 50 Cent, Adjusted for Inflation

https://50centadjustedforinflation.com/
580•gaws•13h ago•156 comments

4Chan Lawyer publishes Ofcom correspondence

https://alecmuffett.com/article/117792
366•alecmuffett•22h ago•480 comments

Asking AI to build scrapers should be easy right?

https://www.skyvern.com/blog/asking-ai-to-build-scrapers-should-be-easy-right/
94•suchintan•11h ago•42 comments

The Wi-Fi Revolution (2003)

https://www.wired.com/2003/05/wifirevolution/
71•Cieplak•5d ago•48 comments

When if is just a function

https://ryelang.org/blog/posts/if-as-function-blogpost-working-on-it_ver1/
37•soheilpro•3d ago•43 comments

Researchers Discover the Optimal Way to Optimize

https://www.quantamagazine.org/researchers-discover-the-optimal-way-to-optimize-20251013/
36•jnord•4d ago•7 comments

MIT physicists improve the precision of atomic clocks

https://news.mit.edu/2025/mit-physicists-improve-atomic-clocks-precision-1008
78•pykello•6d ago•33 comments

Amazon’s Ring to partner with Flock

https://techcrunch.com/2025/10/16/amazons-ring-to-partner-with-flock-a-network-of-ai-cameras-used...
494•gman83•21h ago•434 comments

Intercellular communication in the brain through a dendritic nanotubular network

https://www.science.org/doi/10.1126/science.adr7403
268•marshfram•14h ago•214 comments

NeXT Computer Offices

https://archive.org/details/NeXTComputerOffices
83•walterbell•6h ago•14 comments

Cyberpsychology's Influence on Modern Computing

https://cacm.acm.org/research/cyberpsychologys-influence-on-modern-computing/
5•pseudolus•5d ago•0 comments

Smithsonian Open Access

https://www.si.edu/openaccess
59•bookofjoe•3d ago•9 comments

How I bypassed Amazon's Kindle web DRM

https://blog.pixelmelt.dev/kindle-web-drm/
1584•pixelmelt•1d ago•482 comments
Open in hackernews

Asking AI to build scrapers should be easy right?

https://www.skyvern.com/blog/asking-ai-to-build-scrapers-should-be-easy-right/
94•suchintan•11h ago

Comments

showerst•10h ago
A point orthogonal to this; consider whether you need browser automation at all.

If a website isn't using Cloudflare or a JS-only design, it's generally better to skip playwright. All the major AIs understand beautifulsoup pretty well, and they're likely to write you a faster, less brittle scraper.

pavel_lishin•10h ago
If.
Etheryte•9h ago
The vast majority of the modern internet falls into one of those two buckets though, no?
showerst•9h ago
I mostly scrape government data so the sites are a little 'behind' on that trend, but no. Even JS heavy sites are almost always pulling from a JSON or graphql source under the hood.

At scale, dropping the heavier dependencies and network traffic of a browser is meaningful.

suchintan•9h ago
Yeah, reverse engineering APIs is another fantastic approach. They aren't enough if you are dealing with wizards (eg typeform), but they can work really well
suchintan•9h ago
IF you can use crawlers, definitely do.

They aren't enough for anything that's login-protected, or requires interacting with wizards (eg JS, downloading files, etc)

philipbjorge•10h ago
We had a similar realization here at Thoughtful and pivoted towards code generation approaches as well.

I know the authors of Skyvern are around here sometimes -- How do you think about code generation with vision based approaches to agentic browser use like OpenAI's Operator, Claude Computer Use and Magnitude?

From my POV, I think the vision based approaches are superior, but they are less amenable to codegen IMO.

suchintan•9h ago
I think they're complementary, and that's the direction we're headed.

We can ask the vision based models to output why they are doing what they are doing, and fallback to code-based approaches for subsequent runs

suchintan•6h ago
Unrelated, but thoughtful gave us some very very helpful feedback early in our journey. We are big fans!
ahstilde•10h ago
this matches our personal experience, too
franze•10h ago
In AI First workshops. By now I tell them for the last exercise "no scrappers". the learning is to separate reasoning (AI) from data (that you have to bring.) and ai coded scrappers seem a logical, but always fail. scrapping is a scaling issue, not reasoning challenge. also the most interesting websites are not keen for new scrappers.
pyuser583•9h ago
Over the past few days I've spent a lot of time dealing with terribly designed UIs. Some legitimate and desired use cases are impossible because poor logic excludes them.

Is AI capable of saying, "This website sucks, and doesn't work - file a complaint with the webmaster?"

I once had similar problems with the CIA's World Factbook. I shudder to think what an I would do there.

suchintan•8h ago
It's funny, one time we had a customer that wanted to use us to test their website for bugs..

Skyvern kept suggesting improvements unrelated to the issue they were testing for

pyuser583•7h ago
So how do clients process this sort of feedback? As a dev, “negative user feedback” gives me scares that “failed behavior testing” does not.

The AI isn’t mad, and won’t refuse to renew. Unless it’s being run by the client of course.

Are clients using your platform to assess vendors?

suchintan•5h ago
No, we don't have a lot of usage in that direction. People mainly use us to log into websites and either fill out forms or download files!
nithril•9h ago
The same day, a post on reddit was about: "We built 3B and 8B models that rival GPT-5 at HTML extraction while costing 40-80x less - fully open source" [1].

Not fully equivalent to what is doing Skyvern, but still an interesting approach.

[1] https://www.reddit.com/r/LocalLLaMA/comments/1o8m0ti/we_buil...

suchintan•8h ago
This is really cool. We might integrate this into Skyvern actually - we've been looking for a faster HTML extraction engine

Thanks for sharing!

guluarte•9h ago
the hardest part of scrapping is bypassing Cloudflare/captchas/fingerprinting etc
suchintan•8h ago
Definitely. What are your thoughts on the CloudFlare agent identity
fragmede•5h ago
The hardest part is not telling anyone how you're bypassing it!
ThatPlayer•2h ago
I can talk about this bypass because they've fixed it: a site I was scraping rolled their own custom captcha that was just multiple choice. But they didn't have a nonce, so I would just attempt all the choices, and one of them would let me in.
herpdyderp•9h ago
I'd be all over Skyvern if only they had enterprise compliance agreements available.
suchintan•8h ago
We do have them! We are HIPAA compliant, have soc-2 type 2 and offer self hosted deployments
whinvik•9h ago
I feel like this is how normal work is. When I have to figure out how to use a new app/api etc, I go through an initial period where I am just clicking around, shouting in the ether etc until I get the hang of it.

And then the third or fourth time its automatic. Its weird but sometimes I feel like the best way to make agents work is to metathink about how I myself work.

suchintan•8h ago
I have a 2yo and it's been surreal watching her learn the world. It deeply resembles how LLMs learn and think. Crazy
Retric•8h ago
Odd, I've been stuck by how different LLMs and kids learn the world.

You don’t get that whole uncanny valley disconnect do you?

goatlover•8h ago
How so? Your kid has a body that interacts with the physical world. An LLM is trained on terabytes of text, then modified by human feedback and rules to be a useful chatbot for all sorts of tasks. I don't see the similarity.
crazygringo•6h ago
If you watch how agents attempt a task, fail, try to figure out what went wrong, try again, repeat a couple more times, then finally succeed -- you don't see the similarity?
dingnuts•6h ago
no I see something resembling gradient descent which is fine but it's hardly a child
balder1991•4h ago
No, because an agent doesn’t learn, it’s just continuing a story. A kid will learn from the experience and at the end will be a different person.
deadbabe•4h ago
A person is not their body.

The person is the data that they have ingested and trained on through the senses that are exposed by their body. Body is just an interface to reality.

_pdp_•8h ago
This is exactly the direction I am seeing agent go. They should be able to write their own tools and we are soon launching something about that.

That being said...

LLMS are amazing for some coding tasks and fail miserably at others. My hypothesis is that there is some sort of practical limit to how many concepts an LLM can hold into account no matter the context window given the current model architectures.

For a long time I wanted to find some sort of litmus test to measure this and I think I found one that is an easy to understand programming problem, can be done in a single file, yet complex enough. I have not found a single LLM to be able to build a solution without careful guidance.

I wrote more about this here if you are interested: https://chatbotkit.com/reflections/where-ai-coding-agents-go...

meowface•6h ago
With the upcoming release of Gemini 3.0 Pro, we might see a breakthrough for that particular issue. (Those are the rumors, at least.) I'm sure not fully solved, but possibly greatly improved.
Groxx•3h ago
also training data quality. they are horrifyingly bad at concurrent code in general in my experience, and looking at most concurrent code in existence.... yeah I can see why.
pennaMan•8h ago
Yes, it is easy. LLMs have reduced my maintenance work on scraping tasks I manage (lots of specialized high-traffic adfield sites) by 99%

What used to be a constant almost daily chore with them breaking all the time at random intervals is now a self-healing system that rarely ever fails.

suchintan•6h ago
That's the dream
ACCount37•6h ago
One of the uses for AI I'm excited about - maintaining systems, keeping up with the moving targets.
silver_sun•4h ago
Interesting. Could you elaborate? Is there a specific reason that it doesn't do 100% of the work already?
TheTaytay•1h ago
Could you elaborate on your setup please?
claysmithr•7h ago
I misread this as 'sky scrapers'
pcblues•7h ago
You gain experience getting interactions with other agencies optimised by dealing with them yourself. If the AI you rely on fails, you are dead in the water. And I'm speaking as a fairly resilient 50 year old with plenty of hands-on experience, but concerned for the next generation. I know generational concern has existed since the invention of writing, and the world hasn't fallen apart, so what do I know? :)
hamasho•3h ago
Off topic, but because the article mentioned improper usage of DOM, I put down the UK government's design system/accessibility. It's well documented, and I hope all governments have the same standard. I guess they paid a huge amount of money to consultants and vendors.

[1] https://design-system.service.gov.uk/components/radios/