frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you search the web programmatically these days?

2•coreyp_1•9h ago
For the first time in a long time, I need to query a search engine programmatically, and found that most of them block the use of curl, etc.

So, my question is simple: how do you solve the problem? I've tried searxng with mediocre success, but it seems a bit heavy to have to be running a complete separate service for this one thing that I only need every once in a while. I haven't tried using a service that requires an API key, simply because I'm not sure which direction to go or who to go with.

Just thought I would ask here first.

Comments

pwg•9h ago
> and found that most of them block the use of curl

Try again, but have curl provide a user agent string from one of the real browsers. You'll likely find that the request goes through.

dserban•4h ago
https://pypi.org/project/ddgs/

(Assuming you prefer Python.)

raw_anon_1111•18m ago
Can’t speak for search engines specifically. But I recently had to do a project which required me to crawl the customer’s large site and index it into a vector search for RAG for a call center.

My first attempt was to use crawl it just by doing GET requests (ie same thing as using curl). That got me nowhere. I had to use headless Chrome and Playwright.

Do any modern websites work with just curl even if they don’t block it - ie without being able to run JS?

Gmail label bridge on Claude Cowork just broke

4•mangoe•5h ago•2 comments

Do I Stop Learning Coding? DSA?

4•s_u_d_o•4h ago•9 comments

Ask HN: Building a solo business is impossible?

35•fnoef•20h ago•57 comments

Stop using naive RAG – adding relationships to AI context

3•eduardobenck•4h ago•0 comments

Ask HN: Who is using OpenClaw?

333•misterchocolat•2d ago•374 comments

Tell HN: Security Incident at Porter (YC S20)

5•leetrout•6h ago•0 comments

Tell HN: Fiverr left customer files public and searchable

819•morpheuskafka•3d ago•230 comments

Ask HN: How do you search the web programmatically these days?

2•coreyp_1•9h ago•3 comments

Tell HN: 48 absurd web projects – one every month

75•absurdwebsite•1d ago•25 comments

Ask HN: Teaching life skills through games, am I crazy?

2•shivaniShimpi_•11h ago•2 comments

Ask HN: How do you maintain flow when vibe coding?

29•fny•1d ago•29 comments

Ask HN: Getting depressed day by day, how to cope?

15•throwaw12•16h ago•14 comments

Ask HN: How did you get your first users with zero audience?

14•arikusi•1d ago•8 comments

Aliens.gov Resolves – To a WordPress "Site Not Found" Error

11•ascarola•1d ago•5 comments

Ask HN: How do you find motivation to do stuff?

24•RockstarSprain•2d ago•22 comments

Ask HN: How are you using LLMs in production?

8•Anon84•1d ago•10 comments

Advice for tracking down a listening device?

8•comrade1234•1d ago•5 comments

Opus 4.7 is horrible at writing

16•limalabs•1d ago•19 comments

Ask HN: Who is your favourite Entrepreneur/Visionary?

13•wasimsk•1d ago•31 comments

Durable Object alarm loop: $34k in 8 days, zero users, no platform warning

27•thewillmoss•2d ago•2 comments

Ask HN: How are you actively keeping your thinking sharp while using LLMs daily?

12•smonk108•1d ago•10 comments

Tell HN: Anthropic no longer allows you to fix to specific model version

25•baobabKoodaa•2d ago•2 comments

Ask HN: Is Claude Getting Worse?

9•sahli•2d ago•19 comments

Ask HN: How to highlight talent from untraditional backgrounds?

6•etherus•1d ago•4 comments

Ask HN: As an Australian, is it possible to get a remote US role?

4•apatheticonion•2d ago•8 comments

GitHub gave webhook secrets away in webhook call

12•time4tea•3d ago•1 comments

Ask HN: SeedLegals Partnerships in London, worth it?

2•pain_perdu•1d ago•1 comments

Ask HN: LeetCode, anyone still doing it?

19•kwar13•3d ago•14 comments

Tell HN: GitHub might have been leaking your webhook secrets. Check your emails.

43•ssiddharth•3d ago•12 comments

Any engineers here with experience of clinical data standards?

2•kalturnbull•2d ago•0 comments