frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Qwen3-Coder-Next

https://qwen.ai/blog?id=qwen3-coder-next
351•danielhanchen•3h ago•204 comments

Deno Sandbox

https://deno.com/blog/introducing-deno-sandbox
109•johnspurlock•1h ago•33 comments

Agent Skills

https://agentskills.io/home
262•mooreds•5h ago•161 comments

AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines

https://github.com/alibaba/AliSQL
23•baotiao•48m ago•3 comments

Prek: A better, faster, drop-in pre-commit replacement, engineered in Rust

https://github.com/j178/prek
91•fortuitous-frog•2h ago•48 comments

Xcode 26.3 unlocks the power of agentic coding

https://www.apple.com/newsroom/2026/02/xcode-26-point-3-unlocks-the-power-of-agentic-coding/
107•davidbarker•1h ago•70 comments

221 Cannon Road Is Not for Sale

https://fredbenenson.com/blog/2026/02/03/221-cannon-is-not-for-sale/
63•mecredis•2h ago•38 comments

France dumps Zoom and Teams as Europe seeks digital autonomy from the US

https://apnews.com/article/europe-digital-sovereignty-big-tech-9f5388b68a0648514cebc8d92f682060
303•AareyBaba•2h ago•168 comments

What's up with all those equals signs anyway?

https://lars.ingebrigtsen.no/2026/02/02/whats-up-with-all-those-equals-signs-anyway/
495•todsacerdoti•9h ago•151 comments

Kilobyte is precisely 1000 bytes

https://waspdev.com/articles/2026-01-11/kilobyte-is-1000-bytes
35•surprisetalk•2h ago•107 comments

Launch HN: Modelence (YC S25) – App Builder with TypeScript / MongoDB Framework

29•eduardpi•3h ago•17 comments

Bunny Database

https://bunny.net/blog/meet-bunny-database-the-sql-service-that-just-works/
166•dabinat•7h ago•80 comments

Heritability of intrinsic human life span is about 50%

https://www.science.org/doi/10.1126/science.adz1187
101•XzetaU8•2d ago•59 comments

Puget Systems Most Reliable Hardware of 2025

https://www.pugetsystems.com/labs/articles/puget-systems-most-reliable-hardware-of-2025/
15•zdw•3d ago•1 comments

Defining Safe Hardware Design [pdf]

https://people.csail.mit.edu/rachit/files/pubs/safe-hdls.pdf
20•rachitnigam•2h ago•2 comments

Show HN: Octosphere, a tool to decentralise scientific publishing

https://octosphere.social/
20•crimsoneer•2h ago•10 comments

Show HN: C discrete event SIM w stackful coroutines runs 45x faster than SimPy

https://github.com/ambonvik/cimba
21•ambonvik•3h ago•7 comments

The Everdeck: A Universal Card System (2019)

https://thewrongtools.wordpress.com/2019/10/10/the-everdeck/
63•surprisetalk•6d ago•16 comments

Tadpole – A modular and extensible DSL built for web scraping

https://tadpolehq.com/
15•zachperkitny•2h ago•5 comments

Migrate Wizard – IMAP Based Email Migration Tool

https://migratewizard.com/#features
10•techstuff123•2h ago•7 comments

Emerge Career (YC S22) is hiring a product designer

https://www.ycombinator.com/companies/emerge-career/jobs/omqT34S-founding-product-designer
1•gabesaruhashi•7h ago

Y Combinator will let founders receive funds in stablecoins

https://fortune.com/2026/02/03/famed-startup-incubator-y-combinator-to-let-founders-receive-funds...
21•shscs911•1h ago•10 comments

Young adults report lower life satisfaction in Sweden

https://internationaljournalofwellbeing.org/index.php/ijow/article/view/6001/1299
11•late•2h ago•4 comments

Floppinux – An Embedded Linux on a Single Floppy, 2025 Edition

https://krzysztofjankowski.com/floppinux/floppinux-2025.html
221•GalaxySnail•14h ago•153 comments

Show HN: I built "AI Wattpad" to eval LLMs on fiction

https://narrator.sh/llm-leaderboard
8•jauws•2h ago•6 comments

Show HN: Sandboxing untrusted code using WebAssembly

https://github.com/mavdol/capsule
46•mavdol04•5h ago•17 comments

Show HN: PII-Shield – Log Sanitization Sidecar with JSON Integrity (Go, Entropy)

https://github.com/aragossa/pii-shield
7•aragoss•2h ago•3 comments

The next steps for Airbus' big bet on open rotor engines

https://aerospaceamerica.aiaa.org/the-next-steps-for-airbus-big-bet-on-open-rotor-engines/
30•CGMthrowaway•3h ago•28 comments

Show HN: Safe-now.live – Ultra-light emergency info site (<10KB)

https://safe-now.live
141•tinuviel•10h ago•63 comments

Banning lead in gas worked. The proof is in our hair

https://attheu.utah.edu/health-medicine/banning-lead-in-gas-worked-the-proof-is-in-our-hair/
285•geox•17h ago•213 comments
Open in hackernews

Tadpole – A modular and extensible DSL built for web scraping

https://tadpolehq.com/
15•zachperkitny•2h ago

Comments

zachperkitny•2h ago
Hello!

I wanted to share my recent project: Tadpole. It is a custom DSL built on top of KDL specifically for web scraping and browser automation. I wanted there to be a standardized way of writing scrapers and reusing existing scraper logic. This was my solution.

Why?

    Abstraction: Simulating realistic human behavior (bezier curves, easing) through high-level composed actions.
    Zero Config: Import and share scraper modules directly via Git, bypass NPM/Registry overhead.
    Reusability: Actions and evaluators can be composed through slots to create more complex workflows.

Example

This is a fully running example, @tadpole/cli is published on npm:

tadpole run redfin.kdl --input '{"text": "Seattle, WA"}' --auto --output output.json

  import "modules/redfin/mod.kdl" repo="github.com/tadpolehq/community"

  main {
    new_page {
      redfin.search text="=text"
      wait_until
      redfin.extract_from_card extract_to="addresses" {
        address {
          redfin.extract_address_from_card
        }
      }
    }
  }

Roadmap? Planned for 0.2.0

    Control Flow: Add maybe (effectively try/catch) and loop (while {}, do {})
    DOMPick: Used to select elements by index
    DOMFilter: Used to filter elements using evaluators
    More Evaluators: Type casting, regex, exists
    Root Slots: Support for top level dynamic placeholders
    Error Reporting: More robust error reporting
    Logging: More consistent logging from actions and add log action to global registry
0.3.0

    Piping: Allowing different files to chain input/output.
    Outputs: Complex output sinks to databases, s3, kafka, etc.
    DAGs: Use directed acylic graphs to create complex crawling scenarios and parallel compute.
Github Repository: https://github.com/tadpolehq/tadpole

I've also created a community repository for sharing scraper logic: https://github.com/tadpolehq/community

Feedback would be greatly appreciated!

bobajeff•1h ago
I like the idea of a DSL for scraping but my scrapers do more than extract text. I also download files (+monitor download progress) and intercept images (+ check for partial or failed to load images). So it seems my use case isn't really covered with this.
zachperkitny•42m ago
Thanks for the idea actually! It's difficult to cover every use case in the 0.1.0 release. I'll take this into account. Downloading Files/Images could likely be abstracted into just an HTTP source and the data sources could be merged in some way.
bobajeff•1h ago
I had to look up what KDL is and what `Functional Source License, Version 1.1, ALv2 Future License` is.

So KDL is like another JSON or Yaml. FSL-1.1-ALv2 is an, almost but not really, open source license that after a 2 years becomes available under a real open source license. It's to prevent free loading from companies or something. Sounds fine to me actually.

zachperkitny•34m ago
Effectively, it's not meant to restrict people from using it, even in a commercial setting, just to protect my person interests in what I want to do with it in a commercial setting.

KDL is more than just JSON or YAML. It's node based. It's output in libraries is effectively an AST and its use cases are open ended.