frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: AnyCrawl v0.0.1-alpha.5 – custom user-agent and richer scraping API

https://github.com/any4ai/AnyCrawl
2•ntbperst•13h ago
## [0.0.1-alpha.5] - 2025-06-14

### Added

- Integrated AWS S3 storage support with new `S3` class and environment variables for seamless file uploads and retrievals. - Introduced `FileController` for serving files from S3 or local storage with robust path validation and error handling. - Added multiple content transformers (Screenshot, `HTMLTransformer`) improving HTML/Markdown extraction and screenshot generation. - Extended scraping capabilities with new options: output `formats`, `timeout`, tag filtering, `wait_for`, retry strategy, viewport configuration, and custom user-agent support. - Added Safe Search parameter to `SearchSchema` for filtered search results. - Refactored engine architecture with a factory pattern and new core modules for configuration validation, data extraction, and job management. - Implemented graceful shutdown handling for the API server and improved logging for uncaught exceptions / unhandled rejections. - Added Jest configuration for API and library packages with ESM support and updated test scripts. - Updated CI workflows to publish Docker images on version tags. - Expanded README with detailed environment variable descriptions and API usage examples.

### Changed

- Refined error handling in `ScrapeController` and `JobManager`; failure responses now include structured error objects and HTTP status codes. - Enhanced `BaseEngine` with explicit HTTP error checks and resilience improvements. - Updated OpenAPI documentation to reflect new scraping parameters and error formats. - Migrated key-value store name to environment configuration for greater flexibility. - Enhanced per-request credit tracking in `ScrapeController` and enhanced logging middleware to include credit usage.

### Fixed

- Improved job failure messages to include detailed error data, ensuring clearer debugging information. - Minor documentation corrections and clarifications.

Get your compliance automated now

https://horuscheck.io/
1•sandboxmumu•6m ago•1 comments

Show HN: Made a 3 SEC log streaming setup (paste command –> streaming starts)

https://www.logsy.info/
1•devparagiri•13m ago•0 comments

Disaster Party – A "Universal" AI API SDK

https://github.com/segin/disasterparty
1•segin•13m ago•0 comments

The Art of Lisp and Writing

https://www.dreamsongs.com/ArtOfLisp.html
2•Bogdanp•25m ago•0 comments

A Parting Message to My Students

https://kstan.gitlab.io/blog/partingmessage/
1•tankangsoon•25m ago•0 comments

Dead Hand automatic nuclear weapons control system

https://en.wikipedia.org/wiki/Dead_Hand
2•aroman•31m ago•1 comments

Trade with China Is Becoming a One-Way Street

https://www.wsj.com/economy/trade/china-us-export-market-222ebc3a
4•Ozarkian•35m ago•3 comments

Show HN: Mdc – just another Markdown viewer with ToC and CLI support

https://github.com/zoetin45/mdc
1•zoetin45•37m ago•0 comments

Government awards contract to French company to develop sonar system

https://www.rte.ie/news/ireland/2025/0615/1518526-sonar-system-defence/
1•austinallegro•37m ago•0 comments

The Apple "Reasoning Collapse" Paper Is Even Dumber Than You Think

https://mikecaulfield.substack.com/p/the-apple-reasoning-collapse-paper
1•gsky•38m ago•0 comments

Spatializing 6k years of global urbanization from 3700 BC to AD 2000

https://www.nature.com/articles/sdata201634
1•talonx•39m ago•0 comments

Coinbase, famously a "no politics" company in 2020, sponsors a military parade

https://old.reddit.com/r/Military/comments/1lblspo/thanks_to_our_sponsor_coinbase/
3•tomlockwood•45m ago•1 comments

Introduction to Competitive Programming in Haskell

https://byorgey.github.io/blog/posts/2025/06/10/comprog-hs-intro.html
1•matt_d•48m ago•0 comments

Sweden gets help pulling its sovereign AI socks up

https://www.computerweekly.com/news/366625706/Sweden-gets-help-pulling-its-sovereign-AI-socks-up
1•jamesblonde•50m ago•1 comments

How you breathe is like a fingerprint that can identify you

https://www.nature.com/articles/d41586-025-01835-0
1•XzetaU8•51m ago•1 comments

Root Cause of the June 12, 2025 Google Cloud Outage

https://twitter.com/0xTib3rius/status/1933702904734429560
1•thunderbong•54m ago•0 comments

Disturbing Rumor – PBS NewsHour (Brooks / Capehart)

5•mobileturdfctry•1h ago•0 comments

I build an anonymous stranger chat with no log in

https://randomize.chat/
2•henrymuddleton•1h ago•0 comments

Novo Nordisk's Canadian Mistake

https://www.science.org/content/blog-post/novo-nordisk-s-canadian-mistake
1•taubek•1h ago•0 comments

Show HN: Shields.rs – a Rust badge engine 10x faster than Node.js

https://github.com/Jannchie/shields.rs
1•jannchie•1h ago•0 comments

Software Engineering Talent Is Gold Right Now

https://gametorch.app/blog/software-engineering-talent
2•gametorch•1h ago•0 comments

Centralization or Decentralization? Evolution of State-Ownership in China (2022)

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4283197
1•walterbell•1h ago•0 comments

The Algebra of an Infinite Grid of Resistors

https://www.mathpages.com/home/kmath669/kmath669.htm
2•gone35•1h ago•0 comments

Ordinary users can also generate professional and creative print ads

https://www.piclabs.org
1•rooty_ship•1h ago•0 comments

Arkane Linux: Opinionated, immutable, atomic Arch-based distribution

https://arkanelinux.org/
2•theycallhermax•1h ago•0 comments

Remove Bug Bounty Program

https://github.com/CycloneDX/cyclonedx-rust-cargo/commit/93b19cb4ac96d1b8f51647df2b89ec4359becae1
3•Tomte•1h ago•0 comments

Adding .md URLs for Raw Markdown Content in Next.js

https://www.bengubler.com/posts/2025-06-14-raw-markdown-urls-nextjs
1•nebrelbug•2h ago•0 comments

Scaling Laws – Can Someone Tell Elon?

https://waymo.com/blog/2025/06/scaling-laws-in-autonomous-driving
3•bobby_mcbrown•2h ago•0 comments

Smooth Page Transitions in Next.js with next-view-transitions

https://www.bengubler.com/posts/2025-06-14-smooth-page-transitions-next-view-transitions
2•nebrelbug•2h ago•0 comments

The Trolley Problem: the UX of shopping carts (2023)

https://usamawaheed.substack.com/p/the-real-trolley-problem-the-ux-of
3•Mr_Minderbinder•2h ago•0 comments