frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Python lib and Java CLI tool to read/write Screaming Frog config files

https://github.com/Amaculus/sf-config-tool
1•am-seo•1h ago
Hey all, first time posting. I posted about this on linkedin and a friend said I should post here too

I run a lot of headless Screaming Frog crawls on servers.

The main bottleneck is that while the SF CLI can consume configuration files (.seospiderconfig), it cannot produce them. If you want to run a crawl with complex settings (like custom extractions or specific excludes), you are forced to open the desktop GUI, configure it manually, save the file, and upload it.

You can't just script the file generation because the configs are serialized Java objects (binary blobs), not JSON or XML.

I decided to reverse engineer it. A hex dump confirmed the format was standard Java serialization. Instead of writing a fragile parser, I realized I could use the application's own JARs to handle the heavy lifting.

I built two tools to solve this:

Python Library: Uses JPype to bridge Python to the local SF JARs. You can instantiate config objects, modify them (e.g., config.set_user_agent(...)), and serialize them back to disk. Great for Airflow/Python pipelines.

Java Utility: A standalone CLI tool to do the same thing if you prefer a native Java environment or don't want the Python overhead.

What this enables:

True Headless Automation: Generate valid configs on the fly right before a crawl runs.

Diffing: Compare two binary config files to debug "config drift" (e.g., seeing exactly why a crawl limit changed).

Feedback welcome—especially on the JPype implementation, as that was the trickiest part to stabilize!

The Possessed Machines: Dostoevsky's Demons and the Coming AGI Catastrophe

https://possessedmachines.com/
1•kvee•2m ago•0 comments

Why I Switched from Alacritty to Kitty

https://linkarzu.com/posts/macos/alacritty-to-kitty/
1•behnamoh•4m ago•0 comments

Show HN: I built a CLI to search screenshots by what's in them

https://github.com/memvid/screenshot-memory
1•saleban1031•6m ago•0 comments

An Idea for Solving Superintelligence Alignment

https://science-dao.org/ai-needs/
1•porton•7m ago•0 comments

Nobel Institute affirms that Venezuela's Machado can't give Peace Prize to Trump

https://www.latimes.com/world-nation/story/2026-01-11/nobel-institute-affirms-venezuelas-machado-...
2•vedantnair•9m ago•0 comments

Rolling with the economic tides

https://www.nybooks.com/articles/2026/02/12/rolling-with-the-economic-tides-empty-vessel-kumekawa/
1•hhs•9m ago•0 comments

Over 600 Minnesota businesses close in protest of ICE operations

https://www.perplexity.ai/discover/you/trump-immigration-enforcement-l3O3OX48SMuW4LngN6jltQ
2•vedantnair•9m ago•0 comments

How I estimate work as a staff software engineer

https://www.seangoedecke.com/how-i-estimate-work/
2•lalitmaganti•11m ago•0 comments

Sony Hands Its Bravia TV Reins over to Tcl

https://www.ign.com/articles/sony-hands-its-bravia-tv-reins-over-to-tcl
1•LopRabbit•12m ago•0 comments

California becomes first state to join WHO disease network

https://thehill.com/homenews/state-watch/5703447-who-gavin-newsom-california/
2•geox•16m ago•0 comments

Crackdown on Shadow Fleet Escalates with Seizure of Ship Carrying Russian Oil

https://www.wsj.com/world/europe/crackdown-on-shadow-fleet-escalates-with-seizure-of-ship-carryin...
2•JumpCrisscross•17m ago•0 comments

Rx Inspector: Look Up Where Your Generic Prescription Drugs Were Made

https://projects.propublica.org/rx-inspector/
2•coloneltcb•18m ago•0 comments

Google vs. SerpApi: Threatening Access to Public Data

https://serpapi.com/blog/google-v-serpapi-threatening-access-to-public-data/
1•nateskiles•19m ago•0 comments

Claude Code Upgrading Todos into Tasks

https://twitter.com/trq212/status/2014480496013803643
1•Dlouie•20m ago•1 comments

Quantum physicists create largest ever 'superposition'

https://www.nature.com/articles/d41586-026-00177-9
1•Luc•21m ago•0 comments

Google vs. SerpApi: Threatening Access to Public Data

https://serpapi.com/blog/google-v-serpapi-threatening-access-to-public-data/
2•Autumrose34•24m ago•1 comments

Can AI Pass Freshman CS? [video]

https://www.youtube.com/watch?v=56HJQm5nb0U
2•moonleay•25m ago•0 comments

A social network only for AI models – no humans allowed

https://aifeed.social/timeline
2•zonadigital•28m ago•0 comments

Memoria: A Technical Overview of Venice's Memory System

https://venice.ai/blog/venice-memoria-technical-overview
1•madars•29m ago•0 comments

OpenAI to Take a Percentage from Customer AI-Assisted R&D Outcomes

https://news.aibase.com/news/24859
1•jpster•29m ago•0 comments

Ask HN: How do you investigate server-side bugs that affect subset of users?

1•jatin-dot-py•29m ago•0 comments

Psiphon, join the fight for internet freedom #Iran

https://www.psiphon.ca
1•us321•30m ago•0 comments

I found the perfect yearly calendar (for me)

https://blog.notmyhostna.me/posts/i-found-the-perfect-yearly-calendar-for-me
2•dewey•32m ago•0 comments

Ronald Shusett Discusses 'Alien' Creating the Fear Factor (2003)

https://scriptmag.com/features/writers-on-writing-ronald-shusett-discusses-alien-creating-the-fea...
1•exvi•33m ago•0 comments

Cinefantastique Interview with Bolaji Badejo (1979)

https://alienseries.wordpress.com/2012/10/18/interview-with-bolaji-badejo-1979/
1•exvi•35m ago•1 comments

Ask HN: How to Follow the Bosses' Wants

1•drekipus•35m ago•0 comments

The eom Expression: Beautiful Chaos

https://www.amazon.com/eom-Expression-Beautiful-Satirical-Adventure-ebook/dp/B0CBW42YZ6
1•dpforesi•37m ago•1 comments

The Mpemba Effect

https://en.wikipedia.org/wiki/Mpemba_effect
1•dr_dshiv•38m ago•0 comments

How to Stop Morning Doomscrolling: Why the First 15 Minutes Shape Your Focus

https://www.focusfit.app/blog/morning-phone-scrolling-focus
1•shenli3514•43m ago•0 comments

Show HN: On-brand, context-related <img> generation for blogs

https://blogimagegen.com/en
1•aleksam•43m ago•0 comments