frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: ETL System to Extract Product Data from Websites and Upload to Shopify

https://github.com/GustavoFortti/products-crawler
2•gustavofortti•12h ago
About a year ago, I built an ETL system to extract and process product data from e-commerce websites, mainly supplements. It maps product structures, automates data collection, cleans and classifies the information, and uploads it directly to Shopify through API integration.

The system uses Python and Selenium for extraction, with transformation steps focused on cleaning, enrichment, and product classification. It was designed to be modular and easy to extend to new websites.

I paused development but I'm considering reviving it. Feedback or suggestions are welcome.

Repo: https://github.com/GustavoFortti/products-crawler

Show HN: Semcache – I built a semantic cache in Rust

https://github.com/sensoris/semcache
3•jacobhm98•7m ago•0 comments

Meta's Llama 3.1 can recall 42 percent of the first Harry Potter book

https://www.understandingai.org/p/metas-llama-31-can-recall-42-percent
2•aspenmayer•9m ago•1 comments

Investigation into 4chan and its compliance with the Online Safety Act

https://www.ofcom.org.uk/online-safety/illegal-and-harmful-content/investigation-into-4chan-and-its-compliance-with-duties-to-protect-its-users-from-illegal-content
2•intunderflow•9m ago•0 comments

Enterprise AI adoption stalls as inferencing costs confound cloud customers

https://www.theregister.com/2025/06/13/cloud_costs_ai_inferencing/
2•tempodox•15m ago•0 comments

LLMs in Public Health – Part 2

https://joshuaharrissite.substack.com/p/llms-in-public-health-part-2
1•jah242•20m ago•0 comments

The Story of Stuxnet

https://spectrum.ieee.org/the-real-story-of-stuxnet
1•rbanffy•27m ago•0 comments

Show HN: Tool shows why 1.3B people can't use your website

https://accessibility-lens.lovable.app/
2•sobinsamuel•30m ago•0 comments

Datalog in Rust

https://github.com/frankmcsherry/blog/blob/master/posts/2025-06-03.md
9•brson•32m ago•0 comments

The rider and elephant architecture (2024)

https://d-gate.io/blog/rider-and-elephant-architecture
1•dyl000•33m ago•0 comments

UK govt. rollout of Humphrey AI tool raises fears about reliance on big tech

https://www.theguardian.com/technology/2025/jun/15/government-roll-out-humphrey-ai-tool-reliance-big-tech
1•chrisjj•35m ago•1 comments

BO6 Bot lobbies are going crazy

https://github.com/lllyasviel/FramePack/discussions/627
1•bo6mano•40m ago•0 comments

YouTube tests 30-second non-skippable ads in standard campaigns

https://searchengineland.com/youtube-tests-30-second-non-skippable-ads-standard-campaigns-456884
2•raffael_de•42m ago•0 comments

The Latest X.org Server Activity Are a Lot of Code Reverts

https://www.phoronix.com/news/X.Org-Server-Lots-Of-Reverts
4•mikece•43m ago•0 comments

The launch of ChatGPT polluted the world forever

https://www.theregister.com/2025/06/15/ai_model_collapse_pollution/
21•rntn•44m ago•11 comments

The Line of Death

https://textslashplain.com/2017/01/14/the-line-of-death/
2•tentacleuno•48m ago•0 comments

Frenchirix and the Research on How to Learn Languages

https://www.borzov.ca/posts/frenchirix/
1•ingve•50m ago•0 comments

Backup Evernote Tool

https://backup-evernote.chatgpt2notion.com/
1•chatgpt2notion•55m ago•0 comments

An origin trial for a new HTML <permission> element

https://developer.chrome.com/blog/permission-element-origin-trial
1•tentacleuno•56m ago•0 comments

What makes you think you would have been better?

https://medium.com/luminasticity/what-makes-you-think-you-would-have-been-better-600789629752
1•bryanrasmussen•57m ago•0 comments

Show HN: SudoResume – ATS friendly resume builder

https://sudoresume.com
1•prasoon21•1h ago•0 comments

I built an AI tool that writes your back end in 60 seconds (no code needed)

https://pipo360.xyz
2•the_plug•1h ago•1 comments

Show HN: Segment unstructured text over time with auto-discovered dimensions

https://www.correl8.ai/playground
2•romz•1h ago•0 comments

Machine Learning, AI, and Bots Oreilly 2025 Books Bundle

https://www.humblebundle.com/books/machine-learning-ai-and-bots-oreilly-2025-books
2•laisrast•1h ago•0 comments

The English have become wine producers as well as wine consumers

https://www.economist.com/britain/2025/06/12/the-english-have-become-wine-producers-as-well-as-wine-consumers
1•zeristor•1h ago•1 comments

We Live in a Golden Age of Interoperability

https://borretti.me/article/we-live-in-a-golden-age-of-interoperability
1•aragilar•1h ago•0 comments

"Make in India" Relies on "Made in China"

https://www.hinrichfoundation.com/research/wp/trade-and-geopolitics/make-in-india-relies-on-made-in-china/
2•Ozarkian•1h ago•0 comments

GenAI Image Showdown

https://genai-showdown.specr.net/
1•johnisgood•1h ago•0 comments

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

https://gair-nlp.github.io/cognition-engineering/
2•ddl•1h ago•0 comments

I built an agent framework in 3 Markdown files

https://github.com/EvolvingAgentsLabs/framework-core
2•matiasmolinas•1h ago•1 comments

Your idea probably sucks

https://kyrylo.org/software/2025/06/15/your-idea-probably-sucks.html
2•kyrylo•1h ago•0 comments