news newest ask show jobs

Open Source @Github

fp.

Open in hackernews

ML research datasets from ArXiv and Semantic Scholar (JSONL, quality-scored)

https://huggingface.co/fineset-io

1•dangerlego5•1h ago

Comments

dangerlego5•1h ago

I kept rebuilding the same arXiv scraper at the start of every ML project. After the third time I wrote a dedup pipeline, I automated the whole thing.

The interesting part is that the pipeline is shared; if two people subscribe to the same topic, they share one crawl and one deduplicated record pool. Happy to talk through the pgvector dedup approach if anyone's curious.

Celebrating seven years of the Fairphone 3

https://www.fairphone.com/stories/celebrating-seven-years-of-the-fairphone-3

1•ravenical•1m ago•0 comments

Finplan.me – offline expense tracker for Android no accounts or cloud

https://play.google.com/store/apps/details?id=com.finplan.me.finplan.me&hl=en_US

1•ivarlev•3m ago•0 comments

SpaceX to buy Cursor AI coding agent operator Anysphere for $60B

https://www.reuters.com/legal/transactional/spacex-buy-anysphere-60-billion-2026-06-16/

4•itsmarcelg•4m ago•0 comments

Do you tend to follow the rules that suppliers made?

1•carnoxen•4m ago•0 comments

Starlink ends free dish perks ahead of new Standard and Mini kits launch

https://www.notebookcheck.net/Starlink-ends-free-dish-perks-ahead-of-new-Standard-and-Mini-kits-l...

1•ashitlerferad•7m ago•0 comments

Revisiting the Inmos Transputer – Also in the News

https://www.rs-online.com/designspark/revisiting-the-inmos-transputer

1•rbanffy•8m ago•0 comments

Europe Wants Digital Sovereignty. 2,165 Polish Organisations Show the Gap

https://ciphercue.com/blog/polish-cdn-email-traffic-american-companies-2026

3•adulion•10m ago•2 comments

Inmos and the Transputer – Part 1: Parallel Ventures

https://thechipletter.substack.com/p/inmos-and-the-transputer-part-1-parallel

1•rbanffy•13m ago•0 comments

Cyber-Censorship: Web Censorship Cases Rebound in 2025

https://www.statista.com/chart/35957/number-of-online-censorship-cases-worldwide-social-media-blo...

1•bilekas•13m ago•0 comments

Telescope Ranchers

https://kottke.org/26/06/telescope-ranchers

1•bookofjoe•14m ago•0 comments

The Productive Sovereign

https://greggbarbers.substack.com/p/the-productive-sovereign

1•taivare•15m ago•1 comments

x86 Hypervisors and Emulators: Architecture, Features, and Performance

https://deepresearch.ninja/2026/06/x86-Hypervisors-and-Emulators-Architecture-Features-and-Perfor...

1•scrapemaster•16m ago•0 comments

Show HN: Mittr – Webhook delivery on one Postgres (inbound and outbound)

https://mittr.io/

1•stevewanjohi•16m ago•0 comments

Reeks, Wrecks and Robots

https://www.washingtonpost.com/archive/politics/1982/07/21/reeks-wrecks-and-robots/c3b63ac8-a823-...

1•lebek•17m ago•0 comments

Agentic AI PRs sit in the review queue 5.3x longer than unassisted ones

https://blog.codacy.com/ai-breaking-code-review-how-engineering-teams-survive-pr-bottleneck

1•claudiacsf•18m ago•1 comments

The Unofficial and Home Assistant MCP Server

https://github.com/homeassistant-ai/ha-mcp

1•mooreds•18m ago•0 comments

Show HN: Mcpwn – treating MCP servers as the attack surface they are

https://github.com/D0rs4n/mcpwn

2•thedorsan•20m ago•0 comments

Rh Disease and its Cure (2023)

https://www.thebloodproject.com/hostile-blood-the-forgotten-history-of-rh-disease-and-its-miracul...

1•mooreds•22m ago•0 comments

Have You Noticed JalapeñOS Seem Milder These Days? (2025)

https://www.foodandwine.com/why-jalapenos-have-become-less-spicy-11740201

1•mooreds•23m ago•0 comments

Alibaba unveils AI models for robots, amid shift from chatbots to agents

https://www.reuters.com/world/asia-pacific/alibaba-unveils-ai-models-robots-amid-shift-chatbots-a...

2•giuliomagnifico•25m ago•0 comments

Show HN: Brainfuck but with Turtle Graphics

https://czterycztery.pl/programy/turtlefreeze/

2•fourgreen•25m ago•0 comments

Show HN: A prompt generator to help AI agents implement my new email API

https://emailsdone.dev/#prompt-generator

1•mikeapple•27m ago•0 comments

A Claude Code plugin that makes coding agents measurably cheaper over time

https://github.com/vukkt/token-warden

1•vukkt•29m ago•0 comments

Show HN: Build a timeline of your experiences and generate tailored job pitches

https://www.pitchlikethis.com

1•pranavm27•29m ago•1 comments

Attention Backpropagation: Step by step derivation

https://liyuan24.github.io/writings/attention_backprop.html

1•deadf00d•32m ago•0 comments

Ratchet: An AI Delivery Loop That Can Only Move Forward

https://praveenvijayan.substack.com/p/ratchet-an-ai-delivery-loop-that

1•praveenvijayan•32m ago•0 comments

Skills Are Where Your Judgment Goes

https://geoffstearns.com/blog/skills-are-where-your-judgment-goes/

1•tensafefrogs•32m ago•0 comments

IPO boom mints thousands of new millionaires and Silicon Valley angst

https://www.washingtonpost.com/technology/2026/06/13/ipo-boom-mints-thousands-new-millionaires-si...

2•bookofjoe•35m ago•1 comments

Building a Deep Research Agent That Survives

https://steel.dev/blog/durable-researcher

1•nkko•36m ago•0 comments

Haskell for Elm developers: giving names to stuff (Part 8 – IO)

https://flaviocorpa.com/haskell-for-elm-developers-giving-names-to-stuff-part-8-io.html

2•cekrem•39m ago•0 comments