frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The Janitor on Mars

https://www.newyorker.com/magazine/1998/10/26/the-janitor-on-mars
1•evo_9•1m ago•0 comments

Bringing Polars to .NET

https://github.com/ErrorLSC/Polars.NET
2•CurtHagenlocher•3m ago•0 comments

Adventures in Guix Packaging

https://nemin.hu/guix-packaging.html
1•todsacerdoti•4m ago•0 comments

Show HN: We had 20 Claude terminals open, so we built Orcha

1•buildingwdavid•5m ago•0 comments

Your Best Thinking Is Wasted on the Wrong Decisions

https://www.iankduncan.com/engineering/2026-02-07-your-best-thinking-is-wasted-on-the-wrong-decis...
1•iand675•5m ago•0 comments

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

https://www.warcraftcn.com/
1•vyrotek•6m ago•0 comments

Trump Vodka Becomes Available for Pre-Orders

https://www.forbes.com/sites/kirkogunrinde/2025/12/01/trump-vodka-becomes-available-for-pre-order...
1•stopbulying•7m ago•0 comments

Velocity of Money

https://en.wikipedia.org/wiki/Velocity_of_money
1•gurjeet•10m ago•0 comments

Stop building automations. Start running your business

https://www.fluxtopus.com/automate-your-business
1•valboa•14m ago•1 comments

You can't QA your way to the frontier

https://www.scorecard.io/blog/you-cant-qa-your-way-to-the-frontier
1•gk1•15m ago•0 comments

Show HN: PalettePoint – AI color palette generator from text or images

https://palettepoint.com
1•latentio•16m ago•0 comments

Robust and Interactable World Models in Computer Vision [video]

https://www.youtube.com/watch?v=9B4kkaGOozA
2•Anon84•19m ago•0 comments

Nestlé couldn't crack Japan's coffee market.Then they hired a child psychologist

https://twitter.com/BigBrainMkting/status/2019792335509541220
1•rmason•21m ago•0 comments

Notes for February 2-7

https://taoofmac.com/space/notes/2026/02/07/2000
2•rcarmo•22m ago•0 comments

Study confirms experience beats youthful enthusiasm

https://www.theregister.com/2026/02/07/boomers_vs_zoomers_workplace/
2•Willingham•29m ago•0 comments

The Big Hunger by Walter J Miller, Jr. (1952)

https://lauriepenny.substack.com/p/the-big-hunger
2•shervinafshar•30m ago•0 comments

The Genus Amanita

https://www.mushroomexpert.com/amanita.html
1•rolph•35m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/
9•mooreds•36m ago•2 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•37m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

2•pinkmuffinere•38m ago•0 comments

KV Cache Transform Coding for Compact Storage in LLM Inference

https://arxiv.org/abs/2511.01815
1•walterbell•43m ago•0 comments

A quantitative, multimodal wearable bioelectronic device for stress assessment

https://www.nature.com/articles/s41467-025-67747-9
1•PaulHoule•45m ago•0 comments

Why Big Tech Is Throwing Cash into India in Quest for AI Supremacy

https://www.wsj.com/world/india/why-big-tech-is-throwing-cash-into-india-in-quest-for-ai-supremac...
2•saikatsg•45m ago•0 comments

How to shoot yourself in the foot – 2026 edition

https://github.com/aweussom/HowToShootYourselfInTheFoot
2•aweussom•45m ago•0 comments

Eight More Months of Agents

https://crawshaw.io/blog/eight-more-months-of-agents
4•archb•47m ago•0 comments

From Human Thought to Machine Coordination

https://www.psychologytoday.com/us/blog/the-digital-self/202602/from-human-thought-to-machine-coo...
1•walterbell•48m ago•0 comments

The new X API pricing must be a joke

https://developer.x.com/
1•danver0•49m ago•0 comments

Show HN: RMA Dashboard fast SAST results for monorepos (SARIF and triage)

https://rma-dashboard.bukhari-kibuka7.workers.dev/
1•bumahkib7•49m ago•0 comments

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag
1•artigent•54m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor
4•dragandj•55m ago•0 comments
Open in hackernews

I built an AI pipeline to analyze every SEC 8-K filing in real-time

https://secwhisperer.com
11•borxtrk•4mo ago

Comments

borxtrk•4mo ago
I got tired of missing material corporate events buried in SEC filings, so I built SEC Whisperer - a system that monitors, downloads, and summarizes 8-K filings using Gemini 2.5 Flash.

  Technical Stack:
  - Python pipeline polling SEC EDGAR API every 2 hours
  - Cloud Run jobs for serverless processing (avoiding cold starts with batch processing)
  - 98% noise reduction on HTML filings before LLM analysis
  - Firebase for real-time publishing to Next.js frontend
  - Gemini with structured JSON output + post-processing to prevent hallucination

  The interesting technical challenges:
  1. SEC filings are massive (40KB+ exhibits). Had to build a sectionizer that
     identifies item boundaries and caps exhibit text at 5KB (770x speedup)
  2. LLMs hallucinate quarters and M&A tags. Solution: deterministic post-processing
     that strips anything not in source text
  3. Filing amendments create tricky supersedes/superseded_by relationships in Firestore

  Live site: https://secwhisperer.com
  Code: Not open source yet, but happy to discuss architecture

  Example output: The site caught Nvidia's $5B Intel deal within minutes of the 
  8-K filing and had AI analysis published before most financial news sites.

  Would love feedback from the HN community - especially on the LLM hallucination 
  prevention patterns. What other techniques are you all using?
golden-face•4mo ago
Can you share any details or samples of the code/prompts especially with regards to "Gemini with structured JSON output" and "LLMs hallucinate quarters and M&A tags. Solution: deterministic post-processing"?

I recently started using Gemini to perform perform classification tasks and I have been struggling with 2 things:

1) Documentation on the input/prompt schema when you want to require structured output 2) How to enforce outputs like "this key-value output must come from the supplied list of key-values"

It is really fascinating to work with this tool if only because it works well on 90% of tasks and then decides to go full stream of consciousness "hello good day, I know this is a horse race on TV but I cannot find the horse race in the list of car manufacturers you supplied" with a random schema for the output.