frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

I tried to buy Dataroma. Now I'm building the research engine I wish existed

2•giorgio_n•13h ago
Hi HN! I'm George, founder of ValueSense (valuesense.io). For the past year, I’ve been working on a platform that helps investors research public companies the way analysts at hedge funds do, but without the spreadsheets, PDFs and 12 browser tabs.

The backstory I used to rely on Dataroma to track superinvestor portfolios (the Buffetts, Ackmans, Klarmans of the world). I liked the simplicity, but as I used it more, I hit major friction: - There’s no conviction scoring, trends or clustering of buys/sells - No institutional or insider context - No real ability to explore investor relations data - UI hasn't changed in over a decade

I actually tried to acquire Dataroma at one point, but the deal didn’t go anywhere. So I started building the tool I wanted.

What I’m building The core idea: a research engine that connects smart money activity with investor relations data and makes it usable.

Here’s what’s already working: 1) Smart Money Signals A pipeline that ingests, cleans and structures data from: - 13F filings — fund-level holdings, position sizes, entry timing - Insider trades — Form 4s parsed for clusters, trends, and volume - Institutional flows — sourced from ownership filings (13G, 13D, NPORT, etc.)

We generate: - Conviction scores — based on % of portfolio, position history, and co-investing behavior - Cluster flags — when multiple insiders or funds pile into a stock at once - Time-series of ownership shifts — visualized by entity and role (e.g. activist, insider, fund)

This is all stored in a PostgreSQL database with event-based indexing and rendered live with a charting engine that uses caching for fast reloads across tickers.

2) IR Intelligence

The other side of public company research is buried in PDFs: earnings decks, segment data, KPIs, commentary etc.

I built a parser that pulls these into a structured format:

- Revenue by segment and geography - Operational KPIs (e.g. Uber trips, Netflix users, Nvidia DC revenue) - Historical earnings slides and management guidance - How Company Makes Money breakdowns

It runs on a data pipeline built in Python + Airflow, pulling from SEC EDGAR, earnings call transcripts and investor websites. All numbers are standardized quarterly and TTM, cleaned, and visualized inside the platform.

My Technical stack Backend: FastAPI, PostgreSQL, Redis ETL: Python, Airflow, BeautifulSoup, custom EDGAR parser Data storage: Postgres for structured financial data; S3 for raw filings & charts Frontend: React + Tailwind, Highcharts for data visualization Infra: GCP + Cloud Run + Supabase auth AI: LLMs used for DCF templating, narrative parsing, and user-defined screeners

What I’m working on next:

Letting users ask questions like: “Which stocks have both rising insider buys and top-line revenue growth?” “What did Ackman add last quarter that others didn’t?” This is a combo of natural language → SQL generation and curated filters. DCF and valuation models that users can tweak, save, and share AI research agents trained on historical investor letters, filings, and segment data

I’m not launching publicly yet — just shipping core modules and talking to early users. But I wanted to share here because:

- A lot of folks on HN manage personal portfolios and feel the same frustration - Many financial tools today are either too surface-level (Yahoo Finance) or too expensive (Bloomberg)

If anyone’s built similar data pipelines, financial tools, or research systems — I’d love to trade notes

Also, if you’re building a fintech product or have thoughts on data infrastructure, LLMs for research, or public markets — I’d love to hear how you’d make this better.

Try it (early version): https://valuesense.io Email me: george@valuesense.io

Happy to answer anything!

Ancient DNA: surprising genetic links between early Egyptians and Mesopotamians

https://www.cnn.com/2025/07/02/science/ancient-egyptian-genome-sequenced
1•karlperera•1m ago•1 comments

Show HN: Dfembed is a Rust-powered Python lib turning DataFrames into vector db

https://github.com/a-agmon/dfembeder
1•alonagmon•4m ago•0 comments

How are you handling Git branching for database migrations?

https://www.harness.io/blog/how-git-strategy-can-break-your-database-pipeline
1•sonichigo•5m ago•1 comments

AI's 16:1 capex ratio: bubble or moat?

https://fluxus.io/article/why-the-ai-bubble-isnt-a-bubble
1•dreamfactored•6m ago•0 comments

[deleted by user]

https://harness.io/blog/gitops-branching-for-database-devops
1•sonichigo•10m ago•1 comments

Analysis Shows Competitive LCOE Target for Small Modular Reactors

https://www.nucnet.org/news/analysis-shows-competitive-lcoe-target-for-small-modular-reactors-7-3-2025
1•mpweiher•14m ago•0 comments

Show HN: I built a free backlink exchange marketplace

https://launchigniter.com/link-exchange
1•maulikdhameliya•15m ago•0 comments

Bitchat Mesh

https://apps.apple.com/us/app/bitchat-mesh/id6748219622
2•doener•17m ago•0 comments

Apisix Integration with AI/ML API

https://apisix.apache.org/blog/2025/07/29/announcing-integration-of-apisix-and-ai-ml-api/
1•Yilialinn•18m ago•0 comments

Automatic A2A Service Discovery in Kubernetes with Inference Gateway

https://github.com/inference-gateway/inference-gateway/tree/main/examples/kubernetes/a2a
1•edenr•18m ago•1 comments

The Online Safety Act for forum and blog owners

https://successfulsoftware.net/2025/07/29/the-online-safety-act-for-forum-owners/
1•hermitcrab•19m ago•1 comments

Most Watched Software Engineering Talks Of 2025 (so far)

https://www.techtalksweekly.io/p/50-most-watched-software-engineering
3•hal918•19m ago•0 comments

Parity of Zero

https://en.wikipedia.org/wiki/Parity_of_zero
1•derdi•25m ago•2 comments

Hypercube 3d ultimate tic tac toe

https://dhkts1.github.io/ultimate-nd-tictactoe-3d/
1•dhkts1•28m ago•0 comments

Tell HN: NISAR Satellite to Launch Today

1•_448•28m ago•0 comments

New battery manufacturer with European software: GAZ Energy

https://www.ess-news.com/2025/07/28/new-battery-manufacturer-with-european-software-gaz-energy-builds-factory-in-czech-republic/
1•doener•28m ago•0 comments

Nostr Auth Provider · clerk · Discussion #6435

https://github.com/orgs/clerk/discussions/6435
2•kehiy•34m ago•0 comments

Show HN: Deno is amazing. I built a toy TUI text editor to make sure of that

https://github.com/eu-ge-ne/toy
1•eu-ge-ne•34m ago•0 comments

Happy 20th Birthday MDN

https://web.dev/blog/mdn-birthday
2•feross•36m ago•0 comments

Do LLMs Identify Fonts?

https://maxhalford.github.io/blog/llm-font-identification/
4•Lemaxoxo•37m ago•1 comments

The Torch of Terrorism (1994)

https://time.com/archive/6726261/the-torch-of-terrorism/
2•thomassmith65•38m ago•0 comments

Decoding the Chinese Computer

https://www.sixthtone.com/news/1017405
3•sohkamyung•38m ago•0 comments

YouTube to be included in Australia's teen social media ban

https://www.bbc.com/news/articles/cpv0zkxx0njo
3•nojs•39m ago•0 comments

The chaos and confusion of itch.io and Steam's abrupt adult game ban

https://www.theverge.com/games/715299/itchio-games-delisting-payment-processor-paypal
2•isaacfrond•42m ago•0 comments

Intra-procedural lifetime and borrowing analysis in Clang

https://discourse.llvm.org/t/rfc-intra-procedural-lifetime-analysis-in-clang/86291
2•fanf2•42m ago•0 comments

Dead Internet Theory becomes more real – Now anyone can start botting easily

https://twitter.com/ArtusVranken/status/1950476396033175721
2•reeeeee•43m ago•1 comments

Seriously, Why Do Some AI Chatbot Subscriptions Cost More Than $200?

https://www.wired.com/story/seriously-why-do-some-ai-chatbot-subscriptions-cost-more-than-200/
11•isaacfrond•47m ago•3 comments

Show HN: I built a local AI assistant as a browser extension (zero cloud)

https://github.com/NativeMindBrowser/NativeMindExtension
3•kaylakay•47m ago•0 comments

Sleep all comes down to the mitochondria

https://www.science.org/content/blog-post/it-all-comes-down-mitochondria
3•A_D_E_P_T•50m ago•1 comments

Nvidia CEO Jensen Huang Sells $27.6M in Stock over Five Days

https://techgraph.co/stock-market/nvidia-ceo-jensen-huang-sells-27-6-million-in-stock-over-five-days/
3•visitednews•57m ago•1 comments