frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Python notebook of Princeton GraphMERT Paper – a better knowledge graph

https://github.com/creativeautomaton/graphMERT-python
2•7jewve5rws•2h ago

Comments

7jewve5rws•2h ago
For those interested in a new knowledge graph model, I implemented the Princeton work on GraphMERT into a python notebook for experimentation.

abstract: The notebook uses a Romeo and Juliet corpus text that is embedded with a sentencetransformers model then trained with to build the GraphMERT model which is used to build the knowledge graph in a GraphRAG inference setup.

For a detailed review of the GraphMERT paper watch this great european youtuber: discover ai - https://www.youtube.com/watch?v=xh6R2WR49yM&t=1s

About GraphMERT:

GraphMERT: Efficient and Scalable Distillation of Reliable Knowledge Graphs from Unstructured Data Margarita Belova, Jiaxin Xiao, Shikhar Tuli, Niraj K. Jha

    Researchers have pursued neurosymbolic artificial intelligence (AI) applications for nearly three decades because symbolic components provide abstraction while neural components provide generalization. Thus, a marriage of the two components can lead to rapid advancements in AI. Yet, the field has not realized this promise since most neurosymbolic AI frameworks fail to scale. In addition, the implicit representations and approximate reasoning of neural approaches limit interpretability and trust. Knowledge graphs (KGs), a gold-standard representation of explicit semantic knowledge, can address the symbolic side. However, automatically deriving reliable KGs from text corpora has remained an open problem. We address these challenges by introducing GraphMERT, a tiny graphical encoder-only model that distills high-quality KGs from unstructured text corpora and its own internal representations. GraphMERT and its equivalent KG form a modular neurosymbolic stack: neural learning of abstractions; symbolic KGs for verifiable reasoning. GraphMERT + KG is the first efficient and scalable neurosymbolic model to achieve state-of-the-art benchmark accuracy along with superior symbolic representations relative to baselines.
    Concretely, we target reliable domain-specific KGs that are both (1) factual (with provenance) and (2) valid (ontology-consistent relations with domain-appropriate semantics). When a large language model (LLM), e.g., Qwen3-32B, generates domain-specific KGs, it falls short on reliability due to prompt sensitivity, shallow domain expertise, and hallucinated relations. On text obtained from PubMed papers on diabetes, our 80M-parameter GraphMERT yields a KG with a 69.8% FActScore; a 32B-parameter baseline LLM yields a KG that achieves only 40.2% FActScore. The GraphMERT KG also attains a higher ValidityScore of 68.8%, versus 43.0% for the LLM baseline.

Nuclear fusion, the 'holy grail' of power

https://fortune.com/2025/10/02/nuclear-fusion-online-commercial-ai-power/
1•measurablefunc•3m ago•0 comments

Will the explainer post go extinct?

https://dynomight.net/explainers/
1•Curiositry•3m ago•0 comments

Where are we on XChat security?

https://mjg59.dreamwidth.org/73625.html
1•bariumbitmap•4m ago•0 comments

iOS 26.1 beta transparency toggle changes liquid glass

https://www.macrumors.com/2025/10/20/ios-26-1-transparency-option-liquid-glass/
1•danielsht•5m ago•0 comments

The GUI S-curve is peaking

https://twitter.com/theOpusLABS/status/1978872762161590549
1•opuslabs•8m ago•0 comments

The Cost of Cloud, a Trillion Dollar Paradox (2021)

https://a16z.com/the-cost-of-cloud-a-trillion-dollar-paradox/
1•gregsadetsky•8m ago•1 comments

Kohler's Dekoda Toilet Camera

https://www.kohlerhealth.com/dekoda/
1•zdw•9m ago•1 comments

China Went from Clean Energy Copycat to Global Innovator

https://www.nytimes.com/interactive/2025/08/14/climate/china-clean-energy-patents.html
1•alphabetatango•10m ago•0 comments

NORAD's Cheyenne Mountain Combat Center, C.1966

https://flashbak.com/norad-cheyenne-mountain-combat-center-478804/
2•zdw•13m ago•0 comments

Start an AI PhD Now

https://jasonppy.github.io/story/best-time-AI-phd/
1•hedgehog0•20m ago•0 comments

US NSA alleged to have launched a cyber attack on Chinese timekeeping agency

https://www.csoonline.com/article/4075846/us-nsa-alleged-to-have-launched-a-cyber-attack-on-a-chi...
2•mmooss•22m ago•1 comments

We Built WebSocket Servers for Vercel Functions

https://www.rivet.dev/blog/2025-10-20-how-we-built-websocket-servers-for-vercel-functions/
1•Bogdanp•22m ago•0 comments

BlackRock Says Insurers Expect to Keep Ramping Up Private Bets

https://www.bloomberg.com/news/articles/2025-10-21/blackrock-says-insurers-expect-to-keep-ramping...
1•zerosizedweasle•24m ago•1 comments

It was a weather balloon, not space debris, that struck a United Airlines plane

https://arstechnica.com/space/2025/10/the-mystery-object-that-struck-a-plane-in-flight-it-was-pro...
2•hughes•28m ago•0 comments

It was DNS

https://www.redshirtjeff.com/shop/p/it-was-dns-shirt
4•corvad•30m ago•0 comments

The breach that broke the internet: The untold story of Log4Shell

https://github.blog/open-source/inside-the-breach-that-broke-the-internet-the-untold-story-of-log...
1•quentinp•34m ago•0 comments

I Could Have Lived Without AI

https://www.mindprison.cc/p/i-could-have-lived-without-ai
4•13years•49m ago•1 comments

U.S. Banks Are Hunting for Collateral to Back $20B Argentina Bailout

https://www.wsj.com/finance/argentina-bailout-banks-collateral-721bc2b5
4•JumpCrisscross•51m ago•1 comments

Free Seedream 4.0 – No Login Required

https://www.seedream4free.com
1•cnych•55m ago•0 comments

Sam Altman got Silicon Valley's giants to tether their fates to his company

https://www.wsj.com/tech/ai/sam-altman-open-ai-nvidia-deals-d10a6525
5•zerosizedweasle•57m ago•4 comments

IKEA Phone Bed

https://qz.com/ikea-miniature-bed-for-smartphone-phone-sleep-collection
2•praving5•59m ago•0 comments

Ask HN: What books are you reading now?

5•hellohihello135•1h ago•8 comments

Ask HN: What software dev tasks have you found LLMs to be good at versus bad at?

3•ronbenton•1h ago•0 comments

Proposed DNS RFC 8767: Serving Stale Data to Improve DNS Resiliency (2020)

https://datatracker.ietf.org/doc/html/rfc8767
1•antimatter15•1h ago•0 comments

Show HN: WatchDoggo – simple open-source service status monitor

https://github.com/zyra-engineering-ltda/watch-doggo/tree/v0.0.1
1•mcloide1942•1h ago•0 comments

Why 'Functor' Doesn't Matter (2019)

https://www.parsonsmatt.org/2019/08/30/why_functor_doesnt_matter.html
1•signa11•1h ago•1 comments

Sonoluminescence

https://en.wikipedia.org/wiki/Sonoluminescence
4•pmalynin•1h ago•0 comments

Fundraiser with Safe Using Stripe Atlas

https://docs.stripe.com/atlas/fundraise-with-safes
3•tzury•1h ago•1 comments

NobelBiz – Erlang/OTP and Elassandra/Cassandra|Full-Time|Remote|80K-100K USD

1•DarthAppleCider•1h ago•0 comments

The Great Crown Caper – Two crowns, one crime, one unsolved mystery

https://fightingfor.nd.edu/stories/the-great-crown-caper/
1•b_mc2•1h ago•0 comments