frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: TheorIA – An Open Curated Physics Dataset (Equations,Explanations,JSON)

https://theoria-dataset.github.io/theoria-dataset/
7•ManuelSH•17h ago
We’re building TheorIA— an open, high quality dataset of theoretical physics results: equations, derivations, definitions, and explanations — all in structured, machine- and human-readable JSON.

Why? Physics is rich with beautiful, formal results — but most of them are trapped in PDFs, LaTeX, or lecture notes. That makes it hard to:

- train symbolic/physics-aware ML models,

- build derivation-checking tools,

- or even just teach physics interactively.

THEORIA fills that gap. Each entry includes:

A result name (e.g., Lorentz transformations)

Clean equations (AsciiMath)

Straightforward step-by-step derivation with reasoning

Symbol definitions & assumptions

Programmatic validation using sympy

References, arXiv-style domain tags, and contributor metadata

Everything is in open, self-contained JSON files. No scraping, no PDFs, just clear structured data for physics learners, teachers, and ML devs.

Contributors Wanted: We’re tiny right now and trying to grow. If you’re into physics or symbolic ML:

Add an entry (any result you love)

Review others' derivations

Build tools on top of the dataset

GitHub https://github.com/theoria-dataset/theoria-dataset/

Licensed under CC-BY 4.0, and we welcome educators, students, ML people, or just anyone who thinks physics deserves better data.

Comments

somethingsome•11h ago
There are only 3 entries, am I correct?
ManuelSH•8h ago
Yes, we are at very early stage. Looking for other physics experts to help increasing it.
somethingsome•5h ago
I like the idea of having a dataset for physics, but those entries are very basics, most of the physics happens with very complicated maths and it will be difficult to make an entry for a lot of physics.

For example, imagine the entry for the standard equation, should all the derivation and symbolic implementation done as a unique entry? It will be difficult to separate it in logical entries that reference each others, and many physical ideas are fundamentally different, leading to divergences.

I have the impression that it should be easier to just parse reference books and format each paragraph/section as an entry, and maybe build a graph. (considering the reference book as authoritative on the subject)

Fosstodon Community Statement – Cleaning house, owning past mistakes

https://hub.fosstodon.org/fosstodon-community-statement
1•evolve2k•25s ago•0 comments

Norway hands over Arctic Council intact after 'difficult' term as chair

https://www.theguardian.com/world/2025/may/12/norway-arctic-council-leadership-ukraine-trump-greenland
1•defrost•7m ago•0 comments

Replacing tmux and GNU screen with Emacs

https://www.masteringemacs.org/article/replacing-tmux-gnu-screen-emacs
1•signa11•8m ago•0 comments

Terence Tao: Formalizing a proof in Lean using GitHub Copilot and canonical

https://www.youtube.com/watch?v=cyyR7j2ChCI
1•admingirl•10m ago•0 comments

Show HN: Clean, high-quality AI image generation

https://www.shutterly.co/
1•avin_regmi•12m ago•0 comments

The Disaster Cycle [video]

https://www.youtube.com/watch?v=icfR332pVa8
1•keepamovin•14m ago•0 comments

Little Language Lessons – Google Labs

https://labs.google/lll/en
1•claucambra•19m ago•0 comments

Trump To Sign EO Aimed at Lowering Drug Prices

https://www.wsj.com/politics/policy/trump-says-he-will-sign-executive-order-aimed-at-lowering-drug-prices-2a5b9b28
1•IG_Semmelweiss•20m ago•1 comments

Ask HN: Zer0 Browser – A Fast, Private Browser with Zero Bloat?

1•gokulnair2001•21m ago•0 comments

Netcetera used Clojure+Rama to 100x a product used by millions

https://blog.redplanetlabs.com/2025/04/22/how-gd-netcetera-used-rama-to-100x-the-performance-of-a-product-used-by-millions-of-people/
1•nathanmarz•24m ago•0 comments

India's Perfumers Recreate the Smell of Rain on Earth [video]

https://www.youtube.com/watch?v=LDrm4KQ1n_c
1•teleforce•25m ago•0 comments

I Don't Have Spotify

https://idonthavespotify.donado.co/
2•handfuloflight•27m ago•0 comments

Trump to sign executive order to cut prices of medicine to match other countries

https://www.reuters.com/business/healthcare-pharmaceuticals/trump-sign-executive-order-reducing-prescription-drug-prices-2025-05-11/
3•y1zhou•30m ago•2 comments

Ask HN: Pipelines with WASM Components

1•mootoday•35m ago•0 comments

Ocamlfind will not build on OS X Catalina if CLICOLOR=1

https://github.com/ocaml/ocamlfind/issues/40
1•transpute•39m ago•0 comments

Show HN: Nashville Lyric and Chord Chart Formatter

https://git.sr.ht/~curiositry/nashville-chord-chart-formatter
1•Curiositry•39m ago•0 comments

About Green Screens and mouse-clickable UIs

https://try-as400.pocnet.net/wiki/About_Green_Screens_and_mouse-clickable_UIs
3•nivethan•43m ago•0 comments

Property Division Calculator – A California Divorce App

https://ca-divorce.streamlit.app
2•rachelgreenai•45m ago•0 comments

Ask HN: Cursor or Windsurf?

15•skarat•52m ago•10 comments

Best Sudoku Apps for iPhone

https://www.notevil.io/posts/best-sudoku-apps-for-iphone/
1•Intragalactic•52m ago•0 comments

AI Powered Energy Management Systems – Prospects and Challenges

https://arxiv.org/abs/2505.05498
1•nickevante•54m ago•1 comments

In defense of self-signed certificates (2013)

https://michael.orlitzky.com/articles/in_defense_of_self-signed_certificates.xhtml
5•1vuio0pswjnm7•1h ago•0 comments

Why are scheduling tools still so frustrating?

5•chetansorted•1h ago•2 comments

Show HN: Schezy – AI-Powered School Management System for Modern Education

https://www.schezy.com/
2•qareena•1h ago•0 comments

Alabamian with diabetes built her own artificial pancreas, gives away plan

https://www.al.com/news/huntsville/2017/05/daniel_lewis_built_her_own_art.html
4•MaysonL•1h ago•0 comments

Getting started with React component library

https://jinen83.github.io/react-component-library-vs-dronahq/
2•kinj28•1h ago•0 comments

Emotional Durability

http://oxs.335.myftpupload.com/2020/08/01/on-emotional-durability/
3•sixpackpg•1h ago•0 comments

Need an ops-do-it-all guy for your S25 startup?

3•JessePinkmanYo•1h ago•1 comments

Cursor: Security

https://simonwillison.net/2025/May/11/cursor-security/
1•thunderbong•1h ago•0 comments

Show HN: TaoPrompt – AI prompt generator that auto‐builds custom expert agents

https://taoprompt.com/guide
2•lucashaper•1h ago•0 comments