JSON-LD Explained for Personal Websites

https://hawksley.dev/blog/json-ld-explained-for-personal-websites/

79•ethanhawksley•2h ago

Comments

mananaysiempre•1h ago

A bit disappointing that (IIUC) for the common parsers you have to say everything twice, in HTML and in the accompanying JSON-LD form even though RDFa exists for the exact purpose of letting you point at the values already present in your markup. (Admittedly RDFa is perhaps too flexible for its own good when you just want to mark up some stuff, but if you’re writing a full parser anyway dealing with a bit of excessive cleverness in the format should not be too bad.)

panzi•1h ago

And then there is https://schema.org/ It's the item* attributes, e.g.: https://developer.mozilla.org/en-US/docs/Web/HTML/Reference/... Also Dublin Core in <meta> tags. Why do they keep adding conflicting meta data formats to HTML!?!

9dev•1h ago

https://xkcd.com/927/

captn3m0•1h ago

There is also microformats.

klodolph•53m ago

I think if you are using Dublin Core, it’s because you’re a library. Maybe I am off the mark, but that is the sense I get from this—not all these standards should be used for all pages on the web.

I think you should just think about what metadata you actually care about, and the main metadata I care about (choose your own list) is authorship, publish date, last update, subject keywords, thumbnail (OpenGraph 1200x630), and summary.

There’s a long list of additional metadata that I could put in my webpages because there are standardized ways to do it, but, why bother?

jauco•41m ago

To be fair schema.org and dublin core say “when a property is name ‘title’ it means …” and you can expect to find the following properties…

Json-ld says: if you want to know whether the “title” property means the schema.org or the dublin core variant then you can find out which it is by <json-ld algorithm>

So you’d always use json-ld _with_ schema.org or something.

alwillis•39m ago

They don't conflict; they were designed to work together. You can have schema.org (in JSON-LD, RDFa, or micro data) on the same page as Dublin Core, etc.

For example, there's no explicit property in schema's Person type [1] for a nickname. But the FOAF standard does [2].

Just add FOAF to the JSON-LD context:

    {
      "@context": {
        "@vocab": "https://schema.org/",
        "foaf": "http://xmlns.com/foaf/0.1/",
        "pronouns": "https://schema.org/pronouns" 
    }

You now use the FOAF nickname property:

    "@type": "Person",
      "givenName": "Timothy",
      "familyName": "Berners-Lee",
      "foaf:nick": "TBL",

You can do the same thing with Dublin Core, DBPedia, etc.

[1]: https://schema.org/Person

[2]: https://xmlns.com/foaf/spec/#term_nick

klodolph•1h ago

IMO this is going overboard. Any time you are duplicating data from HTML into JSON-LD, consider just omitting that data from JSON-LD, unless the data isn’t consistently present in HTML (because it is a bitch to be consistent about this stuff).

I tried using RDFa and liked the property that it was theoretically less redundant, but switched to JSON-LD because it JSON-LD is just easier to get working. And this is speaking as somebody who uses a hand-rolled static site generator—the issue here is that whether information is present in the raw HTML is something contextual, and if something isn’t present in the HTML then you need to put it somewhere else or it’s not mechanically parseable from the page. Like, to a human reader, a post on “Alice’s Blog” is assumed to be authored by Alice, so I may omit the “by Alice” text from the document, and then I would want to put that metadata in the page some other way.

Putting the metadata in JSON-LD lets me just be dumb about it. The metadata is always in JSON-LD, and the HTML may or may not contain an explicit representation of that same metadata. Easy.

But the JSON-LD does not need to contain the URL of the page (which is <link rel=canonical>) or the title (which is in <title>), for example.

alwillis•33m ago

> I tried using RDFa and liked the property that it was theoretically less redundant, but switched to JSON-LD because it JSON-LD is just easier to get working.

For me, it depends on the project. For personal projects, I tend to use RDFa; otherwise, JSON-LD.

lenkite•1h ago

We have semantic HTML, but for some weird reason we need to yet again re-express the semantic meaning of our website in bespoke weird JSON in a script tag that the browser won't process.

klodolph•1h ago

I have used JSON-LD in my own websites and found that it fills a separate need from semantic HTML. Your semantic HTML will specify things that the browser processes, like the title and headings. The JSON-LD data is metadata, like date created, date updated, tags, authorship. These things can be expressed in the HTML using micro data, but I stopped using micro data because JSON-LD was easier.

The JSON-LD I populate from the same data that I use to generate my site, and I use the JSON-LD metadata to generate things like index pages (list of blog posts from 2024, all posts related to topic X, etc). The main consumers of JSON-LD are search engines.

If you are interested in getting offended, then think about how we are also putting OpenGraph metadata in our web pages. Two different metadata formats for the same page.

tommica•45m ago

Structured data exists yo pass the metadata. Issue with it is that of might impact the way your html needs to be structured, this can be messy.

rglullis•1h ago

What I see as the ideal would be a world where servers and browsers could do content negotitation, and have browsers attempting first to request only the json-ld from the website and using its own internal renderer format.

_heimdall•30m ago

tommica•43m ago

Super useful article, wish that had existed in my seo days.

I had misunderstood the type field, because to me I was often just linking to a webpage, even if it is for a saas, the marketing page is still a webpage.

klodolph•39m ago

I would encourage people who have the pragmatic bent to read about JSON-LD from the Google documentation for web sites;

https://developers.google.com/search/docs/appearance/structu...

You’ll also notice that a lot of the information is relevant to only a small subset of sites. Rotten Tomatoes can publish the critic rating for movies using JSON-LD, but that’s not relevant for me (even if I write a review for a movie).

JSON-LD is nice because it’s easy and it is actually used by search engines. Yes, it can duplicate information in the web page itself, but I think the dream of perfectly annotating information so it only appears exactly once in your document is, well, a dream of spherical cows and massless ropes. It takes human effort to make a webpage and I am ok with a little duplication in the final product. My <h1> duplicates information in <title> anyway.

JdeBP•28m ago

> It can aid web crawlers in understanding the semantic structure of your site, qualifying you for richer link previews, and even potentially improving your search ranking.

This is fighting the last war, to stretch a metaphor.

As far as I and my WWW site are concerned, Google has nowadays switched to giving people lengthy LLM-generated versions of my stuff, with errors, above pointing people to my actual stuff. 'Breadcrumbs' and getting a pretty display name instead of the domain name, don't address the fact that Google de-prioritizes all of that, pretty tweaks or no, nowadays.

This is a lot of effort for stuff that people visiting my actual site directly will never see, and which people using Google will not find above the fold of its own massively LLM-ized version of stuff.

gomoboo•14m ago

Do these attributes actually help with search engine visibility or do they just make it easier for search engines to keep users from leaving the search page? Honest question here.

JSON-LD Explained for Personal Websites

Prefer duplication over the wrong abstraction (2016)

Beyond All Reason (Free Total Annihilation Inspired RTS)

(How to Write a (Lisp) Interpreter (In Python)) (2010)

Identity verification on Claude

The minimum viable unit of saleable software

Show HN: CleverCrow: give tokens to your favorite projects

An Embedded Linux on a Single Floppy

Ask for no, don't ask for yes (2022)

Occupancy Math on the AMD MI355X: A From-First-Principles Guide

A 3D voxel game engine written in APL

15-minute at-home Lyme disease tick test

Loupe – A iOS app that raises awareness about what native apps can see

Wildcard (YC W25) is hiring an applied ML engineer

Developers don't understand CORS (2019)

Show HN: TownSquare, a tiny presence layer for websites

Running MicroVMs in Proxmox VE, the Easy Way

System call instrumentation on Linux/x86‑64 using memory‑indirect calls, part I

Proportional-Integral-Derivative (PID) controllers

Cocktail Optimization, an Integer Programming Problem

Slow breathing modulates brain function and risk behavior

Excessive nil pointer checks in Go

Djevops: Self-Host Django Easily

Fossil Fuels Are 40% of Freight Shipping Tonnage, but Half Its Fuel Use

Burnout is real for open source maintainers

Show HN: Pulse – Dashboard for Claude Code, approve tool calls from your phone

Renting a sewing machine from the library

The brain was not designed for this much bad news

Two Qwen3 models on one DGX Spark: the residency math

Epoll vs. io_uring in Linux

JSON-LD Explained for Personal Websites

Prefer duplication over the wrong abstraction (2016)

Beyond All Reason (Free Total Annihilation Inspired RTS)

(How to Write a (Lisp) Interpreter (In Python)) (2010)

Identity verification on Claude

The minimum viable unit of saleable software

Show HN: CleverCrow: give tokens to your favorite projects

An Embedded Linux on a Single Floppy

Ask for no, don't ask for yes (2022)

Occupancy Math on the AMD MI355X: A From-First-Principles Guide

A 3D voxel game engine written in APL

15-minute at-home Lyme disease tick test

Loupe – A iOS app that raises awareness about what native apps can see

Wildcard (YC W25) is hiring an applied ML engineer

Developers don't understand CORS (2019)

Show HN: TownSquare, a tiny presence layer for websites

Running MicroVMs in Proxmox VE, the Easy Way

System call instrumentation on Linux/x86‑64 using memory‑indirect calls, part I

Proportional-Integral-Derivative (PID) controllers

Cocktail Optimization, an Integer Programming Problem

Slow breathing modulates brain function and risk behavior

Excessive nil pointer checks in Go

Djevops: Self-Host Django Easily

Fossil Fuels Are 40% of Freight Shipping Tonnage, but Half Its Fuel Use

Burnout is real for open source maintainers

Show HN: Pulse – Dashboard for Claude Code, approve tool calls from your phone

Renting a sewing machine from the library

The brain was not designed for this much bad news

Two Qwen3 models on one DGX Spark: the residency math

Epoll vs. io_uring in Linux

JSON-LD Explained for Personal Websites

Comments