frontpage.

Built this originally as a small competitive game, then it turned into a useful prompt-engineering practice loop.

  Core mechanic: user sees a target image, writes a prompt, model generates output, and we score similarity.

  Scoring uses multiple signals so one metric doesn’t dominate:

  1. Semantic alignment (CLIP)
  - user_prompt -> target_image (is the prompt conceptually aligned with target?)
  - user_image -> target_image (is the generated result semantically aligned with target?)

  2. Prompt faithfulness (CLIP)
  - user_prompt -> user_image (did generation actually follow the submitted prompt?)

  3. Color similarity
  - HSV histogram overlap (user_image vs target_image) for palette/tone distribution

  4. Structure similarity
  - HOG-lite gradient/orientation comparison (user_image vs target_image) for layout/edge composition

  Final score is a weighted blend (content signals weighted highest), normalized to player-facing points.

  Why this approach:
  - CLIP-only can overrate semantically related but visually off outputs
  - color-only ignores structure/meaning
  - structure-only misses semantics/style
  - combining prompt-image and image-image signals reduced obvious false positives in ranking

  Stack:
  - Spring Boot backend
  - separate CLIP scoring container
  - external image generation service
  - Next.js frontend
  - PostgreSQL

  Would love technical feedback on:
  - metric weighting/calibration
  - known failure modes I should benchmark
  - alternatives to HOG-lite for fast structural scoring

Ralph Giles Passed Away (Xiph.org| Rust@Mozilla | Ghostscript)

Resurrecting _why's Dream

Windows 11 is getting a big security update

Anthropic Found Why ChatGPT Goes Insane [video]

The Holy Order of Clean Code – A Claude Skill

Worlds: A Simulation Engine for Agentic Pentesting

Sub-part-per-trillion test of the Standard Model with atomic hydrogen

RocksDB 10 and TidesDB 8 Benchmark Analysis on Dedicated Threadripper

California Political Operative Sentenced to 4 Years as Covert Agent of PRC

CEO Jensen Huang said he wants employees to stop coding

Trump official overruled FDA scientists to reject Moderna's flu shot

FreeBSD: Home NAS, part 10 – monitoring with VictoriaMetrics and Grafana

Ask HN: Best practices for AI agent safety and privacy

Ask HN: If your OpenClaw could do 1 thing it currently can't, what would it be?

Ask HN: Fix MCP OAuth Gaps (CLI and CI Check)

The Scariest Climate Plot in the World (2023)

An Effect runtime visualizer that runs in the browser. Written in Effect

.plan Files (2020)

Selfish AI

Evaluating Multilingual, Context-Aware Guardrails: A Humanitarian LLM Use Case

The Ho-6 Masterclass

How do founders demo real product without exposing sensitive data?

Show HN: Revvly – Income operating system for freelancers (replacing 5 tools)

Show HN: EPI – Cryptographically verifiable execution artifacts for AI agents

In one swoop, Trump kills US greenhouse gas regulations

How do you "step through" your own anxiety?

Learn Fundamentals, Not Frameworks

Anthropic's Chief on A.I.: 'We Don't Know If the Models Are Conscious'

CCBench: How do agents perform on codebases that aren't part of training data?

I've built Googles LangExtract like libary on my own runtime

Ralph Giles Passed Away (Xiph.org| Rust@Mozilla | Ghostscript)

Resurrecting _why's Dream

Windows 11 is getting a big security update

Anthropic Found Why ChatGPT Goes Insane [video]

The Holy Order of Clean Code – A Claude Skill

Worlds: A Simulation Engine for Agentic Pentesting

Sub-part-per-trillion test of the Standard Model with atomic hydrogen

RocksDB 10 and TidesDB 8 Benchmark Analysis on Dedicated Threadripper

California Political Operative Sentenced to 4 Years as Covert Agent of PRC

CEO Jensen Huang said he wants employees to stop coding

Trump official overruled FDA scientists to reject Moderna's flu shot

FreeBSD: Home NAS, part 10 – monitoring with VictoriaMetrics and Grafana

Ask HN: Best practices for AI agent safety and privacy

Ask HN: If your OpenClaw could do 1 thing it currently can't, what would it be?

Ask HN: Fix MCP OAuth Gaps (CLI and CI Check)

The Scariest Climate Plot in the World (2023)

An Effect runtime visualizer that runs in the browser. Written in Effect

.plan Files (2020)

Selfish AI

Evaluating Multilingual, Context-Aware Guardrails: A Humanitarian LLM Use Case

The Ho-6 Masterclass

How do founders demo real product without exposing sensitive data?

Show HN: Revvly – Income operating system for freelancers (replacing 5 tools)

Show HN: EPI – Cryptographically verifiable execution artifacts for AI agents

In one swoop, Trump kills US greenhouse gas regulations

How do you "step through" your own anxiety?

Learn Fundamentals, Not Frameworks

Anthropic's Chief on A.I.: 'We Don't Know If the Models Are Conscious'

CCBench: How do agents perform on codebases that aren't part of training data?

I've built Googles LangExtract like libary on my own runtime

Show HN: Image prompt game with multi-signal CLIP/HSV/HOG scoring

Comments