How to turn 'sfo-jfk' into a suitable photo

https://www.approachwithalacrity.com/how-to-turn-sfo-jfk-into-a-beautiful-photo/

31•bblcla•1w ago

Comments

kazinator•1w ago

[flagged]

dang•1w ago

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

linkjuice4all•1w ago

I realize dang has already tagged this (and I get it) but I don’t know who is actually benefiting from this. This is deceptive as it doesn’t represent a traveler’s actual/expected experience and just adds more slop to the pile. This is a case where you should pay actual photographers to take verified pictures of the places you are recommending people visit. Why would someone cheapen their brand like this?

properbrew•1w ago

If you read the article you'll see it's not about generating AI slop images:

> Take a freeform query (like ‘sfo->jfk’) and turn it into a ‘place’

> Build a database of ‘places’ -> pictures

> Build a software system that can take a ‘place’, look it up in a database and spit out the right picture – even if that ‘place’ isn’t in the database

busymom0•1w ago

Couldn't you grab the image from the Wikipedia page of that location? For example searching for "Deadvlei" from the blog post gives this which has a photo:

https://en.wikipedia.org/wiki/Deadvlei

presentation•1w ago

My experience is that many Wikipedia place photos are quite ugly.

dang•1w ago

[stub for offtopicness]

[sorry I messed up with that title! perils of not reading the articles closely]

tehjoker•1w ago

I’m not sure it’s good idea to use AI for this purpose. When you’re talking about travel, you’re talking about a real place.

If you show something photorealistic and AI generated, what is shown is simply an illusion.

If you use a cartoon style maybe they can work because the user will immediately understand what is shown is not a photograph.

hayksaakian•1w ago

the article doesn't talk about AI generating photos. it talks about using AI to interpret user queries into photo selections from photographers

nluken•1w ago

The post title here is extremely misleading. Per HN standards, the post should use the original title, "How to turn 'sfo-jfk' into a beautiful photo"

dang•1w ago

Ok, sorry! but re 'beautiful' see https://news.ycombinator.com/item?id=46802571

tehjoker•1w ago

sorry i skimmed the post and confused myself then

MrZander•1w ago

The title on HN is incorrect/misleading, they are not generating AI images. They are hand curating a database of images by location and using an LLM to pick the pictures.

npinsker•1w ago

The headline (currently: “Trying to craft AI images that are worth displaying to end users”) is misleading and changed from the original. Author isn’t crafting any AI images; they’re using AI in tandem with manual work to help choose from a set of human-authored images.

dang•1w ago

Ok! that was my attempt to avoid linkbait and make the title less provocative (submitted title was "How to design an AI app with a sense of taste"). But I missed the mark this time, so have reverted the title to the article's own headline, except I'm not going to keep the word 'beautiful' up there since that would be certain to provoke shallow objections.

echelon•1w ago

Speaking of "crafting", I think this is the perfect word to describe something more than "prompting".

It's extremely hard to block out a scene with just words, eg. "rotate hand 45 degrees, stand perpendicular to the column, shadows from light source 60 degrees above horizon, large box in front of chest, approximately 2 feet wide", etc.

Image-to-image, ControlNets, previz-to-final, etc. are the way to go, and I'm convinced this is the core interface for image and video creation. Text prompts will get you a coarse grained first approximation, which you then visually adjust to your exact needs with UI/UX-first models.

I built an intentional "crafting" engine so people could mold images like clay, with full intention:

https://github.com/storytold/artcraft

This is really early days though. I expect more tools and models to enable you to fully manipulate everything first-class, in 2d/3d. As if everything in an image were mutable.

As a film director, this is really exciting stuff.

mips_avatar•1w ago

The annoying thing with Unsplash is that it sounds like a really permissive license, but the moment you try and do it at scale you're no longer in compliance with their terms. Also their definition of not building a competing service is really broad. Maybe i'm being overly cautious but I get eeked out by all that.

drivingmenuts•1w ago

This sort of thing will kill the graphic design industry.

mips_avatar•1w ago

Not really it’s just a photo search engine (albeit a very small one)

jedbrooke•1w ago

the author mentions they’re using haiku for the model, but I wonder if the travel query -> place transform could be done with a tiny local model.

But that’s the bitter lesson I guess, unless there’s a reason to go fully local (extreme privacy concerns, offline use, etc) training a custom model just isn’t worth it over using some cloud api offering, even if it is orders of magnitude more compute that _somebody_ is paying for in the end

iberator•1w ago

Photo is made, not generated!

mkmk•1w ago

API limitations? Why not have the script run for a few days collecting images and then crank through all of them in an hour?

bblcla•1w ago

great idea, actually, I just didn't think of it!

SectorC: A C Compiler in 512 bytes

Brookhaven Lab's RHIC concludes 25-year run with final collisions

The F Word

Software factories and the agentic moment

I write games in C (yes, C)

Speed up responses with fast mode

Hoot: Scheme on WebAssembly

Stories from 25 Years of Software Development

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

First Proof

The Waymo World Model

Al Lowe on model trains, funny deaths and working with Disney

Show HN: A luma dependent chroma compression algorithm (image compression)

Vocal Guide – belt sing without killing yourself

Start all of your commands with a comma (2009)

Reinforcement Learning from Human Feedback

Selection Rather Than Prediction

We mourn our craft

Coding agents have replaced every framework I used

The AI boom is causing shortages everywhere else

France's homegrown open source online office suite

72M Points of Interest

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

A Fresh Look at IBM 3270 Information Display System

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

History and Timeline of the Proco Rat Pedal (2021)

Learning from context is harder than we thought

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

SectorC: A C Compiler in 512 bytes

Brookhaven Lab's RHIC concludes 25-year run with final collisions

The F Word

Software factories and the agentic moment

I write games in C (yes, C)

Speed up responses with fast mode

Hoot: Scheme on WebAssembly

Stories from 25 Years of Software Development

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

First Proof

The Waymo World Model

Al Lowe on model trains, funny deaths and working with Disney

Show HN: A luma dependent chroma compression algorithm (image compression)

Vocal Guide – belt sing without killing yourself

Start all of your commands with a comma (2009)

Reinforcement Learning from Human Feedback

Selection Rather Than Prediction

We mourn our craft

Coding agents have replaced every framework I used

The AI boom is causing shortages everywhere else

France's homegrown open source online office suite

72M Points of Interest

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

A Fresh Look at IBM 3270 Information Display System

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

History and Timeline of the Proco Rat Pedal (2021)

Learning from context is harder than we thought

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

How to turn 'sfo-jfk' into a suitable photo

Comments