Edit: Even without giving it context, at best, just get a single picture and two paragraphs. Maybe they are slowly rolling the feature out. It doesn't seem to get it.
Maybe it's different in different countries. I'm in Italy.
That's... uh... a pretty bold description for a tool where you are in fact outsourcing the "imagination" part to the machine.
People hear my product "writes stories" and always ask why the site doesn't have any features to share a full story: because it wouldn't make sense.
It'd be like listening to a stream of every song a person has ever played for themselves. Maybe they didn't write the songs, but they chose them based on the moment. Sometimes they start a song and skip half way because they already got the emotion, sometimes they repeat the saddest part 10 times.
They weren't trying to build a playlist for others to consume, it was for them, and only they could have come up with it.
Obviously Gemini doesn't know that "music truck" is another name for "ice cream truck", but more concerningly, the illustrations it made for the trucks were this kind of eldritch amalgamation of Cars-movie style cars and people driving cars. The story was just OK, I don't think it would have kept my toddler's attention for the whole ten pages. Plus, the mailman is barely involved.
Imagine the meetings where they decide to add personal illustrated storybooks before fixing chat histories
But yeah, it’s Google after all
But isn’t because of the promo culture? My n=1 of BigTech experience is that promos are based on “impact” and it’s a lot harder to show impact maintaining and improving existing features.
That might explain why there are over a dozen different ways to run a Docker container at AWS.
https://www.lastweekinaws.com/blog/the-17-ways-to-run-contai...
My theory is this misalignment of incentives is probably at the heart of most of our quality rot in software. Product managers are incentivized to create new features that boost the daily active users, while generally blind to the death by a thousand cuts caused by all the quality issues.
It seems like large institutions almost inevitably accumulate misaligned incentives along with an inertia that makes them almost impossible to correct.
I don't see this as a misalignment, it is a choice by the company, to the extent companies as a whole make choices. The incentives are the manifestations of those choices.
It created a 10 page story that stuck to the topic and was overall coherent. The main character changed color and style on every page, so no consistency there. The overall page layouts and animation style were reasonably consistent.
The metaphor it used was the character climbing a mountain and encountering other characters that represented each mode. Each supporting character was reasonably unique, although note motif was present on 3 or 4. The mountain also changed significantly and the character was frequently back at the bottom. However, in the end, he does reach the summit.
I can't say I am overly impressed but it does mostly do what they claim.
Is there something lost, when it's not the adult telling the child a bedtime improv story? (IME, kids love this.)
Is something else gained by the generated storybook?
The opportunity for low-effort, low-talent grifters to make a buck on Amazon?
BUT, I don't always have that energy, and I already spend hours a day reading stories to my kids, so I am okay with them spending some fraction of time hearing stories from robots/screens/etc. (Lately, it's "Hey Google, tell a story" if mommy is too busy to read)
I hope we never stop paying amazing children's book illustrators though! I have so many books where I marvel at each page and the ingenuity of the illustrative style.
But what’s wrong with image 10/10?
If Gemini added a reflection step to its book drawing routine, I think the model could easily notice the errors, and generate images to correct them - the errors do not seem unsurmountable.
Given that, I'm assuming Amazon is or will soon be filled with decently illustrated somewhat amusing stories.
Feels like this could have been an opportunity by the LLM to take the story in a less pedestrian direction with a loose parable around a noxious equivalent to Maslow's hammer.
And yes, I was confused too as to how farting would clear away fog.
This feels like the crux of my issues with AI. We're passing the human side of life to machines to do for us; music, art, storytelling with our kids. So often I hear that these are the things that people want to spend their time on, but AI has come to "free" us from "not having energy" to do those things, so we can instead continue to spend our energy toiling away, safe in the knowledge that our children will still get a bedtime story (albeit from a machine and not from a loving parent).
I get it, I know parents have no energy on top of everything else they're doing, this just feels so much like when I walk into a restaurant to a family with their kids, but the kids are all on tablets with headphones.
I was 100% against screens when first having a kid, but now I'm content with kids getting a spectrum of entertainment styles, and for parents to get a break every so often.
Yeah, kids love creating stuff
Kids use their imagination because they're encouraged to do so. It's somewhat of a challenge to find the cusp between what is plain and what is incomprehensible (think of the ZPD but for creativity).
I live abroad, so I don't have unlimited access to books in my native language and all the websites were crappy sites with dozens of ads on it, made it unusable.
I was fed up with searching, so I went to ChatGPT, told it to generate me a story in my native tongue about a boy named $MySonsName and his partner $FavoriteAnimalOfTheDay, who is doing $WhateverMySonDidThatDay. It was a good story, used phrases commonly used in children's books in my language, and all.
I think the aspect of being with my son, hugging him while reading something before going to sleep is much more important than who came up with the story. And as parents, after a day of full time work and constantly helping at the household, sleep deprivation, my stories would be two sentences before I run out of ideas.
If you don't mind me asking, what languages do you speak? Which do you use when interacting with LLMs?
Just a heads up: as I tried to print several stories to PDF, most times one of the generated images did not appear on the PDF. It’s surely a bug of some sort, because regenerating stories eventually makes it go away. Hope these kinds of issues will be fixed soon.
Using a Claymation art style, create a storybook about why zero point nine recurring is equal to one.Finally it gives generated text and images some sort of coherence that makes everything immediately "usable".
It is easier to develop something from a lot of text and images than having to assemble everything from zero.
Hope that it's editable too?
They refer (sort of) to a desktop app on the page, but I've never seen a Gemini desktop app. It seems they're just saying the web app on desktop...
I don't understand why Google doesn't have a true desktop app nor keyboard shortcuts. These things are so easy... (Especially the keyboard shortcuts.)
Their execution on everything but the model just seems terrible.
So many startups in that space that now get killed. Oscar Stories is going to have a hard time
Like, why is one of the richest man on the planet getting all giggly presenting us "a fairy sloth in a magical forest"? https://imgur.com/1naGLfp
Just spend more time with your kids, they want connection!
Isn't the point of this you have a customized book in like 5 minutes, and you can spend time sharing it with your child? Presumably you aren't just throwing the book at them and telling them to read it. If you spent hours drawing a book, would that mean you can spend more time with your kids?
Right now I’m using ChatGPT to create my own lessons and having it to draw pictures depicting sentences in Spanish and putting a caption in Spanish underneath.
It’s keeping me from having to go from Spanish -> English -> mental image directly to Spanish -> mental image
Not quite. Gemini isn't available in Hong Kong. Unfortunately instead of telling Pixel users that, they updated their phones to use Gemini instead of the functional assistant, and then whenever the assistant is accessed, it just spins forever with a "just a moment" prompt.
It's not even clear why it's disabled, since it works just fine if you pay them for workspace subscription.
xnx•6mo ago