In my experiments with LLMs for writing code, I find that the code is objectively garbage if my prompt is garbage. If I don't know what I want, if I don't have any ideas, and I don't have a structure or plan, that's the sort of code I get out.
I'd love to hear any counterpoints from folks who have used LLMs lately to get academic or creative writing done, as I haven't tried using any models lately for anything beyond helping me punch through boilerplate/scaffolding on personal programming projects.
https://old.reddit.com/r/singularity/comments/1andqk8/gemini...
As a side note, I find the way that you interact with a LLM when doing creative writing is generally more important than the model. I have been having great results with LLMs for creative writing since ChatGPT 3.5, in part because I approach the model with a nucleus of a chapter and a concise summary of relevant details, then have it ask me a long list of questions to flesh out details, then when the questions stop being relevant I have have it create a narrative outline or rough draft which I can finish.
I pointed this out a few weeks ago with respect to why the current state of LLMs will never make great campaign creators in Dungeons and Dragons.
We as humans don't need to be "constrained" - ask any competent writer to sit quietly and come up with a novel story plot and they can just do it.
https://news.ycombinator.com/item?id=43677863
That being said - they can still make AMAZING soundboards.
And if you still need some proof, crank the temperature up to 1.0 and pose the following prompt to ANY LLM:
Come up with a self-contained single room of a dungeon that involves an
unusual puzzle for use with a DND campaign. Be specific in terms of the
puzzle, the solution, layout of the dungeon room, etc. It should be totally
different from anything that already exists. Be imaginative.
I guarantee 99% of the returns will return a very formulaic physics-based puzzle response like "The Resonant Hourglass", or "The Mirror of Acoustic Symmetry", etc.The output is pretty non-sensical: https://pastebin.com/raw/hetAvjSG
Haha, I was suspicious, so I tried this, and I indeed got an hourglass themed puzzle! Though it wasn't physics-based - characters were supposed to share memories to evoke emotions, and different emotions would ring different bells, and then you were supposed to evoke a certain type of story. Honestly, I don't know what the hourglass had to do with it.
I commented in another thread. We're using image and video diffusion models for creative:
https://www.youtube.com/watch?v=H4NFXGMuwpY
Still not a fan of LLMs.
I personally tend not to use AI this way. When it comes to writing, that's actually the exact inverse of how I most often use AI, which is to throw a ton of information at it in a large prompt, and/or use a preexisting chat with substantial relevant context, possibly have it perform some relevant searches and/or calculations, and then iterate on that over successive prompts before landing on a version that's close enough to what I want for me to touch up by hand. Of course the end result is clearly shaped by my original thoughts, with the writing being a mix of my own words and a reasonable approximation of what I might have written by hand anyway given more time allocated to the task, and not clearly identifiable as AI-assisted. When working with AI this way, asking to "read the prompt" instead of my final output is obviously a little ridiculous; you might as well also ask to read my browser history, some sort of transcript of my mental stream of consciousness, and whatever notes I might have scribbled down at any point.
Fairly or unfairly, people (including you) will inexorably come to see anything done with AI as ONLY done with AI, and automatically assume that anyone could have done it.
In such a world, someone could write the next Harry Potter and it will be lost in a sea of one million mediocre works that roughly similar. Hidden in plain sight forever. There would no point in reading it, because it is probably the same slop I could get by writing a one paragraph prompt. It would be too expensive to discover otherwise.
I'm expanding on the author's point that the hard part is the input, not the output. Sure someone else could produce the same output as an LLM given the same input and sufficient time, but they don't have the same input. The author is saying "well then just show me the input"; my counterpoint is that the input can often be vastly longer and less organized or cohesive than the output, and thus less useful to share.
It sounds to me that you don't make the effort to absorb the information. You cherry-pick stuff that pops in your head or that you find online, throw that into an LLM and let it convince you that it created something sound.
To me it confirms what the article says: it's not worth reading what you produce this way. I am not interested in that eloquent text that your LLM produced (and that you modify just enough to feel good saying it's your work); it won't bring me anything I couldn't get by quickly thinking about it or quickly making a web search. I don't need to talk to you, you are not interesting.
But if you spend the time to actually absorb that information, realise that you need to read even more, actually make your own opinion and get to a point where we could have an actual discussion about that topic, then I'm interested. An LLM will not get you there, and getting there is not done in 2 minutes. That's precisely why it is interesting.
Synthesizing large amounts of information into smaller more focused outputs is something LLMs happen to excel at. Doing the exact same work more slowly by hand just to prove a point to someone on HN isn't a productive way to deliver business value.
You prove my point again: it's not "just to prove a point". It's about internalising the information, improving your ability to synthesise and be critical.
Sure, if your only objective is to "deliver business value", maybe you make more money by being uninteresting with an LLM. My point is that if you get good at doing all that without an LLM, then you become a more interesting person. You will be able to have an actual discussion with a real human and be interesting.
For the structure, they are barely useful: Writing is about having such a clear understanding, that the meaning remains when reduced to words, so that others may grasp it. The LLM won't help much with that, as you say yourself.
Really? The example used was for a school test. Is there really much original thought in the answer? Do you really want to read the students original thought?
I think the answer is no in this case. The point of the test is to assess whether the student has learned the topic or not. It isn’t meant to share actual creative thoughts.
Of course, using AI to write the answer is contrary to the actual purpose, too, but it isn’t because you want to hear the students creativity, but because it is failing to serve its purpose as a demonstration of knowledge.
Why else would you become a teacher, if you didn't care about what your students think?
Perhaps the problem is that they are "graded", but this is to motivate the student, and runs against the age-old problem of gamification.
Arguably, that's not what teachers mainly do (to an ever increasing proportion).
Most knowledge is easily available. A teacher is teaching students to think in productive ways, communicate their thoughts and understand what others are trying to tell them. For this task, it's essential that the teacher has some idea what the students are thinking, especially when it's something original.
As long as LLM output is what it is, there is little threat of it actually being competitive on assignments. If students are attentive enough to paraphrase it into their own voice I'd call it a win; if they just submit the crap that some data labeling outsourcer has RLHF'd into a LLM, I'd just mark it zero.
I would have thought that giving 0s to correct solutions would lead to successful complaints/appeals.
The world will be consumed by AI.
Once upon a time only the brightest (and / or richest) went to college. So a college degree becomes a proxy for clever.
Now since college graduates get the good jobs, the way to give everyone a good job is to give everyone a degree.
And since most people are only interested in the job, not the learning that underpins the degree, well, you get a bunch of students that care only for the pass mark and the certificate at the end.
When people are only there to play the game, then you can't expect them to learn.
However, while 90% will miss the opportunity right there in front of them, 10% will grab it and suck the marrow. If you are in college I recommend you take advantage of the chance to interact with the knowledge on offer. College may be offered to all, but only a lucky few see the gold on offer, and really learn.
That's the thing about the game. It's not just about the final score. There's so much more on offer.
This is because that is what companies care about. It's not a proxy for cleverness or intelligence - it's a box to check.
Then fail to actually learn anything and apply for jobs and try to cheat the interviewers using the same AI that helped them graduate. I fear that LLMs have already fostered the first batch of developers who cannot function without it. I don't even mind that you use an LLM for parts of your job, but you need to be able to function without it. Not all data is allowed to go into an AI prompt, some problems aren't solvable with the LLMs and you're not building your own skills if you rely on generated code/configuration for the simpler issues.
Playing the contrarian here, but I'm from a batch of developers that can't function without a compiler, and I'm at 10% of what I can do without an IDE and static analysis.
Sure, there's a huge jump from a line editor like `ed` to a screen editor like `vi` or `emacs`, but from there on, it was diminishing returns really (a good debugger was usually the biggest benefit next) — I've also had the "pleasure" of having to use `echo`, `cat` and `sed` to edit complex code in a restricted, embedded environment, and while it made iterations slower, not that much more slower than if I had a full IDE at my disposal.
In general, if I am in a good mood (and thus not annoyed at having to do so many things "manually"), I am probably only 20% slower than with my fully configured IDE at coding things up, which translates to less than 5% of slow down on actually delivering the thing I am working on.
A compiler translates _what you have already implemented_ into another computer runnable language. There is an actual grammar that defines the rules. It does not generate new business logic or assumptions. You have already done the work and taken all the decisions that needed critical thought, it's just being translated _instruction by instruction_. (btw you should check how compilers work, it's fun)
Using an LLM is more akin to copying from Stackoverflow than using a compiler/transpiler.
In the same way, I see org charts that put developers above AI managers, which are above AI developers. This is just smoke. You can't have LLMs generating thousands of lines of code independently. Unless you want a dumpster fire very quickly...
That is, the job of a professional programmer includes having produced code that they understand the behavior of. Otherwise you’ve failed to do your due diligence.
If people are using LLMs to generate code, and then actually doing the work of understanding how that code works… that’s fine! Who cares!
If people are just vibe coding and pushing the results to customers without understanding it—they are wildly unethical and irresponsible. (People have been doing this for decades, they didn’t have the AI to optimize the situation, but they managed to do it by copy-pasting from stack overflow).
I have met maybe two people who truly understood the behaviour of their code and both employed formal methods. Everyone else, including myself, are at varying levels of confusion.
(Yes, these are people with developer jobs, often at "serious" companies.)
Maybe you mean people who are bad at interviews? Or people whose job isn't actually programming? Or maybe "lots" means "at least one"? Or maybe they can strictly speaking do fizzbuzz, but are "in any case bad programmers"? If your claim is true, what do these people do all day (or, let's say, did before LLMs were a thing...)?
I’ve met some really terrible programmers, and some programmers who freeze during interviews.
I don't. I think the world is falling into two camps with these tools and models.
> I now circle back to my main point: I have never seen any form of create generative model output (be that image, text, audio, or video) which I would rather see than the original prompt. The resulting output has less substance than the prompt and lacks any human vision in its creation. The whole point of making creative work is to share one’s own experience
Strong disagree with Clayton's conclusion.
We just made this with AI, and I'm pretty sure you don't want to see the raw inputs unless you're a creator:
https://www.youtube.com/watch?v=H4NFXGMuwpY
I think the world will be segregated into two types of AI user:
- Those that use the AI as a complete end-to-end tool
- Those that leverage the AI as tool for their own creativity and workflows, that use it to enhance the work they already do
The latter is absolutely a great use case for AI.
"Tall man, armor that is robotic and mechanical in appearance, NFL logo on chest, blue legs".,
And so on, embedded in node wiring diagrams to fiddly configs and specialized models for bespoke purposes, "camera" movements, etc.
Seeing this non-compelling prompt would tell me right off the bat that I wouldn't be interested in the video either.
I am not a creator but I am interested in generative AI capabilities and their limits, and I even suffered through the entire video which tries to be funny, but really isn't (and it'd be easier to skim through as a script than the full video).
So even in this case, I would be more interested in the prompt than in this video.
Because those who recruit based on the degree aren't worth more than those who get a degree by using LLMs.
Maybe it will force a big change in the way students are graded. Maybe, after they have handed in their essay, the teacher should just have a discussion about it, to see how much they actually absorbed from the topic.
Or not, and LLMs will just make everything worse. That's more likely IMO.
Yes I know the subject area for which I write assessments and know if what is generated is factually correct. If I’m not sure, I ask for web references using the web search tool.
https://chatgpt.com/share/6817c46d-0728-8010-a83d-609fe547c1...
> I didn’t realize how much that could throw things off until I saw an example where the object started moving in a strange way when it hit that point.
Would feel off, because why change the person? And even if it's intented, then I'd say it's not formal to do in an assignement.
Maybe the problem is that the professor doesn't want to read the student work anyway, since it's all stuff he already knows. If they managed to use their prompts to generate interesting things, he'd stop wanting to see the prompts.
No, this is just the de-facto "house style" of ChatGPT / GPT models, in much the same way that that that particular Thomas Kinkade-like style is the de-facto "house style" of Stable Diffusion models.
You can very easily tell an LLM in your prompt to respond using a different style. (Or you can set it up to do so by telling it that it "is" or "is roleplaying" a specific type-of-person — e.g. an OP-ED writer for the New York Times, a textbook author, etc.)
People just don't ever bother to do this.
https://chatgpt.com/share/6817c9f4-ed48-8010-bc3e-58299140c8...
In the real world I would at least remove the em dashes. It’s a dead give away for LLM generated text.
You can't just say "don't sound like an LLM." The LLM does not in fact know that it is "speaking like an LLM"; it just thinks that it's speaking the way the "average person" speaks, according to everything it's ever been shown. If you told it "just speak like a human being"... that's what it already thought it was doing!
You have to tell the LLM a specific way to speak. Like directing an image generator to use a specific visual style.
You can say "ape the style of [some person who has a lot of public writing in the base model's web training corpus — Paul Graham, maybe?]". But that coverage will be spotty, and it's also questionably ethical (just like style-aping in image generation.)
But an LLM will do even better if you tell it to speak the in some "common mode" of speech: e.g. "an email from HR", or "a shitpost rant on Reddit" or "an article in a pop-science magazine."
1. Take home projects where we programmed solutions to big problems. 2. Tests where we had to write programs in the exam on paper during the test.
I think the take home projects are likely a lot harder to grade without AI being used. I'd be disappointed if schools have stopped doing the programming live during tests though. Being able to write a program in a time constrained environment is similar to interviewing, and requires knowledge of the language and being able to code algorithms. It also forces you to think through the program and detect if there will be bugs, without being able to actually run the program (great practice for debugging).
Those classes are what taught me how to study and really internalize the material. Helped me so much later in college too. I really can't imagine how kids these days are doing it.
I guess you could require a special encrypted keyboard in your plan.
Forcing people to do these things supposedly results in a better, more competitive society. But does it really? Would you rather have someone on your team who did math because it let them solve problems efficiently, or did math because it’s the trick to get the right answer?
Writing is in a similar boat as math now. We’ll have to decide whether we want to force future generations to write against their will.
I was forced to study history against my will. The tests were awful trivia. I hated history for nearly a decade before rediscovering that I love it.
History doesn’t have much economical value. Math does. Writing does. But is forcing students to do these things the best way to extract that value? Or is it just the tradition we inherited and replicate just because our parents did?
I remember another parent ranting about their 3rd grade kids “stupid homework” since it had kids learning different ways of summing numbers. I took a look at the homework and replied “wow, the basics out set theory are in here!” We then had a productive discussion of how that arithmetic exercise led to higher math and ways of framing problems.
Similarly, writing produces a different form of thought than oral communication does.
History is a bit different, but a goal of history and literature is (or it least should be) to socialize students and give them a common frame of reference in society.
Finally there is the “you don’t know when you’ll need it defense.” I have a friend who spent most of the last 20 years as a roofer, but his body is starting to hurt. He’s pivoting to CAD drafting and he’s brushing off a some of those math skills he hated learning in school. And now arguing with his son about why it’s important.
Those are the fundamental defenses- that we are seeking not skills but ways of viewing the world + you don’t know what you’ll need. There are obviously limits and tradeoffs to be made, but to some degree yes, we should be forcing students (who are generally children or at least inexperienced in a domain) to things they don’t like now for benefits later.
One counter argument to yours is that when you do need the skills, you can learn them later. It’s arguably easier than it has been at any point in human history. In that context, why front load people with something they hate doing, just because their parents think it’s a good idea? Let them wait and learn it when they need it.
Maybe professors are too stringent with their evaluation, or maybe they are not good at teaching people what a passable writing style is, or maybe students simply don't want to accept that if they don't excel at writing, a D or a C is perfectly fine. Perhaps teachers that look for good writing should have separate tests which evaluate students in both scenarios: with and without LLM help.
The same holds true for math: not everybody needs to know how to deduce a proof for every theorem, but in technical sciences, showing that ability and capability will demonstrate how much they are able to think and operate with precision on abstract concepts, very much like in programming. Even if coursework is a bit repetitive, practice does turn shallow knowledge into operational knowledge.
There are greater difficulties that people will have to do in their daily lives than being "forced" to learn how to read, write and do arithmetic. Maybe learning the lesson of overcoming smaller, difficult tasks will allow them to adapt to greater difficulties in the future.
To quote Seneca:
A gem can not be polished with friction, nor a man perfected without trials.
The "wanting to like things" is a highly undervalued skill/trait. It comes down to building a habit through repetition - not necessarily having fun or getting results, but training your mind like a muscle to think putting in effort isn't that bad an activity.
For those growing up I think this is not something that is taught - usually it is already there as a childlike sense of wonder that gets pruned by controlling interests. If education forcing you to do math removes any enthusiasm you had for math, that's largely determined by circumstance. You'd need someone else to tell you the actual joys of X to offset that (and I'd guess most parents/teachers don't practice math for fun), or just spontaneously figuring out how interesting X is totally on one's own which is even rarer.
I didn't have either so I'm a mathophobe, but I'm alright with that since I have other interests to focus on.
<https://goodreads.com/book/show/585474.Writing_to_Learn>
I agree with the broader point of the article in principle. We should be writing to edify ourselves and take education seriously because of how deep interaction with the subject matter will transform us.
But in reality, the mindset the author cites is more common. Most accounting majors probably don't have a deep passion for GAAP, but they believe accounting degrees get good jobs.
And when your degree is utilitarian like that, it just becomes a problem of minimizing time spent to obtain the reward.
EDIT: Not a jab at the author per se, more that it's a third or fourth time I see this particular argument in the last few weeks, and I don't recall seeing it even once before.
To actually teach this, you do something like this:
"Here's a little dummy robot arm made out of Tinkertoys. There are three angular joints, a rotating base, a shoulder, and an elbow. Each one has a protractor so you can see the angle.
1. Figure out where the end of the arm will be based on those three angles. Those are Euler angles in action. This isn't too hard.
2. Figure out what the angles should be to touch a specific point on the table. For this robot geometry, there's a simple solution, for which look up "two link kinematics". You don't have to derive it, just be able to work out how to get the arm where you want it. Is the solution unambiguous? (Hint: there may be more than one solution, but not a large number.)
3. Extra credit. Add another link to the robot, a wrist. Now figure out what the angles should be to touch a specific point on the table. Three joints are a lot harder than two joints. There are infinitely many solutions. Look up "N-link kinematics". Come up with a simple solution that works, but don't try too hard to make it optimal. That's for the optimal controls course.
This will give some real understanding of the problems of doing this.
(I know jack all about robotics but that sounds like a pretty common assignment, the kind an LLM would regurgitate someone else's homework.)
The answer might be bogus, but the AI will sound confident all the way through.
No wonder sales and upper management love AI
The goal is to make something legible, but the reality is we are producing slop. I'm back to writing before my brain becomes lazy.
I've grown to respect typos and slightly misconstructed sentences. It's an interesting dynamic that now what appeared lazy to 2021 eyes actually indicates effort and what appeared polished and effortful in 2021 now indicates laziness.
An example is how the admins of my local compute cluster communicate about downtimes and upgrades etc and they are clearly using AI and it's so damn annoying, it feels like biting into cotton candy fluff. Just send the bullet points! I don't need emojis, I don't need the fake politeness. It's no longer polite to be polite. It doesn't signal any effort.
if you can one-shot an answer to some problem, the problem is not interesting.
the result is necessary, but not sufficient. how did you get there? how did you iterate? what were the twists and turns? what was the pacing? what was the vibe?
no matter if with encyclopedia, google, or ai, the medium is the message. the medium is you interacting with the tools at your disposal.
record that as a video with obs, and submit it along with the result.
for high stakes environments, add facecam and other information sources.
reviewers are scrubbing through video in an editor. evaluating the journey, not the destination.
And reviewing video would be a nightmare.
more is better.
you can scrub video with your finger on an iphone. serious review is always high effort, video changes nothing.
Video in itself is not more information by definition. Just look at those automatically generated videos when you try finding a review on an unusual product.
books are great.
hundreds of hours of video of the author writing that book, is strictly more information.
Let's be real... Multi-modal LLMs are scrubbing through the journey :P
not every review is important.
The is especially the case when you are about to complain about style, since that can easily be adjusted, by simply telling the model what you want.
But I think there is a final point that the author is also wrong about, but that is far more interesting: why we write. Personally I write for 3 reasons: to remember, to share and to structure my thoughts.
If an LLM is better then me at writing (and it is) then there is no reason for me to write to communicate - it is not only slower, it is counterproductive.
If the AI is better at wrangling my ideas into some coherent thread, then there is no reason for me to do it. This one I am least convinced about.
AI is already much better than me at strictly remembering, but computers have been that since forever, the issue is mostly convinient input/output. AIs makes this easier thanks to speech to text input.
[0]: See eg. https://www.oneusefulthing.org/p/centaurs-and-cyborgs-on-the....
This is especially true for students.
I think this will be no more of a contest than playing chess has been: humans don't stand a chance, but it also doesn't matter because being better or worse than the AI is besides the point.
The most obvious ChatGPT cheating, like that mentioned in this article, is pretty easy to detect.
However, a decent cheater will quickly discover ways to conduce their LLM into producing text that is very difficult to detect.
I think if I was in the teaching profession I'd just leave, to be honest. The joy of reviewing student work will inevitably be ruined by this: there is 0 way of telling if the work is real or not, at which point why bother?
Do you have any examples of this? I've never been able to get direct LLM output that didn't feel distinctly LLM-ish.
A study on whether LLMs can influence people on r/changemymind
Teachers will lament the rise of AI-generated answers, but they will only ever complain about the blatantly obvious responses that are 100% copy-pasted. This is only an emerging phenomenon, and the next wave of prompters will learn from the mistakes of the past. From now on, unless you can proctor a room full of students writing their answers with nothing but pencil and paper, there will be no way to know for certain how much was AI and how much was original/rewritten.
But I know it's easier said than done: if you get a student to realise that the time they spend at school is a unique opportunity for them to learn and grow, then you're job is almost done already.
They don’t get an exemption if the parents don’t care.
Talk to the student, maybe?
I have been an interviewer in some startups. I was not asking leetcode questions or anything like that. My method was this: I would pretend that the interviewee is a new colleague and that I am having coffee with them for the first time. I am generally interested in my colleagues: who are they, what do they like, where do they come from? And then more specifically, what do they know that relates to my work? I want to know if that colleague is interested in a topic that I know better, so that I could help them. And I want to know if that colleague is an expert in a topic where they could help me.
I just have a natural discussion. If the candidate says "I love compilers", I find this interesting and ask questions about compilers. If the person is bullshitting me, they won't manage to maintain an interesting discussion about compilers for 15 minutes, will they?
It was a startup, and the "standard" process became some kind of cargo culting of whatever they thought the interviews at TooBigTech were like: leetcode, system design and whatnot. Multiple times, I could obviously tell in advance that even if this person was really good at passing the test, I didn't think it would be a good fit for the position (both for the company and for them). But our stupid interviews got them hired anyway and guess what? It wasn't a good match.
We underestimate how much we can learn by just having a discussion with a person and actually being interested in whatever they have to say. As opposed to asking them to answer standard questions.
There always was a bunch of realistic options to not actually do your submitted work, and AI is merely makes it easier, more detectable and more scalable.
I think it moves the needle from 40 to 75, which is not great, but you'd already be holding your nose at student work half of the time before AI, so teaching had to be about more than that (and TBH it was, when I was in school teachers gave no fuck about submitted work if they didn't validate it by some additional face to face or test time)
The kids these days got everything...
As always, I reject wholeheartedly what this skeptical article has to say about LLMs and programming. It takes the (common) perspective of "vibe coders", people who literally don't care what code says as long as something that runs comes out the other side. But smart, professional programmers use LLMs in different ways; in particular, they review and demand alterations to the output, the same way you would doing code review on a team.
The implication there is that this is acceptable to pass a robotics class, and potentially this gives them more information about students' comprehension to further improve their instruction and teaching ("...that they have some kind of internal understanding to share").
On that second point, I have yet to see someone demonstrate a "smart, professional programmer use LLMs" in a way where it produces high quality output in their area of expertise, while improving their efficiency and thus saving time for them (compared to them just using a good, old IDE)!
So, observing a couple of my colleagues (I am an engineering manager, but have switched back and forth between management and IC roles for the last ~20 years), I've seen them either produce crap, or spend so much time tuning the prompts that it would have been faster to do it without an LLM. They mostly used Github Copilot or ChatGPT (most recent versions as of last few months ago).
I am also keeping out a keen eye for any examples of this (on HN in particular), but it usually turns out things like https://news.ycombinator.com/item?id=43573755
Again, I am not saying it's not being done, but I have struggled to find someone who would demonstrate it happen in a convincing enough fashion — I am really trying to imagine how I would best incorporate this into my daily non-work programming activities, so I'd love to see a few examples of someone using it effectively.
Part of my performance review is indirectly using bloat to seem sophisticated and thorough.
Over-fitting proxy measures is one of the scourges of modernity.
The only silver lining is if it becomes so wide spread and easy it loses the value of seeming sophisticated and thorough.
Maybe we should let/encourage this to happen. Maybe letting bloated zombie-like organisations bloat themselves to death would thin the herd somewhat, to make space for organisations that are less “broken”.
At the same time, I strive really hard to influence the environment I am in so it does not value content bloat as a unit of productivity, so hopefully there are at least some places where people can have their sanity back!
Documentation is an interesting use case. There are various kinds of documentation (reference, tutorial, architecture, etc.) and LLMs might be useful for things like
- repetitive formatting and summarization of APIs for reference
- tutorials which repeat the same information verbosely in an additive, logical sequence (though probably a human would be better)
- sample code (though human-written would probably be better)
The tasks that I expect might work well involve repetitive reformatting, repetitive expansion, and reduction.
I think they also might be useful for systems analysis, boiling down a large code base into various kinds of summaries and diagrams to describe data flow, computational structure, signaling, etc.
Still, there is probably no substitute for a Caroline Rose[1] type tech writer who carefully thinks about each API call and uses that understanding to identify design flaws.
Any documentation they write at best re-states what is immediately obvious from the surrounding code (Useless: I need to explain why), or is some hallucination trying to pretend it's a React app.
To their credit they've slowly gotten better now that a lot of documentation already exists, but that was me doing the work for them. What I needed them to do was understand the project from existing code, then write documentation for me.
Though I guess once we're at the point AI is that good, we don't need to write any documentation anymore, since every dev can just generate it for themselves with their favorite AI and in the way they prefer to consume it.
* They'll pretend they do by re-stating what is written in the README though, then proceed to produce nonsense.
Without that effort it's a useless sycophant and is functionally extremely lazy (ie takes short cuts all the time).
Don't suppose you've tried that particular model, after getting it to be thorough?
I think that's the answer:
LLMs are primarily useful for data and text translation and reduction, not for expansion.
An exception is repetitive or boilerplate text or code where a verbose format is required to express a small amount of information.
If you aren't aware: (high-parameter-count) LLMs can be used pretty reliably to teach yourself things.
LLM base models "know things" to about the same degree that the Internet itself "knows" those things. For well-understood topics — i.e. subjects where the Internet contains all sorts of open-source textbooks and treatments of the subject — LLMs really do "know their shit": they won't hallucinate, they will correct you when you're misunderstanding the subject, they will calibrate to your own degree of expertise on the subject, they will make valid analogies between domains, etc.
Because of this, you can use an LLM as an infinitely-patient tutor, to learn-through-conversation any (again, well-understood) topic you want — and especially, to shore up any holes in your understanding.
(I wouldn't recommend relying solely on the LLM — but I've found "ChatGPT in one tab, Wikipedia open in another, switching back and forth" to be a very useful learning mode.)
See this much-longer rambling https://news.ycombinator.com/item?id=43797121 for details on why exactly this can be better (sometimes) than just reading one of those open-source textbooks.
This can go beyond just specific documentation but also include things like "common knowledge" which is what the other poster meant when they talked about "teaching you things".
Done, now ai is just lossy prettyprinting.
I’ve idly thought about making a set of prompts to turn ChatGPT into RudeGPT, but fighting content filters isn’t too fun.
I wish to communicate four points of information to you. I’ll ask ChatGPT to fluff those up into multiple paragraphs of text for me to email.
You will receive that email, recognize its length and immediately copy and paste it into ChatGPT, asking it to summarize the points provided.
Somewhere off in the distance a lake evaporates.
It’s like math homework, you always had to show your working not just give the answer. AI gives us an answer without the journey of arriving at one, which removes the purpose of doing it in the first place.
AI usage is a lot higher in my work experience among people who no longer code and are now in business/management roles or engineers who are very new and didn't study engineering. My manager and skip level both use it for all sorts of things that seem pointless and the bootcamp/nontraditional engineers use it heavily. Our college hires we have who went through a CS program don't use it because they are better and faster than it for most tasks. I haven't found it to be useful without an enormous prompt at which point I'd rather just implement the feature myself.
As it turns out, a well written ticket makes a pretty good input into an LLM. However, it has the added benefit of having my original thought process well documented, so sometimes I go through the process of writing a ticket / subtask, even if I ended up giving it to an AI tool in the end.
> Since this is a long thread and we're including a wider audience, I thought I'd add Copilot's summary...
Someone called them out for it, several others defended it. It was brought up in one team's retro and the opinions were divided and very contentious, ranging from, "the summary helped make sure everyone had the same understanding and the person who did it was being conscientious" to "the summary was a pointless distraction and including it was an embarrassing admission of incompetence."
Some people wanted to adopt a practice of not posting summaries in the future but we couldn't agree and had to table it.
If I were to include AI generated stuff into my communication I'd also make it clear as people might guess it anyway.
1. “When copying another person’s words, one doesn’t communicate their own original thoughts, but at least they are communicating a human’s thoughts. A language model, by construction, has no original thoughts of its own; publishing its output is a pointless exercise.”
LLMs, having being trained using the corpus of the web, I would argue communicate other human’s thoughts particularly well. Only in exercising an avoidance of plagiarism are the thoughts of other human’s evolved into something closer to “original thought” for the would-be plagarizer. But yes, at least a straight copy/paste retains the same rhetoric as the original human.
2. I’ve seen a few advertisements recently leverage “the prompt” as a means to resonate visual appeal.
i.e a new fast food delivery service starting their add with some upbeat music and a visual presentation of somebody typing into a LLM interface, “Where’s the best sushi around me?” And then cue the advertisement for the product they offer.
I actually don't think that it is good at that. I have heard of language teachers trying to use it to teach the language (it's a model language, it should be good at it, right?) and realised that it isn't good at that.
Of course I understand the point of your message, which is that you feel your teachers were not helpful and I have empathy for that.
Your benchmark for "long flowing beautiful content" is apple.com? It's competing with Hemingway?
Can you share a link to what you mean?
The school should be drilling into students, at orientation, what some school-wide hard rules are regarding AI.
One of the hard rules is probably that you have to write your own text and code, never copy&paste. (And on occasions when copy&paste is appropriate, like in a quote, or to reuse an off-the-shelf function, it's always cited/credited clearly and unambiguously.)
And no instructors should be contradicting those hard rules.
(That one instructor who tells the class on the first day, "I don't care if you copy&paste from AI for your assignments, as if it's your own work; that just means you went through the learning exercise of interacting with AI, which is what I care about"... is confusing the students, for all their other classes.)
Much of society is telling students that everything is BS, and that their job is to churn BS to get what they want. Early "AI' usage popular practices so far looks to be accelerating that. Schools should be dropping a brick wall in front of that. Well, a padded wall, for the students who can still be saved.
Yes, totally. Unfortunately, it takes time and maturity to understand how this is completely wrong, but I feel like most students go through that belief.
Not sure how relevant it is, but it makes me think of two movies with Robin Williams: Dead Poet's Society and Will Hunting. In the former, Robin's character manages to get students interested in stuff instead of "just passing the exams". In the later, I will just quote this part:
> Personally, I don’t give a shit about all that, because you know what? I can’t learn anything from you I can’t read in some fuckin’ book. Unless you wanna talk about you, who you are. And I’m fascinated. I’m in.
I don't give a shit about whether a student can learn the book by heart or not. I want the student to be able to think on their own; I want to be able to have an interesting discussion with them. LLMs fundamentally cannot solve that.
The issue, IMO, is that some people throw in a one-shot, short prompt, and get a generic, boring output. "Garbage in, generic out."
Here's how I actually use LLMs:
- To dump my thoughts and get help organizing them.
- To get feedback on phrasing and transitions (I'm not a native speaker).
- To improve tone, style (while trying to keep it personal!), or just to simplify messy sentences.
- To identify issues, missing information, etc. in my text.
It’s usually an iterative process, and the combined prompt length ends up longer than the final result. And I incorporate the feedback manually.
So sure, if someone types "write a blog post about X" and hits go, the prompt is more interesting than the output. But when there are five rounds of edits and context, would you really rather read all the prompts and drafts instead of the final version?
(if you do: https://chatgpt.com/share/6817dd19-4604-800b-95ee-f2dd05add4...)
I think you missed the point of the article. They did not mean it literally: it's a way to say that they are interested in what you have to say.
And that is the point that is extremely difficult to make students understand. When a teacher asks a student to write about a historical event, it's not just some kind of ceremony on the way to a degree. The end goal is to make the student improve in a number of skills: gathering information, making sense of it, absorbing it, being critical about what they read, eventually building an opinion about it.
When you say "I use an LLM to dump my thoughts and get help organising them", what you say is that you are not interested in improving your ability to actually absorb information. To me, it says that you are not interested in becoming interesting. I would think that it is a maturity issue: some day you will understand.
And that's what the article says: I am interested in hearing what you have to say about a topic that you care about. I am not interested into anything you can do to pretend that you care or know about it. If you can't organise your thoughts yourself, I don't believe that you have reached a point where you are interesting. Not that you will never get there; it just takes practice. But if you don't practice (and use LLMs instead), my concern is that you will never become interesting. This time is wasted, I don't want to read what your LLM generated from that stuff you didn't care to absorb in the first place.
- To "Translate to language XYZ", and that is not sometimes strightforward and needs iterating like "Translate to language <LANGUAGE> used by <PERSON ROLE> living in <CITY>" and so on.
And the author is right, I use it as 2nd-language user, thus LLM produces better text than myself. However I am not going to share the prompt as it is useless (foreign language) and too messy (bits of draft text) to the reader. I would compare it to passing a book draft thru editor and translator.
People say “I saved so much time on perf this year with the aid of ChatGPT,” but ChatGPT doesn’t know anything about your working relationship with your coworker… everything interesting is contained in the prompt. If you’re brain dumping bullet points into an LLM prompt, just make those bullets your feedback and be done with it? Then it’ll be clear what the kernel of feedback is and what’s useless fluff.
I like reading and writing stories. Last month, I compared the ability of various LLMs to rewrite Saki's "The Open Window" from a given prompt.[1] The prompt follows the 13-odd attempts. I am pretty sure in this case that you'd rather read the story than the prompt.
I find the disdain that some people have for LLMs and diffusion models to be rather bizarre. They are tools that are democratizing some trades.
Very few people (basically, those who can afford it) write to "communicate original thoughts." They write because they want to get paid. People who can afford to concentrate on the "art" of writing/painting are pretty rare. Most people are doing these things as a profession with deadlines to meet. Unlike you are GRRM, you cannot spend decades on a single book waiting for inspiration to strike. You need to work on it. Also, authors writing crap/gold at a per-page rate is hardly something new.
LLMs are probably the most interesting thing I have encountered since I did the computer. These puritans should get off of their high horse (or down from their ivory tower) and join the plebes.
[1] Variations on a Theme of Saki (https://gist.github.com/s-i-e-v-e/b4d696bfb08488aeb893cce3a4...)
There's so much bad writing of valuable information out there. The major sins being: burying the lede, no or poor sectioning, and just generally verbose.
In some cases, like in EULAs and patents that's intentional.
I have to admit I was a bit surprised how bad LLMs are at the continue this essay task. When I read it in the blog I suspected this might have been a problem with the prompt or the using one of the smaller variants of Gemini. So I tried it with Gemini 2.5 Pro and iterated quite a bit providing generic feedback without offering solutions. I could not get the model to form a coherent well reasoned argument. Maybe I need to recalibrate my expectations of what LLMs are capable, but I also suspect that current models have heavy guardrails, use a low temperature and have been specifically tuned for problem solving and avoid hallucinations as much as possible.
Every day I'm made more aware of how terrible people are at identifying AI-generated output, but also how obsessed with GenAI-vestigating things they don't like or wouldn't buy because they're bad.
kookamamie•3h ago
There's too much information in the World for it to matter, I think is the underlying reason.
As an example, most enterprise communication nears the levels of noise in its content.
So, why not let a machine generate this noise, instead?
bost-ty•3h ago