I imagine I'd have similar frustrations if I couldn't copy-paste the text easily though!
What about just asking them what file that is?
This includes Safari, where not only do images (inline or otherwise) have selectable text, but the built in translator leverages that text and uses it to translate images, too! This is super useful for translating Japanese webpages in particular, which tend to have tons of text baked into images.
- Preserves the full 80 character width without line-wrapping, which destroys readability
- Guarantees monospace, so tabular data doesn't get all misaligned
- Preserves a good coding font, so it doesn't come out as some hairline-width Courier on the other end
- Preserves syntax highlighting, very helpful
Obviously if somebody needs a whole file or whole log, then send the whole thing as an attachment. But very often I'll still include a screenshot of the relevant part. With line numbers, it's not difficult to jump to the right part of the attached file.
Screenshots are incredibly useful for keeping code and terminal output looking like code and terminal output, and not getting completely mangled in an e-mail or chat message being read on a mobile device or in a narrow column.
Let me introduce you to Putty users who never change the default font...
(OP’s blog purports to be pertinent to freelance software development).
I'm biased, but I can't help but feel like chances are, if the screenshot is text, the content of the text is important, not the visual aspects.
99% of the time I get a screenshot these days, it's people sending me screenshots of text logs or code, and almost always cropped in a way that eliminates any context anyway. Give me plain text or give me death.
Is widely supported to add code. E.g. in Slack, Confluence...
Sure, if you want someone to reproduce the text of course you'd send them actual text. But to show a problem, a picture is, as they say, worth 1000 words.
Readability is on the eyes of the final user, they are free to use whatever narrow column width they prefer.
> - Guarantees monospace, so tabular data doesn't get all misaligned
When was the last time a computer shipped without a monospace font? This points at the rare occasion where there's a problem with the setup, but you could also argue that maybe there's a system with a broken image decompressor.
> Screenshots are incredibly useful for keeping code and terminal output looking like code and terminal output, and not getting completely mangled in an e-mail or chat message being read on a mobile device or in a narrow column.
Are you complaining about GMail's rendering maybe? Its awful[^0], but that's more of a GMail problem that could be solved if they wanted.
[^0]: Column width unbounded even on 4k monitors. Weird and inconsistent font sizes across different fonts (monospace is smaller). Reads poorly on phones too.
For reading purposes, the question of screen width is best left to the reader. They will have the window set to their preferred width, possibly limited by screen size. If the text has to wrap, so be it. It's better that than having to try to squint at your 3713x211 screen grab on an iPhone (portrait orientation). Also bear in mind that even the most basic of font and colour choices (large/small font, dark/light mode) can cause accessibility issues for some readers.
For copying and pasting purposes, images suck. Yes, macOS can do it, sort of, and I expect Windows 11 can do it too, probably to about the same extent. But it's not as easy as having the text right there in copyable form.
For searching purposes, ditto - only worse, because at least when you copy and paste and it comes out wrong, you'll notice. When you search: you just won't find the thing. You'll never know.
Nothing against screenshots unless they are lacking context
But I can't be the only one appalled at the suggestion to use an LLM to parse the text. The sheer, prodigious waste of computing power, just to round-trip text to an image and back to text, when what's really missing is a computer user interface that makes it as simple to send text or other snippets as it is to send screenshots.
That being said, I've had to twist some arms in a previous job for new employees attaching screenshots of a log viewer instead of the whole logs. The big problem was training: Once I made it very clear to the entire team that unedited logs were critical to solving problems, management made sure that all newcomers knew how to attach unedited logs.
Another thing I wish was more common is metadata in screenshots, especially on phones. Eg if I take a screenshot of a picture in Instagram, I wish a URL of the picture was embedded (eg instagram.com/p/ABCD1234/). If I take a screenshot in the browser, include the URL that's being viewed (+ path to the DOM element in the viewport). If I take a screenshot in a maps app, include the bounding coordinates. If I take a screenshot in a PDF viewer, include a SHA1 hash of the document being viewed + offset in the document so that if I send the screenshot to someone else with the same document, it can seamlessly link to it. Etc etc.
There are probably privacy concerns to solve here, but no idea is new in computer science and I'm pretty sure some grad student somewhere has already explored the topic in depth (it just never made it to mainstream computing platforms).
It feels like screenshots have become the de facto common denominator in our mobile computing era, since platforms have abstracted files away from us. Lots of people who have only ever used phones as their main computing devices are confused when it comes to files, but everyone seems to understand screenshots.
Also, necessary shout out to Screenshot Conf! https://screenshot.arquipelago.org
Google/Apple have taken notice. Both have recently redone their full-screen post-screenshot UI to include AI insights / automatic product searches / direct chat with Gemini/LLM / etc.
Its true everyone uses screenshots to save things they are interested in or want to look up / search more of / save for reason and this UI is the perfect place to insert themselves.
There are companies like Evernote/Zight/CloudApp that at one point tried some things like this, but they never really caught - I think because it's pretty easy to add annotations yourself or some note of your own - and a screenshot not "trying to do everything" is part of what makes them useful & ubiquitous.
SVG maybe?
It is extra work to do both but I like to be through even when asking for help. Even if the other side doesn't need it -- because I myself might not remember all the nuances when I refer to that conversation later.
Also screenshot preserves (before any fixes) the exact way things looked when I confronted a certain situation. The visual of the screenshot serves as a much stronger reminder of that situation and my thinking ...way better than mere copy pasted text.
Your coding agent is not very smart if it can't deal with something as simple as OCR'ing an image and processing all the references in it, or letting you just select text from an image and searching or copying to the clipboard.
If I copy code from PyCharm or VS Code and paste it into fucking Microsoft Word, even spawn-of-Satan-MS-Word-for-Mac respects most of my formatting. Plenty of web text editors are also able to do that.
But Slack, "The King of Useless Features Nobody Asked For", can't bother themselves to implement such a useful feature for their primary market.
????? Just OCR a line and paste it into the IDE’s search field???? Or, if for some baffling reason you don’t have the ability to OCR, just pick out a function declaration in the screenshot and search for that? We’re so doomed as a profession.
... Is this really common?
Only among people who don't code. A non-coder doesn't know the difference between a block of code, and a picture of a block of code.
A woman visits the studio of a famous artist. She says, "That woman is all distorted!" The artist replies, "Madam, that is not a woman, that is a picture of a woman."
Maybe vibe coders. :) People who have never been in the opposite predicament.
I honestly thought this was going to be solved in the 2010s with the rise of comic-like memes, but we just kept sharing images with ever increasing compression artifacts as things were shared around and used to create new memes.
I am hyper-sensitive to emailing terminal screenshots in MS Outlook, as they cannot be searched.
Many up-to-date messaging environments allow you to copy & paste text directly from your coding environment with indentation and syntax coloring intact. This is something the sender can establish before hitting "Send".
Also, the sender has the option to attaching the source file, which if entered into a coding environment will recreate the syntax colors.
The reason I personally hate it is I am often working from my phone. And it’s much easier to read text rendered properly than pinch zooming text in an image. What’s worse is slack will downgrade images for mobile and you can’t even pinch zoom in fully.
I see two possible reasons for this -- the sender has no technical experience, or they're focused on making things more difficult for the recipient.
But when trying to decide between these two, I'm reminded of the saying, "Never attribute to malice that which can be adequately explained by stupidity."
This actually happened. A client wrote me, saying, "First, don't treat me like an idiot -- I have years of computer experience."
"Okay, I promise," I replied. "What's the problem?"
"Your program doesn't work."
"Can you be more specific?"
"I followed your instructions to the letter, but I see an error message."
"Okay, what is the error message?"
"It says, 'User [Enter your name here] is not found'."
Screenshots are fine. Just don't ONLY send screenshots.
1. I have ‘rubber duck debugged’ my own question.
2. I checked that this question hasn’t been asked before.
3. I have noted in my message what I’ve tried.
4. I have avoided the ‘XY problem’ by clearly detailing the core problem, X.
5. I have provided specifics of my issue, not vague references or descriptions.
6. I have provided URL links to relevant content, and where possible the URL links are immutable.
7. I have not included screenshots of text in my message.
8. I have not used obscure acronyms or abbreviations.
9. I have formatted my message well, particularly paying attention to code formatting and headings.
10. I have not just said “hi” and waited for a reply.
Like other posters, I don't think Apple OCR is sufficient to make up for screenshotting. The biggest problem is search.
1. https://thundergolfer.com/communication/slack/2021/02/24/how...
Both Spotlight and Photos will find text in screenshots.
However, a screenshot acts like a print-out / pdf, and very handy for sharing in other platforms e.g social media, mobile devices.
Like many others I like the use of ai for OCR in imagery. Won't be long before ai tool can copy the style + content from an image, or video.
That says all you need to know. The reason they send those screenshots is they believe the full context is more helpful. Code formatting, indentation etc.
Personally I agree with that sentiment. There is a lot of context in the full visual of the original text in situ.
Not nice for the recipient maybe, but hella efficient for everyone else, and there are many more people in the latter camp than the former.
https://learn.microsoft.com/en-us/windows/powertoys/
https://learn.microsoft.com/en-us/windows/powertoys/text-ext...
tcdent•2h ago
Linux desktop users will get there one day.
walt_grata•2h ago
tcdent•2h ago
matt_kantor•1h ago
recursive•2h ago
forgotpwd16•2h ago
anonzzzies•1h ago