frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Pg_lake: Postgres with Iceberg and data lake access

https://github.com/Snowflake-Labs/pg_lake
56•plaur782•1h ago•17 comments

Show HN: A CSS-Only Terrain Generator

https://terra.layoutit.com
147•rofko•3h ago•52 comments

Optimizing Datalog for the GPU

https://danglingpointers.substack.com/p/optimizing-datalog-for-the-gpu
55•blakepelton•2h ago•11 comments

What is a manifold?

https://www.quantamagazine.org/what-is-a-manifold-20251103/
214•isaacfrond•7h ago•66 comments

Show HN: I built a local-first daily planner for iOS

https://apps.apple.com/ca/app/to-do-list-planner-zesfy/id6479947874
43•zesfy•3h ago•30 comments

This Day in 1988, the Morris worm infected 10% of the Internet within 24 hours

https://www.tomshardware.com/tech-industry/cyber-security/on-this-day-in-1988-the-morris-worm-sli...
70•canucker2016•2h ago•33 comments

Launch HN: Plexe (YC X25) – Build production-grade ML models from prompts

https://www.plexe.ai/
3•vaibhavdubey97•15m ago•0 comments

Chaining FFmpeg with a Browser Agent

https://100x.bot/a/chaining-ffmpeg-with-browser-agent
53•shardullavekar•4h ago•34 comments

Server DRAM prices surge 50% as AI-induced memory shortage hits hyperscalers

https://www.tomshardware.com/pc-components/storage/server-dram-prices-surge-50-percent
83•walterbell•1h ago•67 comments

Bloom filters are good for search that does not scale

https://notpeerreviewed.com/blog/bloom-filters/
120•birdculture•7h ago•16 comments

My Truck Desk

https://www.theparisreview.org/blog/2025/10/29/truck-desk/
312•zdw•14h ago•67 comments

US startup Substrate announces chipmaking tool that it says will rival ASML

https://www.reuters.com/world/asia-pacific/us-startup-substrate-announces-chipmaking-tool-that-it...
66•outrun86•6d ago•35 comments

Customize Nano Text Editor

https://shafi.ddns.net/blog/customize-nano-text-editor
67•shafiemoji•1w ago•32 comments

YouTube AI error costs creator his channel over alleged link to Japanese account

https://piunikaweb.com/2025/11/04/youtube-ai-error-terminates-enderman-channel/
26•rabinovich•46m ago•13 comments

You can't cURL a Border

https://drobinin.com/posts/you-cant-curl-a-border/
380•valzevul•16h ago•196 comments

How devtools map minified JS code back to your TypeScript source code

https://www.polarsignals.com/blog/posts/2025/11/04/javascript-source-maps-internals
14•manojvivek•2h ago•5 comments

The 512KB Club

https://512kb.club/
35•lr0•1h ago•25 comments

Tell HN: X is opening any tweet link in a webview whether you press it or not

309•stillatit•11h ago•271 comments

Things you can do with diodes

https://lcamtuf.substack.com/p/things-you-can-do-with-diodes
319•zdw•17h ago•84 comments

Recovering videos from my Sony camera that I stupidly deleted

https://www.jeffgeerling.com/blog/2025/recovering-videos-my-sony-camera-i-stupidly-deleted
6•speckx•1w ago•2 comments

When stick figures fought

https://animationobsessive.substack.com/p/when-stick-figures-fought
288•ani_obsessive•16h ago•101 comments

AI's Dial-Up Era

https://www.wreflection.com/p/ai-dial-up-era
397•nowflux•20h ago•352 comments

Reverse-engineered CUPS driver for Phomemo receipt/label printers

https://github.com/vivier/phomemo-tools
64•Curiositry•1w ago•16 comments

Aisuru botnet shifts from DDoS to residential proxies

https://krebsonsecurity.com/2025/10/aisuru-botnet-shifts-from-ddos-to-residential-proxies/
31•feross•6d ago•8 comments

Tenacity – a multi-track audio editor/recorder

https://tenacityaudio.org
90•smartmic•1w ago•27 comments

The Farmer Was Replaced [video]

https://www.youtube.com/watch?v=aP2WHQKJVsw
86•surprisetalk•1w ago•45 comments

A friendly tour of process memory on Linux

https://www.0xkato.xyz/linux-process-memory/
206•0xkato•18h ago•19 comments

Ask HN: Who is hiring? (November 2025)

371•whoishiring•1d ago•431 comments

Data breach at major Swedish software supplier impacts 1.5M

https://www.bleepingcomputer.com/news/security/data-breach-at-major-swedish-software-supplier-imp...
6•fleahunter•29m ago•3 comments

The Art of Atari (2016)

http://www.artofatari.com
33•ghtbircshotbe•4h ago•8 comments
Open in hackernews

Show HN: AnydocAI – Every file exists as all file types

https://anydocai.com/
22•grandslammer•5mo ago
I work with AI tools often and I find myself constantly swapping between file types to navigate different sites and use cases. I built anydoc — I imagined a universal document format drive layer that would allow me to essentially have a file that is many different file types at once. CSV for organizing, markdown for feeding into LLMs, HTML for fast sharing... so here it is and I am working on making it better all of the time. Just shipped an update that makes the most relevant format types available faster on a document upload @boshjerns on X

Comments

gus_massa•5mo ago
I think calling it a "drive" will confuse a lot of people. I expected a device driver, and not technical people have no idea of drives, they just have documents and photos.

I tried with a text file

  X Y
  1 2
  3 4
and for some reason the convert version has 1 2 3 4 in the same row.
gus_massa•5mo ago
Too late to edit: Google drive is call "drive", so I guess it's more usual than I noticed.
alentred•5mo ago
Hm, I don't know, I am OK with "drive". Google Drive, Microsoft OneDrive, iCloud Drive.
grandslammer•5mo ago
It really does just make sense too. We have hard drive so I wouldn’t we have digital drives.
unsnap_biceps•5mo ago
Have you considered writing this as a FUSE system rather than a web service?
nkrisc•5mo ago
So it turns one file into many? Or is it actually one file that is simultaneously a valid HTML document and PNG?
IAmBroom•5mo ago
According to what I read, the latter.
c0wb0yc0d3r•5mo ago
And you’d be mislead. The video shows the original file is converted to different formats, depending on the user’s selection. The video shows jpeg to html (using AI to perform OCR?).

Pandoc but extra AI steps.

grandslammer•5mo ago
That argument really skips over what most people actually need. Nobody outside of a tech bubble wants to learn half a dozen Pandoc flags, stitch together shell commands and temp files, or write Lua filters just to reshape a document. With our drive layer you literally rename a file or type “make this header bold and export as PDF” and the work just happens, no scripts required.

This isn’t about replacing power-user workflows, it’s about giving anyone on your team the ability to reshape data and documents without ever opening a terminal. You getflexibility with the simple UX of renaming a file. Calling it “Pandoc plus AI” misses the fact that 90 percent of users neither know nor care about Pandoc’s internals. They just want “I have a file, make it look like this, or formatted with these sections to share with X person who works in X field...” and that’s exactly what our natural-language, filesystem-driven approach delivers.

grandslammer•5mo ago
It’s basically an access layer that gives you quick access to all the different conversions of the files in one place, but it also allows you to redesign them with natural language so that you can configure them for your needs on the fly
RileyJames•5mo ago
It’s an interesting idea.

I’ve definitely felt the pain of file formats in some unexpected ways recently.

Like airdropping a photo from iPhone only to discover a .HEIC file, which nothing will accept.

I’ve previously used “what ever turns up first on google”, but I now won’t for anything of significance (privacy)

I’ve recently discovered Automator (on Mac) and the quick actions menu. Which can achieve a lot of image and pdf related conversions, but takes some setup (not a mass market solution)

I like the idea of this product. But I think the challenge will be: - reaching the user at the moment they have this problem

- making your solution frictionless to solve their immediate problem, while also bootstrapping to solve it next time around (without them forgetting it exists)

If you can nail that experience for a single use case, I think this will be a winner.

DontchaKnowit•5mo ago
I think the real problem is getting it to actually work....
grandslammer•5mo ago
i think i hit credit limits because so many people were using the app all of a sudden and i'm just like using my own funds for api costs and had a cap on my openai account
grandslammer•5mo ago
Hey, I’m just catching up here and I really appreciate the feedback and I’m gonna work to integrate all this feedback into the application and repost about it again I really appreciate you
grandslammer•5mo ago
Let me actually work on this HEIC issue right now. I think that I know a fix for this.
globular-toast•5mo ago
I like the idea if it was deterministic. So if there are standard ways to convert to/from document types, like Pandoc, being able to write to any one of them and have it update the rest would be interesting.

I hate it if it's built with "AI". Can't imagine a use case for this apart from just shit you don't care about. Why would I be hoarding data I don't care about?

grandslammer•5mo ago
It’s not about hoarding data rather it’s about the malleability of the data itself so for me, I’m constantly working with data but need to format the way that it is displayed whether it’s the file type or the way that the data is given in the specific file if it’s a CSV for example so an application like this allows me to quickly reformat the files with natural language to maybe make them an HTML where I could share a certain form of document HTML file or take that information and reverted into a CSV format. I need to configure it with a management system or something like that.
SPBS•5mo ago
the page is really laggy on Edge, kills any interest in wanting to explore more (strangely, it's much snappier on Chrome)
grandslammer•5mo ago
I will work to fix this asap I just caught up here
ramoz•5mo ago
AI as a use case doesn’t make sense to me.

You’re using AI to create a transpilation of whatever modality. It’s a wasted step if the purpose is to feed back into AI.

cyanydeez•5mo ago
keep in mind, almost all the uses of the current AI are to generate some unstable product that whimsically can change given a butterfly's wings in Japan.
grandslammer•5mo ago
I literally find myself using this tool every day because I need to use natural language to reformat files and the data that are in the files like CSV‘s or markdown so maybe this isn’t useful for you but it definitely is useful to have the LM be able to interpret your natural language to redesign the file the way that you want it to give the information
ramoz•5mo ago
You're talking about a commodity interaction at this point your tool offers nothing different from a chatbot other than your confusing semantics and abstraction.

What Im saying: If the point is to "convert this csv to markdown so i can feed the markdown to a LLM to ask questions about it" etc... it is a completely unnecessary step.

Your service is nothing more than:

1. augmented metadata for files; btw if that requires a whole new drive-oriented solution then you're doing too much.

2. llm api wrapper for a commoditized capability (custom format/or transpilation)

grandslammer•5mo ago
The friction isn’t “can I call an LLM,” it’s every time I want to do anything with this file I have to:

open it in a tool that understands the format,

export / paste the part I care about,

phrase an LLM prompt,

paste the result back,

do this all again if i want the data formatted differently for different use cases.

adding the ability to format your data and view/download that natively, fast is like giving python scripting capabilities to normal users. You're thinking like a dev not like a business owner who may want to take a picture of a timesheet and have that immediately become a CSV then have it reformatted for a management system they use, all on the fly through natural language... there's so many ways that normal people navigate files and formats and I want to give these people some superpowers that they won't seek out themselves.

the gpt-wrapper argument is so played out. just like you’d say “my app is a GPT-wrapper” (it wraps the OpenAI API in a file-centric UX), you could say “Google Drive is a distributed-storage-wrapper” or “a cloud-storage-and-sync wrapper.” It’s the polished frontend and glue that makes the raw backend useful to end users.

voidUpdate•5mo ago
So I can have my exe convert into a shapefile and an mp3?
voidUpdate•5mo ago
Well I tried to convert an exe into a pptx and it outputted a file that looked like an attempt at html, saying that the conversion wasn't feasible due to its nature and size
lloeki•5mo ago
You have it the wrong way around? Usually you are handed a pptx by customers and your job is to turn that into an exe.
voidUpdate•5mo ago
I mean the website cays it can convert anything to anything ("every file exists as all file types all of the time") so it should be able to do exe to pptx
thebeardisred•5mo ago
You missed the joke
jeroenhd•5mo ago
Based on important research like https://www.youtube.com/watch?v=uNjxe8ShM-8, it should definitely be possible to generate a .pptx that will run Windows inside of an emulator inside PowerPoint slides. That HTML file is lying to you!
grandslammer•5mo ago
i need to work on the pptx conversions and some of the other file types specifically- now i'm on this
grandslammer•5mo ago
I’m actually working on some really interesting conversions right now
dgan•5mo ago
i ve read the title 5 times, and can't make sense of it. Is this even valid English ?
Akronymus•5mo ago
>Imagine a drive[,] where every file exists[,] as all file types[,] all of the time

Basically treating one file type as if it were any arbitrary other file type

quesera•5mo ago
Punctuated like that, I can't help reading it in the movie trailer guy[0] voice.

[0] https://en.wikipedia.org/wiki/Don_LaFontaine .. wow, dead for 17 years!

grandslammer•5mo ago
Hahahahaha thank you??
grandslammer•5mo ago
I may need to work on the short pitch
sigmaisaletter•5mo ago
It's a fancy file format conversion utility.

Am I missing something?

dsr_•5mo ago
Yes: it's a fancy file format conversion service that adds errors so your QA people have more work.
emadda•5mo ago
Could have called it quantumdoc
grandslammer•5mo ago
It’s not too late
jy14898•5mo ago
Now make it an HTTP API where content negotiation always succeeds
ramses0•5mo ago
Back in the day there were a bunch of `x2y` programs[1], like html2pdf, xls2csv, rst2odt, jpg2png, png2jpg, etc...

You could imagine something like `any2zip`, or `any2tgz` or `iso2mp4` or something.

It seems like there could/should be some sort of virtual filesystem where you could say "cat inventory.xls.csv", or "wine.exe excel.exe inventory.csv.xls" (please bear with me on these examples). Effectively "$BLOB.format.format", where "." becomes a sort of "convert to this $TYPE".

Imagine being able to say:

    `echo "# Hello\n\n * World" > README.md ; cat README.md.html"`
    (effectively invoking `md2html`)
    
    `printer README.md.html.pdf`
    (eg: `cat README.md | md2html | html2pdf | printer`)
...if you requested `README.md.pdf`, maybe it could intuit the intermediate `md2html2pdf` (HTML) portion?

I really wish local linux filesystems (for end-users) would at least match Apple's capabilities. eg: `$RECENT`, spotlight, auto-OCR. We've really regressed since the era of `locate`, but I'd _LOVE_ some sort of modern equivalent.

Imagine: `inotify`, `auditd`, just anything that can avoid full-disk scans during "normal end user" daily operation... wired up to `llm-summary $FILE >> sqlite.db ; `llava-describe $IMAGE >> sqlite.db ; etc...`

For bonus points, catch anything missed with some sort of full daily/weekly backup operation. We're on the cusp of a much more intimate "partnership" with the compute boxes underneath our desks, but so much is getting sucked into the void of "the network is the computer".

[1]: compgen -c | grep 2 | grep -v '2$' | grep -v '\.2' | grep -v '2\.'

[2]: https://en.wikipedia.org/wiki/Locate_(Unix)

RetroTechie•5mo ago
> Back in the day there were a bunch of `x2y` programs[1], like html2pdf, xls2csv, rst2odt, jpg2png, png2jpg, etc...

They're still around. A problem is loss of information on each conversion. For example, wav->mp3 loses info. Converting back (mp3->wav) won't get you the exact .wav you started with. Similar thing with file types supporting different resolution graphics, vector vs. bitmap, metadata being stripped, features in format A not supported in format B, etc.

Another problem is the explosion of M:N file format combinations. A possible fix would be a universal (?) in-between format, functioning as a container for [portions of a file] + whatever metadata was extracted from original. That way you can at least do conversions along the lines of video container formats, where container type is changed but video inside does not get decompressed/re-encoded. Or simular operations like extracting/shuffling pages in a pdf document.

All in all this is not an easy problem & therefore unlikely to be solved anytime soon.

grandslammer•5mo ago
really appreciate you adding to the discourse here - I'm not sure if you got a chance to test out the site but I refilled my credits after the surge of attention and would love if you checked it out! also @boshjerns on X if you want to reach out to chat
TomMasz•5mo ago
I got "No video with supported format and MIME type found." in the How It Works section.
troyvit•5mo ago
I think you can get past that if you download the video, then upload it back up to anydoc and ask it to translate it to Markdown.

edit: /s

grandslammer•5mo ago
hahahaahhaha nice one
grandslammer•5mo ago
Video files are not supported right now is probably the issue. Working on this because I’m going to have to pass the video into frames and then feed the frames into a model and I just need to work this out a little bit more
Y_Y•5mo ago
I thought this was something like a FUSE driver that would on-the-fly generate any file you tried to read, with some consistency. Like if you open stories/zombie-party.txt it will have some generative network make it, and cache it. If you later ask for stories/zombie-party.odt it can just do a conversion.

I vibe-coded a demo of such a thing, with the idea of making game assets like textures/outdoor/wall.jpg etc. You can do it easily enough, but you need to be patient, and not particularly discerning.

raphman•5mo ago
FWIW, I wrote a small paper on this general topic a few years ago, collecting earlier work and own ideas.

"Files as Directories: Some Thoughts on Accessing Structured Data within Files"

https://dl.acm.org/doi/pdf/10.1145/3191697.3214323

RetroTechie•5mo ago
Says "get access" with a "locked" icon.

Is this paper freely available somewhere?

kurtoid•5mo ago
Here: https://www.shift-society.org/salon/papers/2018/revised/file...

Link found here: https://scholar.google.com/scholar?cluster=14832107127874645...

raphman•5mo ago
Thanks! Sorry - I didn't realize that the paper in the ACM DL is not open-access.
grandslammer•5mo ago
i think we have some similar thoughts - i am working on a file format that accomplishes something like this
quesera•5mo ago
I played with this idea for media servers.

I want iTunes and Audiobookshelf and beets and Jellyfin, etc to all work on the same filesystem and media archive.

There are challenges.

lawlessone•5mo ago
Wouldn't this make every file a lot bigger?
grandslammer•5mo ago
It’s not really saving everything into one file type rather than allowing a layer. That access is all the file types easily and fast.
jeroenhd•5mo ago
You might not want to use https://anydocai.com/result/<incremental number> for URLs like that. Anyone can enumerate the ~300 files from the home page and look at what others have uploaded.

That said, the website doesn't seem to work anymore. It just errors out.

pavel_lishin•5mo ago
I wonder if they ran out of credits.
grandslammer•5mo ago
Yeah, this is exactly what happened. I did not expect this or catch up until just now, but I just fixed it.
grandslammer•5mo ago
I didn’t expect this to go semi viral on here so I just refilled the credits. It actually ran out of credits for open AI.
woleium•5mo ago
gemini is cheaper, probably
grandslammer•5mo ago
Also, as far as the enumeration users are only authorized to access the files that they’ve created in our system, but I should definitely obscure the file count
grandslammer•5mo ago
I’m like a mid-level developer though so if I messed up the authorization access and you worked around it in someway if you let me know that would be sick @boshjerns on X
_wire_•5mo ago
Imagine no file types

♪ It's easy if you try No hell below us Above us only sky Imagine all the people Visualize whirled peas Ah ah ah oooo!

You may say I'm a dreamer...

grandslammer•5mo ago
real one