frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: MCP server for searching and downloading documents from Anna's Archive

https://github.com/iosifache/annas-mcp
89•iosifache•7h ago
I was looking around for an MCP server that could connect Anna's Archive to Claude Desktop, as I wanted to be able to search and download books directly through the interface.

I couldn't find any public implementations, so ended up building one myself.

What it does?

- It searches Anna's Archive by keywords. - It downloads books from search results. - It works directly in Claude Desktop through MCP.

Check out the repository's README for detailed installation and configuration instructions.

The code is fully open source and builds run on GitHub Actions for transparency.

I figured I'd share, since I couldn't be the only one wanting this functionality!

Comments

neilv•6h ago
> This software does not endorse unauthorized acquisition of copyrighted content and should be regarded solely as a utility. Users are urged to respect the intellectual property rights of authors and acknowledge the considerable effort invested in document creation.

How sincere is that statement?

bigyabai•6h ago
As sincere as the user takes it.
iosifache•6h ago
I just provide a hammer. Users decide whether they're hitting their own nail or the metal one.

The comparison might be loose, but the problem is similar to releasing a browser. Do you prevent users from accessing websites you think are malicious or illegal? Or do you delegate that responsibility?

I was hesitant about releasing the MCP server as open source software, but I hope (1) it proves useful for others and (2) people understand that the authors of the books they're reading need money to eat, live, and support their families.

ASalazarMX•6h ago
I'd bet you won't find a single string containing "acquire copyrighted content arrr!", so pretty sincere. The software doesn't endorse it.
hereme888•4h ago
As sincere as LLM providers not wanting to get sued for the copyrighted content they used.
thorum•6h ago
Interesting project! I’m a little surprised that Claude is willing to call these functions. The demo screenshot is downloading a public domain work, I wonder if it would also happily go along with requests for Harry Potter or other copyrighted material?
throwaway314155•6h ago
Last I checked downloading isn't the issue. It's distributing. Not an expert though.
iosifache•6h ago
> I wonder if it would also happily go along with requests for Harry Potter or other copyrighted material?

There's no way to protect against this. Anna's Archive doesn't include licence information in their data fields. It would be helpful to integrate with another data source that could warn MCP server users when they're attempting potentially risky actions. Please let me know if you have ideas on how to achieve this.

On a related note, please see this reply:

https://news.ycombinator.com/reply?id=44515205

throwaway314155•5h ago
Thanks, good context.
hotstickyballs•6h ago
Just give the AI something worse that would happen if it doesn’t call these functions.
jddj•6h ago
The ironic universe theory would dictate that LLMs should tell us that downloading and consuming copyrighted material from pirates books is wrong.
totetsu•4h ago
Aren’t they all trained on copyrighted material, and lobbying governments to make that legal? Should copyright law only apply to the plebs?
puppycodes•6h ago
love this. god bless anna's archive
iosifache•5h ago
Cheers!
toomuchtodo•6h ago
Edit: Would you accept a PR to override the search and download endpoint hostnames with env vars? For someone who has their own copy and ES index, it might be helpful to support overriding the public endpoint hostnames (/internal/anna/anna.go#L22-L23).
iosifache•6h ago
> local copy of this corpus

Are you referring to the JSON index (https://annas-archive.org/faq#api)?

xmonkee•6h ago
I'm an LLM noob, but how feasible it is to make a research agent that can not only download articles, but read and reference them in it's process?
iosifache•6h ago
It's doable, as you'll also find MCP servers for reading files [1].

Claude Desktop also has a built-in file reader [2], so you can ask it to read the file and process the content (e.g., generate a summary or even a meta-summary [3]).

[1] https://github.com/sylphxltd/pdf-reader-mcp [2] https://github.com/modelcontextprotocol/servers/tree/main/sr... [3] https://x.com/iosifache/status/1942247320302547175

conception•6h ago
Firecrawl -> Rag -> mcp is the general path
irskep•6h ago
What advantage do you get from this being an MCP server rather than simply a command line tool? Genuinely curious, as I'm trying to develop my mental model of when to use one or the other.

Lovely project!

iosifache•5h ago
Cheers, glad you like it!

I justified the hours I invested by thinking I could search, download, and explore books directly from Claude Desktop. While the initial steps are achievable with a CLI tool, the integration opens up new possibilities.

Some general thoughts:

- You’ll find the MCP mental model similar to the API one. - MCP integrations make it easier for non-technical users to access tools that were previously too technical. - An MCP integration implicitly respects a contract, unlike CLIs and GUIs which involve human aspects (aesthetics, information organisation, etc.). - MCP is an excuse for people to democratize data access. I wrote about this aspect here: https://x.com/iosifache/status/1941049600162574676?s=46

And BTW, that’s a good idea! The functionality should probably also be exposed via CLI.

btown•5h ago
https://neon.com/blog/building-a-cli-client-for-model-contex... might be of interest.

An MCP server provides enough metadata and self-documentation that it's quite straightforward to make a MCP-agnostic CLI client that adapts an arbitrary MCP server into a set of flags that allow you to call its explicit tools with explicit arguments - without ever needing to involve an LLM in the mix! You could even have that CLI tool launch the MCP server as a local subprocess, if you wanted - again, all deterministically.

And if you want to have an SDK in any language under the sun, once you have an MCP outputting reasonable tool descriptions, any LLM could make a best-in-class SDK for you in a heartbeat following that language's best practices.

So it's not unreasonable for someone working on a greenfield project to make an MCP server first nowadays!

profsummergig•1h ago
My understanding of Anna's Archive is that one has to download large zip files (>10 Gb) containing thousands of books even if one wants only a single book.

Am I correct here?

Does this MCP server allow one to download just a single book?

I remember once using an IPFS based tool to download a single 200-year-old, out-of-copyright copy of "Last of the Mohicans" from Anna's Archive. It worked, but was very very complicated to figure out how to make it work.

LeratoAustini•31m ago
I've downloaded single books several times recently (annas-archive.org in the browser):

  - search for book
  - tap a result
  - see a list of links to download mirrors (under 'slow downloads'), tap a link
  - get a countdown timer
  - timer expires, download links appear
  - click a link, book downloads just like any other download

Show HN: FlopperZiro – A DIY open-source Flipper Zero clone

https://github.com/lraton/FlopperZiro
219•iraton•10h ago•53 comments

Show HN: MCP server for searching and downloading documents from Anna's Archive

https://github.com/iosifache/annas-mcp
90•iosifache•7h ago•24 comments

Show HN: Petrichor – a free, open-source, offline music player for macOS

https://github.com/kushalpandya/Petrichor
65•kushalpandya•6h ago•16 comments

Show HN: I built a playground to showcase what Flux Kontext is good at

https://fluxkontextlab.com
9•Zephyrion•3h ago•2 comments

Show HN: Still coding in VC++ 6.0 after losing everything, living in a trailer

10•CodeAndExile•1h ago•4 comments

Show HN: Virby, a vfkit-based Linux builder for Nix-Darwin

https://github.com/quinneden/virby-nix-darwin
13•qeden•3d ago•1 comments

Show HN: OffChess – Offline chess puzzles app

https://offchess.com
350•avadhesh18•1d ago•156 comments

Show HN: I rewrote an outdated React Native map clustering library

https://github.com/suwi-lanji/rn-maps-clustering
30•hadat•21h ago•7 comments

Show HN: Stravu – Editable, multi-player AI notebooks with text, tables, diagram

8•wek•14h ago•1 comments

Show HN: Pyhoff – Connect Python ML Models to Beckhoff/WAGO IO Hardware

https://github.com/Nonannet/pyhoff
7•Saloc•17h ago•1 comments

Show HN: A rain Pomodoro with brown noise, ASMR, and Middle Eastern music

https://forgetoolz.com/rain-pomodoro
97•ShadowUnknown•1d ago•50 comments

Show HN: Jukebox – Free, Open Source Group Playlist with Fair Queueing

https://www.jukeboxhq.com/
118•skeptrune•1d ago•42 comments

Show HN: NYC Subway Simulator and Route Designer

https://buildmytransit.nyc
196•HeavenFox•2d ago•31 comments

Show HN: KCast

https://github.com/Agundur-KDE/KCast
2•Agundur•10h ago•0 comments

Show HN: Todo2 – AI Project Manager Inside Cursor

https://todo2.pro
3•adam-pavlat•11h ago•0 comments

Show HN: RecomPal – A no-code AI chatbot to increase Shopify sales

https://recompal.com
2•logicalangel•11h ago•0 comments

Show HN: I wrote a "web OS" based on the Apple Lisa's UI, with 1-bit graphics

https://alpha.lisagui.com/
508•ayaros•3d ago•141 comments

Show HN: From Photos to Positions: Prototyping VLM-Based Indoor Maps

https://arjo129.github.io/blog/5-7-2025-From-Photos-To-Positions-Prototyping.html
55•accurrent•4d ago•2 comments

Show HN: Nordstars shows a team's missing skills for different business goals

https://nordstars.ai/
2•doraby•14h ago•0 comments

Show HN: I built a tool to solve window management

https://aboveaverageuser.com/smartswitcher
52•atommachinist•1d ago•66 comments

Show HN: Modernized file manager and program manager from Windows 3.x

https://github.com/brianluft/heirloom
73•electroly•3d ago•17 comments

Show HN: Ossia score – A sequencer for audio-visual artists

https://github.com/ossia/score
93•jcelerier•2d ago•14 comments

Show HN: Sumble – knowledge graph for GTM data – query tech stack, key projects

https://sumble.com
90•antgoldbloom•1d ago•47 comments

Show HN: Unlearning Comparator, a visual tool to compare machine unlearning

https://gnueaj.github.io/Machine-Unlearning-Comparator/
40•jaeunglee•2d ago•2 comments

Show HN: Snub – A fast, lightweight file search CLI for Windows (written in C)

https://github.com/seeyebe/snub
5•seeyebe•17h ago•7 comments

Show HN: Piano Trainer – Learn piano scales, chords and more using MIDI

https://github.com/ZaneH/piano-trainer
200•FinalDestiny•5d ago•58 comments

Show HN: An obsidian plugin inspired by the 'I deleted my second brain' article

https://versen.substack.com/p/to-learn-you-need-to-forget
2•boars_tiffs•18h ago•0 comments

Show HN: Multi-session Claude Code manager with async workflow

https://github.com/ObservedObserver/async-code
3•loa_observer•21h ago•0 comments

Show HN: A Language Server Implementation for SystemD Unit Files

https://github.com/JFryy/systemd-lsp
76•arandomhuman•3d ago•21 comments

Show HN: Piplo helps you stay in touch with the people who matter

https://apps.apple.com/us/app/piplo/id6748089184
3•airpaulg•1d ago•0 comments