frontpage.

I got tired of spending hours in PowerPoint and TikZ drawing methodology diagrams for my papers. So I built PaperBanana — you paste your Method section text, and it generates a publication-ready figure in about 2-3 minutes.

How it works under the hood:

1. A Retriever agent searches a curated database of real academic diagrams to find structurally similar references 2. A Planner agent reads your text and generates a detailed visual description (layout, components, connections, groupings) 3. A Stylist agent polishes the visual aesthetics without changing content 4. Then it enters an iterative loop: a Visualizer generates the image, and a Critic evaluates it and suggests revisions — this repeats 1-5 times (you choose)

The key insight is that academic diagrams follow conventions — Transformer architectures, GAN pipelines, RLHF frameworks all have recognizable visual patterns. By retrieving relevant references first, the output is much closer to what you'd actually put in a paper vs. generic AI image generation.

Built with: Next.js + FastAPI + Celery, using Gemini 2.5 Flash for planning/critique and Nanobanana Pro/Seedream for image generation.

Try it here: https://paperbanana.online

Some examples it handles well: Transformer architectures, GAN training pipelines, RLHF frameworks, multi-agent systems, encoder-decoder architectures.

Known limitations: - Works best for CS/AI methodology diagrams — not optimized for biology, chemistry, or general scientific illustration - Text rendering in generated images isn't perfect yet — sometimes labels get slightly garbled - The curated reference database is still small (13 examples), expanding it is ongoing work

Would love feedback from anyone who writes papers regularly. What types of diagrams do you struggle with most?

Show HN: X86CSS – An x86 CPU emulator written in CSS

Show HN: Steerling-8B, a language model that can explain any token it generates

Show HN: PgDog – Scale Postgres without changing the app

Show HN: Sowbot – Open-hardware agricultural robot (ROS2, RTK GPS)

Show HN: Babyshark – Wireshark made easy (terminal UI for PCAPs)

Show HN: Falcon – Chat-first communities built on Bluesky AT Protocol

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

Show HN: PaperBanana – Paste methodology text, get publication-ready diagrams

Show HN: Enseal – Stop pasting secrets into Slack .env sharing from the terminal

Show HN: A deadly simple tmux windows like start UI

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

Show HN: Agent Multiplexer – manage Claude Code via tmux

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

Show HN: 3D Mahjong, Built in CSS

Show HN: WorldCanvas – R/place, but with a real world map as the canvas

Show HN: PureBee – A software-defined GPU running Llama 3.2 1B at 3.6 tok/SEC

Show HN: Peekl – A modern alternative to Ansible and Puppet

Show HN: Merkle Casino – Random CT Domains

Show HN: BVisor – An Embedded Bash Sandbox, 2ms Boot, Written in Zig

Show HN: Implementing ping from the Ethernet layer (ARP,IPv4,ICMP in user space)

Show HN: Search-sessions – Search all your Claude Code session history in <300ms

Show HN: Local-First Linux MicroVMs for macOS

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Show HN: Rendering 18,000 videos in real-time with Python

Show HN: I vibe-coded a custom WebGPU engine for my MMO

Show HN: AgentDbg - local-first debugger for AI agents (timeline, loops, etc.)

Show HN: Unlock the best engineering knowledge in papers for your coding agent

Show HN: TLA+ Workbench skill for coding agents (compat. with Vercel skills CLI)

Show HN: What I've learned from shipping 25 mobile apps

Show HN: A native macOS client for Hacker News, built with SwiftUI

Show HN: PaperBanana – Paste methodology text, get publication-ready diagrams

Show HN: X86CSS – An x86 CPU emulator written in CSS

Show HN: Steerling-8B, a language model that can explain any token it generates

Show HN: PgDog – Scale Postgres without changing the app

Show HN: Sowbot – Open-hardware agricultural robot (ROS2, RTK GPS)

Show HN: Babyshark – Wireshark made easy (terminal UI for PCAPs)

Show HN: Falcon – Chat-first communities built on Bluesky AT Protocol

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

Show HN: PaperBanana – Paste methodology text, get publication-ready diagrams

Show HN: Enseal – Stop pasting secrets into Slack .env sharing from the terminal

Show HN: A deadly simple tmux windows like start UI

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

Show HN: Agent Multiplexer – manage Claude Code via tmux

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

Show HN: 3D Mahjong, Built in CSS

Show HN: WorldCanvas – R/place, but with a real world map as the canvas

Show HN: PureBee – A software-defined GPU running Llama 3.2 1B at 3.6 tok/SEC

Show HN: Peekl – A modern alternative to Ansible and Puppet

Show HN: Merkle Casino – Random CT Domains

Show HN: BVisor – An Embedded Bash Sandbox, 2ms Boot, Written in Zig

Show HN: Implementing ping from the Ethernet layer (ARP,IPv4,ICMP in user space)

Show HN: Search-sessions – Search all your Claude Code session history in <300ms

Show HN: Local-First Linux MicroVMs for macOS

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Show HN: Rendering 18,000 videos in real-time with Python

Show HN: I vibe-coded a custom WebGPU engine for my MMO

Show HN: AgentDbg - local-first debugger for AI agents (timeline, loops, etc.)

Show HN: Unlock the best engineering knowledge in papers for your coding agent

Show HN: TLA+ Workbench skill for coding agents (compat. with Vercel skills CLI)

Show HN: What I've learned from shipping 25 mobile apps

Show HN: A native macOS client for Hacker News, built with SwiftUI