Hi HN,
We are the OpenDCAI group from Peking University. We built Paper2Any, an open-source tool designed to automate the "Paper to Slides" workflow based on our DataFlow-Agent framework.
The Problem: Writing papers is hard, but creating professional architecture diagrams and slides (PPTs) is often more tedious. Most AI tools just generate static images (PNGs) that are impossible to tweak for final publication.
The Solution: Paper2Any takes a PDF, text, or sketch as input, understands the research logic, and generates fully editable PPTX (PowerPoint) files and SVGs. We prioritize flexibility and fidelity—allowing you to specify page ranges, switch visual styles, and preserve original assets.
How it works:
1. Multimodal Reading: Extracts text and visual elements from the paper. You can now specify page ranges (e.g., Method section only) to focus the context and reduce token usage.
2. Content Understanding: Identifies core contributions and structural logic.
3. PPT Generation: Instead of generating one flat image, it generates independent elements (blocks, arrows, text) with selectable visual styles and organizes them into a slide layout.
Links:
- Demo:
http://dcai-paper2any.cpolar.top/
- Code (DataFlow-Agent):
https://github.com/OpenDCAI/DataFlow-Agent
We'd love to hear your feedback on the generation quality and the agent workflow!
OutOfHere•12m ago