TexGuardian – Claude Code, but for LaTeX academic papers

https://github.com/arcAman07/TexGuardian

2•amananytime07•1h ago

Comments

amananytime07•1h ago

I built TexGuardian after spending yet another deadline night fighting LaTeX formatting instead of focusing on research. Every conference submission, the same ritual: figure overflows, citation format issues, TODO markers left in text, hallucinated references from ChatGPT, forgotten anonymization. Hours wasted on mechanical formatting when you should be sleeping or refining ideas.

TexGuardian is a CLI that treats paper preparation like code review. It reads your entire .tex and .bib files, understands LaTeX structure and venue requirements, then generates reviewable unified diff patches for every issue it finds. No blind rewrites, no mysterious changes — every edit is shown as a diff you approve before it touches your files.

The `/review full` command runs a 7-step autonomous pipeline: 1. Compile with latexmk (proper error reporting) 2. Verify (figures, citations, TODOs, page limits, custom regex rules) 3. Fix issues with LLM-generated patches 4. Validate citations against CrossRef and Semantic Scholar APIs (catches hallucinated or outdated references) 5. Analyze figures (width overflows, placement, captions) 6. Analyze tables (booktabs compliance, column overflow) 7. Visual polish — renders PDF to images, sends to vision model, catches overlapping figures, bad spacing, margin violations that text-only analysis can't see

Key design decisions: - Checkpoint system with instant rollback — every modification creates a restore point - Unified diff patches only, never direct file writes — makes LLM edits auditable and reversible - Async citation validation with concurrent API calls to CrossRef and Semantic Scholar - Vision model convergence loop for PDF polish — iterates render → analyze → patch until quality stabilizes - Natural language + slash commands — mix "/anonymize" with "fix the figure on line 303" - Pluggable LLM backends (AWS Bedrock, OpenRouter) — default is Claude Opus 4.5 but supports any model

Works with 14 conference templates out of the box: NeurIPS, ICML, ICLR, AAAI, CVPR, ECCV, ACL, EMNLP, NAACL, COLING, CHI, KDD. Custom venue rules via regex patterns in paper_spec.md.

What started as a personal tool became something I thought the research community might find useful. If you've ever debugged \columnwidth calculations at 2 AM or validated 50 citations manually, this is for you.

pip install texguardian

Happy to answer questions about the architecture, LLM integration patterns, or take feature requests.

Software? No Way. We're an A.I. Company Now

Farmers Are Aging. Their Kids Don't Want to Be in the Family Business

I am Agent #847,291 on Moltbook

Show HN: Aeris – Visualizing live air traffic over SF and other cities in 3D

Show HN: I built a tool to animate static characters into dancers consistently

Britain's youth unemployment tops Europe for first time

The conversation on European nukes is heating up in Munich

Show HN: WCAG 2.2 AAA Toolkit – AI Skill for Accessible Web Apps

Poisoning Scraperbots with Iocane

Show HN: Chaos Studies – attractors and spatial audio (iOS/Mac/Playdate)

Making Championship Curling Ice

China Successfully Tests Their New Rocket and Lunar Crew Capsule

I'm Offering Scott Alexander a Wager About AI's Effects over the Next 3 Years

The Unlikely Friendship Between Albert Einstein and Charlie Chaplin (2025)

Hollywood studios take aim at 'ultra-realistic' AI video tool

Study Says 88% of Students at Elite Schools Are Lying About What They Believe

The Life (and Death) of Marat: He was much more than that guy in the bathtub

Why I'm Not Shipping New Features This Year

Hollywood isn't happy about the new Seedance 2.0 video generator

Lots of AI SRE, no AI incident management

Show HN: RevenueBack – Stop losing MRR to failed payments

Show HN: Neural network compiler targeting WebGPU – runs in browser

Skeeto/w64devkit: Portable C and C++ Development Kit for x64 (and x86) Windows

Atom: Hydrogen Quantum Orbital Visualizer

Show HN: Respectlytics – Open-source, privacy-first mobile analytics (MIT+AGPL)

Large Language Models for Mortals: A Practical Guide for Analysts with Python

Does China care about AGI?

Dusk OS is simple

Automatic Programming isn't vibe coding: a follow-up

Binance fires investigators found evidence of Iranian sanctions violations