frontpage.

Show HN: Optimize and launch a travel-planning AI application in minutes

https://www.gensee.ai/

4•yiyingzhang•6mo ago

We're the creators of Gensee, a platform we built to help developers quickly productionize their AI agents and workflows.

To show how Gensee works, we created a new end-to-end demo https://www.youtube.com/watch?v=AXIX9LgN4mU where we build and launch a travel planner AI application: https://demo.gensee.ai/travel-planner. The web app uses two agents: one to generate a travel plan based on user requirements built using CamelAI's multi-agent society, and another to answer follow-up questions with LLM and web search using no framework (pure Python). We've also open-sourced the travel planner app itself: https://github.com/GenseeAI/Trip-planner-demo.

Here's the process we show:

- DEPLOY: We start with the agent's source code in the GitHub repo and deploy it to Gensee directly using the repo url.

- TEST & ANALYZE: To evaluate the agent, Gensee automatically generates test cases customized to the agent. We can then inspect the full execution trace for each test run (including LLM and tool call inputs/outputs) and manually swap models/tools.

- METRICS: Next, we can instruct Gensee to automatically generate metrics (e.g., "does the generated plan include all requested cities?"). These metrics use LLM-as-a-Judge internally. There are also two objective metrics: dollar cost and execution latency.

- OPTIMIZE: We then select our desired metrics and run Gensee’s automated optimization process, which experiments with different models and tools to find the setup that maximizes quality, minimizes cost, or minimizes latency.

- LAUNCH & AUTOSCALE: Once we're happy with the optimized agent, Gensee provides a production-ready API endpoint that we can integrate directly into our web application. We can also download the Gensee-optimized source code and do more offline tuning. Once launched, the agent will be autoscaled on Gensee as requests arrive. Gensee is the only entity to pay, as Gensee internally covers all model and tool call costs.

We are trying to build the "AgentOps" tooling that we hope can be useful to all agent developers and beyond.

We would be grateful for the community's honest feedback!

You can try it here: https://platform.gensee.ai. We're providing $10 in FREE credits every month. Thanks for checking it out!

EVs Are a Failed Experiment

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

CCC (Claude's C Compiler) on Compiler Explorer

Homeland Security Spying on Reddit Users

Actors with Tokio (2021)

Can graph neural networks for biology realistically run on edge devices?

Deeper into the shareing of one air conditioner for 2 rooms

Weatherman introduces fruit-based authentication system to combat deep fakes

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

A Curated List of ML System Design Case Studies

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

Open Problems in Mechanistic Interpretability

Bye Bye Humanity: The Potential AMOC Collapse

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

Digital Iris [video]

Essential CDN: The CDN that lets you do more than JavaScript

They Hijacked Our Tech [video]

Vouch

HRL Labs in Malibu laying off 1/3 of their workforce

Show HN: High-performance bidirectional list for React, React Native, and Vue

Show HN: I built a Mac screen recorder Recap.Studio

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

Vectors and HNSW for Dummies

Sanskrit AI beats CleanRL SOTA by 125%

'Washington Post' CEO resigns after going AWOL during job cuts

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

TSMC to produce 3-nanometer chips in Japan

Quantization-Aware Distillation

List of Musical Genres