frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

EVs Are a Failed Experiment

https://spectator.org/evs-are-a-failed-experiment/
1•ArtemZ•2m ago•0 comments

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

https://www.databricks.com/blog/memalign-building-better-llm-judges-human-feedback-scalable-memory
1•superchink•3m ago•0 comments

CCC (Claude's C Compiler) on Compiler Explorer

https://godbolt.org/z/asjc13sa6
1•LiamPowell•4m ago•0 comments

Homeland Security Spying on Reddit Users

https://www.kenklippenstein.com/p/homeland-security-spies-on-reddit
2•duxup•7m ago•0 comments

Actors with Tokio (2021)

https://ryhl.io/blog/actors-with-tokio/
1•vinhnx•8m ago•0 comments

Can graph neural networks for biology realistically run on edge devices?

https://doi.org/10.21203/rs.3.rs-8645211/v1
1•swapinvidya•20m ago•1 comments

Deeper into the shareing of one air conditioner for 2 rooms

1•ozzysnaps•22m ago•0 comments

Weatherman introduces fruit-based authentication system to combat deep fakes

https://www.youtube.com/watch?v=5HVbZwJ9gPE
2•savrajsingh•23m ago•0 comments

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

http://www.effacermonexistence.com/rcc-hn-1-1
1•formerOpenAI•25m ago•2 comments

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies
3•tejonutella•29m ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro
1•qzcanoe•33m ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot
1•g1raffe•36m ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496
2•vinhnx•42m ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/
2•rolph•46m ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter
1•Lwrless•48m ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE
1•vermilingua•53m ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/
1•telui•53m ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k
1•cedel2k1•57m ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
34•chwtutha•57m ago•6 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/
4•osnium123•58m ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/
2•jeremy_su•59m ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/
1•fx31xo•1h ago•1 comments

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•1h ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•1h ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•1h ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
4•thread_id•1h ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•1h ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
3•cwwc•1h ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
2•paladin314159•1h ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•1h ago•0 comments
Open in hackernews

Show HN: Optimize and launch a travel-planning AI application in minutes

https://www.gensee.ai/
4•yiyingzhang•6mo ago
We're the creators of Gensee, a platform we built to help developers quickly productionize their AI agents and workflows.

To show how Gensee works, we created a new end-to-end demo https://www.youtube.com/watch?v=AXIX9LgN4mU where we build and launch a travel planner AI application: https://demo.gensee.ai/travel-planner. The web app uses two agents: one to generate a travel plan based on user requirements built using CamelAI's multi-agent society, and another to answer follow-up questions with LLM and web search using no framework (pure Python). We've also open-sourced the travel planner app itself: https://github.com/GenseeAI/Trip-planner-demo.

Here's the process we show:

- DEPLOY: We start with the agent's source code in the GitHub repo and deploy it to Gensee directly using the repo url.

- TEST & ANALYZE: To evaluate the agent, Gensee automatically generates test cases customized to the agent. We can then inspect the full execution trace for each test run (including LLM and tool call inputs/outputs) and manually swap models/tools.

- METRICS: Next, we can instruct Gensee to automatically generate metrics (e.g., "does the generated plan include all requested cities?"). These metrics use LLM-as-a-Judge internally. There are also two objective metrics: dollar cost and execution latency.

- OPTIMIZE: We then select our desired metrics and run Gensee’s automated optimization process, which experiments with different models and tools to find the setup that maximizes quality, minimizes cost, or minimizes latency.

- LAUNCH & AUTOSCALE: Once we're happy with the optimized agent, Gensee provides a production-ready API endpoint that we can integrate directly into our web application. We can also download the Gensee-optimized source code and do more offline tuning. Once launched, the agent will be autoscaled on Gensee as requests arrive. Gensee is the only entity to pay, as Gensee internally covers all model and tool call costs.

We are trying to build the "AgentOps" tooling that we hope can be useful to all agent developers and beyond.

We would be grateful for the community's honest feedback!

You can try it here: https://platform.gensee.ai. We're providing $10 in FREE credits every month. Thanks for checking it out!