frontpage.

Cloudflare timeout on using DeepSeek via Novita API

1•t917910•2mo ago

I use DeepSeek through the Novita service. Unlike Nebius, Novita has more models, new models appear earlier, and inference on them runs faster. But what do I see today when I try to use DeepSeek 3.2?

A timeout occurred Error code 524 Visit cloudflare.com for more information. api.novita.ai Host Error What happened? The origin web server timed out responding to this request.

In other words: the model is new, smart, thinks for a long time, the context is huge too, the server doesn’t respond for ages, and our beloved proxy Cloudflare (which has decided to replace the entire Internet) happily kills the connection. (yes, request body contains "stream": true)

Tell me (this is a rhetorical question): who exactly came up with the idea that in the OpenAI protocol the only option available is streaming? Which FAANG genius was that? In my opinion, this is how long-running requests to a server should work:

The client sends a request and immediately receives a task identifier. The client polls the server for the task status and reads the response (buffered in server) in chunks.

Is that hard to implement? Does it require ten interview rounds? Why is it that in my boring enterprise API, when working with, shall we say, leisurely third-party services, my API worked exactly the way I described above? And why didn’t these hyper-smart AI people (the world experts in high-speed matrix multiplication) do the same?

Tell me, what do you think: how exactly does Novita expect people to use long-thinking models if their proxy has a 60-second timeout?

After all, the SSH protocol has special empty keep-alive packets to prevent timeout disconnects. TCP has keep-alive packets.

Windows 2000 already supported SIO_KEEPALIVE_VALS for sockets. That was twenty-five years ago. Supported, as you can see, by Windows (the system that all real hackers despise), and modern data scientists despise it too (they have MacBooks, blue hair, and were born in 1999).

So why haven’t the geniuses of the AI industry thought of keep-alive in SSE? Their API could just send empty events so Cloudflare wouldn’t die.

GPT-5.3-Codex System Card [pdf]

Atlas: Manage your database schema as code

Geist Pixel

Show HN: MCP to get latest dependency package and tool versions

The better you get at something, the harder it becomes to do

Show HN: WP Float – Archive WordPress blogs to free static hosting

Show HN: I Hacked My Family's Meal Planning with an App

Sony BMG copy protection rootkit scandal

The Future of Systems

NASA now allowing astronauts to bring their smartphones on space missions

Claude Code Is the Inflection Point

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

AI Agent Automates Google Stock Analysis from Financial Reports

Voxtral Realtime 4B Pure C Implementation

I Was Trapped in Chinese Mafia Crypto Slavery [video]

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

Show HN: I built a free UCP checker – see if AI agents can find your store

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

Study of 150 developers shows AI generated code no harder to maintain long term

Spotify now requires premium accounts for developer mode API access

When Albert Einstein Moved to Princeton

Agents.md as a Dark Signal

System time, clocks, and their syncing in macOS

McCLIM and 7GUIs – Part 1: The Counter

So whats the next word, then? Almost-no-math intro to transformer models

Ed Zitron: The Hater's Guide to Microsoft

UK infants ill after drinking contaminated baby formula of Nestle and Danone

Show HN: Android-based audio player for seniors – Homer Audio Player