← Blog/Engineering

My telegram bot not replying turned out to be a full disk, not a bad model

Priya SharmaJune 9, 2026 · 8 min read

Agent Debugging

Bot silence was disk full, not model failure

Triage Board

Telegram webhook

events in

Model endpoint

looked guilty

SQLite writes

disk full

OpenClaw state

messy after upgrade

sqlite3.OperationalError: database or disk is full

agent stalled after upgrade state replay

Disk Usage

SQLite blocked

100%

If your telegram bot not replying suddenly starts showing vague agent errors, check storage and local state before you blame GPT-5 or Claude. In one OpenClaw v2026.6.1 failure, the visible symptom was "assistant turn failed before producing content," but the real cause was `ENOSPC: no space left on device, write`.

If your telegram bot not replying suddenly starts showing vague agent errors, check storage and local state before you blame GPT-5 or Claude. In one OpenClaw v2026.6.1 failure, the visible symptom was "assistant turn failed before producing content," but the real cause was ENOSPC: no space left on device, write.

I love how fast we all jump to the glamorous explanation.

Agent stopped answering in Telegram? Must be GPT-5. Provider got flaky. Prompt degraded. Context window bug. Maybe Claude is having a weird day. Maybe your OpenAI API alternative is routing badly. Maybe Mercury is in retrograde.

And then you open the logs and the machine is just... out of disk.

That was the part that got me while researching OpenClaw issues this week. I came across a thread on r/openclaw where someone upgraded to OpenClaw 2026.6.1 and suddenly got no replies in Telegram or the TUI. On the surface, it looked exactly like the kind of issue people blame on the model layer.

The TUI showed repeated "[assistant turn failed before producing content]" messages. The session status line showed openai/gpt-5.5, a 272k context window, and a local endpoint at ws://127.0.0.1:18789. If you only looked at the visible behavior, you'd absolutely start fiddling with prompts or swapping providers.

But the real error was much dumber:

openclaw tui - ws://127.0.0.1:18789 - agent main - session main

run error: ENOSPC: no space left on device, write

That is not a model failure. That is your agent face-planting into the storage layer before it can say a word.

And honestly? This is exactly why so many teams lose hours debugging the wrong thing.

The model gets blamed because the symptoms look guilty

This is the trap.

When a Telegram agent stops replying, you don't see "disk usage at 100%." You see silence. Or a half-rendered TUI. Or a generic execution failure. In OpenClaw, that can look like the assistant never got its turn out.

So people do what smart builders always do first: inspect prompts, rotate providers, lower temperature, retry with GPT-5, retry with Claude Opus 4.6, maybe test Qwen or Llama just to isolate the vendor.

Reasonable move. Wrong first move.

Because agents that run all day accumulate junk in slow motion. Sessions grow. Logs grow. Memory grows. Plugin state grows. SQLite files grow. Telegram history grows. TUI artifacts grow. If you're running an always-on bot on a VPS, a Raspberry Pi, or some tiny cloud instance you forgot about three months ago, storage becomes a time bomb.

One commenter in that same r/openclaw discussion said it plainly: "Same problem here. My open claw backed up and ran out of space." The original poster replied that their sessions had basically eaten through storage.

That's the story right there. Not "the model got worse." Not "Telegram broke." Your agent just kept living until the floor gave out.

But disk wasn't even the whole story.

What if freeing space still doesn't fix it?

This is where things get more annoying.

In that same OpenClaw failure, storage exhaustion was the smoking gun, but there were also upgrade warnings that pointed to a second class of problem: state and migration damage.

The thread included warnings about legacy migration behavior and conflicting plugin metadata in SQLite, including this gem:

Left plugin install index in place because shared SQLite state has conflicting plugin install metadata for: codex

That line matters.

It suggests the upgrade didn't just expose a low-disk condition. It may also have left OpenClaw in a half-migrated state where plugin metadata, shared SQLite state, or install indexes no longer lined up cleanly. That's exactly the kind of issue that makes Telegram and TUI failures look random.

You clean up disk. Restart. Still broken.

Now you think, "Aha, so it really was GPT-5.5."

Nope. Could still be local state.

This is the operational version of a horror movie where you think you killed the monster in the first act.

Did the update break the agent, or did the update expose old mess?

While reading more, I found another r/openclaw thread about v2026.6.1 that made the pattern even clearer.

One user explained:

"In recent versions, OC moved from bundled providers to plugins, so the configuration is slightly different and you need to install the plugins. OpenClaw doctor fixes it."

That is such an important sentence because it moves the conversation out of prompt theater and back into operations.

Three boring failure modes that look like one dramatic one

Failure source	What it usually looks like
Storage exhaustion (ENOSPC)	Assistant fails before producing content; shows up in local runtime or TUI logs; fix is freeing disk and checking session/log growth
Plugin/provider migration after update	Telegram or TUI breaks right after a version bump; shows up in doctor warnings, migration warnings, or missing provider plugins; fix is running OpenClaw doctor and reinstalling/configuring plugins
Model/context misconfiguration	Errors like "context too large" or execution failures; shows up in provider settings or model config; fix is setting the correct context size and verifying config after update

That's the practical lesson.

Not "it's always disk." Not "upgrades always break plugins." And definitely not "just rewrite the prompt."

It's this: check the machine, the state, and the migration path before you start blaming the model.

The update thread had even more clues

The v2026.6.1 discussion wasn't just one weird plugin issue.

People also reported:

"channel drop outs for specific agents"
compaction running 6 times in a row on an active session
reverting back to 5.20 because the newer build felt unstable
provider config confusion caused by the move from bundled providers to plugins

None of that sounds like "GPT-5 forgot how to answer Telegram." It sounds like a local runtime that changed under load.

And yes, if you're using an OpenAI API alternative behind OpenClaw, this matters even more, because the temptation is to blame the upstream model router first. Sometimes that's fair. A lot of the time, the fire started on your box.

So what should you check first when Telegram goes silent?

Here's the order I wish more teams used.

Check disk space immediately
- Look for full volumes, swollen logs, huge session stores, or runaway SQLite files.
- If the error includes ENOSPC, stop there and fix storage before touching prompts.
Run OpenClaw doctor
- Especially after upgrading to 2026.6.1 or later.
- The provider architecture changed, and missing plugins can make healthy models look dead.
Inspect migration and plugin warnings
- Search for SQLite conflicts, legacy migration messages, and plugin install metadata issues.
- If you see warnings around codex or provider plugins, assume state integrity is questionable.
Verify provider and model config
- Make sure your installed provider plugins actually match your config.
- Confirm context-size registration after updates.
Only then test prompts and model selection
- If the machine is healthy and state is clean, now it makes sense to compare GPT-5, Claude Opus 4.6, Grok, Qwen, or Llama variants.

That order sounds boring because it is boring. That's why people skip it.

But what if it really is the model?

Sometimes it is.

The same OpenClaw update thread included reports of "context too large" errors after updating. A commenter said it came from a bug in how context size was read or registered for the model, and that setting a smaller or correct context size in config could work around it.

So yes, model configuration can absolutely be the actual problem.

But notice the difference: that's still not a prompt-quality issue. It's not "Claude got worse" or "GPT-5 became unreliable." It's a config mismatch between what OpenClaw thinks the model can handle and what the provider setup actually supports.

That distinction matters.

Too many agent teams treat every failure as a model intelligence problem. In practice, a huge share of failures are plumbing: disk, state, migrations, sockets, plugin installs, bad context registration, stale credentials.

The smarter your stack gets, the more humiliating the outages become.

The unsexy rule that saves the most time

If your Telegram bot vanishes right after an upgrade, assume operations first, models second.

Especially if you're running long-lived agents in OpenClaw, n8n, Make, Zapier, or a custom TUI loop. Persistent memory is useful. Persistent sessions are useful. Shared SQLite state is useful. Until they quietly become the reason nothing replies.

The weirdly comforting thing about the OpenClaw threads is that they show the truth in public. One build identifier, 2e08f0f. One local websocket endpoint, ws://127.0.0.1:18789. One failing session on openai/gpt-5.5. One brutally ordinary runtime error: no space left on device.

That's not glamorous. It's better than glamorous.

It's fixable.

And the next time your telegram bot not replying sends you hunting through prompts, temperature settings, and provider dashboards, do yourself a favor: check the disk first. Then check the plugin state. Then check the migration warnings.

Because if your agent can't write to disk, GPT-5 never even gets a chance to be wrong.

Frequently Asked Questions

Why is my Telegram bot not replying even though the model looks fine?

The model may not be the problem at all. In OpenClaw, a bot can stop replying because of local runtime issues like full disk storage, broken plugin state, or failed migrations, which block the agent before it can return a response.

What does ENOSPC mean in OpenClaw?

ENOSPC means "no space left on device." If OpenClaw throws `ENOSPC: no space left on device, write`, the agent is failing at the storage layer, so freeing disk space should come before prompt debugging or provider switching.

Can an OpenClaw update break Telegram or TUI agents without changing prompts?

Yes. In OpenClaw v2026.6.1, users reported issues caused by the move from bundled providers to plugins, missing plugin installs, migration warnings, and channel dropouts, all of which can break agent behavior without any prompt changes.

What should I check first after upgrading OpenClaw and my agent stops working?

Start with disk usage, then run OpenClaw doctor, then inspect migration and plugin warnings. After that, verify provider configuration and model context-size settings before assuming GPT-5, Claude, or another model is at fault.

Will freeing disk space fully fix a broken OpenClaw agent?

Not always. If the outage also involved migration conflicts or corrupted plugin metadata in shared SQLite state, you may need to repair plugin installs, rerun doctor, or clean up state after restoring available storage.