Paste any LLM streaming log → replay it at 1× or 5×, see first-token latency, inter-token deltas, throughput.
Paste streaming log
Replay output
0 / 0 chunksFirst-token latency
—
Time from request to first chunk
As-is, no warranty. These apps are free under their listed license and run entirely in your browser. Use at your own risk — don't blame me if your PC catches fire, your dog runs away, or the math turns out wrong. Verify anything that actually matters. None of this is professional financial, medical, legal, or engineering advice.
Paste a streaming log from any LLM API and this tool:
| Format | Example |
|---|---|
| OpenAI SSE | data: {"choices":[{"delta":{"content":"hi"}}]} |
| Anthropic SSE | data: {"type":"content_block_delta","delta":{"text":"hi"}} |
| Bedrock event-stream | {"bytes":"<base64>"} (auto-decoded best-effort) |
| Bracketed timestamp | [12.456] {"delta":{"text":"hi"}} |
| Generic ndjson | {"text":"hi","ts":420} |
The parser also handles _ts / ts / timestamp / t fields when present. If no timestamp is in the log, it falls back to 50ms-per-chunk spacing so you can still see relative ordering.
For informational purposes only. Not financial, medical, or legal advice. You are solely responsible for how you use these tools.