Public benchmark set

Prompt compression with deliberate passthrough, measured end to end.

This page uses a curated launch benchmark slice from the broader Tokelang Lite evaluation corpus: strong compaction examples across incident response, legal, sales, analytics, research, and operations, plus two deliberate passthrough cases where exact wording should stay intact.

Curated prompts
10
Public launch set selected from the larger internal benchmark corpus.
Compiled savings
44.44%
Weighted token savings on the prompts that compiled to compact Tokelang output.
Average latency
11.63 ms
Client-measured end-to-end HTTPS latency from the production droplet.
Mode split
8 / 2
Strong compiled examples plus two exactness-sensitive passthrough examples.
Method

POST /v1/compile with prompt only. Token counts use tiktoken cl100k_base, and latency uses Client-measured end-to-end HTTPS latency to the live public deployment from the production droplet.

Why passthrough exists

Passthrough is not a miss. It is the compiler choosing fidelity over compression for code, exact math, row-heavy inputs, and contract-sensitive prompts.

Sample method

Each prompt was measured with 1 warmup + 5 measured requests. The table below shows average latency and p95 latency per prompt.

Representative examples

What the benchmark set actually contains

These examples are selected to show both sides of the product: compaction on stronger workflow-style prompts, and passthrough when exact wording should not be compressed.

pb-06tokelangsalesextraction-plus-strategy

Sales Follow-Up Strategy

34 to 15 tokens, 55.88% savings, 11.86 ms average latency.

Original prompt
Turn the sales call notes into follow-up actions.

Extract buyer pain points, timeline, budget signals, procurement blockers, and return a short account strategy memo for the account executive.
Compiled output
process
analyze buyer pain points timeline budget signals procurement blockers account strategy memo executive brief
pb-08tokelanglegaldecision-tree

Contract Exception Memo

48 to 26 tokens, 45.83% savings, 11.69 ms average latency.

Original prompt
Contract review workflow.

Step 1: Read the payment terms.
Step 2: If the side letter changes payment timing, surface the exception.
Step 3: Otherwise compare warranty and indemnity language.
Step 4: Return a short legal memo for procurement.
Compiled output
process
default simple
analyze payment terms
analyze side letter changes payment timing surface exception
else compare warranty indemnity language
output
return legal memo for procurement brief
pb-17passthroughtranslationcontract-sensitive

Exact Translation

25 to 25 tokens, 0.00% savings, 10.00 ms average latency.

Original prompt
Translate this customer message into German.

Keep the tone polite, preserve the order number exactly, and return only the translated text.
Returned output
Translate this customer message into German.

Keep the tone polite, preserve the order number exactly, and return only the translated text.
pb-21passthroughmathexactness-sensitive

Exact Symbolic Math

37 to 37 tokens, 0.00% savings, 10.28 ms average latency.

Original prompt
Solve this exactly.

Let f(x) = x^4 - 6x^2 + 8x - 3.
Find all critical points, classify them, and keep the symbolic derivation explicit.
Returned output
Solve this exactly.

Let f(x) = x^4 - 6x^2 + 8x - 3.
Find all critical points, classify them, and keep the symbolic derivation explicit.
Full table

Curated launch benchmark prompts

This public table keeps the cleaner launch-facing examples and two deliberate passthrough cases. Expected mode reflects the benchmark design intent. Actual mode reflects what the current benchmarked build returned during measurement.

PromptExpectedActualOriginalOutputSavingsAvg latencyp95 latency
Sales Follow-Up Strategy
pb-06 · sales
tokelangtokelang341555.88%11.86 ms12.76 ms
Security Alert Triage
pb-14 · security
tokelangtokelang401855.00%11.36 ms12.10 ms
User Research Themes
pb-15 · user-research
tokelangtokelang341652.94%10.81 ms11.59 ms
Contract Exception Memo
pb-08 · legal
tokelangtokelang482645.83%11.69 ms12.05 ms
Grant Proposal Review
pb-16 · grants
tokelangtokelang271640.74%18.60 ms39.94 ms
Exec Meeting Summary
pb-04 · operations
tokelangtokelang291837.93%11.20 ms11.90 ms
Incident Response Narrative
pb-02 · incident-response
tokelangtokelang523336.54%10.24 ms10.79 ms
Lesson Plan
pb-10 · education
tokelangtokelang332330.30%10.26 ms10.48 ms
Exact Translation
pb-17 · translation
passthroughpassthrough25250.00%10.00 ms10.50 ms
Exact Symbolic Math
pb-21 · math
passthroughpassthrough37370.00%10.28 ms11.46 ms