Johann Schopplich
|
9a6125424c
|
docs: update benchmarks for v3 list item syntax
|
2025-11-24 16:35:44 +01:00 |
|
Johann Schopplich
|
11a089bb86
|
docs: update token count
|
2025-11-24 09:13:22 +01:00 |
|
Johann Schopplich
|
796b333e75
|
docs: fix benchmark dataset spacing (closes #196)
|
2025-11-19 22:06:23 +01:00 |
|
Johann Schopplich
|
0ac629a085
|
docs(website): highlight benchmarks
|
2025-11-18 10:14:07 +01:00 |
|
Johann Schopplich
|
4b4f7c05f9
|
docs: add dedicated docs website
|
2025-11-18 07:23:10 +01:00 |
|
Johann Schopplich
|
67169f6f9f
|
docs: switch benchmark order
|
2025-11-09 11:38:14 +01:00 |
|
Johann Schopplich
|
b4655b01af
|
chore(benchmarks): fix CSV question count in accuracy reports
|
2025-11-07 21:31:15 +01:00 |
|
Johann Schopplich
|
acca69c64a
|
chore(benchmarks): replace LLM-as-judge, new structural validation
|
2025-11-07 21:28:21 +01:00 |
|
Johann Schopplich
|
c6ba6446f5
|
chore(benchmarks): finalize structure-awareness run
|
2025-11-07 10:33:46 +01:00 |
|
Johann Schopplich
|
54433de930
|
chore: split token efficiency benchmark into mixed/flat tracks
|
2025-11-06 22:17:18 +01:00 |
|
Johann Schopplich
|
af17efe128
|
docs: add accuracy per 1k tokens report (closes #72)
|
2025-11-05 08:21:57 +01:00 |
|
Johann Schopplich
|
fb43bdf527
|
docs: adjust padding for benchmark comparison
|
2025-10-30 15:19:16 +01:00 |
|
Johann Schopplich
|
2c4f3c4362
|
test: add benchmarks for compact vs. pretty JSON
|
2025-10-30 15:02:51 +01:00 |
|
Johann Schopplich
|
38ea864763
|
docs: clarify TOON's advantages and optimal data structure
|
2025-10-29 19:04:04 +01:00 |
|
Johann Schopplich
|
45604b06e8
|
feat: decode method (#10)
|
2025-10-29 07:42:15 +01:00 |
|
Johann Schopplich
|
e757746351
|
docs(accuracy): highlight toon in perf table
|
2025-10-28 23:08:47 +01:00 |
|
Johann Schopplich
|
ecf578a7dc
|
text(accuracy): add Grok-4-fast, remove default temperature
|
2025-10-28 22:54:00 +01:00 |
|
Johann Schopplich
|
67c0df8cb0
|
docs: overhaul retrieval accuracy benchmark
|
2025-10-28 20:22:43 +01:00 |
|