Commit Graph

17 Commits

Author SHA1 Message Date
Johann Schopplich
bc711ccecf test(benchmark): overhaul generation 2025-11-06 14:45:44 +01:00
Johann Schopplich
af17efe128 docs: add accuracy per 1k tokens report (closes #72) 2025-11-05 08:21:57 +01:00
Johann Schopplich
3472081b40 docs: clarify CSV vs TOON use cases 2025-11-04 18:12:19 +01:00
Johann Schopplich
5f09a14c61 chore: fix type issues 2025-11-01 17:15:37 +01:00
Johann Schopplich
fb43bdf527 docs: adjust padding for benchmark comparison 2025-10-30 15:19:16 +01:00
Johann Schopplich
2c4f3c4362 test: add benchmarks for compact vs. pretty JSON 2025-10-30 15:02:51 +01:00
Johann Schopplich
7db91398fe docs(benchmark): add YAML format support 2025-10-29 06:42:40 +01:00
Johann Schopplich
67c0df8cb0 docs: overhaul retrieval accuracy benchmark 2025-10-28 20:22:43 +01:00
Johann Schopplich
352e936370 docs: update notes & limitations guide 2025-10-28 07:44:35 +01:00
Johann Schopplich
8b9924ff05 refactor: token efficiency benchmark code 2025-10-28 07:42:49 +01:00
Johann Schopplich
4ec7e84f5f refactor: shared utils for benchmark scripts 2025-10-27 17:37:27 +01:00
Johann Schopplich
7b76acde31 docs: add benchmarks for gemini-2.5-flash 2025-10-27 16:02:51 +01:00
Johann Schopplich
77696ce932 docs: benchmarks for XML format 2025-10-27 14:50:26 +01:00
Johann Schopplich
b9f54ba585 docs: update benchmark reports' readability 2025-10-27 14:18:37 +01:00
Johann Schopplich
05b3d43023 test: refactor accuracy benchmark generation 2025-10-27 14:07:20 +01:00
Johann Schopplich
1a5e6199ac test: update retrieval accuracy benchmarks 2025-10-27 13:45:48 +01:00
Johann Schopplich
3c840259fe test: add LLM retrieval accuracy tests 2025-10-27 11:48:33 +01:00