Commit Graph

12 Commits

Author SHA1 Message Date
Johann Schopplich
2c4f3c4362 test: add benchmarks for compact vs. pretty JSON 2025-10-30 15:02:51 +01:00
Johann Schopplich
ecf578a7dc text(accuracy): add Grok-4-fast, remove default temperature 2025-10-28 22:54:00 +01:00
Johann Schopplich
67c0df8cb0 docs: overhaul retrieval accuracy benchmark 2025-10-28 20:22:43 +01:00
Johann Schopplich
52dc9c4b3f docs: clarify retrieval accuracy metrics 2025-10-28 08:39:43 +01:00
Johann Schopplich
352e936370 docs: update notes & limitations guide 2025-10-28 07:44:35 +01:00
Johann Schopplich
b839d35ad0 docs: how the benchmarks work section 2025-10-27 20:35:43 +01:00
Johann Schopplich
7b76acde31 docs: add benchmarks for gemini-2.5-flash 2025-10-27 16:02:51 +01:00
Johann Schopplich
b9f54ba585 docs: update benchmark reports' readability 2025-10-27 14:18:37 +01:00
Johann Schopplich
05b3d43023 test: refactor accuracy benchmark generation 2025-10-27 14:07:20 +01:00
Johann Schopplich
1a5e6199ac test: update retrieval accuracy benchmarks 2025-10-27 13:45:48 +01:00
Johann Schopplich
b2c58d2b97 chore: fix linting issues 2025-10-27 11:49:40 +01:00
Johann Schopplich
3c840259fe test: add LLM retrieval accuracy tests 2025-10-27 11:48:33 +01:00