Johann Schopplich
|
cdd4a20c67
|
refactor: benchmarks code style
|
2025-10-28 08:02:57 +01:00 |
|
Johann Schopplich
|
352e936370
|
docs: update notes & limitations guide
|
2025-10-28 07:44:35 +01:00 |
|
Johann Schopplich
|
8b9924ff05
|
refactor: token efficiency benchmark code
|
2025-10-28 07:42:49 +01:00 |
|
Johann Schopplich
|
b839d35ad0
|
docs: how the benchmarks work section
|
2025-10-27 20:35:43 +01:00 |
|
Johann Schopplich
|
4ec7e84f5f
|
refactor: shared utils for benchmark scripts
|
2025-10-27 17:37:27 +01:00 |
|
Johann Schopplich
|
7b76acde31
|
docs: add benchmarks for gemini-2.5-flash
|
2025-10-27 16:02:51 +01:00 |
|
Johann Schopplich
|
77696ce932
|
docs: benchmarks for XML format
|
2025-10-27 14:50:26 +01:00 |
|
Johann Schopplich
|
b9f54ba585
|
docs: update benchmark reports' readability
|
2025-10-27 14:18:37 +01:00 |
|
Johann Schopplich
|
05b3d43023
|
test: refactor accuracy benchmark generation
|
2025-10-27 14:07:20 +01:00 |
|
Johann Schopplich
|
1a5e6199ac
|
test: update retrieval accuracy benchmarks
|
2025-10-27 13:45:48 +01:00 |
|
Johann Schopplich
|
b2c58d2b97
|
chore: fix linting issues
|
2025-10-27 11:49:40 +01:00 |
|
Johann Schopplich
|
3c840259fe
|
test: add LLM retrieval accuracy tests
|
2025-10-27 11:48:33 +01:00 |
|