Update MLX model patterns and reduce max_tokens in eval script #205

codelion · 2025-06-30T06:44:34Z

Added '-mlx-' to the list of MLX model patterns in should_use_mlx for broader matching. Reduced max_tokens from 32768 to 8192 in get_llm_response within eval_math500_benchmark.py to limit token usage.

Update version number in __init__.py and setup.py to 0.1.18 for new release.

codelion added 2 commits June 30, 2025 14:43

Update MLX model patterns and reduce max_tokens in eval script

61d0b82

Added '-mlx-' to the list of MLX model patterns in should_use_mlx for broader matching. Reduced max_tokens from 32768 to 8192 in get_llm_response within eval_math500_benchmark.py to limit token usage.

Bump version to 0.1.18

0a6bc20

Update version number in __init__.py and setup.py to 0.1.18 for new release.

codelion merged commit 50f5f7a into main Jun 30, 2025
1 check passed

codelion deleted the fix-mlx-model-id branch June 30, 2025 06:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update MLX model patterns and reduce max_tokens in eval script #205

Update MLX model patterns and reduce max_tokens in eval script #205

Uh oh!

codelion commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

Update MLX model patterns and reduce max_tokens in eval script #205

Update MLX model patterns and reduce max_tokens in eval script #205

Uh oh!

Conversation

codelion commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!