Packages

Model evaluation harness for standardized benchmarking with semantic similarity, exact match, and custom metrics.

Current section

1 Dependant

Jump to

Packages depending on eval_ex

1 package
  • Comprehensive benchmarking framework for AI research. Measures latency, throughput, cost, and reliability with percentile analysis and Nx numerical computations.

    Updated 4 months ago

    97
    recent downloads
1 package of 1 total

Checksum

Dependency Config

mix.exs

rebar.config

Gleam

erlang.mk

Package Details

Downloads Last 30 days, all versions
0 5 10 15 20

this version

81

yesterday

0

last 7 days

2

all time

396

Last Updated

Dec 29, 2025

License

MIT

Build Tools

mix

Publisher

nshkrdotcom nshkrdotcom

Links