Add a dataset or a baseline
The benchmark harness is open. The fastest way to make the comparison fairer is to add a dataset where it falls short, or a baseline it should be measured against.
What is open
The harness — the datasets registry, the per-algorithm runners, the metric definitions, and the methodology — is public and reproducible. The Ufinq engine itself is closed; it is run as one baseline among several, on the same fixed configuration and the same held-out splits as every other algorithm.
Good contributions
- A dataset with a known ground-truth law, for the symbolic-recovery metric.
- A real-world dataset that exposes a failure mode the current corpus misses.
- A new baseline runner that meets the one-declared-configuration policy.
- A correction to a metric definition or a dataset's provenance.
Contributions follow the methodology: one declared configuration per algorithm, uniform across datasets and seeds, with failures reported rather than hidden.
