Publicly submitted performance data for various models and devices by users of Layla. All benchmarks are run on-device under real-world conditions. All data is submitted willingly by users and not collected automatically.
| Model Name | Device Model | Decode (t/s) | Prefill (t/s) | RAM Usage | Hardware | Date |
|---|