Edit model card

microsoft/Phi-3.5-mini-instruct, UQFF quantization

Run with mistral.rs. Documentation: UQFF docs.

  1. Flexible ๐ŸŒ€: Multiple quantization formats in one file format with one framework to run them all.
  2. Reliable ๐Ÿ”’: Compatibility ensured with embedded and checked semantic versioning information from day 1.
  3. Easy ๐Ÿค—: Download UQFF models easily and quickly from Hugging Face, or use a local file.
  4. Customizable ๐Ÿ› ๏ธ: Make and publish your own UQFF files in minutes.

Files

Name Quantization type(s) Example
phi3.5-mini-instruct-hqq4.uqff HQQ4 ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-hqq4.uqff
phi3.5-mini-instruct-hqq8.uqff HQQ8 ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-hqq8.uqff
phi3.5-mini-instruct-q4k.uqff Q4K ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-q4k.uqff
phi3.5-mini-instruct-q5k.uqff Q5K ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-q5k.uqff
phi3.5-mini-instruct-q6k.uqff Q6K ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-q6k.uqff
phi3.5-mini-instruct-q8_0.uqff Q8_0 ./mistralrs-server -i plain -m microsoft/Phi-3.5-mini-instruct --from-uqff EricB/Phi-3.5-mini-instruct-UQFF/phi3.5-mini-instruct-q8_0.uqff
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Model tree for EricB/Phi-3.5-mini-instruct-UQFF

Quantized
this model