LLMOps: Quantization models & Inference ONNX Generative Runtime #datascience #machinelearning

LLMOps: Quantization models & Inference ONNX Generative Runtime #datascience #machinelearning

The Machine Learning Engineer

3 месяца назад

143 Просмотров

In this video I will show you how to install onnx runtime GPU support and do inference with a generative Model. We will use a Phi3-mini-4k quantized to 4int.
After that we will convert an original Phi3-mini-128k into a 4int quantized version with the runtime onnx

Notebook:
https://github.com/olonok69/LLM_Notebooks/blob/main/onnx/Phi3__ONNX_gpu.ipynb

Тэги:

#ONNX #Generative_ai #LLMs #GPU #Microsoft_Phi3_instruct #Machine_Learning #Deep_Learning #Python

Скачать видео

Комментарии:

Сейчас смотрят

LLMOps: Quantization models & Inference ONNX Generative Runtime #datascience #machinelearning

LLMOps: Quantization models & Inference ONNX Generative Runtime #datascience #machinelearning The Machine Learning Engineer

Grupo Dceo - Chilenas Oaxaqueñas Mixtecas 2024 | Carnaval Mixtepecano

Grupo Dceo - Chilenas Oaxaqueñas Mixtecas 2024 | Carnaval Mixtepecano Rolando's Films

Luvuvhu 4x4 Trail: Phalaborwa - Makuya Park

Luvuvhu 4x4 Trail: Phalaborwa - Makuya Park Know Your Africa Adventures

Ivo Pogorelich ..Balakiriev - Islamey ..Carnegie Hall, 1992 ..

Ivo Pogorelich ..Balakiriev - Islamey ..Carnegie Hall, 1992 .. Gazda Mitke

Шындықты айту керек - Ақерке Арыс

Шындықты айту керек - Ақерке Арыс Akerke Arys

The London Hazards Centre

The London Hazards Centre sean breslin

DON'T EAT MEAT: Dr Garth Davis

DON'T EAT MEAT: Dr Garth Davis VeganLinked

Why 10 Gigabit Ethernet? (Introducing 10 gigabit)

Why 10 Gigabit Ethernet? (Introducing 10 gigabit) Tek Syndicate

Learn Lean | Presented by GE

Learn Lean | Presented by GE Bloomberg Originals

#Eri #Selina #Studio Eritrean Show Part 3 by Wedi Hiwet

#Eri #Selina #Studio Eritrean Show Part 3 by Wedi Hiwet Eri Selina Studio

Тесиев VS Самуэль. Сиана VS Валькирия. 2 БОЯ ЗА ПОЯС. Подаревский VS Аминов | НАШЕ ДЕЛО 84

Тесиев VS Самуэль. Сиана VS Валькирия. 2 БОЯ ЗА ПОЯС. Подаревский VS Аминов | НАШЕ ДЕЛО 84 Наше Дело

Elite ship scale video 2021 edition

Elite ship scale video 2021 edition Preferred Image