Smoothquant

Quick Overview: Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce ... In this video, we look into SmoothQ Algorithm and Paper: Paper: Pseudocode Open Source ... SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models

Smoothquant - Detailed Overview & Context

Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce ... In this video, we look into SmoothQ Algorithm and Paper: Paper: Pseudocode Open Source ... SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models Seminar date : 2024.07.05 # Seminar contents Paper Review Seminar # Paper Title Xiao, Guangxuan, et al. " Pseudo-lab (‪-lab‬ ) EfficientLLM study Presenter: 김승우 Date: 2025/09/30 Paper: 00:00 Introduction to LLM Quantization 02:15 What is Quantization? 04:45 Post-Training Quantization (PTQ) vs. QAT 07:30 GPTQ ...

Are 1-bit LLMs the future of efficient AI? Or just a catchy Microsoft metaphor? In this video, we break down BitNet, the so-called ...

Photo Gallery

SmoothQuant: Efficient & Accurate Quantization for Massive Language Models

SmoothQuant

SmoothQuant: Migrate Activation Difficulty to Weights

SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models

SmoothQuant : Accurate and Efficient Post Training Quantization for Large Langu

Final Presentation CS104 SmoothQuant (15 Min)

SmoothQuant : run LLM on CPU

[IDSL Paper Review] SmoothQuant

05.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

CS104 SmoothQuant Final Presentation

[Paper Review] SmoothQuant

LLM Quantization Explained: GPTQ, AWQ, QLoRA, GGUF and More

View Main Result

SmoothQuant: Efficient & Accurate Quantization for Massive Language Models

SmoothQuant: Efficient & Accurate Quantization for Massive Language Models

Links : Subscribe: https://www.youtube.com/@Arxflix Twitter: https://x.com/arxflix LMNT: https://lmnt.com/

SmoothQuant

SmoothQuant

Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce ...

SmoothQuant: Migrate Activation Difficulty to Weights

SmoothQuant: Migrate Activation Difficulty to Weights

In this video, we look into SmoothQ Algorithm and Paper: Paper: https://arxiv.org/abs/2211.10438 Pseudocode Open Source ...

SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models

SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models

SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language Models

SmoothQuant : Accurate and Efficient Post Training Quantization for Large Langu

SmoothQuant : Accurate and Efficient Post Training Quantization for Large Langu

SmoothQuant

Final Presentation CS104 SmoothQuant (15 Min)

Final Presentation CS104 SmoothQuant (15 Min)

By Marie Zhussupova.

SmoothQuant : run LLM on CPU

SmoothQuant : run LLM on CPU

SmoothQuant : run LLM on CPU

[IDSL Paper Review] SmoothQuant

[IDSL Paper Review] SmoothQuant

Seminar date : 2024.07.05 # Seminar contents Paper Review Seminar # Paper Title Xiao, Guangxuan, et al. "

05.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

05.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

https://arxiv.org/abs/2211.10438.

CS104 SmoothQuant Final Presentation

CS104 SmoothQuant Final Presentation

By Marie Zhussupova.

[Paper Review] SmoothQuant

[Paper Review] SmoothQuant

Pseudo-lab (‪@pseudo-lab‬ ) EfficientLLM study Presenter: 김승우 Date: 2025/09/30 Paper:

LLM Quantization Explained: GPTQ, AWQ, QLoRA, GGUF and More

LLM Quantization Explained: GPTQ, AWQ, QLoRA, GGUF and More

00:00 Introduction to LLM Quantization 02:15 What is Quantization? 04:45 Post-Training Quantization (PTQ) vs. QAT 07:30 GPTQ ...

Large Language Models Post Training Quantization(smoothQuant, RPTQ)

Large Language Models Post Training Quantization(smoothQuant, RPTQ)

SmoothQuant

The myth of 1-bit LLMs | Quantization-Aware Training

The myth of 1-bit LLMs | Quantization-Aware Training

Are 1-bit LLMs the future of efficient AI? Or just a catchy Microsoft metaphor? In this video, we break down BitNet, the so-called ...