NestQuant: Post-Training Integer-Nesting Quantization for On-Device DNN [IEEE TMC 2025]
-
Updated
Jul 6, 2025 - Python
NestQuant: Post-Training Integer-Nesting Quantization for On-Device DNN [IEEE TMC 2025]
Implementation of "Low-Cost and Effective Fault-Tolerance Enhancement Techniques for Emerging Memories-Based Deep Neural Networks." 2021 58th ACM/IEEE Design Automation Conference (DAC).
RAG with Binary Quantization for enhanced performance
Batched QLoRA fine-tuning of FLAN-T5-Large for three-way stance classification, with systematic evaluation of clustering, embedding probes, and full model inference
Add a description, image, and links to the quantizations topic page so that developers can more easily learn about it.
To associate your repository with the quantizations topic, visit your repo's landing page and select "manage topics."