Quantization-Aware Training (QAT): A Comprehensive Deep Dive31 March 2026AI Accelerator Quantization QAT Model Compression STE LSQ PACT Binary Networks QLoRA Mixed Precision TensorRT Edge AI Inference Optimization