<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>AI Accelerator on wiredwisdom</title><link>https://wiredwisdom.netlify.app/categories/ai-accelerator/</link><description>Recent content in AI Accelerator on wiredwisdom</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026</copyright><lastBuildDate>Tue, 31 Mar 2026 08:00:00 +0000</lastBuildDate><atom:link href="https://wiredwisdom.netlify.app/categories/ai-accelerator/index.xml" rel="self" type="application/rss+xml"/><item><title>Pruning for Large Language Models — From SparseGPT to KV-Cache Pruning</title><link>https://wiredwisdom.netlify.app/posts/pruning-for-llms/</link><pubDate>Tue, 31 Mar 2026 08:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/pruning-for-llms/</guid><description/></item><item><title>Advanced Pruning Methods for Deep Neural Networks</title><link>https://wiredwisdom.netlify.app/posts/pruning-advanced-methods/</link><pubDate>Tue, 31 Mar 2026 07:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/pruning-advanced-methods/</guid><description/></item><item><title>Structured vs Unstructured Pruning: A Complete Guide with Math, Diagrams, and Real-World Analysis</title><link>https://wiredwisdom.netlify.app/posts/pruning-structured-vs-unstructured/</link><pubDate>Tue, 31 Mar 2026 06:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/pruning-structured-vs-unstructured/</guid><description/></item><item><title>Pruning Fundamentals: A Complete Guide to Neural Network Weight Pruning</title><link>https://wiredwisdom.netlify.app/posts/pruning-fundamentals/</link><pubDate>Tue, 31 Mar 2026 05:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/pruning-fundamentals/</guid><description/></item><item><title>Extreme and Mixed-Precision Quantization: From FP8 to Binary Neural Networks</title><link>https://wiredwisdom.netlify.app/posts/quantization-extreme-mixed-precision/</link><pubDate>Tue, 31 Mar 2026 04:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/quantization-extreme-mixed-precision/</guid><description/></item><item><title>Quantization-Aware Training (QAT): A Comprehensive Deep Dive</title><link>https://wiredwisdom.netlify.app/posts/quantization-qat/</link><pubDate>Tue, 31 Mar 2026 03:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/quantization-qat/</guid><description/></item><item><title>Post-Training Quantization (PTQ): A Comprehensive Deep Dive</title><link>https://wiredwisdom.netlify.app/posts/quantization-ptq/</link><pubDate>Tue, 31 Mar 2026 02:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/quantization-ptq/</guid><description/></item><item><title>Quantization Fundamentals for Deep Learning</title><link>https://wiredwisdom.netlify.app/posts/quantization-fundamentals/</link><pubDate>Tue, 31 Mar 2026 01:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/quantization-fundamentals/</guid><description/></item><item><title>AI Model Optimization Techniques</title><link>https://wiredwisdom.netlify.app/posts/model-optimization/</link><pubDate>Sat, 06 Jan 2024 00:00:00 +0000</pubDate><guid>https://wiredwisdom.netlify.app/posts/model-optimization/</guid><description/></item></channel></rss>