LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

Independently Published
LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

1/1

Afbeelding van LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

Prijzen vanaf

9,27

Uitgelicht

	9,27	Naar shop
	9,27	Naar shop

VERGELIJK ALLE AANBIEDERS (2)

Beschrijving

Amazon Pagina's: 82, Paperback, Independently published

Lees meer

Vergelijk aanbieders (2)

Shop

Prijs

Verzendkosten

Totale prijs

9,27

Gratis

9,27

Naar shop

Gratis

9,27

Gratis

9,27

Naar shop

Gratis

Beschrijving (1)

Pagina's: 82, Paperback, Independently published

Lees meer

Productspecificaties

Merk	Independently Published
EAN	9798180985187

Prijzen voor het laatst bijgewerkt op: 14-06-2026, 22:44

Independently Published

AI Inference Optimization Engineering: Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

Vergelijk 2 shops 2 shops

Independently Published

THE LLM ECONOMIST: HIGH THROUGHPUT SERVING and GPU EFFICIENCY: A Systemic Blueprint...

Vergelijk 2 shops 2 shops

Independently Published

THE LLM ECONOMIST: HIGH THROUGHPUT SERVING and GPU EFFICIENCY: A Systemic Blueprint...

Vergelijk 2 shops 2 shops

Independently Published

Local LLM Inference Optimization: A Comprehensive Guide to Quantization, Hardware Acceleration, and...

Vergelijk 2 shops 2 shops

Uitgelichte Keuze

9,27

Naar shop