Independently Published

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Name: DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications
Brand: Independently Published
SKU: 9ff801375eac530933018ac6fd0f92c2

1/1

Afbeelding van DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Bol

Prijzen vanaf

30,76

Uitgelicht

	30,76	Naar shop
	32,15	Naar shop
	32,15	Naar shop
VERGELIJK ALLE AANBIEDERS (3)

Beschrijving

Amazon Pagina's: 288, Paperback, Independently published

Lees meer

Vergelijk aanbieders (3)

Shop

Prijs

Verzendkosten

Totale prijs

30,76

Gratis

30,76

Naar shop

Gratis

32,15

Gratis

32,15

Naar shop

Gratis

32,15

Gratis

32,15

Naar shop

Gratis

Beschrijving (1)

Pagina's: 288, Paperback, Independently published

Lees meer

Productspecificaties

Merk	Independently Published
EAN	9798274507356
Maat

Prijshistorie

* Prijshistorie bevat geen data van Amazon, Amazon Marketplace.

Prijzen voor het laatst bijgewerkt op: 24-07-2026, 01:33

Independently Published

High-Performance Inference Serving: Batching, Quantization, and Low-Latency Model Deployment.

37,44

Vergelijk 2 shops 2 shops

Independently Published

High-Performance Inference Serving: Batching, Quantization, and Low-Latency Model Deployment.

46,81

Vergelijk 2 shops 2 shops

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

15,29

Vergelijk 3 shops 3 shops

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

26,42

Vergelijk 2 shops 2 shops

Populair nu

Categorieën

Populaire categorieën

Thema's

Populaire zoekopdrachten

Merken

Verkopers

Populaire categorieën

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Beschrijving

Productspecificaties

Prijshistorie