LLM Inference in C++: Building High-Throughput Engines with PagedAttention and CUDA Kernels

Independently Published
LLM Inference in C++: Building High-Throughput Engines with PagedAttention and CUDA Kernels

Afbeelding van LLM Inference in C++: Building High-Throughput Engines with PagedAttention and CUDA Kernels

Prijzen vanaf

31,53

Uitgelicht

	31,53	Naar shop
	31,53	Naar shop

Beschrijving

Amazon Pagina's: 287, Hardcover, Independently published

Lees meer

Vergelijk aanbieders (2)

Shop

Prijs

Verzendkosten

Totale prijs

31,53

Gratis

31,53

Naar shop

Gratis

31,53

Gratis

31,53

Naar shop

Gratis

Beschrijving (1)

Pagina's: 287, Hardcover, Independently published

Lees meer

Productspecificaties

Merk	Independently Published
EAN	9798259265028

High-Performance LLM Inference with TensorRT-LLM

6,05

Meer informatie Meer info

Independently Published

Rust Programming for AI and CUDA: Master High Performance Machine Learning with...

25,03

Vergelijk 2 shops 2 shops

Independently Published

AI Performance Engineering: From GPU Kernels to LLM Inference

33,43

Vergelijk 2 shops 2 shops

Independently Published

GPU Computing with C++ and cuda for Generative AI: A Comprehensive Guide...

28,52

Vergelijk 2 shops 2 shops

Uitgelichte Keuze

31,53

Naar shop

Populair nu

Categorieën

Populaire categorieën

Thema's

Populaire zoekopdrachten

Merken

Verkopers

Populaire categorieën

LLM Inference in C++: Building High-Throughput Engines with PagedAttention and CUDA Kernels

Beschrijving

Productspecificaties