• There are no suggestions because the search field is empty.

Red Hat AI Inference Server Technical Overview (AI010)

Course Description Gain essential insights into AI deployment with this Red Hat AI Inference Server technical overview. Learn how to address the complexities and costs of running AI models in production. Discover how Red Hat's solution, powered by ...

1 Day

CZ

0 €

Course Dates

Date
Availability
Form
Language
Cena
Location
Date
Free Course
Try a Free course!
Availability
Guaranteed
Form
Vector (1)
Language
Cena
0 EUR
Location
Online

Register Form

Course Description

Gain essential insights into AI deployment with this Red Hat AI Inference Server technical overview. Learn how to address the complexities and costs of running AI models in production. Discover how Red Hat's solution, powered by vLLM, optimizes performance and delivers significant cost savings across cloud, on-premise, virtualized, and edge environments. Dive into advanced techniques like quantization and speculative decoding to enhance your AI inference capabilities. This on-demand video content demonstrates seamless model deployment and management within OpenShift AI, showcasing how you can achieve unparalleled efficiency and flexibility for your AI workloads.

Course summary

  • What is Inference?
  • Challenges with Inference
  • Red Hat AI Inference Server Solution
  • Red Hat AI Portfolio Integration
  • Flexibility of Deployment
  • LLM Compression Tool (Quantization)
  • Performance Optimization Techniques (kV Cache, Speculative Decoding, Tensor Parallel Inference)
  • Case Studies
  • Model Deployment and Management
  • Storage Connections for Models
  • Metrics and Monitoring
  • Hugging Face Integration


Outline for this course

  • What is Inference?
  • Challenges with Inference
  • Red Hat AI Inference Server Solution
  • Red Hat AI Portfolio Integration
  • Flexibility of Deployment
  • LLM Compression Tool (Quantization)
  • Performance Optimization Techniques (kV Cache, Speculative Decoding, Tensor Parallel Inference)
  • Case Studies
  • Model Deployment and Management
  • Storage Connections for Models
  • Metrics and Monitoring
  • Hugging Face Integration



Audience for this course

  • AI/ML Engineers and Practitioners
  • DevOps Engineers
  • Cloud Architects and Engineers
  • Technical Decision-Makers

 

Recommended training

  • There are no prerequisites for this Technical Overview

 

Technology considerations

  • N/A

Related courses

Do you want to expand your knowledge in this area or build on a completed course? Explore additional training that focuses on the same technology, advanced skills, or related topics.

RH304

26 October 2026

New Features in Red Hat Enterprise Linux (RH304)

Course description Explore new features and changes in Red Hat Enterprise Linux 10 to prepare for deployment.

2 Days

9:00 - 17:00

Eng

CZ

1.270.00 EUR

Red Hat Enterprise Linux AI Technical Overview (AI096)

Course Description An introduction to Red Hat Enterprise Linux AI

1 Day

CZ

Free Course
0 €

Running Containers with Red Hat Technical Overview (RH065)

Course Description A basic introduction to container management in Red Hat Enterprise Linux

1 Day

CZ

Free Course
0 €

Red Hat Enterprise Linux Technical Overview (RH024)

Course Description Learn the basics of Linux

1 Day

CZ

Free Course
0 €

LET’S FACE IT

Since you got this far, there is probably something on your mind.

Just hit the "Go for ELOS" and please, provide us with some details of what that is.  It's simple as that.

It´s simple as that

Please provide us with as many details as possible. In case your vessel is in a need of immediate help, it will help us help you.