Running llama-cpp-python in Ubuntu 24.04 with AMD Radeon GPU #
The llama-cpp-python package provides Python bindings for llama.cpp that gives low-level access to C API and high-level APIs (OpenAI-like API) in Python scripts.
It also implements OpenAI-compatible API web server [1].
The installation instructions are included in their README.md.
Before installing llama.cpp, we need to make sure the relevant GPU drivers are installed (for AMD GPU see [2], for NVidia see [3]).