Experimental GPU support
Adds support for Metal/CUDA and OpenCL acceleration for LLama-based models.
Adds CI for the different acceleration backends to create prebuild binaries
Adds support for Metal/CUDA and OpenCL acceleration for LLama-based models.
Adds CI for the different acceleration backends to create prebuild binaries