Running open large language models in production with Ollama and serverless GPUs Similarity score = 0.59 More