https://github.com/huggingface/huggingface-llama-recipes/blob/main/llama_tgi_api_inference/tgi_api_inference_recipe.ipynb