llama.cpp: The Ultimate Guide to Efficient LLM Inference and Applications