Retrieval-Augmented Generation (RAG) on the Cheap: Part 2First steps; running the inference in the cloud.1d ago1d ago
Retrieval-Augmented Generation (RAG) on the Cheap: Part 1Implementing and exploring the costs of a Retrieval-Augmented Generation (RAG) solution with a small (pilot) project.2d ago2d ago
Retrieval-Augmented Generation (RAG) with vLLM by Example: Part 3Using vLLM for the language model.Oct 27Oct 27
Retrieval-Augmented Generation (RAG) with vLLM by Example: Part 2Using vLLM for the embedding model.Oct 27Oct 27
Retrieval-Augmented Generation (RAG) with vLLM by Example: Part 1Starting the exploration; introducing key concepts through example.Oct 25A response icon1Oct 25A response icon1
vLLM by Example: Part 1Kicking the tires of vLLM; an open source framework for serving Large Language Models (LLM).Oct 12Oct 12
Ollama by Example: Part 2A seemingly random selection of topics that leads to an interesting use case.Oct 6Oct 6