相关资源
Embedding Engine
Qdrant is a vector similarity search engine and vector database. It provides a production-ready service with a convenient API to store, search, and manage points—vectors with an additional payload Qdrant is tailored to extended filtering support.
- SIZE: ~30M
LLMs
标准化 OpenAI API 接口实现
ialacol 是一个 OpenAI API 的轻量级直接替代品。ialacol is inspired by other similar projects like LocalAI, privateGPT, local.ai, llama-cpp-python, closedai, and mlc-llm, with a specific focus on Kubernetes deployment.
- Compatibility with OpenAI APIs, compatible with langchain.
- Lightweight, easy deployment on Kubernetes clusters with a 1-click Helm installation.
- Streaming first! For better UX.
- Optional CUDA acceleration.
- Compatible with Github Copilot VSCode Extension
Embedding Model
See Receipts below for instructions of deployments.
- LLaMa 2 variants
- OpenLLaMA variants
- StarCoder variants
- WizardCoder
- StarChat variants
- MPT-7B
- MPT-30B
- Falcon
And all LLMs supported by ctransformers.