相关资源

Embedding Engine

Qdrant is a vector similarity search engine and vector database. It provides a production-ready service with a convenient API to store, search, and manage points—vectors with an additional payload Qdrant is tailored to extended filtering support.

  • SIZE: ~30M

LLMs

标准化 OpenAI API 接口实现

ialacol 是一个 OpenAI API 的轻量级直接替代品。ialacol is inspired by other similar projects like LocalAI, privateGPT, local.ai, llama-cpp-python, closedai, and mlc-llm, with a specific focus on Kubernetes deployment.

Features

  • Compatibility with OpenAI APIs, compatible with langchain.
  • Lightweight, easy deployment on Kubernetes clusters with a 1-click Helm installation.
  • Streaming first! For better UX.
  • Optional CUDA acceleration.
  • Compatible with Github Copilot VSCode Extension

Embedding Model

See Receipts below for instructions of deployments.

And all LLMs supported by ctransformers.