Aris-AI

Introduction

This is a project that provides private large language model services, aiming to quickly access general large models (GPT3.5, GPT4) and private models (Qwen1.5, ChatGLM3, LLama2, Baichuan2, etc.) services, providing a unified API interface. Relying on the langchain framework to provide multi-turn dialogue (Chat) and retrieval augmented generation (RAG) services, the project name comes from the character Aris in Blue Archive, as shown in the figure below

Aris: Blue Archive 中的角色

Change Log

[2024-07-13] We open source the Aris-AI-Model-Server, which integrates LLM, Embedding and Reranker deployment services, and provides an OpenAI Compatible API interface to facilitate users to deploy private models.
[2024-06-23] We release the Aris-14B-Chat Series Model which sft and dpo by Qwen1.5-14B-Chat on our private dataset. Please obey the qwen open source agreement while using it.
[2024-06-15] Use Neo4j as the database for storing knowledge bases

Tech Stack

Fine-tuning

Transformers
PEFT
Pytorch
Deepspeed

Private Model Deployment

llama.cpp
llama-cpp-python

Large Language Model Service

Langchain

API Backend

Fastapi
Sqlalchemy
JWT
Mysql
Redis
Neo4j

Web UI

Streamlit

Project Deployment

Docker

Function Implementation

API Related

User registration, login, permission management
Dialogue management, history management
Model (LLM, Embedding) management, preset (System) prompt management
Vector database management, vector database insertion, support:

Files: Pdf, Markdown, HTML, Jupyter, TXT, Python, C++, Java and other code files
Links: Arxiv, Git, unauthenticated url (supports recursive crawling, automated tool crawling)

Model Service Related

Chat: Supports multi-round dialogue
Retriever QA: Supports question answering with (RAG) retrieval enhanced generation

Web Interface

Provide an interface to upload knowledge bases
Provide a dialogue interface

Project Structure

.
├── assets
├── confs
│   ├── deployment
│   └── local
├── docker
│   ├── deployment
│   └── local
├── envs
│   ├── deployment
│   └── local
├── kubernetes
├── logs
├── pages
└── src
    ├── api
    │   ├── auth
    │   ├── model
    │   └── router
    │       └── v1
    │           ├── model
    │           └── oauth2
    ├── config
    ├── langchain_aris
    ├── logger
    ├── middleware
    │   ├── jwt
    │   ├── logger
    │   ├── mysql
    │   │   └── models
    │   └── redis
    └── webui

Local Deployment

Clone the Repository

git clone https://github.com/hcd233/Aris-AI
cd Aris-AI

Create a Virtual Environment (Optional)

You can skip this step, but you need to make sure that the python environment is 3.11

conda create -n aris python=3.11.0
conda activate aris

Install Dependencies

pip install poetry
poetry install

Configure conf and env (Omitted)

See the template file

Start Mysql and Redis

docker-compose -f docker/local/docker-compose.yml up -d

Start the API Server

Note that you need to specify local/api.env as the environment variable in the IDE

python aris_api.py

Start the WebUI

Note that you need to specify local/webui.env as the environment variable in the IDE

streamlit run aris_webui.py

Access SwaggerUI and WebUI

SwaggerUI: http://localhost:${API_PORT}/docs
WebUI: http://localhost:8501

Docker Deployment

Configure conf and env (As above)

See the template file

Create Volumes

docker volume create mysql-data
docker volume create redis-data
docker volume create neo4j-data

Start the Container

docker-compose -f docker/deployment/docker-compose.yml up -d --no-build

Operation Instructions

User Operation

For login operations, I only did simple username and password verification, and did not provide a registration function in the WebUI. Please call the API interface yourself, and set the administrator status (is_admin=1) in the database operation to access private models
After login, you need to carry a jwt token to operate the secret key, which is used to call the private model service

Model Operation

Call the general large model service, which currently only supports the OpenAI series models (or agents with OpenAI-like interfaces). You can access it directly in the API. You need to store information such as base, key, max_tokens in the database, and you can customize the System prompt
Call the private model service, please deploy the model as an API service with an OpenAI-like API (you can use Aris-AI-Model-Server), and configure it accordingly.

Project Outlook

Goals

Support access to more models (AzureOpenAI, Gemini, HuggingFaceEndpoint, Llama.cpp)
More RAG strategies (RAG fusion, rearrangement, multi-path recall, etc.)
Support multi-modal Chat & RAG
Support maintaining a Key pool for the same model to achieve load balancing
Support Agent and tool calls
Release fine-tuned private models

Author Status

Due to my busy work schedule, the project progress may be relatively slow, and I will update it occasionally. PRs and Issues are welcome

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
.github/workflows		.github/workflows
assets		assets
confs		confs
docker		docker
envs		envs
pages		pages
src		src
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
aris_api.py		aris_api.py
aris_webui.py		aris_webui.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

License

hcd233/Aris-AI

Folders and files

Latest commit

History

Repository files navigation