ModelEngine Data Engineering and Model Engineering Integration Guide
This document provides a detailed guide on how to integrate ModelEngine's Data Engineering (DataMate) and Model Engineering (ModelLite) in the Nexent platform.
1. ModelEngine Overview
ModelEngine provides an end-to-end AI toolchain for data processing, knowledge generation, model fine-tuning and deployment, as well as RAG (Retrieval Augmented Generation) application development. It aims to shorten the cycle from data to model, and from data to AI application deployment. ModelEngine offers low-code orchestration, flexible execution scheduling, high-performance data bus and other technologies. Combined with built-in data processing operators, RAG framework and extensive ecosystem capabilities, it provides data development engineers, model development engineers, and application development engineers with an efficient, easy-to-use, open, flexible, out-of-the-box, and lightweight full-process AI development experience.
2. Integrating Model Engineering (ModelLite)
2.1 Model Engineering Overview
ModelLite is a toolchain for model fine-tuning and model inference, hosting and providing access to various AI models. After integrating ModelLite model services in Nexent, you can:
- Sync all models deployed on the ModelEngine platform
- Use Large Language Models (LLM) for conversation generation
- Use Embedding models for knowledge base processing
- Use Vision Language Models (VLM) for image processing
2.2 Configuration Steps
Step 1: Obtain ModelEngine Credentials
- Visit your ModelEngine platform
- Create an API Key (for authentication)
- Record the ModelEngine host address (format:
https://<host>:<port>)
⚠️ Note: Make sure you have deployed the required models on ModelEngine, otherwise you won't see any models after syncing.
Step 2: Configure ModelEngine Models in Nexent
- Log in to Nexent platform
- Go to Model Management page
- Click Sync ModelEngine Configuration button in Model Settings (when deploying Nexent, need to change the value of MODEL_ENGINE_ENABLED variable to True in the .env file)
- Fill in the following information in the popup:
- Host Address: ModelEngine service URL (e.g.,
https://<host>:<port>) - Model Type: Select the model type to integrate
- API Key: ModelEngine API Key
- Host Address: ModelEngine service URL (e.g.,
- After configuration, click Get Models button. The system will automatically fetch all available models deployed on ModelEngine. Enable the models as needed.
- Successfully synced models will appear in the model list, marked with "ModelEngine" as the source.
3. Integrating Data Engineering (DataMate)
3.1 What is Datamate
DataMate is an enterprise-level data processing platform for model fine-tuning and RAG retrieval. It supports core functions such as data collection, data management, operator marketplace, data cleaning, data synthesis, data annotation, data evaluation, and knowledge generation. By integrating Datamate, you can:
- Reuse existing Datamate knowledge base resources
- Retrieve Datamate documents in Nexent agents
3.2 Configuration Steps
Step 1: Install and Start Datamate Service
First, you need to deploy the Datamate service. For detailed installation instructions, refer to Datamate Official Documentation.
After startup, record the Datamate service address (e.g., https://<host>:<port>).
Step 2: Configure Datamate in Nexent
- Log in to Nexent platform
- Go to Knowledge Base page
- Click DataMate Configuration button
- Fill in the Datamate server address:
- Datamate URL: Datamate service address (e.g.,
https://<host>:<port>)
- Datamate URL: Datamate service address (e.g.,
- After configuration, click Sync button. The system will automatically fetch all knowledge bases from Datamate
- After successful sync, knowledge bases will appear in the knowledge base list, marked with source as "DataMate"
Step 3: Create or Edit Knowledge Base Retrieval Agent
- Go to Agent Development page
- Create a new agent or edit an existing one
Step 4: Add Tools
In the agent configuration page:
Find the Tool Configuration section
Click Local Tools > Search button
Select
datamate_searchtool from the tool list: for retrieving Datamate knowledge basesConfigure
datamate_searchtool parameters:a) Fill in the Datamate server address (usually auto-filled from your previous configuration)
b) Click Select Knowledge Base button
c) Select Datamate knowledge bases to retrieve from the knowledge base list (multiple selection supported)
d) Click Confirm to save configuration
4. Comprehensive Usage Example
Scenario: Creating a Knowledge Base Retrieval Agent
Configure ModelEngine Models
- Go to Model Management page
- Click ModelEngine Configuration, fill in API Key and host address
- After syncing models, select a Large Language Model as the agent's runtime model
Integrate Datamate Knowledge Base
- Go to Knowledge Base page
- Click DataMate Configuration, fill in Datamate server address
- Click Sync DataMate Knowledge Bases to get available knowledge base list
Create Agent
- Go to Agent Management, create a new agent
- Add
datamate_searchtool in tool configuration - Select synced Datamate knowledge bases
- Write system prompt, for example: "You are a professional product assistant. You can answer user questions based on documents from the Datamate knowledge base."
Test Usage
- Interact with the agent on the chat page
- Ask product-related questions, the agent will automatically retrieve relevant content from Datamate knowledge base and respond
5. Related Resources
6. Technical Support
If you encounter issues during usage, feel free to ask questions on GitHub Discussions
