User Guide

DataMate feature usage guides

This guide introduces how to use each feature module of DataMate.

DataMate provides comprehensive data processing solutions for large models, covering data collection, management, cleaning, annotation, synthesis, evaluation, and the full process.

Feature Modules

Typical Use Cases

Model Fine-tuning Scenario

1. Data Collection → 2. Data Management → 3. Data Cleaning → 4. Data Annotation
↓
5. Data Evaluation → 6. Export Training Data

RAG Application Scenario

1. Upload Documents → 2. Vectorization Index → 3. Knowledge Base Management
↓
4. Agent Chat (Knowledge Base Q&A)

Data Augmentation Scenario

1. Prepare Raw Data → 2. Create Instruction Template → 3. Data Synthesis
↓
4. Quality Evaluation → 5. Export Augmented Data

Data Collection

Collect data from multiple data sources with DataMate

Data Management

Manage datasets and files with DataMate

Data Cleaning

Clean and preprocess data with DataMate

Data Annotation

Perform data annotation with DataMate

Data Synthesis

Use large models for data augmentation and synthesis

Data Evaluation

Evaluate data quality with DataMate

Knowledge Base Management

Build and manage RAG knowledge bases with DataMate

Operator Market

Manage and use DataMate operators

Pipeline Orchestration

Visual workflow orchestration with DataMate

Agent Chat

Use DataMate Agent for intelligent conversation


Last modified February 9, 2026: :memo: add english docs (3868c82)