Data Collection
Collect data from multiple data sources with DataMate
This guide introduces how to use each feature module of DataMate.
DataMate provides comprehensive data processing solutions for large models, covering data collection, management, cleaning, annotation, synthesis, evaluation, and the full process.
1. Data Collection → 2. Data Management → 3. Data Cleaning → 4. Data Annotation
↓
5. Data Evaluation → 6. Export Training Data
1. Upload Documents → 2. Vectorization Index → 3. Knowledge Base Management
↓
4. Agent Chat (Knowledge Base Q&A)
1. Prepare Raw Data → 2. Create Instruction Template → 3. Data Synthesis
↓
4. Quality Evaluation → 5. Export Augmented Data
Collect data from multiple data sources with DataMate
Manage datasets and files with DataMate
Clean and preprocess data with DataMate
Perform data annotation with DataMate
Use large models for data augmentation and synthesis
Evaluate data quality with DataMate
Build and manage RAG knowledge bases with DataMate
Manage and use DataMate operators
Visual workflow orchestration with DataMate
Use DataMate Agent for intelligent conversation
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.