CorpusNÓS: A massive Galician corpus for training LLM Collection CorpusNÓS is the largest collection of data in Galician language for training LLM. • 1 item • Updated 3 days ago
Domain Specific Corpora Collection Collection of corpora prepared from specific domains mainly in Galician language. • 4 items • Updated 3 days ago
Text Datasets for Fine-tuning and Instruction tuning Collection Collection of datasets in Galician for fine-tuning, instruction tuning or training purposes. • 12 items • Updated 2 days ago