×
Apr 27, 2023 · This dataset includes two splits ( train and test ). We split these two by dividing the randomly permuted version of the corpus into (95%, 5%) ...
Missing: مخبران? 00022-
People also ask
Datasets is a library for easily accessing and sharing datasets for Audio, Computer Vision, and Natural Language Processing (NLP) tasks. Load a dataset in a ...
Missing: مخبران? q= SLPL/ naab/ 41a97143d2b247bce9401802ac077f1da45d3304/ 00022- 00126.
This corpus can be used for training all language models trained by Masked Language Modeling (MLM) or any other self-supervised objective. language-modeling ...
Missing: مخبران? q= resolve/ 41a97143d2b247bce9401802ac077f1da45d3304/ 00022- 00126.
Aug 22, 2022 · This file is stored with Git LFS . It is too big to display, but you can still download it. Git LFS Details.
Missing: مخبران? q= https:// resolve/ 41a97143d2b247bce9401802ac077f1da45d3304/ 00022-
Jan 9, 2023 · I have just tried to make my first dataset on the Huggingface website. ... resolve any data file that matches ['train ... https://huggingface.co/ ...
Missing: مخبران? q= SLPL/ naab/ 41a97143d2b247bce9401802ac077f1da45d3304/ 00022- 00126.
The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools - huggingface/datasets.
We're on a journey to advance and democratize artificial intelligence through open source and open science.
Missing: مخبران? q= resolve/ 41a97143d2b247bce9401802ac077f1da45d3304/ 00022-
Aug 23, 2022 · We propose naab, the biggest cleaned and ready-to-use open-source textual corpus in Farsi. It contains about 130GB of data, 250 million ...
Missing: مخبران? q= 41a97143d2b247bce9401802ac077f1da45d3304/ 00022- 00126.
In order to show you the most relevant results, we have omitted some entries very similar to the 8 already displayed. If you like, you can repeat the search with the omitted results included.