Update README.md

This commit is contained in:
Bleys 2023-07-01 05:13:46 +00:00 committed by huggingface-web
parent bf7d7cc428
commit c2edde545c

@ -48,7 +48,7 @@ It has been instrumental in generating high-performing model checkpoints and ser
Dataset Summary Dataset Summary
The Open Orca dataset is a collection of unaugmented and augmented FLAN data. The Open Orca dataset is a collection of unaugmented and augmented FLAN data.
Currently ~1M GPT-4 completions, and ~3.5M GPT-3.5 completions. Currently ~1M GPT-4 completions, and ~3.0M GPT-3.5 completions.
It is tabularized in alignment with the distributions presented in the ORCA paper and currently represents a partial completion of the full intended dataset, with ongoing generation to expand its scope. It is tabularized in alignment with the distributions presented in the ORCA paper and currently represents a partial completion of the full intended dataset, with ongoing generation to expand its scope.
The data is primarily used for training and evaluation in the field of natural language processing. The data is primarily used for training and evaluation in the field of natural language processing.