Update README.md
This commit is contained in:
parent
bf7d7cc428
commit
c2edde545c
@ -48,7 +48,7 @@ It has been instrumental in generating high-performing model checkpoints and ser
|
|||||||
Dataset Summary
|
Dataset Summary
|
||||||
|
|
||||||
The Open Orca dataset is a collection of unaugmented and augmented FLAN data.
|
The Open Orca dataset is a collection of unaugmented and augmented FLAN data.
|
||||||
Currently ~1M GPT-4 completions, and ~3.5M GPT-3.5 completions.
|
Currently ~1M GPT-4 completions, and ~3.0M GPT-3.5 completions.
|
||||||
It is tabularized in alignment with the distributions presented in the ORCA paper and currently represents a partial completion of the full intended dataset, with ongoing generation to expand its scope.
|
It is tabularized in alignment with the distributions presented in the ORCA paper and currently represents a partial completion of the full intended dataset, with ongoing generation to expand its scope.
|
||||||
The data is primarily used for training and evaluation in the field of natural language processing.
|
The data is primarily used for training and evaluation in the field of natural language processing.
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user