From 378ac6e247153df7c65f8c92eabd532be8cddc7e Mon Sep 17 00:00:00 2001 From: Bleys Date: Thu, 29 Jun 2023 19:15:08 +0000 Subject: [PATCH] Update README.md --- README.md | 46 +++++++++++++++++++++------------------------- 1 file changed, 21 insertions(+), 25 deletions(-) diff --git a/README.md b/README.md index 267b129..c7ad5ff 100644 --- a/README.md +++ b/README.md @@ -37,44 +37,40 @@ Dataset Use Use Cases Usage Caveats Getting Started - -the Orca paper has been replicated to as fine of a degree of precision as several obsessive nerds sweating for weeks could pull off(a very high degree) -We will be releasing Orca's as the models continue to be trained -And the dataset after we wipe off all the sweat and tears. -Right now, we're testing our fifth iteration of orca on a subset of the final data, and are just about to jump into the final stages! -Thanks to - the + + +The Orca paper has been replicated to as fine of a degree of precision as several obsessive nerds sweating for weeks could pull off (a very high degree) +We will be releasing Orcas as the models continue to be trained and the dataset after we wipe off all the sweat and tears. +Right now, we're testing our fifth iteration of orca on a subset of the final data, and are just about to jump into the final stages! + +Thanks to the Team: - - - winglian - - -@erhartford - - Nanobit +winglian +erhartford +Nanobit Pankajmathur http://AlignmentLab.ai: - Entropi - AtlasUnified - NeverendingToast - Autometa +Autometa +Entropi +AtlasUnified +NeverendingToast -And of course, as always TheBloke, for being the backbone of the whole community. +Also of course, as always, TheBloke, for being the backbone of the whole community. Be sure to check out Axolotl on github, developed by Nano and Winglian, the platform that developed and trained manticore, minotaur, and many others! -OrcaMini on huggingface! - -Samantha, WizardVicuna, and more! +Other team projects on huggingface: +OrcaMini +Samantha +WizardVicuna, and more! and maybe even one of our projects at: http://Alignmentlab.ai https://discord.gg/n9hXaBPWxx -we are looking for sponsors or collaborators to help us build these models to the scale they deserve, my 3090 wont quite cut i this time, i think. -not for Falcon 40b, it wont ! +We are looking for sponsors or collaborators to help us build these models to the scale they deserve; stacks of 3090s wont quite cut it this time, we think. +Not for Falcon 40b, it won't! Dataset Summary The Open Orca dataset is a collection of unaugmented and augmented FLAN data. It is tabularized in alignment with the distributions presented in the ORCA paper and currently represents a partial completion of the full intended dataset, with ongoing generation to expand its scope. The data is primarily used for training and evaluation in the field of natural language processing.