From c63a64ac7c73266463de2b97fae31b80064ac38c Mon Sep 17 00:00:00 2001 From: Falcon LLM TII UAE Date: Tue, 30 May 2023 06:56:44 +0000 Subject: [PATCH] corrected a typo (#5) - corrected a typo (77fc53d7cef8458c3881219f7a641b3cb9b22d22) Co-authored-by: Ilyas Moutawwakil --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 15574c2..49f7934 100644 --- a/README.md +++ b/README.md @@ -131,7 +131,7 @@ Falcon-40B was trained on 1,000B tokens of [RefinedWeb](https://huggingface.co/d | **Data source** | **Fraction** | **Tokens** | **Sources** | |--------------------|--------------|------------|-----------------------------------| | [RefinedWeb-English](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) | 75% | 750B | massive web crawl | -| RefinedWeb-Europe | 7% | 70B | European massive zeb crawl | +| RefinedWeb-Europe | 7% | 70B | European massive web crawl | | Books | 6% | 60B | | | Conversations | 5% | 50B | Reddit, StackOverflow, HackerNews | | Code | 5% | 50B | |