From 37d790373da332a4c8be24bd29ec1550e7f04c3f Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Andreas=20K=C3=B6pf?= Date: Sat, 15 Apr 2023 16:06:25 +0000 Subject: [PATCH] Update README.md --- README.md | 11 ++++------- 1 file changed, 4 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index b0a2acc..5fd6572 100644 --- a/README.md +++ b/README.md @@ -134,17 +134,14 @@ Please refer to our paper for further details. ### Dataset Structure -This dataset contains demonstrations of human-assistant conversations which were collected -on the [open-assistant.io](https://www.open-assistant.io/) website until April, 12 2023. - -Conversations are exported as conversation trees with messages as nodes. -The root node of a conversation tree is called the initial prompt. Each message can have -multiple replies. Nodes with more than one reply can have a `rank` field indicating the -aggregated user preference (the most preferred message has rank 0). +This dataset contains message trees which each have an inital prompt message as root which can have +multiple child messages as replies which itself again can have multiple replies. All messages have a role property which can either be "assistant" or "prompter". The roles in conversation threads from prompt to leaf node are stricly alternating between "prompter" and "assistant". +This version of the dataset contains data collected on the [open-assistant.io](https://www.open-assistant.io/) website until April, 12 2023. + ### JSON Example: Message For readability the following JSON examples are shown formatted with indentation on multiple lines.