diff --git a/README.md b/README.md index b0a2acc..5fd6572 100644 --- a/README.md +++ b/README.md @@ -134,17 +134,14 @@ Please refer to our paper for further details. ### Dataset Structure -This dataset contains demonstrations of human-assistant conversations which were collected -on the [open-assistant.io](https://www.open-assistant.io/) website until April, 12 2023. - -Conversations are exported as conversation trees with messages as nodes. -The root node of a conversation tree is called the initial prompt. Each message can have -multiple replies. Nodes with more than one reply can have a `rank` field indicating the -aggregated user preference (the most preferred message has rank 0). +This dataset contains message trees which each have an inital prompt message as root which can have +multiple child messages as replies which itself again can have multiple replies. All messages have a role property which can either be "assistant" or "prompter". The roles in conversation threads from prompt to leaf node are stricly alternating between "prompter" and "assistant". +This version of the dataset contains data collected on the [open-assistant.io](https://www.open-assistant.io/) website until April, 12 2023. + ### JSON Example: Message For readability the following JSON examples are shown formatted with indentation on multiple lines.