Jan 04 2022 01:09 AM - edited Jan 04 2022 01:26 AM
Hey guys,
i want to add an AML-Dataset to Bonsai to train a brain with that dataset, as described here, last sentences: https://docs.microsoft.com/en-us/bonsai/guides/add-dataset.
But: I can not find the "+ Add dataset next to the Datasets list in the Bonsai UI" - so i've no idea how to add a dataset to bonsai.
Assume i didn't understand something - is there somewhere a button to add a datasets to bonsai? Or how is that possible?
Here the description:
Thanks for enlighting me :)
Torsten
Jan 04 2022 04:58 AM - edited Jan 04 2022 04:59 AM
Jan 04 2022 04:58 AM - edited Jan 04 2022 04:59 AM
Finally i got the first part: After enabling the "Beta features", the Datasets-Menu is there, and it works more or less as described... Choosing the right AML-DataStore is important - if its the wrong, you can choose and find yout dataset, but after clicking "create" - nothing relevant happens...
Jan 04 2022 09:26 PM
Hi @Torsten_Katthoefer sorry we were not able to get back to you on time. But I'm glad to see you were able to figure this out by yourself.
I created an internal bug for the case you are reporting about the incorrect combination of fields in the Dataset creation is not providing proper error messages.
About your initial issue, how did you figure out that Beta has to be enabled? (I want to see what or how to improve in the documentation).
Thanks for trying this new feature. And feel free to reach out to me directly for anything related to training directly with Datasets.
Jan 04 2022 10:34 PM
Hey @edilmo,
related to to your question how i figured out to enable the Beta-features: I was lucky by clicking randomly...
Cheers, Torsten
Jan 05 2022 07:38 AM
Thanks for your quick response@Torsten_Katthoefer.
I will create a bug for that as well. We will fix the doc.
Really appreciated your feedback. And again, if you face any issues using this new training feature, feel free to tag me directly.
I want to take the opportunity to clarify a misleading comment in the documentation. As you can notice, the feature allows you to train Brains (policies) directly from data, without simulators, or with both.
The feature is in beta and that is why you could find issues like those you saw (in the system and the doc). In the Doc, when we talk about the Hybrid mode, there is a comment that could be misinterpreted. Hybrid mode, which allows you to use both types of sources, data, and simulators, is actually using data as the first source of training and the simulator as the source for assessment. In other words, if you are using both, simulator and data, any type of assessment, automatic or custom, will be performed using the simulator. If you are in pure Offline training mode (no simulator source), both types of assessment are performed with an algorithm.
Training with datasets enables you to use expert data which is an effective way to accelerate the training, reduce the number of samples required for training, and more importantly, achieve better results. One typical issue is how to know that the task (reward function or goal statement in Inkling) is actually aligned with the expert data in the dataset. To address that issue you can use the "Pre-Training chart" which shows you the performance in the dataset as a distribution of rewards or goal satisfaction rates. If you are using expert data and the plot is showing a bad performance in the "Pre-Training chart", then you have an issue either in the Inkling or in the dataset.
We should be adding more documentation about the impact of type and amount of data in the future, but in the meanwhile I hope these comments could help you and others to avoid typical problems.
Jan 05 2022 08:46 AM
Hey Torsten, I have a question. Does the create button get disabled when you click it and do you see a loading indicator in the create button box? The request to create a dataset is being sent when you click create and It may take a few moments for the request to complete.
To help with debugging could you check browser console and network tab to see if any errors are occurring?
Jan 05 2022 10:05 AM
Jan 05 2022 11:08 AM
@Torsten_Katthoefer I'll forward this error to our service team.
I have another question as I may have misread the thread. Were you able to get it to work and this situation only occurs when you select an incorrect combination of datastore/container/version?
Jan 05 2022 11:29 AM
Jan 05 2022 11:44 AM
Jan 05 2022 01:01 PM
Jan 05 2022 04:42 PM