pablocastro thanks for the answer. This "want to specialize on a very specific domain" is exactly what we are looking for. As mentioned, if the enterprise data goes 10 years back, even with the best quality retriever we won't be able to feed any significant percentage of that data into the model.
You are right about the security constraints, but in our case we don't have them, the stored knowledge is available to everyone. Currently people can search for this knowledge using "classical" methods, it would be amazing to be able to offer them the possibility to search using natural language and chatgpt.
Also, real-time or near real-time answers aren't an issue in our case. It's good enough if we can somehow train the model once a day or even once every few days. However at the moment it's not possible to train gpt35 or gpt4 at all. Is this something you are looking at to add as a possibility in the near future?