Mar 31, 2023
Recent approaches consist of optimizing the processing of information after training on huge datasets. Thanks to the size of the dataset and deep learning techniques, especially the Transformer architecture, it's like the models "understands" most of the data in the limits of the statistical distribution they were exposed to. But there is no way for them to deal with a completely novel situation, we don't know of any approach for that.