Patent 11915471 was granted and assigned to Salesforce on February, 2024 by the United States Patent and Trademark Office.
Methods, computer readable media, and devices for exceeding the limits of visual-linguistic multi-task learning are disclosed. One method may include identifying a multi-modal multi-task classification dataset including a plurality of data examples, creating a transformer machine learning model to predict a plurality of categorical attributes of a product, and training the transformer machine learning model based on the multi-modal multi-task classification dataset using an alpha decay schedule and dynamically allocating task-specific parameters for at least one of the plurality of task-specific classification heads based on task complexity.