#
Merging the categories
With category merging, there is no need to create a new dataset when you want to perform a new operation on the same data. Training can use images from different datasets when you merge their categories to create new ones.
#
How does merging work?
Let’s say you have 4 different datasets that contain images of cats and dogs. They were uploaded at different times and were used for different tasks. Now you want to use them for a common training task, and you will use the category merge function to do so.
DOG
and CAT
.
This can be achieved through the model parameterization process. In the step where you need to define the training set, you can access all the datasets you have ever uploaded along with their categories. Now it is only up to you to decide which categories from which datasets to select for the training set.
The categories can be freely merged. In this example, you will include the sets of cats
, perros
, gatos
and one category dog
from the cats_dogs
set into the merged training set.
You have selected a total of 4 categories from 4 sets - two categories representing cats, and two representing dogs. Based on the previously selected categories, you created super-categories, so-called buckets, represented by a rectangle framed by a dashed line. The names of the categories have now become the names of the buckets.
The tiles within the buckets show the original dataset name, along with the category names. Each category is automatically assigned to an appropriate bucket.
Bucket names can be edited (with the same restrictions as for category names). You can also add new buckets and delete unnecessary ones. Category tiles can be dragged and dropped between buckets. If there are at least two categories in one bucket, they will be merged. No bucket can be empty, because in the training context you cannot use a category that is not represented by any dataset.
Back to the example. We want your classification model to recognize two categories: CAT
and DOG
. To do this, merge the categories and edit their names by renaming the merging bucket.
Now remove the empty buckets and (optionally) edit the bucket name.
From the 4 data sets with assigned categories, you have now created a training set with two categories.