# Merging the categories

With category merging, there is no need to create a new dataset when you want to perform a new operation on the same data. Training can use images from different datasets when you merge their categories to create new ones.

# How does merging work?

Let’s say you have 4 different datasets that contain images of cats and dogs. They were uploaded at different times and were used for different tasks. Now you want to use them for a common training task, and you will use the category merge function to do so.

Dashboard view
Dashboard view
The datasets have categories with 4 different names. You want the model to classify the images into only two categories - DOG and CAT.

This can be achieved through the model parameterization process. In the step where you need to define the training set, you can access all the datasets you have ever uploaded along with their categories. Now it is only up to you to decide which categories from which datasets to select for the training set.

New model
New model

The categories can be freely merged. In this example, you will include the sets of cats, perros, gatos and one category dog from the cats_dogs set into the merged training set.

Category selection
Category selection

You have selected a total of 4 categories from 4 sets - two categories representing cats, and two representing dogs. Based on the previously selected categories, you created super-categories, so-called buckets, represented by a rectangle framed by a dashed line. The names of the categories have now become the names of the buckets.

The tiles within the buckets show the original dataset name, along with the category names. Each category is automatically assigned to an appropriate bucket.

Bucket names can be edited (with the same restrictions as for category names). You can also add new buckets and delete unnecessary ones. Category tiles can be dragged and dropped between buckets. If there are at least two categories in one bucket, they will be merged. No bucket can be empty, because in the training context you cannot use a category that is not represented by any dataset.

Back to the example. We want your classification model to recognize two categories: CAT and DOG. To do this, merge the categories and edit their names by renaming the merging bucket.

Merged categories
Merged categories

Now remove the empty buckets and (optionally) edit the bucket name.

From the 4 data sets with assigned categories, you have now created a training set with two categories.

Deleting empty categories
Deleting empty categories