Glossary

Glossary

A glossary has been created to make this documentation easier to use. Here you will find definitions of key terms you may encounter while using ONESTEP AI.

Accuracy

A measure of the recognition performance of a neural network model. It usually represents the percentage of correct recognitions over all, calculated on a given set of data different from the training set.

Adversarial learning

Changes all detected objects so that they look different from the neural network’s point of view. Therefore, the neural network launches an adversarial attack on itself.

AI’GORITHM

AI’GORITHM is an application for creating algorithms that allows the practical use of ONESTEP AI-trained models in a wide range of scenarios. The aim is to allow users to independently develop their own end applications, which they can then deploy on their own or rented devices, creating a unique AI environment.

AI’SPECTOR

AI’SPECTOR is a system designed for experienced users, serving as an alternative to AI’VIEWER. It operates continuously, running algorithms indefinitely until manually stopped, providing uninterrupted processing and monitoring. Additionally, it can operate according to a schedule set by the user, who can configure different time intervals for each day of the week. This system allows for the execution of algorithms created in AI’GORITHM, enabling users to parameterize these algorithms by selecting specific inputs, such as connected cameras or uploaded videos.

AI’VIEWER

An application installed on an edge device to run a trained and properly optimized neural network model. The model can be used on an image from a camera connected to the device and streaming real-time video, or on a photo/video file.

Angle

The rotation angle of the photo, set in the Rotation section of the Dataset Manipulator. The photo is rotated by consecutive multiples of the set angle, until it reaches 360 degrees, or until the limit of photos generated from one for the Rotation operation is reached.

Annotation

The image annotation stores information about the labels describing the depicted objects and any additional information, depending on the selected standard.

Batch/Batch size

The number of images that the CPU or GPU is processing simultaneously at a given point during the training process.

Brightness

In the Dataset Manipulator:

Brightness parametrization for images freshly generated by the Brightness and Contrast operation.

See also: Manipulator

Brightness variation

The range of brightness changes for newly generated images, on a scale from 0 to 255.

Brightness and contrast

In the Dataset Manipulator:

An operation used to change the brightness and/or contrast of an image. Based on a set of input images (or, if this is not the first operation, already generated data), it produces the number of new images specified in the parameter. The formula used here is:

$α⋅i(x,y)+β$

Where:

$α$ - the parameter responsible for the contrast change between [0.1-3.0].

$i(x,y)$ - the value of the input pixel of the image with $x$ and $y$ coordinates

$β$ - the parameter responsible for brightness change in the range [0,255].

Each created image will have a brightness value that is the sum of the original value and a random value from the $[-β, β]$ range. The parameter $α$, responsible for the contrast, will also be generated randomly, taking into account the lower and upper values specified in the parameter (they cannot exceed the range specified above).

See also: Manipulator

Bucket

Category container. All categories added to it will be merged into a single category matching the name of the bucket, but only in the context of a particular training. The originally defined categories will remain intact.

Burn in

Initial iterations of training; a ‘warm-up’ during which the learning rate changes dynamically.

The first n batches (iterations) will increase the learning speed according to the formula: $current\_learning\_rate = learning\_rate ⋅ \left({\frac{iterations}{n}}\right)^{power}$, where power = 4 by default. This parameter cannot be greater than the maximum batch value.

By classification, we mean giving the entire image a single category or class that defines the object depicted in the image.

Cloning

A process of creating exact copies of existing datasets.

Confusion matrix

A matrix representing the number of correct and incorrect model recognitions on a given test dataset. It compares the categories that should be recognized (rows) with the categories recognized by the model (cells). Correct recognitions populate cells on the diagonal of the matrix. The remaining cells contain incorrect recognitions.

Contrast

Range of contrast change for newly generated images shown on a scale from 0.1 to 3.0.

Conversion

The process of transforming a model from its initial format to the target format. Some information may be lost in the process, resulting in a slight loss of precision. However, conversion usually improves the overall model performance.

OSAI automatically selects the appropriate format, providing a solution optimized for the target hardware.

Crop threshold

The percentage limit for clipping annotated objects (in images generated by a rotation operation).

A rotation operation may cause some part of an annotated object to be outside the image, i.e. to be clipped. The cropping threshold determines how much of the object must be visible in the resulting image for the annotation to make sense. Annotations of objects with less visibility than the indicated percentage will be removed.

Dashboard

The main view of the user interface. Displays information about all recent actions in the application, such as recently used datasets, recently trained models or recently received notifications.

Data augmentation

Operations used to increase the amount of data by adding slightly modified copies of already existing data or newly created synthetic data from existing data. It acts as a regularizer and helps reduce overfitting when training a machine learning model - Source.

Dataset

A collection of data representing classes of objects to be recognized by a trained neural network model. It consists of images and (possibly) annotations added to them.

Dataset manipulator

Go to: Manipulator

Decay

Weaker update of weights for typical features. It eliminates imbalance in the dataset (optimizer) - Source.

Detection

Recognition, i.e. the information returned by the neural network model that an object of a given category has been detected in a given area of the image.

Device

Edge device; an endpoint device on which we can run trained and/or converted neural network models. Running the model on the device is called deployment.

DNN (Deep Neural Network)

A Deep Neural Network (DNN) is an artificial neural network with multiple layers between the input and output layers. There are different types of neural networks, but they always consist of the same components: neurons, synapses, weights, biases and functions. These components function in a similar way to the human brain and can be trained like any other ML algorithm - Source.

Early Stop

This means that the model can stop training early if the loss value does not decrease during the specified number of epochs.

Edge device

Go to: Device

Epoch

One iteration of learning a neural network model, performed on the entire dataset (i.e., going through all images once).

Exposure

In the Data Augmentation process:

Random change of exposure (brightness) during training.

See also: Data Augmentation

Flip

In the Data Augmentation process:

A vertical or horizontal image reflection.

See also: Data Augmentation

Framework

A set of off-the-shelf tools with pre-built models and a development environment, available to developers to facilitate and standardize their work on artificial intelligence solutions.

Gaussian noise

In the Data Augmentation process:

Random addition of Gaussian noise to processed images.

See also: Data Augmentation

Hardware

Key components (e.g., GPU, CPU) of the devices on which neural network models run.

Hue

In the Data Augmentation process:

A random change of hue (color) during training - Source.

See also: Data Augmentation

IP address

A numeric identifier, assigned to: a network interface, a group of interfaces (broadcast, multicast), or an entire computer network in the IP protocol. It identifies network elements within and outside the local network (the so-called public address).

Iteration

The act of repeating the same operation in a loop a predetermined number of times or until a specified condition is met.

In the context of neural networks, this refers to the repetitive activity of learning and then verifying the effects of that learning to progressively improve accuracy and reduce loss.

Jitter

In the Data Augmentation process:

The operation of randomly cropping and resizing images, with a changing aspect ratio from $x(1 - 2 \cdot jitter) \rightarrow x(1 + 2 \cdot jitter)$ (the data augmentation parameter is only used from the last layer) - Source.

Label

A label that marks an object and contains information about its location and name (category/class).

Learning rate

A hyperparameter that controls how much we adjust the weights of our network with respect to the loss gradient. The lower the value, the slower we move along the downward slope - Source.

Loss

Go to: Loss Function

Loss function

A loss function is a function that computes the distance between the current output of the algorithm and the expected output. It is a method of evaluating how your algorithm models the data. It can be divided into two groups. One for classification (discrete values, 0,1,2…) and the other for regression (continuous values) - Source.

Manipulation

The process of generating new images from input images using user-selected operations (rotation, resizing, brightness, and contrast).

See also: Manipulator

Manipulator

A service integrated with the OSAI portal. Manipulator generates new images from input images using parallelized operations on a high-performance GPU.

See also: Data Augmentation

Marketplace

In a marketplace, users can sell their own datasets or purchase them from other users.

Metric

A function used to evaluate the model performance. Metric functions are similar to loss functions, except that the results of the metric evaluation are not used when training the model - Source.

Metric function

Go to: Metric

Model

A file-based representation of a trained deep neural network, including information about its layers and weights.

Model loss

Go to: Loss

Model Template

A template (preset) that stores the values of the neural network’s learning parameters. The user can use the template to configure the training.

Momentum

Accumulation of movement; how much the history affects the further change of the weights (optimizer) - Source.

Mosaic

Go to: Mosaic bound

Mosaic bound

Limits the size of objects when mosaic is checked (does not allow bounding boxes to leave the borders of their images when Mosaic-data-augmentation is used) - Source.

Output format

One of the Manipulator’s settings is the output format of the photos - JPG or PNG. If you choose JPG, keep in mind that the newly generated photos will lose the alpha layer responsible for transparency.

Overfitting

Overfitting is a common issue in machine learning, in which a model becomes overly specialized in recognizing patterns from training data, to the point that it performs poorly on new, unseen data. This can occur when the model over-adjusts to specific instances in the training set, potentially causing reduced accuracy when encountering objects in real-world scenarios or positions not well-represented during training. Preventing overfitting is crucial for ensuring the model’s robust performance in diverse and dynamic environments.

Precision

Determines the accuracy of the model based on how well the detected object matches the category or position of the real object.

Prediction

A single neural network result, that is, a single classification category for an entire image or a single detected object. One image can have one classification prediction and 0 or more detection predictions.

Pretrained model

A neural network prepared within the framework, with a predefined layer-weight architecture. It can be customized by parameterizing, modifying layers, and then training on specified datasets.

Publication

A process of making datasets available for purchase.

Rotation

In the Dataset Manipulator:

An operation used to change the position angle of an image. Based on a set of input images (or, if not the first operation, already generated data), it iteratively creates a series of new images. In each iteration, the input image is rotated by a multiple of the angle specified in the parameter. The loop ends when the rotation reaches 360 degrees.

For example: for an angle of 30 degrees, 12 new images rotated by angles of 30°, 60°, 90°, 120°, 150°, 180°, 210°, 240°, 270°, 300°, 330° and 360° will be generated.

See also: Manipulator

Resize

In the Dataset Manipulator:

An operation used to change the dimensions of an image. The modification is performed on all input images (or, if this is not the first operation, on all that have already been generated). The size of the output image can be generated in resolutions from 8x8 to 4096x3112 px. It is also possible to change the aspect ratio.

See also: Manipulator

S3 storage

A server that can store large amounts of data and make it available to clients on demand.

Saturation

In the Data Augmentation process:

An operation that randomly changes the saturation of images during training. Source

A practice of making a collection of data available to others for access, analysis and/or use.

Subdivision

The process of dividing a set of images of the size specified by batch (64) into smaller subsets. This is done to optimize the use of system resources without affecting the results obtained during learning.

Template

Go to: Model template

Training

The process of training a neural network that, in successive iterations/epochs and using specific parameters, aims to gradually improve the accuracy/precision of the network and reduce the value of the loss function (see Loss).

The trained model is then saved to S3 Storage.

Validation

A process embedded in model training that calculates the accuracy of a model on a validation subset.

Validation accuracy

Accuracy calculated on a validation subset.

Worker

A worker is a machine that performs tasks in the background while the application is running and processing other requests. It can perform a variety of tasks, such as:

preparing and manipulating a dataset,
creating, training and converting an AI model,
renting and releasing devices,
making predictions.

As tasks are added to the queue, the worker retrieves them from there and performs them in the background, allowing simultaneous processing of AI tasks without disrupting the user experience. For example, training models can be time-consuming, and using a worker allows you to perform other tasks while the model is being trained in the background.

There are several workers in our application. Their number in the system is scalable and can change over time.

← API

Using the help form →

Accuracy

Adversarial learning

AI’GORITHM

AI’SPECTOR

AI’VIEWER

Angle

Annotation

Batch/Batch size

Brightness

Brightness variation

Brightness and contrast

Bucket

Burn in

Category

Classification

Cloning

Confusion matrix

Contrast

Conversion

Crop threshold

Dashboard

Data augmentation

Dataset

Dataset manipulator

Decay

Detection

Device

DNN (Deep Neural Network)

Early Stop

Edge device

Epoch

Exposure

Flip

Framework

Gaussian noise

Hardware

Hue

IP address

Iteration

Jitter

Label

Learning rate

Loss

Loss function

Manipulation

Manipulator

Marketplace

Metric

Metric function

Model

Model loss

Model Template

Momentum

Mosaic

Mosaic bound

Output format

Overfitting

Precision

Prediction

Pretrained model

Publication

Rotation

Resize

S3 storage

Saturation

Sharing

Subdivision

Template

Training

Validation

Validation accuracy

Worker