#
Glossary
A glossary has been created to make this documentation easier to use. Here you will find definitions of key terms you may encounter while using ONESTEP AI.
#
Accuracy
A measure of the recognition performance of a neural network model. It usually represents the percentage of correct recognitions over all, calculated on a given set of data different from the training set.
#
Adversarial learning
Changes all detected objects so that they look different from the neural network's point of view. Therefore, the neural network launches an adversarial attack on itself.
#
AI'GORITHM
AI'GORITHM is an application for creating algorithms that allows the practical use of ONESTEP AI-trained models in a wide range of scenarios. The aim is to allow users to independently develop their own end applications, which they can then deploy on their own or rented devices, creating a unique AI environment.
#
AI'SPECTOR
AI'SPECTOR is a system designed for experienced users, serving as an alternative to AI’VIEWER. It operates continuously, running algorithms indefinitely until manually stopped, providing uninterrupted processing and monitoring. Additionally, it can operate according to a schedule set by the user, who can configure different time intervals for each day of the week. This system allows for the execution of algorithms created in AI'GORITHM, enabling users to parameterize these algorithms by selecting specific inputs, such as connected cameras or uploaded videos.
#
AI'VIEWER
An application installed on an edge device to run a trained and properly optimized neural network model. The model can be used on an image from a camera connected to the device and streaming real-time video, or on a photo/video file.
#
Angle
The rotation angle of the photo, set in the Rotation section of the Dataset Manipulator. The photo is rotated by consecutive multiples of the set angle, until it reaches 360 degrees, or until the limit of photos generated from one for the Rotation operation is reached.
See also:
#
Annotation
The image annotation stores information about the labels describing the depicted objects and any additional information, depending on the selected standard.
#
Batch/Batch size
The number of images that the CPU or GPU is processing simultaneously at a given point during the training process.
#
Brightness
In the Dataset Manipulator:
Brightness parametrization for images freshly generated by the Brightness and Contrast operation.
See also:
#
Brightness var (Brightness variance)
The range of brightness changes for newly generated images, on a scale from 0 to 255.
See also:
#
Brightness and Contrast
In the Dataset Manipulator:
An operation used to change the brightness and/or contrast of an image. Based on a set of input images (or, if this is not the first operation, already generated data), it produces the number of new images specified in the parameter. The formula used here is:
α⋅i(x,y)+β
Where:
α - the parameter responsible for the contrast change between [0.1-3.0]
.
i(x,y) - the value of the input pixel of the image with x and y coordinates
β - the parameter responsible for brightness change in the range [0,255]
.
Each created image will have a brightness value that is the sum of the original value and a random value from the [-β, β] range. The parameter α, responsible for the contrast, will also be generated randomly, taking into account the lower and upper values specified in the parameter (they cannot exceed the range specified above).
See also:
#
Bucket
Category container. All categories added to it will be merged into a single category matching the name of the bucket, but only in the context of a particular training. The originally defined categories will remain intact.
See also:
#
Burn in
Initial iterations of training; a 'warm-up' during which the learning rate changes dynamically.
The first n batches (iterations) will increase the learning speed according to the formula: current\_learning\_rate = learning\_rate ⋅ \left({\frac{iterations}{n}}\right)^{power}, where power = 4 by default. This parameter cannot be greater than the maximum batch value.
#
Category
Used interchangeably with:
- classification category (i.e., a category for training a classification model in which each image belongs to only one category)
or
- detection label category (i.e., a category for training a detection model in which an image has 0 or more object categories assigned to it along with their position coordinates).
It is possible to combine categories. This results in the creation of a super-category that contains the parameters of the sub-categories selected by the user. A super-category in OSAI is called a bucket.
See also:
#
Classification
By classification, we mean giving the entire image a single category or class that defines the object depicted in the image.
#
Cloning
A process of creating exact copies of existing datasets.
See also: Dataset cloning
#
Confusion matrix
A matrix representing the number of correct and incorrect model recognitions on a given test dataset. It compares the categories that should be recognized (rows) with the categories recognized by the model (cells). Correct recognitions populate cells on the diagonal of the matrix. The remaining cells contain incorrect recognitions.
#
Contrast
Range of contrast change for newly generated images shown on a scale from 0.1 to 3.0.
#
Conversion
The process of transforming a model from its initial format to the target format. Some information may be lost in the process, resulting in a slight loss of precision. However, conversion usually improves the overall model performance.
OSAI automatically selects the appropriate format, providing a solution optimized for the target hardware.
#
Crop threshold
The percentage limit for clipping annotated objects (in images generated by a rotation operation).
A rotation operation may cause some part of an annotated object to be outside the image, i.e. to be clipped. The cropping threshold determines how much of the object must be visible in the resulting image for the annotation to make sense. Annotations of objects with less visibility than the indicated percentage will be removed.
See also:
#
Dashboard
The main view of the user interface. Displays information about all recent actions in the application, such as recently used datasets, recently trained models or recently received notifications.
#
Data augmentation
Operations used to increase the amount of data by adding slightly modified copies of already existing data or newly created synthetic data from existing data. It acts as a regularizer and helps reduce overfitting when training a machine learning model - Source.
#
Dataset
A collection of data representing classes of objects to be recognized by a trained neural network model. It consists of images and (possibly) annotations added to them.
See also:
#
Dataset manipulator
Go to:
#
Decay
Weaker update of weights for typical features. It eliminates imbalance in the dataset (optimizer) - Source.
#
Detection
Recognition, i.e. the information returned by the neural network model that an object of a given category has been detected in a given area of the image.
See also:
#
Device
Edge device; an endpoint device on which we can run trained and/or converted neural network models. Running the model on the device is called deployment.
#
DNN (Deep Neural Network)
A Deep Neural Network (DNN) is an artificial neural network with multiple layers between the input and output layers. There are different types of neural networks, but they always consist of the same components: neurons, synapses, weights, biases and functions. These components function in a similar way to the human brain and can be trained like any other ML algorithm - Source.
#
Early Stop
This means that the model can stop training early if the loss value does not decrease during the specified number of epochs.
#
Edge device
Go to:
#
Epoch
One iteration of learning a neural network model, performed on the entire dataset (i.e., going through all images once).
See also:
#
Exposure
In the Data Augmentation process:
Random change of exposure (brightness) during training.
See also:
#
Flip
In the Data Augmentation process:
A vertical or horizontal image reflection.
See also:
#
Framework
A set of off-the-shelf tools with pre-built models and a development environment, available to developers to facilitate and standardize their work on artificial intelligence solutions.
#
Gaussian noise
In the Data Augmentation process:
Random addition of Gaussian noise to processed images.
See also:
#
Hardware
Key components (e.g., GPU, CPU) of the devices on which neural network models run.
#
Hue
In the Data Augmentation process:
A random change of hue (color) during training - Source.
See also:
#
IP address
A numeric identifier, assigned to: a network interface, a group of interfaces (broadcast, multicast), or an entire computer network in the IP protocol. It identifies network elements within and outside the local network (the so-called public address).
#
Iteration
The act of repeating the same operation in a loop a predetermined number of times or until a specified condition is met.
In the context of neural networks, this refers to the repetitive activity of learning and then verifying the effects of that learning to progressively improve accuracy and reduce loss.
#
Jitter
In the Data Augmentation process:
The operation of randomly cropping and resizing images, with a changing aspect ratio from x(1 - 2 \cdot jitter) \rightarrow x(1 + 2 \cdot jitter) (the data augmentation parameter is only used from the last layer) - Source.
#
Label
A label that marks an object and contains information about its location and name (category/class).
#
Learning rate
A hyperparameter that controls how much we adjust the weights of our network with respect to the loss gradient. The lower the value, the slower we move along the downward slope - Source.
#
Loss
Go to:
#
Loss function
A loss function is a function that computes the distance between the current output of the algorithm and the expected output. It is a method of evaluating how your algorithm models the data. It can be divided into two groups. One for classification (discrete values, 0,1,2…) and the other for regression (continuous values) - Source.
#
Manipulation
The process of generating new images from input images using user-selected operations (rotation, resizing, brightness, and contrast).
See also:
#
Manipulator
A service integrated with the OSAI portal. Manipulator generates new images from input images using parallelized operations on a high-performance GPU.
See also:
#
Marketplace
In a marketplace, users can sell their own datasets or purchase them from other users.
See also:
#
Metric
A function used to evaluate the model performance. Metric functions are similar to loss functions, except that the results of the metric evaluation are not used when training the model - Source.
#
Metric function
Go to:
#
Model
A file-based representation of a trained deep neural network, including information about its layers and weights.
#
Model loss
Go to:
#
Model Template
A template (preset) that stores the values of the neural network's learning parameters. The user can use the template to configure the training.
#
Momentum
Accumulation of movement; how much the history affects the further change of the weights (optimizer) - Source.
#
Mosaic
Go to:
#
Mosaic bound
Limits the size of objects when mosaic is checked (does not allow bounding boxes to leave the borders of their images when Mosaic-data-augmentation is used) - Source.
#
Output format
One of the Manipulator's settings is the output format of the photos - JPG or PNG. If you choose JPG, keep in mind that the newly generated photos will lose the alpha layer responsible for transparency.
See also:
#
Overfitting
Overfitting is a common issue in machine learning, in which a model becomes overly specialized in recognizing patterns from training data, to the point that it performs poorly on new, unseen data. This can occur when the model over-adjusts to specific instances in the training set, potentially causing reduced accuracy when encountering objects in real-world scenarios or positions not well-represented during training. Preventing overfitting is crucial for ensuring the model's robust performance in diverse and dynamic environments.
#
Precision
Determines the accuracy of the model based on how well the detected object matches the category or position of the real object.
#
Prediction
A single neural network result, that is, a single classification category for an entire image or a single detected object. One image can have one classification prediction and 0 or more detection predictions.
#
Pretrained model
A neural network prepared within the framework, with a predefined layer-weight architecture. It can be customized by parameterizing, modifying layers, and then training on specified datasets.
#
Publication
A process of making datasets available for purchase.
See also: Dataset publication
#
Rotation
In the Dataset Manipulator:
An operation used to change the position angle of an image. Based on a set of input images (or, if not the first operation, already generated data), it iteratively creates a series of new images. In each iteration, the input image is rotated by a multiple of the angle specified in the parameter. The loop ends when the rotation reaches 360 degrees.
For example: for an angle of 30 degrees, 12 new images rotated by angles of 30°, 60°, 90°, 120°, 150°, 180°, 210°, 240°, 270°, 300°, 330° and 360° will be generated.
See also:
#
Resize
In the Dataset Manipulator:
An operation used to change the dimensions of an image. The modification is performed on all input images (or, if this is not the first operation, on all that have already been generated). The size of the output image can be generated in resolutions from 8x8 to 4096x3112 px. It is also possible to change the aspect ratio.
See also:
#
S3 storage
A server that can store large amounts of data and make it available to clients on demand.
#
Saturation
In the Data Augmentation process:
An operation that randomly changes the saturation of images during training. Source
See also:
#
Sharing
A practice of making a collection of data available to others for access, analysis and/or use.
See also: Dataset sharing
#
Subdivision
The process of dividing a set of images of the size specified by batch (64) into smaller subsets. This is done to optimize the use of system resources without affecting the results obtained during learning.
See also:
#
Template
Go to:
#
Training
The process of training a neural network that, in successive iterations/epochs and using specific parameters, aims to gradually improve the accuracy/precision of the network and reduce the value of the loss function (see Loss).
The trained model is then saved to S3 Storage.
See also:
#
Validation
A process embedded in model training that calculates the accuracy of a model on a validation subset.
#
Validation accuracy
Accuracy calculated on a validation subset.
#
Worker
A worker is a machine that performs tasks in the background while the application is running and processing other requests. It can perform a variety of tasks, such as:
- preparing and manipulating a dataset,
- creating, training and converting an AI model,
- renting and releasing devices,
- making predictions.
As tasks are added to the queue, the worker retrieves them from there and performs them in the background, allowing simultaneous processing of AI tasks without disrupting the user experience. For example, training models can be time-consuming, and using a worker allows you to perform other tasks while the model is being trained in the background.
There are several workers in our application. Their number in the system is scalable and can change over time.