Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Page Properties
hiddentrue
idDONE

Product Version2021 Autumn
Report Note
AssigneeAntje

Resources & Remarks

Modification History

NameDateProduct VersionAction
Antje21 JUL 20212021 Autumncreated
Goran18 OCT 20212021 Autumnupdated
Antje25 NOV 20222022 Winterremove beta label, update content



Excerpt

Responsible for preparing data for training, training of machine learning models, evaluation of trained models, and preparing for the deployment to the production.

...

Section
bordertrue


Column

Table of Contents

Table of Contents
exclude(Table of Contents|Read on|Basic Use Case Flows|Preprocessing Metadata using Webhooks|Concept of System Hooks|Another interesting Tutorial|System Hooks|Graphical Overview \/ Use Cases \(Flows\))

Characteristics

Note
titleBeta Version

The Machine Learning (ML) Training Pipeline is a component of the Artificial Intelligence Platform. This platform is not included in yuuvis® Momentum installations and is available as a beta version only on request.


Function

The ML Training Pipeline is part of the Artificial Intelligence Platform responsible for data ingestion, data validation, transformation, machine learning training, and model evaluation. The pipeline is based on MLflow – an open-source platform for managing ML lifecycles.ML Training Pipeline is used via the command line application Kairos CLI.

Data Export

The source of data for machine learning is a document management system, e.g., yuuvis® Momentum. The data exported from yuuvis® Momentum are stored on local storage devices, S3 or Azure Blob Storage, in the format suitable for data ingestionshall be exported in a predefined format and shall be made available to the provided training pipelines

Machine Learning Pipelines

The machine learning pipelines are components developed and shipped by OPTIMAL SYSTEMS GmbH. They contain all necessary procedures and algorithms to train machine learning models for different purposes (e.g., document classification and metadata extraction).

Machine learning pipelines are shipped as Docker containers that run inside the ML Training Pipeline.

At the moment,

...

pipelines can be used for document classification (for instance it can determine whether a document is an invoice, a contract, a sick-leave or something else) and for metadata extraction (for instance, extract the issuing date, total amount and invoice number from an invoice). 

Document Classification

In the context of the AI platform, classification means the determination of suitable typification classes fitting for an object based on its full-text rendition. For one object, one prediction is provided that contains mappings of classes and their corresponding relevance probability as well as a reference on the object in yuuvis® Momentum via objectId.

...

ML Pipeline can analyze the PDF rendition of binary content files assigned to objects in yuuvis® Momentum in order to extract specific metadata. Based on the trained models, predictions for values of specific object properties can be determined. The object properties have to be listed in the Duplicate - Inference Schema where conditions for the values and settings for the prediction responses are also specified.

Machine Learning Training

The training of machine learning models can be run using Kairos CLI App to define what data to use and which ML Pipeline to run in order to get a model for the desired purpose, for example, invoice metadata extraction.

Model Evaluation

After the machine learning training is done, the model is evaluated. By examining training results, the user decides whether the model is suitable for use or needs further tuning of hyperparameters, bigger longer training, larger data set, etc. 

Model Registry

Models that are suitable for further use are stored in the Model Registry component. From the Model Registry component, models can be built dockerized and deployed to the Model Serving infrastructureserving infrastructure (typically, to the same Kubernetes cluster where yuuvis Momentum is running).

Info
iconfalse

Read on


Section


Column
width25%

Inference Schema

Insert excerpt
Inference Schema
Inference Schema
nopaneltrue
 Keep reading


Column
width25%
Kairos CLI App

KAIROS-API Service

Insert excerpt
Kairos CLI AppKairos CLI AppKAIROS-API Service
KAIROS-API Service
nopaneltrue
 Keep reading



Column
width25%

PREDICT-API Service

Insert excerpt
PREDICT-API Service
PREDICT-API Service
nopaneltrue
 Keep reading



...