The enterprise platform to build, deliver, and govern AI
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.
Models are the central output of data science, and they have tremendous power to transform companies, industries, and society. At the center of every machine learning or artificial intelligence application is the ML/AI model that is built with data, algorithms and code. Even though models look like software and involve data, models have different input materials, different development processes, and different behaviors. The process of creating models is called modeling.
A model is a special type of algorithm. In software, an algorithm is a hard-coded set of instructions to calculate a deterministic answer. Models are algorithms whose instructions are induced from a set of data and are then used to make predictions, recommendations, or prescribe an action based on a probabilistic assessment. The model uses algorithms to identify patterns in the data that form a relationship with an output. Models can predict things before they happen more accurately than humans, such as catastrophic weather events or who is at risk of imminent death in a hospital.
Models dramatically lower the cost of prediction, similar to how semiconductors dramatically lowered the cost of arithmetic. This change makes models the new currency of competitive advantage, strategy, and growth. Models can build on each other. One model’s output acts as the input to another, more complex model and then creates a living, connected, trainable army of decision makers. And for better or worse, models can do so autonomously, with a level of speed and sophistication that humans can’t hope to match.
For organizations that are model-driven, modeling isn’t merely the process of creating models. It’s a framework of processes, tools and protocols that empower the data science team at every step of the data science lifecycle (DSLC).
Model governance is at the heart of modeling, which involves tracking model artifacts from the moment the first model version is constructed to the final model deployed in a production environment. Every time the model is altered or new data is used, the new versions are recorded. The health of models is continuously monitored to watch for anomalies that can creep up due changes in the input data, changes in the marketplace, or any other changes where the input data no longer parallels the data used when building the model. A model that isn’t monitored could begin producing inaccurate answers that lead to poor business performance and continue to do so without the business realizing it.
Modeling tools are generally code based, although some commercial solutions exist to create simple models without code, and libraries and frameworks exist in multiple languages to help data scientists accelerate their work. These tools contain libraries of algorithms that can be leveraged to create models quickly and efficiently. Many modeling tools are open source and rely on Python, although other languages are commonly used such as R, C++, Java, Perl and many more. Some popular tool libraries and frameworks are:
There are thousands of tools available and most models require a multiple tools that are best suited for the type of data and business problem.
In business environments, the inception of a new model is almost always rooted in a problem in need of a solution, like making better-informed decisions, automating procedures, or finding patterns within vast amounts of data.
Once a solution to that problem is identified, it is translated into a business goal, such as predicting inventory shortfalls, or determining credit limits for banking customers. This can then be translated into a technical problem to be solved using ML/AI models.
Depending on the type of business problem, and the available data, an approach is determined that is best suited for the problem. There are various types of machine learning approaches, including:
Once data is acquired, it is prepared as needed for the specific approach and may include removing unnecessary or duplicate data from the data set. The data scientist will then conduct experiments with different algorithms and compare the performance on a different set of data. As an example, an image recognition model would be trained on one set of images and then tested on a fresh set of images to ensure it will perform as required. Once the performance meets the needs of the business problem, it will be ready for deployment.
Even after a model appears to be stable and is working within expected parameters, it still needs to be monitored. This can be done automatically using the Domino Model Monitor platform by specifying model accuracy metrics and then having the platform notify you if the model performs outside of those metrics. For example, if the model accuracy starts to decline, it will be necessary to investigate the cause and either retrain the model using an updated representation of the data, or even build a new model altogether.
Model risk, whereby a model is used and fails to meet the desired outcome, can be exceptionally dangerous if safeguards are not implemented to manage it. This comes down to model risk management. If you’re familiar with finance, you’ll find that model risk management in data science is similar to risk management for financial models. There are five key aspects to model risk management, of which model governance is one.
Regardless of the model performance, proper model risk management is needed to ensure that the model adheres to the desired and regulatory requirements for the organization.
As organizations grow, the complexity required to manage, develop, deploy and monitor models across teams and tools can bring progress to a standstill. Enterprise MLOps platforms are purpose built to help scale data science across different teams and tools with common governance and model risk management frameworks, while supporting standardized development, deployment, and monitoring processes for all types of models. Model-driven organizations, fueled by Enterprise MLOps, are able to cash in on the new currency of competitive advantage, strategy, and growth - AI/ML models.

David Weedmark is a published author who has worked as a project manager, software developer and as a network security consultant.
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.