Dojo

An ecosystem for model and data registration.

Dojo is a suite of software tools that allows domain experts to register their models using an intuitive web based terminal emulator. Additionally, Dojo provides a mechanism for analysts and domain experts to register and transform datasets for use in downstream modeling workflows.

Contents

  1. Model Registration
  2. Data Registration
  3. CauseMos Compliant Format
  4. Dojo API
  5. Model Registration “Cheatsheet”

Model Registration Overview

The model registration workflow is designed to provide domain modelers with a friendly and familiar environment from which to install, setup, and execute their model. Throughout this process, the modeler is queried for key information about model metadata and how to parameterize the model. Additionally, the modeler is able to annotate model inputs to enhance model explainability. The modeler also annotates model outputs so that they can be automatically transformed into a ready-to-use and well understood format each time the model is executed. The final step of the model registration workflow is the publication of a working “black box” model Docker container to Dockerhub. These containers can run the model with a set of explicit directives; however the Dojo modeling engine can be ignorant as to what occurs within the container itself.

Data Registration Overview

The data registration workflow is designed to allow analysts, modelers and other users to bring your own data (“BYOD”). These datasets are transformed into indicators which can be used by top-down modeling engines. Typical examples of data that might be registered are indicators from the World Bank, FAO, or ACLED, however users can bring anything they think will be useful for modeling so long as it is either a CSV, Excel file, GeoTIFF or NetCDF.

The goal of this process is to capture metadata about the dataset’s provenance as well as each of its features and to transform it into a ready-to-use and well understood format. You can learn more about the data registration process here.

CauseMos Compliant Format

This section is intended to provide an overview of the CauseMos compliant format and to elaborate on various edge cases associated with data transformations. It is important to note that this format is a target: the goal of the Dojo data workflow is to enable an analyst or modeler to flexibly annotate their model output or indicator data and convert it into this target format.

Dojo API

This section provides an overview of how to interact with the Dojo API for model discovery, model execution, fetching model runs, and debugging models.