oumi.core.launcher#
Launcher module for the Oumi (Open Universal Machine Intelligence) library.
This module provides base classes for cloud and cluster management in the Oumi framework.
These classes serve as foundations for implementing cloud-specific and cluster-specific launchers for running machine learning jobs.
- class oumi.core.launcher.BaseCloud[source]#
Bases:
ABC
Base class for resource pool capable of creating clusters.
- abstract get_cluster(name: str) BaseCluster | None [source]#
Gets the cluster with the specified name, or None if not found.
- abstract list_clusters() list[BaseCluster] [source]#
Lists the active clusters on this cloud.
- class oumi.core.launcher.BaseCluster[source]#
Bases:
ABC
Base class for a compute cluster (job queue).
- class oumi.core.launcher.JobStatus(name: str, id: str, status: str, cluster: str, metadata: str, done: bool)[source]#
Bases:
object
Dataclass to hold the status of a job.
- cluster: str#
The cluster to which the job belongs.
- done: bool#
A flag indicating whether the job is done. True only if the job is in a terminal state (e.g. completed, failed, or canceled).
- id: str#
The unique identifier of the job on the cluster
- metadata: str#
Miscellaneous metadata about the job.
- name: str#
The display name of the job.
- status: str#
The status of the job.