MLflow adapter for CrateDB
Project description
MLflow adapter for CrateDB
» Documentation | Changelog | PyPI | Issues | Source code | License | CrateDB | Community Forum
About
MLflow is an open source AI engineering platform for managing the whole ML lifecycle for agents, LLMs, and ML models, including experimentation, reproducibility, and deployment.
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. CrateDB is based on Lucene and Elasticsearch, but compatible with PostgreSQL.
Details
MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.
The MLflow adapter for CrateDB is an adapter for MLflow to use CrateDB as a storage database for its various subsystems.
Features
The MLflow Experiment Tracking subsystem is an API and UI for logging and recording parameters, code versions, metrics, and output files when running your machine learning code, and for later visualizing the results by querying experiments across code, data, and config.
The MLflow Model Registry is a centralized model store, set of APIs and a UI designed to collaboratively manage the full lifecycle of a machine learning model, including lineage, versioning, aliasing, metadata tagging, and annotation support.
The MLflow Dataset Tracking module is a comprehensive solution for dataset management throughout the ML model development workflow. It enables you to track, version, and manage datasets used in training, validation, and evaluation, providing complete lineage from raw data to model predictions.
What's inside
The source code of the mlflow-cratedb package, which implements
the MLflow adapter for CrateDB. It works with both CrateDB and
CrateDB Cloud.
The source code is effectively a few monkey patches that amalgamate MLflow with the necessary changes to support CrateDB. The patches are curated until the adapter can eventually be upstreamed into MLflow mainline as another storage database type.
Usage
General information about what you can do with MLflow, and how, can be found on the Machine Learning with CrateDB and examples about MLflow and CrateDB.
Install
For installation per PyPI package, OCI image, and usage information, please refer to the handbook document, and the documentation about container usage.
The repository also includes a ready-to-run Compose project that provisions MLflow, CrateDB, and RustFS, see MLflow and CrateDB with Docker Compose.
To set up a development environment on your workstation, please refer to the development sandbox documentation. When you see the software tests succeed, you should be ready to start hacking.
Project Information
Resources
Contributing
The MLflow adapter for CrateDB is an open-source project, and is managed on GitHub. Feel free to use the adapter as provided or else modify / extend it as appropriate for your own applications.
Any kind of contribution, feedback, or patch, is much welcome. Create an issue or submit a patch if you think we should include a new feature, or to report or fix a bug.
Acknowledgements
Kudos to the authors of all the many software components this library is inheriting from and building upon, with special thanks to Siddharth Murching, Corey Zumar, Harutaka Kawamura, Ben Wilson, and all other contributors for conceiving and maintaining MLflow.
Andreas Nigg for contributing the tracking_merlion.py and tracking_pycaret.py ML experiment programs, using Merlion and PyCaret.
License
The project is licensed under the terms of the Apache License 2.0, like MLflow and CrateDB, see LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mlflow_cratedb-3.12.0.tar.gz.
File metadata
- Download URL: mlflow_cratedb-3.12.0.tar.gz
- Upload date:
- Size: 33.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
76fed987d2348e14c30520db1daa3dbac622b0e873eeab0e33541eae7d77644c
|
|
| MD5 |
8063ad13b8d238e4d302ad243d21b14a
|
|
| BLAKE2b-256 |
0bd88fcf501dacd2da212833149821cceb25497b5722d91a62632c2ac37891f1
|
File details
Details for the file mlflow_cratedb-3.12.0-py3-none-any.whl.
File metadata
- Download URL: mlflow_cratedb-3.12.0-py3-none-any.whl
- Upload date:
- Size: 24.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
026d5598ad6bc3f1d62d49e72d9ba6e8b9ebe619b5315232793ad07e931b9bbe
|
|
| MD5 |
86fca609ed9bd623860f3fb68e1439d4
|
|
| BLAKE2b-256 |
5ef039fd72eca1cd3852f47a8b31c9d9c3548b94e74bae02395987a242810b6c
|