Automatically generate python API package from a SPARQL endpoint using its VoID descriptive metadata

These details have not been verified by PyPI

Project links

Project description

✨ SPARQL API code generator 🐍

A CLI tool to automatically generate a python package from a SPARQL endpoint VoID description.

It will generate a folder with all requirements for publishing a modern python package containing the classes to automatically work with the data in the endpoint.

Features:

Each class in the endpoint will be defined as a python class, with fields for each predicates available on a class.
It will use the classes and predicates labels from their ontology when possible to generate the python classes and their fields
Type annotations are used for better autocompletion
Fields of a class are retrieved when the field is called (lazy 🦥)

🪄 Usage

Requirements: Python >=3.9

Install the package with pip or pipx:
```
pipx install sparql-api-codegen
```

Generate the code for a SPARQL endpoint which contains a SPARQL Service Description:

sparql-api-codegen <sparql-endpoint-url> <folder-for-generated-python-pkg> -i <iri-of-class-to-ignore>

Once the folders have been generated you can get into the folder, check and improve the instructions to run in the README.md, improve the metadata in the pyproject.toml

Optionally you can ignore some classes. For some endpoints this will be required if the label generated for 2 classes are identical, e.g. for Bgee:

sparql-api-codegen "https://www.bgee.org/sparql/" "bgee-api" \
	-i http://purl.obolibrary.org/obo/CARO_0000000 \
	-i http://purl.obolibrary.org/obo/SO_0000704 \
	-i http://purl.obolibrary.org/obo/NCIT_C14250

Example python API for Bgee:

from bgee_api import AnatomicalEntity, Gene, GeneExpressionExperimentCondition


if __name__ == "__main__":
    all_anats = AnatomicalEntity.get()
    print(len(all_anats), all_anats[0])

    anat = AnatomicalEntity("http://purl.obolibrary.org/obo/AEO_0000013")
    print(anat)
    print(anat.label)
    print(anat.expresses)

    gene= Gene("http://omabrowser.org/ontology/oma#GENE_ENSMUSG00000053483")
    print(gene.label)

    cond = GeneExpressionExperimentCondition("http://bgee.org/#EXPRESSION_CONDITION_101909")
    print(cond.has_a_developmental_stage)
    print(cond.has_anatomical_entity)

For UniProt:

sparql-api-codegen "https://sparql.uniprot.org/sparql/" "uniprot-api" \
	-i http://biohackathon.org/resource/faldo#Region

🧑‍💻 Development setup

The final section of the README is for if you want to run the package in development, and get involved by making a code contribution.

📥️ Clone

Clone the repository:

git clone https://github.com/TRIPLE-CHIST-ERA/sparql-api-codegen
cd sparql-api-codegen

🐣 Install dependencies

Install Hatch, a modern build system, as well as project and virtual env management tool recommended by the Python Packaging Authority. This will automatically handle virtual environments and make sure all dependencies are installed when you run a script in the project:

pipx install hatch

Or you could install in your favorite virtual env:

pip install -e ".[test]"

🛠️ Develop

Test with the Bgee endpoint:

hatch run sparql-api-codegen "https://www.bgee.org/sparql/" "bgee-api" \
    -i http://purl.obolibrary.org/obo/CARO_0000000 \
    -i http://purl.obolibrary.org/obo/SO_0000704 \
    -i http://purl.obolibrary.org/obo/NCIT_C14250

☑️ Run tests

Make sure the existing tests still work by running the test suite and linting checks. Note that any pull requests to the fairworkflows repository on github will automatically trigger running of the test suite;

hatch run test

To display all logs when debugging:

hatch run test -s

♻️ Reset the environment

In case you are facing issues with dependencies not updating properly you can easily reset the virtual environment with:

hatch env prune

Manually trigger installing the dependencies in a local virtual environment:

hatch -v env create

🏷️ New release process

The deployment of new releases is done automatically by a GitHub Action workflow when a new release is created on GitHub. To release a new version:

Make sure the PYPI_TOKEN secret has been defined in the GitHub repository (in Settings > Secrets > Actions). You can get an API token from PyPI at pypi.org/manage/account.
Increment the version number in the pyproject.toml file in the root folder of the repository.
```
hatch version fix
```
Create a new release on GitHub, which will automatically trigger the publish workflow, and publish the new release to PyPI.

You can also build and publish from your computer:

hatch build
hatch publish

TODO

Bulk load with preloaded fields

all_anats_preloaded: list[AnatomicalEntity] = bulk_load(AnatomicalEntity, ["label", "expresses"])
# Or
all_anats_preloaded: list[AnatomicalEntity] = AnatomicalEntity.get(["label", "expresses"])

Allow also to pass a list of IRI (optional, if not we get all?)

Returns pandas matrix with filters?

pandas_matrix = BiologicalEntity.get_matrix(
    filter_has_a_developmental_stage="http://some_dev_stage",
    filter_has_anatomical_entity="some anatomical entity",
)

Also enable to filter on labels instead of IRI?

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.1

Dec 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sparql_api_codegen-0.0.1.tar.gz (11.3 kB view details)

Uploaded Dec 18, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sparql_api_codegen-0.0.1-py3-none-any.whl (12.9 kB view details)

Uploaded Dec 18, 2024 Python 3

File details

Details for the file sparql_api_codegen-0.0.1.tar.gz.

File metadata

Download URL: sparql_api_codegen-0.0.1.tar.gz
Upload date: Dec 18, 2024
Size: 11.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.0

File hashes

Hashes for sparql_api_codegen-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`d31e941d15ed67aa0fad023ad57e5675396ad5d51021c7773ba88e8e92dfdd7f`
MD5	`bf040e24549fc4c792cf4e6725668f4e`
BLAKE2b-256	`299713fc8ede010dbf0cae020c1bf791036c5eb3912572e519c5f2ed1a7e61b4`

See more details on using hashes here.

File details

Details for the file sparql_api_codegen-0.0.1-py3-none-any.whl.

File metadata

Download URL: sparql_api_codegen-0.0.1-py3-none-any.whl
Upload date: Dec 18, 2024
Size: 12.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.27.0

File hashes

Hashes for sparql_api_codegen-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ca87b7901728dac7aa6e7c22b9c3273edfe74583942b4416620efe2e36ce7fc`
MD5	`291a62433f6ec5926b8c58c4c190c151`
BLAKE2b-256	`c977c30bcd0e7cd1aa790a47df87a5e0b11afb94a88dfcc52f56146ca648ed63`

See more details on using hashes here.

sparql-api-codegen 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

✨ SPARQL API code generator 🐍

🪄 Usage

🧑‍💻 Development setup

📥️ Clone

🐣 Install dependencies

🛠️ Develop

☑️ Run tests

♻️ Reset the environment

🏷️ New release process

TODO

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes