Assembler components for the Sayou Data Platform
Project description
sayou-assembler
Overview
The Knowledge Builder for Sayou Fabric.
sayou-assembler acts as the structural engineer of the pipeline. It accepts flat lists of SayouNode objects produced by the Wrapper and constructs meaningful relationships (Edges), embeddings, or specific query payloads.
While the Wrapper focuses on what the data is (Schema), the Assembler focuses on how the data connects (Topology).
1. Architecture & Role
The Assembler takes standardized Nodes and applies a Building Strategy to generate a connected graph structure or vector payloads ready for loading.
graph LR
Nodes[Flat SayouNodes] --> Pipeline[Assembler Pipeline]
subgraph Builders
Graph[General Graph]
Code[Python AST]
Time[Timeline]
Vector[Embeddings]
Cypher[Cypher Query]
end
Pipeline -->|Config Routing| Builders
Builders --> Payload[Connected Payload]
1.1. Core Features
- Topology Construction: Automatically links nodes (e.g., Parent-Child, Function-Class) based on metadata.
- Domain Specificity: Dedicated builders for Python code analysis and Video timelines.
- Bi-directional Linking: Automatically generates reverse edges (e.g.,
contains<->belongsTo) to ensure graph traversability.
2. Available Strategies
sayou-assembler provides specialized builders for different data domains.
| Strategy Key | Builder Class | Description |
|---|---|---|
graph |
GraphBuilder |
[Default] Generic builder. Links nodes based on parent_id and generic relationships. |
code |
CodeGraphBuilder |
[Code] Parses Python AST metadata to link Imports, Classes, and Methods. |
timeline |
TimelineBuilder |
[Media] Sequences video segments or logs chronologically (NEXT, PREV). |
vector |
VectorBuilder |
[Search] Converts node content into vector embeddings. |
cypher |
CypherBuilder |
[DB] Generates Neo4j MERGE statements for idempotent inserts. |
3. Installation
pip install sayou-assembler
4. Usage
The AssemblerPipeline is the entry point. It transforms SayouOutput (from Wrapper) into a dictionary containing nodes and edges.
Case A: General Graph Construction
The default strategy for general documents (PDF, Markdown).
from sayou.assembler import AssemblerPipeline
wrapper_output = {
"nodes": [
{"node_id": "doc:1", "node_class": "sayou:Document", "attributes": {...}},
{"node_id": "doc:1:header", "attributes": {"meta:parent_id": "doc:1"}, ...}
]
}
graph_data = AssemblerPipeline.process(data=wrapper_output)
print(f"Nodes: {len(graph_data['nodes'])}, Edges: {len(graph_data['edges'])}")
Case B: Python Code Graph
Links functions to classes and resolves import dependencies.
from sayou.assembler import AssemblerPipeline
wrapper_output = {
"nodes": [
{"node_id": "doc:1", "node_class": "sayou:Document", "attributes": {...}},
{"node_id": "doc:1:header", "attributes": {"meta:parent_id": "doc:1"}, ...}
]
}
code_graph = AssemblerPipeline.process(data=wrapper_output)
print(f"Dependency Edges: {len(code_graph['edges'])}")
Case C: Multimedia Timeline
Sequences video segments by time.
from sayou.assembler import AssemblerPipeline
wrapper_output = {
"nodes": [
{"node_id": "doc:1", "node_class": "sayou:Document", "attributes": {...}},
{"node_id": "doc:1:header", "attributes": {"meta:parent_id": "doc:1"}, ...}
]
}
timeline_graph = AssemblerPipeline.process(data=wrapper_output)
print(f"Timeline Sequence Created.")
5. Configuration Keys
You can customize the building process via the config dictionary.
graph:create_reverse_edges(bool),default_edge_label.python:resolve_imports(bool),include_external_libs(bool).timeline:interval_threshold(float).vector:embedding_model(e.g., openai, huggingface),batch_size.cypher:merge_mode(bool),batch_size.
6. License
Apache 2.0 License © 2026 Sayouzone
7. Plugin List
| Plugin | Example | Description |
|---|---|---|
Code Graph Assembler |
▶ | |
Timeline Assembler |
▶ |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sayou_assembler-0.5.0.tar.gz.
File metadata
- Download URL: sayou_assembler-0.5.0.tar.gz
- Upload date:
- Size: 42.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae0a29f0be3eb29a97fc3a1cb761b09d245e0d0dbddcdceb0a30568736474cf6
|
|
| MD5 |
26ed3968eaab290df1d5f00701723096
|
|
| BLAKE2b-256 |
ed6b6019a89641038831a6f82f7df3118208ba0176dd31d8fbe9a39d8b589ac8
|
File details
Details for the file sayou_assembler-0.5.0-py3-none-any.whl.
File metadata
- Download URL: sayou_assembler-0.5.0-py3-none-any.whl
- Upload date:
- Size: 22.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
652b386a88685c508d158fc6949b3faf2b912cc5b584151f3d48109379c718da
|
|
| MD5 |
84879cf22e510f91689de79053e95c84
|
|
| BLAKE2b-256 |
893a94b121895d3aa1e670b8f5e533b4de2e2cd69e228692a3f1025985fc29c5
|