word2vec for itemsets
Project description
itembed — Item embeddings
This is yet another variation of the well-known word2vec method, proposed by Mikolov et al., applied to unordered sequences, which are commonly referred as itemsets.
The contribution of itembed is twofold:
- Modifying the base algorithm to handle unordered sequences, which has an impact on the definition of context windows;
- Using the two embedding sets introduced in word2vec for supervised learning.
A similar philosophy is described by Wu et al. in StarSpace and by Barkan and Koenigstein in item2vec.
itembed uses Numba to achieve high performances.
Getting started
Install from PyPI:
pip install itembed
Or install from source, to ensure latest version:
pip install git+https://github.com/sdsc-innovation/itembed.git
Please refer to the documentation for detailed explanations and examples.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file itembed-0.5.1.tar.gz.
File metadata
- Download URL: itembed-0.5.1.tar.gz
- Upload date:
- Size: 9.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8ced138c608f9125b1f1b5d0f74483f5092c156ef2d3eda3efd5d48479231597
|
|
| MD5 |
7b964e9b6d412e791b6672c535dad5bd
|
|
| BLAKE2b-256 |
f32f9eff62087b8ce9306fdab99f4ac4be8814162e2f1f1e60a4c3a376ebdd50
|
File details
Details for the file itembed-0.5.1-py3-none-any.whl.
File metadata
- Download URL: itembed-0.5.1-py3-none-any.whl
- Upload date:
- Size: 9.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/4.0.2 CPython/3.11.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
990b3b769620925890e79c9eb1bc85f7c3f3eb11b1e67c2e9190824bd3ea6dbb
|
|
| MD5 |
be0216cf6b9921a95f5a2daa9ca8d4a9
|
|
| BLAKE2b-256 |
f23540e2b3fd9a74019ab5bc3e7f26e1dfa1c8beb34be16df13f163c84b0cc0e
|