Skip to main content

word2vec for itemsets

Project description

itembed — Item embeddings

This is yet another variation of the well-known word2vec method, proposed by Mikolov et al., applied to unordered sequences, which are commonly referred as itemsets. The contribution of itembed is twofold:

  1. Modifying the base algorithm to handle unordered sequences, which has an impact on the definition of context windows;
  2. Using the two embedding sets introduced in word2vec for supervised learning.

A similar philosophy is described by Wu et al. in StarSpace and by Barkan and Koenigstein in item2vec. itembed uses Numba to achieve high performances.

Getting started

Install from PyPI:

pip install itembed

Or install from source, to ensure latest version:

pip install git+https://github.com/sdsc-innovation/itembed.git

Please refer to the documentation for detailed explanations and examples.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

itembed-0.5.1.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

itembed-0.5.1-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file itembed-0.5.1.tar.gz.

File metadata

  • Download URL: itembed-0.5.1.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for itembed-0.5.1.tar.gz
Algorithm Hash digest
SHA256 8ced138c608f9125b1f1b5d0f74483f5092c156ef2d3eda3efd5d48479231597
MD5 7b964e9b6d412e791b6672c535dad5bd
BLAKE2b-256 f32f9eff62087b8ce9306fdab99f4ac4be8814162e2f1f1e60a4c3a376ebdd50

See more details on using hashes here.

File details

Details for the file itembed-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: itembed-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for itembed-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 990b3b769620925890e79c9eb1bc85f7c3f3eb11b1e67c2e9190824bd3ea6dbb
MD5 be0216cf6b9921a95f5a2daa9ca8d4a9
BLAKE2b-256 f23540e2b3fd9a74019ab5bc3e7f26e1dfa1c8beb34be16df13f163c84b0cc0e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page