Skip to main content

A package for managing data operations using PySpark

Project description

My DataManager

dre_datamanager is a Python package for managing data operations using PySpark. It provides functionalities for extracting unique years from data, caching, joining, and more.

Features

  • Extract unique years from specified date columns
  • Load and cache DataFrames
  • Optimize joins with broadcasting
  • Repartition DataFrames for performance
  • Context manager support for resource cleanup

Installation

pip install dre_datamanager

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dre_datamanager-0.1.0.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dre_datamanager-0.1.0-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file dre_datamanager-0.1.0.tar.gz.

File metadata

  • Download URL: dre_datamanager-0.1.0.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.9

File hashes

Hashes for dre_datamanager-0.1.0.tar.gz
Algorithm Hash digest
SHA256 0f219d17b62d90d4f813bea0d61a796e3070bba5277d53958b302a658e10a627
MD5 9d8c1b0d965de7d7beb969886423c524
BLAKE2b-256 3882c3d3db449b529b384567eff083c5b48c2b2f8260a0c6556fa8865d499609

See more details on using hashes here.

File details

Details for the file dre_datamanager-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dre_datamanager-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 15093c54565c00f901c22c2d5e80e87d7d909d629ca96f24f9deff3a0b53729d
MD5 5e3cd41b6fe82f51d54081fb26952c99
BLAKE2b-256 b1829a64ccfda42d8aad1713c1943627a52b983abe65e9c545d68f6f4a4c8008

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page