Skip to main content

A lightweight data cleaning library for Python

Project description

Phantom Cleaner 🧹

phantom_cleaner is a simple and efficient Python library to clean your datasets with minimal effort.

Features

  • Handle missing values (mean, median, mode, drop)
  • Remove duplicates
  • Remove outliers using IQR
  • Label encode categorical variables
  • Clean text data (lowercase, remove punctuation and digits)

Installation

pip install phantom_cleaner



#USAGE

# import pandas as pd
# from phantom_cleaner import PhantomCleaner

# df = pd.read_csv('yourfile.csv')
# cleaned_df = (PhantomCleaner(df)
#               .fill_missing('mean')
#               .remove_outliers_iqr()
#               .remove_duplicates()
#               .label_encode()
#               .get_data())

# print(cleaned_df.head())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phantom_cleaner-0.1.0.tar.gz (2.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phantom_cleaner-0.1.0-py3-none-any.whl (2.8 kB view details)

Uploaded Python 3

File details

Details for the file phantom_cleaner-0.1.0.tar.gz.

File metadata

  • Download URL: phantom_cleaner-0.1.0.tar.gz
  • Upload date:
  • Size: 2.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for phantom_cleaner-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c4b744efe3c6302d3edb9e114bafbaae41ba93cf63ad5e3c88797942688e0775
MD5 84a1c3513fb8589897176967e210caaf
BLAKE2b-256 37a1134bc17c2d055fea40e24c2e497dffd81f02dba6d3931d7cc607b79ea07f

See more details on using hashes here.

File details

Details for the file phantom_cleaner-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for phantom_cleaner-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 66c002b098a33559b6fe4c58cd3b452de31f7e5b2f36344cc57c77c24fa38792
MD5 5d62bd9e6098a0982914321a1ae117c8
BLAKE2b-256 6f6b6d28f79506469d5a2998eb046921a6ade3b8c61ec2a1f01d0783967d6182

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page