A lightweight data cleaning library for Python
Project description
Phantom Cleaner 🧹
phantom_cleaner is a simple and efficient Python library to clean your datasets with minimal effort.
Features
- Handle missing values (mean, median, mode, drop)
- Remove duplicates
- Remove outliers using IQR
- Label encode categorical variables
- Clean text data (lowercase, remove punctuation and digits)
Installation
pip install phantom_cleaner
#USAGE
# import pandas as pd
# from phantom_cleaner import PhantomCleaner
# df = pd.read_csv('yourfile.csv')
# cleaned_df = (PhantomCleaner(df)
# .fill_missing('mean')
# .remove_outliers_iqr()
# .remove_duplicates()
# .label_encode()
# .get_data())
# print(cleaned_df.head())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
phantom_cleaner-0.1.0.tar.gz
(2.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file phantom_cleaner-0.1.0.tar.gz.
File metadata
- Download URL: phantom_cleaner-0.1.0.tar.gz
- Upload date:
- Size: 2.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4b744efe3c6302d3edb9e114bafbaae41ba93cf63ad5e3c88797942688e0775
|
|
| MD5 |
84a1c3513fb8589897176967e210caaf
|
|
| BLAKE2b-256 |
37a1134bc17c2d055fea40e24c2e497dffd81f02dba6d3931d7cc607b79ea07f
|
File details
Details for the file phantom_cleaner-0.1.0-py3-none-any.whl.
File metadata
- Download URL: phantom_cleaner-0.1.0-py3-none-any.whl
- Upload date:
- Size: 2.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
66c002b098a33559b6fe4c58cd3b452de31f7e5b2f36344cc57c77c24fa38792
|
|
| MD5 |
5d62bd9e6098a0982914321a1ae117c8
|
|
| BLAKE2b-256 |
6f6b6d28f79506469d5a2998eb046921a6ade3b8c61ec2a1f01d0783967d6182
|