Cognia – Automated Exploratory Data Analysis (EDA) with HTML reports
Project description
Cognia
Automated Exploratory Data Analysis
Cognia is a Python library that automatically performs Exploratory Data Analysis (EDA) and generates a structured, insight-rich HTML report.
Instead of writing repetitive and error-prone EDA scripts, Cognia thinks like a data analyst and delivers clear insights, visualizations, and warnings instantly.
✨ Why Cognia?
Before building:
- Machine Learning models
- Statistical analyses
- Dashboards or business insights
You must understand your data.
However, traditional EDA is often:
❌ Time-consuming
❌ Repetitive
❌ Hard to standardize
❌ Difficult to scale
👉 Cognia automates this entire process.
📁 Project Structure:
COGNIA/
│
├── cognia/ # Core Cognia package
│ ├── __init__.py # Package initializer
│ ├── alert.py # Data quality alerts & warnings
│ ├── corr.py # Correlation analysis utilities
│ ├── interpret.py # Distribution & insight interpretation
│ ├── missing.py # Missing value analysis
│ ├── outliers.py # Outlier detection logic
│ ├── profiling.py # Dataset profiling helpers
│ ├── quick_eda.py # Fast high-level EDA summary
│ ├── report.py # HTML report generation engine
│ └── stats.py # Statistical computations
│
├── demo/ # Demo & example files
│ ├── cognia_eda_report.html # Sample generated EDA report
│ ├── input_file.py # Example usage script
│ └── labtoprice.csv # Sample dataset
│
├── pyproject.toml # Build & dependency configuration
├── README.md # Project documentation
🔍 What Cognia Analyzes:
Cognia generates a complete EDA report covering:
📊 Dataset Overview:
- Total rows & columns
- Data types
- Duplicate records
- Numeric vs categorical features
❓ Missing Value Analysis:
- Column-wise missing counts
- Missing percentages
- Data completeness indicators
📈 Statistical Summary:
- Mean, median, standard deviation
- Min / Max values
- Distribution characteristics
📉 Distribution & Shape Analysis:
- Histograms for numeric features
- Skewness detection
- Interpretable insights
🚨 Outlier Detection:
- Outlier counts per column
- Severity-based alerts
- Early modeling risk detection
🧩 Categorical Feature Analysis:
- Top categories
- Frequency bar charts
- Color-coded visualizations
🔗 Correlation Analysis (Smart & Scalable):
- Top correlated feature pairs (for large datasets)
- Optional full correlation heatmap
- Human-readable layout (no clutter)
⚠️ Alerts & Warnings:
- High missing values
- Duplicate data risks
- Extreme skewness & outliers
- Potential modeling issues
🧪 How to Use Cognia?
from cognia import eda_report
eda_report(df)
✔️ That’s it.
✔️ An HTML EDA report is generated instantly.
✔️ No configuration required.
📦 Installation:
Clone the repository and install locally:
pip install -e .
🛠 Built With:
🐍 Python 3.8+
📦 pandas
🔢 numpy
📊 matplotlib
📂 HTML
Lightweight • Fast • Beginner-friendly • Extensible
🏁 Philosophy:
If you can load a DataFrame, you should be able to understand it.
Cognia makes that possible.
If you find Cognia useful, don’t forget to ⭐ star the repository and share it with fellow data enthusiasts.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cognia-0.1.0.tar.gz.
File metadata
- Download URL: cognia-0.1.0.tar.gz
- Upload date:
- Size: 12.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
edc1a2958943a0a847eac53c30506a30ce223edd5f16ab3c85e7556eae536be0
|
|
| MD5 |
071d9e4ae8cec4bc54771b786dd1de9e
|
|
| BLAKE2b-256 |
290af6b79248a9b5caa491e90aaf69e909d00b5a5dfce8f3fcd08ce5a3fb911e
|
File details
Details for the file cognia-0.1.0-py3-none-any.whl.
File metadata
- Download URL: cognia-0.1.0-py3-none-any.whl
- Upload date:
- Size: 13.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8483920080d710c71ba6bac27d9c19680c75af57b4f546ea3e6cc839f172c3aa
|
|
| MD5 |
1c47547c650e48c0a02a9b22ba211fe2
|
|
| BLAKE2b-256 |
c1f46a50b0c1fcc8b1b3740af4e717a4dd64f7b2e228273542be2237f7bc7ab7
|