Skip to main content

Cognia – Automated Exploratory Data Analysis (EDA) with HTML reports

Project description

Cognia

Automated Exploratory Data Analysis

Cognia is a Python library that automatically performs Exploratory Data Analysis (EDA) and generates a structured, insight-rich HTML report.

Instead of writing repetitive and error-prone EDA scripts, Cognia thinks like a data analyst and delivers clear insights, visualizations, and warnings instantly.

✨ Why Cognia?

Before building:

  • Machine Learning models
  • Statistical analyses
  • Dashboards or business insights

You must understand your data.

However, traditional EDA is often:

❌ Time-consuming

❌ Repetitive

❌ Hard to standardize

❌ Difficult to scale

👉 Cognia automates this entire process.

📁 Project Structure:

COGNIA/
│
├── cognia/                     # Core Cognia package
│   ├── __init__.py             # Package initializer
│   ├── alert.py                # Data quality alerts & warnings
│   ├── corr.py                 # Correlation analysis utilities
│   ├── interpret.py            # Distribution & insight interpretation
│   ├── missing.py              # Missing value analysis
│   ├── outliers.py             # Outlier detection logic
│   ├── profiling.py            # Dataset profiling helpers
│   ├── quick_eda.py             # Fast high-level EDA summary
│   ├── report.py               # HTML report generation engine
│   └── stats.py                # Statistical computations
│
├── demo/                       # Demo & example files
│   ├── cognia_eda_report.html  # Sample generated EDA report
│   ├── input_file.py           # Example usage script
│   └── labtoprice.csv          # Sample dataset
│
├── pyproject.toml              # Build & dependency configuration
├── README.md                   # Project documentation

🔍 What Cognia Analyzes:

Cognia generates a complete EDA report covering:

📊 Dataset Overview:

  • Total rows & columns
  • Data types
  • Duplicate records
  • Numeric vs categorical features

❓ Missing Value Analysis:

  • Column-wise missing counts
  • Missing percentages
  • Data completeness indicators

📈 Statistical Summary:

  • Mean, median, standard deviation
  • Min / Max values
  • Distribution characteristics

📉 Distribution & Shape Analysis:

  • Histograms for numeric features
  • Skewness detection
  • Interpretable insights

🚨 Outlier Detection:

  • Outlier counts per column
  • Severity-based alerts
  • Early modeling risk detection

🧩 Categorical Feature Analysis:

  • Top categories
  • Frequency bar charts
  • Color-coded visualizations

🔗 Correlation Analysis (Smart & Scalable):

  • Top correlated feature pairs (for large datasets)
  • Optional full correlation heatmap
  • Human-readable layout (no clutter)

⚠️ Alerts & Warnings:

  • High missing values
  • Duplicate data risks
  • Extreme skewness & outliers
  • Potential modeling issues

🧪 How to Use Cognia?

from cognia import eda_report

eda_report(df)

✔️ That’s it.

✔️ An HTML EDA report is generated instantly.

✔️ No configuration required.

📦 Installation:

Clone the repository and install locally:

pip install -e .

🛠 Built With:

🐍 Python 3.8+

📦 pandas

🔢 numpy

📊 matplotlib

📂 HTML

Lightweight • Fast • Beginner-friendly • Extensible

🏁 Philosophy:

If you can load a DataFrame, you should be able to understand it.

Cognia makes that possible.

If you find Cognia useful, don’t forget to ⭐ star the repository and share it with fellow data enthusiasts.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cognia-0.1.0.tar.gz (12.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cognia-0.1.0-py3-none-any.whl (13.3 kB view details)

Uploaded Python 3

File details

Details for the file cognia-0.1.0.tar.gz.

File metadata

  • Download URL: cognia-0.1.0.tar.gz
  • Upload date:
  • Size: 12.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cognia-0.1.0.tar.gz
Algorithm Hash digest
SHA256 edc1a2958943a0a847eac53c30506a30ce223edd5f16ab3c85e7556eae536be0
MD5 071d9e4ae8cec4bc54771b786dd1de9e
BLAKE2b-256 290af6b79248a9b5caa491e90aaf69e909d00b5a5dfce8f3fcd08ce5a3fb911e

See more details on using hashes here.

File details

Details for the file cognia-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cognia-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 13.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cognia-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8483920080d710c71ba6bac27d9c19680c75af57b4f546ea3e6cc839f172c3aa
MD5 1c47547c650e48c0a02a9b22ba211fe2
BLAKE2b-256 c1f46a50b0c1fcc8b1b3740af4e717a4dd64f7b2e228273542be2237f7bc7ab7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page