Skip to content

Ramil-cyber/Ramil-Pandas-Descriptive-Statistics-Script

 
 

Repository files navigation

Descriptive Statistics Script with Pandas

This project contains a Python script that uses Pandas to compute descriptive statistics for a dataset and generate data visualizations. The script reads data from a CSV or Excel file and provides key summary statistics such as the mean, median, and standard deviation.

Features

  • Reads a dataset from a CSV or Excel file.
  • Generates summary statistics:
    • Mean
    • Median
    • Standard Deviation
  • Produces at least one data visualization (e.g., histogram, boxplot, etc.).
  • Continuous Integration/Continuous Deployment (CI/CD) pipeline integrated with a build status badge.

Visualizations

  • Histogram:
    Histogram

  • Line Graph:
    Line Graph

Summary Statistics Report

You can view the detailed summary statistics report generated in PDF format here:

Requirements

  • Python 3.x
  • Pandas
  • Matplotlib or Seaborn (for visualization)
  • pytest (for testing)

Setup Instructions

  1. Clone the repository:

    git clone https://github.com/nogibjj/Ramil-Pandas-Descriptive-Statistics-Script
    cd Ramil-Pandas-Descriptive-Statistics-Script
  2. Install required dependencies: You can install all required packages by running:

    make setup
  3. Run the script: To run the descriptive statistics script, use the following command:

    python main.py

Usage

  • Place your dataset in the project directory.
  • Modify the script (main.py) to specify the path to your CSV or Excel file.

The script will automatically compute and display the descriptive statistics, and generate the data visualization.

Running Tests

The project includes tests to ensure the correctness of the script. To run the tests, use:

make test

Linting

To ensure code quality, Pylint is used. You can check for linting issues by running:

make lint

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 78.4%
  • Jupyter Notebook 21.2%
  • Other 0.4%