Skip to content

ddkale2022/EDA-Kchd-project

Repository files navigation

Project Name - Exploratory Data Analysis (EDA) of Kings County housing dataset

Contributor: Dipali Kale

This project is example exploratory data analysis as a part of NeueFische Datascience Bootcamp.

Assignment : Through EDA/statistical analysis prodive insights regarding the overall data, and provide AT LEAST 3 recommendations to the client.

Name of client- Larry Sanders Characteristics: Buyer|Waterfront , limited budget, nice & isolated but central neighborhood without kids (but got some of his own, just doesn't want his kids to play with other)

Research questions and hypothesis generation

For this project following research question was addressed, and hypothesis were tested

Hypotheses Indicators
1. The closer a house is to the city center, the higher the price geolocation
2. If a house is located near waterfront, and has nice condition, then the price is higher waterfront(yes/no)

This repo contains

Requirements

  • pyenv
  • python==3.11.3

Setup

The general workflow for creating virtual environment in python version locally to 3.11.3, and installing the required packages via pip

This repo contains a requirements.txt file with a list of all the packages and dependencies you will need. Before you install the virtual environment, make sure to install postgresql if you haven't done it before.

  • Check the postgresql version by run the following commands:
    psql --version
    If you haven't installed it yet, begin at step_1. Otherwise, proceed to step_2.

Before you can start with plotly in Jupyter Lab you have to install node.js (if you haven't done it before).

  • Check Node version by run the following commands:
    node -v
    If you haven't installed it yet, begin at step_2. Otherwise, proceed to step_3.

macOS type the following commands :

  • Step_1: Update Homebrew and install Postgresql by following commands:

    brew update
    brew install postgresql@14

    Restart Your Terminal and than check the postgresql version by run the following commands:

    psql --version

    If psql --version doesn't display the version, add PostgreSQL to your macOS PATH by following these steps:

    • Find and copy the PostgreSQL bin directory on macOS.

      The default path is typically /Library/PostgreSQL/<version>/bin, where is your PostgreSQL version.

    • Edit the .zshrc or a similar .conf file using a text editor like Nano, Vim, or VSCode.

      nano ~/.zshrc
    • Add the following line to the .zshrc file. Make sure to replace with your PostgreSQL version.

      export PATH="/Library/PostgreSQL/<version>/bin:$PATH"
    • Save and exit the text editor. In nano, you can do this by pressing Ctrl + O, then Enter, and then Ctrl + X to exit.

    • Restart Your Terminal

      source ~/.zshrc
      psql --version
  • Step_2: Update Homebrew and install Node by following commands:

    brew update
    brew install node
  • Step_3: Install the virtual environment and the required packages by following commands:

    pyenv local 3.11.3
    python -m venv .venv
    source .venv/bin/activate
    pip install --upgrade pip
    pip install -r requirements.txt

WindowsOS type the following commands :

  • Step_1: Update Chocolatey and install Postgresql by following commands:

    choco upgrade chocolatey
    choco install postgresql14

    Restart Your Terminal and than check the postgresql version by run the following commands:

    psql --version

    If psql --version doesn't display the version, add PostgreSQL to your winOS PATH by following these steps:

    • Find and copy the PostgreSQL bin directory on winOS.

      The default path is typically C:\Program Files\PostgreSQL\<version>\bin, where is your PostgreSQL version.

    • Open Command Prompt as Administrator:

      • Search for "Command Prompt" in your Start menu.
      • Right-click on "Command Prompt" and select "Run as administrator."
    • Add PostgreSQL to PATH:

      • Replace 14 with your PostgreSQL version if it's different.
      setx PATH "$($env:PATH);C:\Program Files\PostgreSQL\14\bin"
    • Close the Administrator Command Prompt window.

    • Open a new Terminal and run the following command

      psql --version
  • Step_2: Update Chocolatey and install Node by following commands:

    choco upgrade chocolatey
    choco install nodejs
  • Step_3: Install the virtual environment and the required packages by following commands.

    For PowerShell CLI :

    python -m venv .venv
    .venv\Scripts\Activate.ps1
    pip install --upgrade pip
    pip install -r requirements.txt

    For Git-Bash CLI :

    python -m venv .venv
    source .venv/Scripts/activate
    pip install --upgrade pip
    pip install -r requirements.txt
    

Note: If you encounter an error when trying to run pip install --upgrade pip, try using the following command:

python.exe -m pip install --upgrade pip

About

data science bootcamp

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors