This repository contains a machine learning project aimed at predicting the likelihood of breast cancer based on a dataset of medical features. The goal is to create a predictive model that assists in the early diagnosis of breast cancer, helping medical professionals make informed decisions.
Breast cancer is one of the most common cancers in women worldwide. Early detection and accurate diagnosis are crucial for effective treatment and improved survival rates. This project utilizes machine learning algorithms to classify breast cancer cases as either malignant or benign based on medical diagnostic features.
The dataset used in this app is sourced from Kaggle and includes 30 features extracted from breast cancer tumor samples. It contains information on both malignant and benign cases.
- Number of Samples: 569
- Number of Features: 30 (e.g., radius, texture, smoothness, compactness)
- Target Variable: Diagnosis (Malignant or Benign)
- Clone the repository:
git clone https://github.com/Khanz9664/Breast-Cancer-Prediction.git cd Breast-Cancer-Prediction
- Ensure the
breast_cancer_model.pkl
anddata.csv
files are in the project directory. - Open Terminal in Project Directory
- Start the Streamlit app using this Command:
streamlit run app.py
- Open your web browser and go to
http://localhost:8501
to interact with the app.
- Adjust the sliders in the sidebar to input tumor feature values.
- Click on the Predict button to get the prediction results.
- Explore:
- Feature Importance: Understand which features contributed the most to the prediction.
- Feature Correlation Heatmap: Gain insights into relationships between features.