Intrusion Detection with Neural Networks (KDD Cup ’99)

This repo contains a Colab-style notebook NN_project.ipynb that builds an Intrusion Detection System (IDS) using classic KDD Cup 1999 network traffic data. It compares a simple baseline (Logistic Regression) with a 1D Convolutional Neural Network (CNN) implemented in TensorFlow/Keras.

🧰 Tech Stack

Python (notebook)
Data/ML: pandas, numpy, scikit-learn
Deep Learning: tensorflow / keras
Viz: matplotlib, seaborn

📦 Dataset

Source: KDD Cup 1999 (10% subset used in the notebook).
File expected by the notebook: kddcup.data_10_percent.gz
You will be prompted in Colab to upload this file.
Targets: The raw target labels are mapped into 5 attack categories via the notebook’s attacks_types mapping:
- normal
- dos (denial of service)
- probe
- r2l (remote to local)
- u2r (user to root)

🔄 Preprocessing (as implemented in the notebook)

Load the gzipped CSV with KDD features and a target column.
Map raw attack strings to the 5-category Attack Type label.
Drop the raw target column; use Attack Type as Y.
Scale features with MinMaxScaler.
Train/test split with train_test_split(test_size=0.33, random_state=42).

Note: The CNN is trained on an input shape of (30, 1), meaning the notebook prepares/uses 30 numeric features reshaped as sequences for 1D convs.

🧪 Models

1) Logistic Regression (baseline)

sklearn.linear_model.LogisticRegression(max_iter=500, random_state=42)
Evaluated with accuracy, classification report, and confusion matrix.

2) 1D CNN (TensorFlow/Keras)

The model defined in the notebook (simplified):

inputs = Input(shape=(30, 1))
y  = Conv1D(62, 3, padding="same", activation="relu")(inputs)
y  = MaxPooling1D(pool_size=2)(y)
y1 = Flatten()(y)

y  = Dropout(0.5)(y)
y  = Conv1D(62, 3, padding="same", activation="relu")(inputs)
y  = MaxPooling1D(pool_size=2)(y)
y2 = Flatten()(y)

y  = Dropout(0.5)(y)
y  = Conv1D(124, 3, padding="same", activation="relu")(inputs)
y  = MaxPooling1D(pool_size=2)(y)
y  = Flatten()(y)
y  = Dropout(0.5)(y)
y  = Dense(256, activation="relu")(y)
y  = Dropout(0.5)(y)
y  = Dense(5, activation="softmax")(y)

y  = Concatenate()([y, y1, y2])
outputs = Dense(5, activation="softmax")(y)
cnn_model = Model(inputs=inputs, outputs=outputs)

Loss: sparse_categorical_crossentropy
Optimizer: adam
Metric: accuracy
Training: epochs=10, batch_size=32
Evaluation mirrors the baseline (accuracy, classification report, confusion matrix).

▶️ How to Run

Option A: Google Colab

Upload NN_project.ipynb to Colab.
Run cells in order. When prompted, upload kddcup.data_10_percent.gz.
The notebook will:
- preprocess the data,
- train Logistic Regression and the CNN,
- print reports and show confusion matrices.

Option B: Local Jupyter

Create and activate a virtual environment.

Install deps:

pip install tensorflow scikit-learn pandas numpy matplotlib seaborn

Launch Jupyter and open the notebook:
```
jupyter lab  # or jupyter notebook
```
Run cells; place kddcup.data_10_percent.gz next to the notebook or adjust the path.

📁 Repo Structure (suggested)

.
├── NN_project.ipynb         # Main notebook
├── README.md                # You are here

✅ Repro Tips

If you change features, update the CNN input shape (30, 1) accordingly.
For GPU training in Colab, enable Runtime → Change runtime type → GPU.

📊 Outputs

Classification Model - Logistic regression
Classification Model - CNN Model
Feature correlation matrix:
Logistic regression - Confusion Matrix
CNN model - Confusion Matrix:

📚 References

KDD Cup 1999 Dataset (Intrusion Detection)
TensorFlow/Keras & scikit-learn documentation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
NN_project.ipynb		NN_project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intrusion Detection with Neural Networks (KDD Cup ’99)

🧰 Tech Stack

📦 Dataset

🔄 Preprocessing (as implemented in the notebook)

🧪 Models

1) Logistic Regression (baseline)

2) 1D CNN (TensorFlow/Keras)

▶️ How to Run

Option A: Google Colab

Option B: Local Jupyter

📁 Repo Structure (suggested)

✅ Repro Tips

📊 Outputs

📚 References

About

Uh oh!

Releases

Packages

Languages

surabhi1914/Intrusion-Detection-Using-Neural-Networks

Folders and files

Latest commit

History

Repository files navigation

Intrusion Detection with Neural Networks (KDD Cup ’99)

🧰 Tech Stack

📦 Dataset

🔄 Preprocessing (as implemented in the notebook)

🧪 Models

1) Logistic Regression (baseline)

2) 1D CNN (TensorFlow/Keras)

▶️ How to Run

Option A: Google Colab

Option B: Local Jupyter

📁 Repo Structure (suggested)

✅ Repro Tips

📊 Outputs

📚 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages