Understanding the Dataset
🗂️ What Data Are We Using?
For this tutorial, we will use a simple cybersecurity dataset containing:
-
login attempts
-
time of access
-
success/failure
-
IP address
-
location
-
number of retries
-
flagged status (normal / suspicious)
📘 Sample Data (Human-Readable Version)
| login_attempts | ip_address | status | retries | suspicious |
|---|---|---|---|---|
| 1 | 192.168.1.22 | success | 0 | 0 |
| 12 | 185.123.77.19 | failed | 5 | 1 |
A high number of retries → suspicious.
8
