Skip to content
/ log-zoo Public

A community-driven collection of raw, multi-vendor log samples (Syslog, CEF, JSON) for parser development and security testing.

Notifications You must be signed in to change notification settings

sva-s1/log-zoo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🪵🐒 Log Zoo

A community-driven repository of raw, unparsed log samples from various vendors and applications.

CC0

About This Project

The goal of log-zoo is to provide a centralized collection of real-world log examples for development, testing, and security research. These samples are useful for building and validating parsers, testing normalization rules, and understanding data structures from different log sources.

Important

The logs in this repository are raw events sourced directly from vendors (e.g., cloud applications, network appliances, self-hosted software). They have not been parsed or processed by SentinelOne's Singularity DataLake (SDL) or any other SIEM.


Directory Structure

Log samples are organized by their format or transport protocol at the top level. The file names should be descriptive of the vendor and event type.

.
├── cef
│   ├── README.md
│   └── xypro_nonstop_object_access.log
├── CONTRIBUTING.md
├── csv
│   └── PLACEHOLDER
├── fortinet
│   └── event.type
│       ├── security-rating
│       │   └── fortigate.log
│       ├── system
│       │   └── fortigate.log
│       └── traffic
│           └── fortigate.log
├── json
│   └── PLACEHOLDER
├── README.md
└── syslog (custom)
    ├── rfc3164
    │   └── PLACEHOLDER
    └── rfc5424
        └── PLACEHOLDER
  • cef/: ArcSight Common Event Format (CEF) logs.
  • csv/: Comma-separated value logs.
  • json/: Logs in JSON format, either as single objects or one object per line.
  • leef/: IBM Log Event Extended Format (LEEF) logs.
  • syslog/: Contains logs transported via syslog protocol, separated by their RFC standard.
    • rfc3164/: The legacy BSD-style syslog protocol.
    • rfc5424/: The newer, standardized syslog protocol.

Note

While formats like CEF and LEEF are often transported via syslog, they are distinct log formats. They have their own top-level directories to accommodate samples that may be sourced from files (.log, .txt) instead of a direct syslog stream.


How to Use

Clone the repository to get local access to the samples. You can use these files as test data for scripts, parser development, or any other analysis.

git clone https://github.com/sva-s1/log-zoo.git

Additional Resources 🛠️

For pre-built dashboards, monitors, parsers, and other configurations that can be used with your log data, please visit the official Scalyr samples repository. While this log-zoo repo provides the raw data, the scalyr/samples repo provides assets to help you process and visualize it.


License

This repository is dedicated to the public domain under the CC0 1.0 Universal license. You can copy, modify, and distribute the work, even for commercial purposes, without asking permission.

About

A community-driven collection of raw, multi-vendor log samples (Syslog, CEF, JSON) for parser development and security testing.

Topics

Resources

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published