onenote-exporter

A command-line tool that exports Microsoft OneNote .one files to Markdown or HTML.

OneNote stores notebooks in a proprietary binary format (MS-ONESTORE). This tool parses those files and converts pages — including text formatting, images, tables, and attachments — into clean, version-control-friendly Markdown or self-contained HTML.

Features

Converts .one files to Markdown or HTML (or both at once)
Preserves formatting: bold, italic, underline, strikethrough, links, super/subscript
HTML output is self-contained with embedded CSS — no external dependencies
Extracts images and embedded file attachments alongside output files
Renders tables in Markdown table syntax or HTML <table> elements
Automatically deduplicates section versions (e.g. Notes.one vs Notes (On 2-25-26).one)
Preserves notebook/section/page hierarchy in the output directory
Optional flat output mode (no notebook subdirectories)

Requirements

Python 3.11+
macOS, Linux, or Windows

Installation

git clone https://github.com/strick-j/onenote-exporter.git
cd onenote-exporter

# Create and activate a virtual environment (recommended)
python3 -m venv .venv
source .venv/bin/activate          # macOS/Linux
# .venv\Scripts\activate.bat       # Windows (CMD)
# .venv\Scripts\Activate.ps1       # Windows (PowerShell)

# Install the package
pip install -e .

Usage

onenote-export -i <input_dir> -o <output_dir> [options]

Arguments

Flag	Long Form	Required	Description
`-i`	`--input`	Yes	Directory containing `.one` files (searched recursively)
`-o`	`--output`	Yes	Directory where output files will be written
`-f`	`--format`	No	Output format: `markdown` (default), `html`, or `both`
`-v`	`--verbose`	No	Show progress details (INFO level logging)
	`--debug`	No	Show detailed diagnostic output (DEBUG level logging)
	`--flat`	No	Write all sections to the output root without notebook subdirectories

Examples

Basic export (Markdown):

onenote-export -i ~/OneDrive/OneNote_Notebooks -o ~/Documents/exported_notes

Export as HTML:

onenote-export -i ~/OneDrive/OneNote_Notebooks -o ~/Documents/exported_notes --format html

Export both Markdown and HTML:

onenote-export -i ~/OneDrive/OneNote_Notebooks -o ~/Documents/exported_notes -f both

Verbose output with flat structure:

onenote-export -i ~/OneDrive/OneNote_Notebooks -o ~/Documents/exported_notes --verbose --flat

Debug mode for troubleshooting:

onenote-export -i ~/OneDrive/OneNote_Notebooks -o ~/Documents/exported_notes --debug

Input Structure

The tool expects .one files organized into directories, where each directory represents a notebook:

input_dir/
├── Work Notebook/
│   ├── Meeting Notes.one
│   ├── Projects.one
│   └── Projects (On 2-25-26).one   # older version, auto-skipped
└── Personal/
    ├── Recipes.one
    └── Travel.one

Files with date suffixes like (On 2-25-26) are recognized as older versions of the same section. Only the most recent version is exported.

Output Structure

Default (nested):

output_dir/
├── Work Notebook/
│   ├── Meeting Notes/
│   │   ├── Standup 2024-01-15.md    # (or .html with --format html)
│   │   ├── images/
│   │   │   └── image_001.png
│   │   └── attachments/
│   │       └── report.pdf
│   └── Projects/
│       └── Project Alpha.md
└── Personal/
    ├── Recipes/
    │   └── Pasta Carbonara.md
    └── Travel/
        └── Japan Trip.md

With --flat:

output_dir/
├── Meeting Notes/
│   └── Standup 2024-01-15.md
├── Projects/
│   └── Project Alpha.md
├── Recipes/
│   └── Pasta Carbonara.md
└── Travel/
    └── Japan Trip.md

Supported Content

Content Type	Markdown	HTML
Bold	`text`	`<strong>`
Italic	`text`	`<em>`
Underline	`text` (italic fallback)	`<u>`
Strikethrough	`~~text~~`	`<del>`
Superscript	`<sup>`	`<sup>`
Subscript	`<sub>`	`<sub>`
Hyperlinks	`[text](url)`	`<a href="...">`
Headings	`# H1` – `###### H6`	`<h1>` – `<h6>`
Images	`![alt](./images/...)`	`<img src="./images/...">`
Tables	Markdown table syntax	`<table>`
Embedded files	`[name](./attachments/...)`	`<a href="./attachments/...">`

Image formats detected automatically: PNG, JPEG, GIF, BMP, WEBP.

Note: Underline is rendered as italic in Markdown since Markdown has no native underline syntax. HTML output uses the <u> tag.

Exit Codes

Code	Meaning
0	Export completed successfully
1	Fatal error (input directory missing or no `.one` files found)
2	All sections failed to process

Errors are printed to stderr. Use -v or --debug for additional detail.

Testing

The project includes a test suite covering utilities, data models, converter rendering, content extraction, and CLI logic.

Setup

# Install with dev dependencies (pytest + pytest-cov)
pip install -e ".[dev]"

Running Tests

# Run all tests
pytest

# Run tests with verbose output
pytest -v

# Run tests with coverage report
pytest --cov=onenote_export

# Run a specific test file
pytest tests/test_utils.py

# Run a specific test class or method
pytest tests/test_markdown_converter.py::TestMarkdownConverterRenderPage::test_page_with_bold_text

Test Structure

tests/
├── Example-NoteBook-1/           # Sample .one files for testing
│   ├── Example-Section-1 (On 2-25-26 - 2).one
│   ├── Example-Section-1 (On 2-25-26 - 3).one
│   ├── Example-Section-2 (On 2-25-26 - 3).one
│   └── Example-Section-2 (On 2-25-26).one
├── test_base_converter.py        # Base converter file I/O and filename utilities
├── test_cli.py                   # CLI argument parsing, --format flag, deduplication
├── test_content_extractor.py     # Text decoding, image format detection, type parsing
├── test_html_converter.py        # HTML rendering, escaping, file writing
├── test_markdown_converter.py    # Markdown rendering, file/image/attachment writing
├── test_models.py                # Data model construction and immutability
└── test_utils.py                 # File discovery and name extraction

What's Tested

Module	Tests	Coverage
`converter/base.py`	File I/O, directory creation, filename sanitization, deduplication	All shared converter logic
`converter/markdown.py`	Rendering for every content type	Bold, italic, strikethrough, links, super/subscript, tables, images, attachments
`converter/html.py`	HTML rendering, XSS escaping, document structure	All content types, self-contained HTML, CSS embedding
`model/`	All dataclass defaults, immutability (`TextRun` is frozen)	All content types: `RichText`, `ImageElement`, `TableElement`, `EmbeddedFile`
`parser/content_extractor.py`	Text decoding (hex, UTF-16, ASCII), garbled text detection	All image formats (PNG, JPEG, GIF, BMP, WEBP), edge cases
`cli.py`	Section deduplication, `--format` flag, logging levels	All format choices, error handling
`utils.py`	File discovery, section name parsing, notebook name extraction	File glob, `.onetoc2` exclusion, date suffix stripping

How It Works

Discover — Recursively finds all .one files in the input directory (skips .onetoc2 files)
Deduplicate — Groups files by section name and keeps only the latest version
Parse — Reads each .one file using the pyOneNote library to extract raw objects from the MS-ONESTORE binary format
Extract — Converts raw objects into a structured model (notebooks, sections, pages, content elements)
Convert — Renders each page as Markdown and/or HTML, writing images and attachments to disk

Limitations

Requires binary .one files — does not work with OneNote for Web exports or .onetoc2 table-of-contents files
Some complex OneNote formatting may not have a direct Markdown equivalent
OneNote revision history may produce duplicate content in edge cases
Tested on macOS, Linux, and Windows

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
onenote_export		onenote_export
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

onenote-exporter

Features

Requirements

Installation

Usage

Arguments

Examples

Input Structure

Output Structure

Supported Content

Exit Codes

Testing

Setup

Running Tests

Test Structure

What's Tested

How It Works

Limitations

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

onenote-exporter

Features

Requirements

Installation

Usage

Arguments

Examples

Input Structure

Output Structure

Supported Content

Exit Codes

Testing

Setup

Running Tests

Test Structure

What's Tested

How It Works

Limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages