Skip to content

XiangpengHao/parquet-viewer

Repository files navigation

Parquet Viewer

Online at: https://parquet-viewer.xiangpeng.systems

OpenVSX or VS Code extension

Parquet viewer is made possible with the funding from InfluxData.

Features

  • Query parquet data with SQL ✅
  • Query parquet data with natural language through LLM ✅
  • View Parquet metadata ✅
  • View Parquet files from anywhere -- local file, S3, or any URLs ✅
  • Everything runs in the browser, no server, no external dependencies, just a web page ✅
  • Read only you query -- won't download the entire parquet file ✅

Demo

screenshot

Tips

Local cli

nix run .#cli -- file.parquet
2026-01-09T15:20:13.357327Z  INFO parquet_viewer_cli: Serving Posts.parquet on http://0.0.0.0:53703

Serving: file.parquet

Viewer URLs:
  http://0.0.0.0:53703/?url=http%3A%2F%2F0.0.0.0%3A53703%2Ffile%2Ffile.parquet
  http://localhost:53703/?url=http%3A%2F%2Flocalhost%3A53703%2Ffile%2Ffile.parquet
  http://xiangpeng-madison:53703/?url=http%3A%2F%2Fxiangpeng-madison%3A53703%2Ffile%2Ffile.parquet

Press Ctrl+C to stop the server.

Development

It compiles Parquet, Arrow, Datafusion, OpenDAL to WebAssembly and uses it to explore Parquet files, more details.

After you have installed nix, the dev environment is setup with:

direnv allow

Which takes care of the cross-compiling of Rust, C++, C to WebAssembly, and everything else you need to run the project.

Run locally

dx serve --profile debug-strip

dx bundle --release

Run tests

wasm-pack test --headless --firefox

Build web static files

nix build .#web

Output will be in result/ directory.

Build VS Code extension

nix build .#vscode-extension

Output will be in result/ directory.

Build Docker image

nix build .#docker
docker load < result
docker run -p 8080:80 parquet-viewer:0.1.31

Citation

If you use Parquet Viewer in your research or wish to refer to it in your publications, please use the following BibTeX entry:

@software{parquet_viewer,
  author = {Xiangpeng Hao and Jigao Luo and LLM and {ParquetViewer developers}},
  title = {Parquet Viewer: Interactive Parquet File Explorer},
  year = {2025},
  url = {https://github.com/XiangpengHao/parquet-viewer},
  note = {Online at: https://parquet-viewer.xiangpeng.systems}
}

License

Licensed under either of

at your option.

Be aware that most of the code is generated by AI, resistance is futile.

About

View parquet files online

Topics

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Contributors