A curated collection of papers, repositories, and resources on Prior-data Fitted Networks (PFNs).
As the name says, Prior-data Fitted Networks (PFNs) are a class of neural networks trained on synthetic datasets sampled from a prior distribution to directly approximate the posterior predictive distribution (PPD). They enable Bayesian prediction through in-context learning and have been applied across tabular data, time series, Bayesian optimization, symbolic regression, and beyond.
A landmark success of the PFN framework is the Tabular Foundation Model (TabPFN) published in Nature: Accurate predictions on small data with a tabular foundation model. It showed that a TabPFN trained on over 100 million synthetic tabular tasks can outperform traditional ML models (like XGBoost, CatBoost, LightGBM) on a wide range of small real datasets.
- Key idea: Train a transformer on 100 million synthetic tabular tasks, enabling zero-training predictions on new datasets.
- Architecture: An adapted transformer encoder designed for two-dimensional tabular data, supporting categorical + numeric features, missing values, and heterogeneous distributions.
- Impact: TabPFN demonstrates that PFNs can act as foundation models for tabular data, achieving state-of-the-art accuracy on small-data benchmarks in seconds, outperforming conventional AutoML systems.
This work established PFNs as a new family of foundation models for structured data, analogous to LLMs for text.
| Venue | Title | Code |
|---|---|---|
| ICLR 2023 | TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second | Code |
| Nature | Accurate predictions on small data with a tabular foundation model | Code |
Foundations and theoretical insights into PFNs, amortized inference, and Bayesian learning.
| Repository | Description |
|---|---|
automl/PFNs |
Canonical PFN implementation; synthetic task generation, Bayesian inference via transformers. |
PriorLabs/TabPFN |
Official Tabular PFN implementation (classification + regression) |
PriorLabs/tabpfn-extensions |
Extensions: interpretability, more classes, imputation, and analysis tools |
PriorLabs/awesome-tabpfn |
Community-curated list of TabPFN applications and papers |
PriorLabs/tabpfn-time-series |
Time-series adaptation of TabPFN |
david-rundel/tabpfn_iml |
Interpretability module for TabPFN (SHAP, feature attribution) |
yandex-research/G2T-FM |
Graph-to-Table Foundation Model: extend TabPFN to graph data |
yandex-research/graphpfn |
GraphPFN: PFN with graph priors and message-passing transformer |
abacusai/ForecastPFN |
PFN for zero-shot time-series forecasting |
soda-inria/tabicl |
a more scalable tabular foundation model |
limix-ldm/LimiX |
LimiX: a tabular foundation model generalizing TabPFN |
autogluon/tabrepo |
Living benchmark for tabular ML; includes foundation-model baselines |
autogluon/autogluon |
End-to-end AutoML framework supporting tabular, time-series, and multimodal data; often used as benchmark in PFN/TabPFN work |
- Predicting Early Outcomes of Prostatic Artery Embolization Using n-Butyl Cyanoacrylate Liquid Embolic Agent: A Machine Learning Study
- Virtual Screening of Natural Anti-Senescent Compounds Based on Sq-TabPFN
- From Rows to Yields: How Foundation Models for Tabular Data Simplify Crop Yield Prediction
- Uncertainty-Aware Tabular Prediction: Evaluating VBLL-Enhanced TabPFN in Safety-Critical Medical Data
- Early Fault Classification in Rotating Machinery With Limited Data Using TabPFN
- Improved Ethereum Fraud Detection Mechanism with Explainable Tabular Transformer Model
- Fast and Accurate Zero-Training Classification for Tabular Engineering Data
- A Fast and Reliable Transformer-Based TabPFN Model for Liver Disease Diagnosis
- Machine learning and radiomics for ventricular tachyarrhythmia prediction in hypertrophic cardiomyopathy: insights from an MRI-based analysis
- Explainable Classification for Non-Small Cell Lung Cancer Based on Positron Emission Tomography Features and Clinical Data
- Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring
- Fault Diagnosis of Slewing Bearing Using Audible Sound Signal Based on Time Generative Adversarial Network–TabPFN Method
- A machine learning-based approach for individualized prediction of short-term outcomes after anterior cervical corpectomy
- Foundation Models for Cybersecurity: A Comprehensive Multi-Modal Evaluation of TabPFN and TabICL for Tabular Intrusion Detection
- TACO: TabPFN Augmented Causal Outcomes for Early Detection of Long COVID
- Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models
- Tabular prior-data fitted network for urban air temperature inference and high temperature risk assessment
- Kriging prior Regression: A Case for Kriging-Based Spatial Features with TabPFN in Soil Mapping
- Uncertainty-Aware Tabular Prediction: Evaluating VBLL-Enhanced TabPFN in Safety-Critical Medical Data
- Tabular foundation model for GEOAI benchmark problems BM/AirportSoilProperties/2/2025
Contributions are welcome!
- Add papers, repos, or tutorials.
- Fix broken links.
- Suggest new categories.
This list aggregates publicly available PFN-related resources. Each work retains its own license and citation requirements. If you use these resources, please cite the corresponding paper and repository.
