Google Summer of Code 2025: "Adjuster this! TabPFN as a replacement for the adjuster" project discussion thread #27
Replies: 36 comments 49 replies
-
|
Hey @Sukh-P, this project is pretty good and I am interested in working and contributing to this project, I will be very grateful if you can share something which is must to go before familiarizing with codebase. Thanks in advance. |
Beta Was this translation helpful? Give feedback.
-
|
Hi Sukh! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P, I’m interested in contributing to this project, specifically in integrating TabPFN to improve the adjuster model. My plan includes:
I have experience in ML, and data processing (pandas, NumPy, Scikit-Learn) and would love to collaborate on refining this approach. Looking forward to feedback and next steps! One more question which discord server, or similar communities should i join? Best, |
Beta Was this translation helpful? Give feedback.
-
|
Hey @Sukh-P ,
I took a look at the repos, and the adjuster logic is pretty interesting. Looking forward to learning more and seeing where this goes! |
Beta Was this translation helpful? Give feedback.
-
|
Hey @Sukh-P, I'm intrigued by your TabPFN solar forecasting project. Some questions I'm curious about:
For implementation, you might consider framing this as multiple specialized TabPFN models for different atmospheric stability conditions, then ensemble their outputs. The project README suggests this kind of domain-specific approach often outperforms a one-size-fits-all model. Would it be possible to share a sample distribution of your historical errors to better understand the patterns TabPFN would need to learn? I'm particularly impressed by how this project tackles a critical problem in renewable energy integration - better solar forecasting directly impacts grid stability and energy market operations. TabPFN's approach seems uniquely suited to capture the complex, non-linear patterns in solar forecasting errors. |
Beta Was this translation helpful? Give feedback.
-
|
Hey @Sukh-P, This project sounds really exciting! I’ve worked with ML and data processing in Python, particularly with pandas, scikit-learn, and some PyTorch, and I’d love to explore how TabPFN could improve solar forecasting adjustments. TabPFN’s ability to outperform traditional methods while being dramatically faster makes it a compelling candidate for dynamically adjusting forecasts based on historical errors. I’m really interested in contributing to this project—what would be the best way to get involved? Let me know how I can help! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P , I am familiar with Machine Learning, Deep Learning and Data Preprocesssing, I have worked with Pytorch and Pandas as well. I have actively contributed to Open Source projects, completed Hacktoberfest and won a Open Source Contribution Contest. I am currently researching about TabPFN and how it can be incorporated in this project. I'd like to know more about the GSoC preparation and selection process. Are they any tasks that need to be completed or contributions that need to be made? Is there anything else which I have missed and should work on? Thanks, |
Beta Was this translation helpful? Give feedback.
-
|
Dear @Sukh-P , My name is Markus and I’m a Computer Science student at Radboud University in Netherlands. I’m new to contributing to open source software, but this project sounds like an exciting challenge. Since I’m new to GSoC, I’d appreciate any insights into the organizational side of the program. For example, how are project milestones, communication, and progress tracking typically managed? Looking forward to collaborating! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Dear @Sukh-P , I am Arya Datla, a freshman majoring in CS and Math at Cornell University. I'm deeply passionate about solving climate change problems using computing, and while exploring projects in that area, I saw your project strongly aligned with my interest. Apart from my previous experience working with Python, pandas, and time-sequence models, which I have proposed for smart energy grids. I am working on an AI Chatbot for the Cornell energy team working on improving Cornell's Grid Efficiency as part of the Engineering for a Sustainable World (ESW) Project team. Specific to your project and the conversations I have seen, I was fascinated to explore TabPFN and utilize it to improve the performance of the error adjuster. Some of my initial thoughts and questions are mainly about the current adjuster's reliance on linear patterns via averaging, but TabPFN could help capture more complex relationships. Moreover, richer datasets, such as seasonal information, could be used to train the model beyond just relying on the MAE. Of course, as discussed heavily, the zero-shot capabilities of TabPFN would make it a great way to capture errors and improve the model. Some of the initial questions I pondered were I will continue to read through more on the topic and share more thoughts, but I also look forward to your guidance at your leisure, Best Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Dear @Sukh-P I hope you are doing well. My name is Han, and I am a graduate student at University College London (UCL), studying Energy Systems and Data Analysis. I am very interested in this Google Summer of Code 2025 project. Some of my relevant experience includes: Machine Learning and Deep Learning: I have a background in machine learning, deep learning that I have worked extensively with time series forecasting, energy data analysis and prediction, and geo-spatial data processing and visualization and data visualization. I work with Keras, TensorFlow frequently in my model and am familiar with them. I have read the guidance and some tutorial videos from the reference link on the GSOC Idea website, but I am still a little confused about which step I should finish before applying. like do I need to write a real model of my idea or just try to list our idea? I wonder what contributions I can make beforehand to strengthen my application and how to file the application form of GSoC. I appreciate your time and look forward to your reply. Best regards |
Beta Was this translation helpful? Give feedback.
-
|
Subject: Interest in Contributing to the TabPFN Integration Project Hi @Sukh-P, I’m Dakshbir Singh, a Computer Science student at NSUT with a strong background in Machine Learning, Deep Learning, and timeseries forecasting. I’m highly interested in contributing to the TabPFN Integration Project under OpenClimateFix for GSoC 2025. To get started and better understand the codebase, I would love to take on some intermediate-level, high-priority issues that need to be solved. It would help me gain familiarity with the existing framework and contribute meaningfully. Looking forward to your guidance on how I can begin contributing. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P, I am a first year CS students in BITS Pilani, and I am learning ML and python and this projects aligns with the the things I know and want to further work on. I get that the TabPFN will be a better model than the existing one as it would have been already trained on lots of data and I am really enthusiastic to work on this for you in GSoC. I just wanted to know what we can do now to increase our chance of getting selected in GSoC or anything you guys are looking in particular. Best Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @Sukh-P ! I’m Vishal S, a second-year Computer Science student at SSN College with experience in machine learning, data processing, and time series forecasting. I’m proficient in NumPy, pandas, scikit-learn, and transformers, and I’ve worked with ARIMA models for time series analysis. I’m excited about the opportunity to contribute to this project as part of GSoC 2025 and would love to understand how I can get started. To familiarize myself with the project, I have a few questions. First, where can I learn more about the existing codebase and its structure? What’s the best way for new contributors to demonstrate their open-source experience and get involved? Would you recommend starting with a specific first issue or small contribution to gain hands-on experience with the project? Regarding TabPFN integration and optimization, the current adjuster relies on historical error aggregation, interpolation, and smoothing. Beyond replacing these steps, are there other aspects where TabPFN could enhance accuracy or efficiency? From your perspective, what are some key challenges in transitioning from the rules-based adjuster to a TabPFN model? Are there any constraints related to interpretability or explainability that should be considered when integrating TabPFN? On the topic of performance, experimentation, and evaluation, are there specific latency or computational constraints that the new model must adhere to? How do you manage dataset versioning and experiment tracking to ensure measurable improvements in forecasting performance? What are the key evaluation metrics or baselines used to compare TabPFN’s effectiveness against the existing adjuster? I’m eager to contribute to this project as part of GSoC 2025 and appreciate any guidance on how to get started. Looking forward to your insights! P.S Sorry if im asking stuff too late, i couldnt fixate on any orgs and i just discovered this a day ago :) Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello Sukh-P and Team, Thank you for the detailed project description. I’m really excited about the opportunity to contribute to the OCF Quartz Solar forecasting model. I have a strong background in machine learning, time series forecasting, and data analysis. My experience with tools like Pandas, NumPy, and Scikit-Learn aligns well with the project requirements. I’ve already gone through the adjuster.py code and explored the TabPFN repository to understand its capabilities. I’m particularly interested in analyzing how TabPFN can enhance the solar forecast adjustment compared to the existing rule-based method. I’d be happy to discuss any further questions or clarifications you might have regarding my understanding of the project. Looking forward to contributing and making a meaningful impact on the project. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P ! Thanks for all the helpful info so far. Is there a small example dataset or something simple I can start with to test TabPFN before trying it on the full system? |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P, I am excited to contribute to this project. After examining the code while considering potential implementation hurdles. I seek your professional guidance on several matters:
I want to gain further knowledge about these components to help achieve successful project outcomes. I look forward to collaborating! |
Beta Was this translation helpful? Give feedback.
-
|
Hello Mr. @Sukh-P! I'm Sayura Lokhande, a freshman at Rose-Hulman Institute of Technology. I wanted to express interest in this project and also ask (if this is the right place to) of what other ways we can contribute to Open Climate Fix besides GSoC. I feel that OPC's mission speaks to me and the organization aligns with my future goals, with the way it uses computational models to solve climate problems. So I hope to continue interacting with it beyond this program. Have a nice day! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P, I'm interested in working on this project! I have experience in ML, Python, and data processing, and I’ve worked on projects involving predictive modeling. I previously built an LSTM model for time series forecasting where I predicted the NVIDIA stock price, which gave me hands-on experience with sequence-based prediction techniques. I'm also a Computer Science and Statistics major at UNC, which allows me to analyze and model complex datasets effectively. I'm excited to explore whether TabPFN can improve solar forecasting adjustments compared to rules-based methods. I'm looking forward to getting involved! |
Beta Was this translation helpful? Give feedback.
-
|
Hello @Sukh-P , I'm Prajakta, an aspiring GSoC contributor with experience in machine learning, Python, and data-driven modeling. I’ve worked with tabular data, predictive modeling, and model evaluation, which makes me excited about this project. I had a few questions to better understand its scope:
2.Would optimizing TabPFN’s inference time be a priority, and what constraints should we consider?
Looking forward to your insights. Thanks for your time! Best, |
Beta Was this translation helpful? Give feedback.
-
|
Respected sir/mam, |
Beta Was this translation helpful? Give feedback.
-
|
Hello @Sukh-P ! Super stoked about the "Adjuster this! TabPFN" project for GSoC 2025! Really digging the idea of using TabPFN to give your solar forecasting a boost. I've been playing around with time series forecasting and even got some experience using TabPFN on tabular data, so I'm keen to see how we can make it work here. Just a few quick questions to get the ball rolling:
Looking forward to hearing from you and hopefully getting started! |
Beta Was this translation helpful? Give feedback.
-
|
Hello @Sukh-P, I am a second-year CS PhD student and am very interested in AI and machine learning, especially for contributing to important issues such as climate change. I have taken Andrew Ng’s Machine Learning and Deep Learning online course specializations, and am currently taking ML in graduate school, where I am actively working on a project focused on predicting household energy usage based on historical consumption patterns. Given my familiarity with Python, and tools like pandas, sci-kit learn, and PyTorch, I am very interested in contributing to this project. I’ve been digging into the current rules-based adjuster for the OCF Quartz solar forecasting model and I really like the idea of improving it with a learned model like TabPFN. I understand that the current method averages errors over the last week by horizon and time of day, which is solid for steady patterns but might underperform during fast-changing conditions like sudden cloud cover. I think it will be very interesting to see if TabPFN can better handle those rapid changes in a zero-shot setting. If it’s not too late for me to ask, I’d love to hear what you think about the following questions I have:
Thanks and cheers, |
Beta Was this translation helpful? Give feedback.
-
|
Hi Sukh, I'm very interested in working on the "Adjuster this! TabPFN as a replacement for the adjuster" project for GSoC 2025. I'm at the start of a long journey to transition into data science. I'm comfortable with Python, Pandas, and learning new tools quickly. Thank you, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P, I’m Gianluca Ferro, and I graduated with a master’s degree in Electronic Engineering. I’m now a research fellow at ciparlabs, working on artificial intelligence for smart grids and renewable energy communities. My master’s thesis focused on time series forecasting, where I developed a framework of models (including gradient boosting, statistical methods, and neural networks) to predict energy consumption and production from renewable sources. I find your TabPFN-based adjuster project extremely interesting because it aligns perfectly with my interest in applying advanced ML solutions to improve solar forecasting. I’m eager to experiment with integrating TabPFN into an existing pipeline and train models using your data, aiming to enhance the overall forecast accuracy. A couple of questions I had in mind: Since TabPFN is a tabular-focused model, do you have any specific strategy in place for capturing the temporal structure of forecasting errors, or would we primarily rely on feature engineering to introduce time-related variables? Are there any real-time or near real-time constraints for the adjuster model’s inference process, and if so, how might we optimize TabPFN’s performance for such scenarios? I look forward to your feedback and the possibility of contributing to this project. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi Sukh, I’ve just submitted my GSoC 2025 proposal for the project titled “Improving Solar Forecast Accuracy Using TabPFN for Dynamic Adjustment.” The project explores using TabPFN to replace the current rules-based adjuster in the Quartz solar forecasting pipeline. Here’s a brief summary: Looking forward to the opportunity to contribute. |
Beta Was this translation helpful? Give feedback.
-
|
Hey Sukh P Sir! I’m also actively contributing to the project right now, exploring different approaches to improve forecast adjustments using machine learning. It's been an exciting learning experience so far! Would love to hear your thoughts or any advice you might have! |
Beta Was this translation helpful? Give feedback.
-
Google Summer of Code 2025 applications are now closed.We are currently reviewing all applications. Contributors will be announced 8 May 2025. Thank you! |
Beta Was this translation helpful? Give feedback.
-
|
Hi @Sukh-P I am probably late to the party. But hi. I am very interested in this project and I'm considering contributing to it in GSOC'26. So, can you give me the 101s about the project and where I should start |
Beta Was this translation helpful? Give feedback.
-
|
Hey @Sukh-P , this project is really nice and I am keen towards working for this project, I will be thankful if you can share something which is must knowing before understanding with the code. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This space is for you to ask any questions you have about this project. We're here to provide clarifications and help you understand the project's goals, scope, and requirements. Feel free to ask about anything that interests you!
Please note that this discussion thread is for questions and clarifications, not for formal applications, formal applications are done via Google Summer of Code https://summerofcode.withgoogle.com/.
Project Description
For the OCF Quartz Solar forecasting model, we have a simple “adjuster” model. It currently looks at the pattern of errors in the last week, and adjust the new forecast. We would like to experiment with a new foundational timeseries forecasting model, TabPFN, with the planned outcome to dynamically adjust our PV forecasting model based on historical errors and improve the forecasting skill. Can compare to current rules based averages method used.
Expected Outcome
An answer to whether a pretrained ML model such as TabPFN can provide better adjustment of solar forecasts than rules based methods. If yes, then the plan would be to incorporate this into OCF's production system.
Other Key Information
Expected Size: 90hrs (small)
Skills: ML knowledge + familiarity with ML tools, timeseries forecasting, working with tabular data + some data processing skills required e.g. pandas
Difficulty level: Easy
Related Reading: https://github.com/openclimatefix/india-forecast-app/blob/main/india_forecast_app/adjuster.py, https://github.com/PriorLabs/TabPFN
Potential mentors: @Sukh-P
Beta Was this translation helpful? Give feedback.
All reactions