Discussion: Advancing Brain Decoding and Cognitive Analysis: Leveraging Diffusion Models for Spatiotemporal Pattern Recognition in fMRI Data #60
Replies: 27 comments 1 reply
-
|
Introduction: I am Franchis N Saikia, currently working as a Data Analyst II in Walmart International Tech after graduating from Indian Institute of Technology, Guwahati (IITG) ‘23. I am a highly motivated individual seeking every opportunity to work on state-of-the-art projects. Previously, I have worked on the application of deep learning models to medical imaging data (MLP-UNet & an ongoing submission to MICCAI'25) and see this opportunity as an outlet for substantial research directed to understanding the functionality of the brain. Below I am sharing a few resources and summaries related to certain aspects of the project. Functional MRI (fMRI): A model neuroimaging method, mapping functional areas of the brain activated during a cognitive, motor, or other tasks. Mapping is performed either on the basis of a change in blood flow to a given area (perfusion) or on the basis of a change in blood oxygenation (the so-called BOLD effect). BOLD fMRI (named after the use of the BOLD effect) is the most common way today and has almost become synonymous with the more general name fMRI. DATASET: [OpenNeuro: Dataset bank] BOLD5000 dataset: Slow event related public fMRI dataset with 5000 images and stimuli mitigating the overlap of stimuli while showing images by utilising images from SUN, COCO and ImageNet respectively covering real-world indoor, outdoor scenes and objects in complex, real-world scenes. ~168 GB dataset Natural Scenes dataset: Large-scale fMRI dataset of 8 healthy adult subjects while viewed thousands of color natural scenes over 30-40 scan sessions. The corresponding research paper discussion a latent diffusion model can be found here. Generic object decoding: The dataset consists of preprocessed fMRI image cued against a total of 1200 ImageNet images. Two tests were performed: Image Presentation and Imagery Experiment and their fMRI patterns were noted. Preprocessing can be done according to the article Horikawa & Kamitani (2017) Generic decoding of seen and imagined objects using hierarchical visual features. Nat Commun and preprocessed data can be found here. Human Connectome Project (HCP): Private data, need access, details METHODOLOGY:
More methods: Semantic Brain Decoding, Sparse Masked Modeling |
Beta Was this translation helpful? Give feedback.
-
|
Hi, I am Prantik Deb an MS by Research student in CSE at IIIT Hyderabad working in the Cognitive Science Lab. My research spans medical imaging, large language models, and visual language models. As part of my coursework in Cognitive Science and AI course, I have been exploring brain encoding and decoding work—closely aligning with this project. I am familiar with datasets like BOLD5000 and NSD and am eager to contribute to research at the intersection of AI and neuroscience. E-mail: prantik.d@research.iiit.ac.in |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, I'm Shiva (e0727167@u.nus.edu), a graduate of the National University of Singapore with a degree in Electrical Engineering, where I specialized in signal processing and machine learning. Currently, I'm publishing a paper on novel compression methods for EEG signals, which has given me hands-on experience with handling and analyzing complex neural data. I am interested to work on the "Advancing Brain Decoding and Cognitive Analysis" project. I'm excited about the potential to contribute to advancing brain decoding and cognitive analysis, and I look forward to discussing how I can help bring fresh perspectives to this initiative. I am sharing a few resources too, Foundational fMRI Connectivity and Parcellation Studies:
Data Preprocessing and Harmonization:
Temporal Dynamics and Connectivity Fluctuations:
Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, I'm Shubham Vishwakarma, a recent ECE graduate from DJSCE, currently working as a Data Scientist at a private company that provides data-driven solutions to clients. I have a strong interest in the intersection of AI and Neurology and am eager to contribute to the project "Advancing Brain Decoding and Cognitive Analysis: Leveraging Diffusion Models for Spatiotemporal Pattern Recognition in fMRI Data" Previously, I worked as a research intern at IIT Patna, where I focused on finding the optimal rank for fine-tuning LLMs in a federated setting and implemented a research paper using PyTorch. I have published a research paper related to Stable Diffusion and am currently working on another paper focused on improving noise variance using RL in federated setting. I am one of the co-authors of this paper, along with my university professor. |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, I’m Karandeep Nanda, currently diving deep into the world of data science as a student in the MSc program at the University of Colorado Boulder. With a background in psychology and biological sciences, I've always been fascinated by the intersection of the brain, data, and technology. My previous work has involved applying machine learning techniques to healthcare and computational biology, and now I'm eager to explore the exciting potential of diffusion models in brain decoding. I’m particularly drawn to how these models can reveal spatiotemporal patterns in fMRI data and offer insights into the brain’s inner workings. I’m thrilled to be part of this conversation and can't wait to collaborate with all of you as we explore this cutting-edge field together. Here’s my GitHub and LinkedIn. Best, |
Beta Was this translation helpful? Give feedback.
-
|
Hey everyone, Recently came across this interesting paper on speech processing in the human brain. |
Beta Was this translation helpful? Give feedback.
-
Hello Everyone,I'm Niranjan Kumar Kishore Kumar, a Biomedical Engineering graduate currently pursuing my Master’s in Artificial Intelligence at Yeshiva University, New York. I have a strong passion for NeuroAI, computational neuroscience, and AI-driven biomedical research, making this project particularly exciting for me. My previous work includes:
Project Interests:For this project, I’m excited to apply diffusion models to fMRI data for spatiotemporal pattern recognition, leveraging U-Net, transformer-based architectures, and denoising probabilistic approaches. Although I have experience in PyTorch, signal processing, and multimodal AI, I am always eager to learn and refine my skills further. I am excited to collaborate, gain deeper insights into fMRI data modeling, and work closely with mentors and the community. Looking forward to this amazing learning experience! Best, |
Beta Was this translation helpful? Give feedback.
-
|
Hello everyone, Look forward to collaborate to this amazing project |
Beta Was this translation helpful? Give feedback.
-
|
Hi Dr. Mahmoudi and Dr. Kara, My name is Sadaf (Sarah) Draper, and I’m a graduate student in Computer Science with a background in data engineering, machine learning, and cloud computing. I recently completed a Master's thesis on optimizing solar energy efficiency using GRU, LSTM, and CNN models, and I’m deeply interested in applying deep learning to real-world scientific domains—especially in neuroscience and cognitive analysis. I came across your GSoC 2025 project, “Advancing Brain Decoding and Cognitive Analysis using Diffusion Models,” and I’m very excited by the opportunity to contribute. I’ve worked extensively with PyTorch, and I’m comfortable designing and training deep learning models, particularly for time-series data. I'm also intrigued by the use of diffusion models for spatiotemporal pattern recognition in high-dimensional data like fMRI scans. I’d love to get involved, learn more about your vision for this project, and begin discussing ideas for how I could contribute. I’ll begin exploring the forum and any relevant datasets or literature, and I’d be happy to start with smaller tasks or drafts if available. Looking forward to your guidance! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hey everyone! I am Niyati Bisht, a 3rd year B.Tech student in Electronics and Telecommunication at Veermata Jijabai Technological Institute(VJTI), Mumbai. I am deeply passionate about Medical Image Processing in Machine Learning, particularly in the intersection of deep learning and neuro-imaging. I had the privilege of working as a Research Intern at the Medical Deep Learning and Artificial Intelligence Lab (MeDAL), IIT Bombay, where I focused on MRI-based segmentation using 2D U-Net. My work involved advanced feature extraction techniques, analyzing outputs from intermediate layers to enhance segmentation quality, improving both spatial resolution and feature representation. Beyond MRI segmentation, I have gained hands-on experience in Diffusion Models, Gaussian Noise, Vision Transformers (ViT), Swin Transformer, RNNs, and CNNs. I am proficient in Python, PyTorch and deep learning libraries relevant to medical imaging. Currently, I am exploring "Advancing Brain Decoding and Cognitive Analysis: Leveraging Diffusion Models for Spatiotemporal Pattern Recognition in fMRI Data." This project excites me because it aligns with my goal of pushing the boundaries of spatiotemporal analysis in neuro-imaging. My prior experience with MRI and diffusion-based models provides a strong foundation to contribute meaningfully to this project. Through this opportunity, I aim to refine my expertise in diffusion models for fMRI, explore innovative conditioning techniques for brain decoding, and work on real-world neuro-imaging applications that could aid cognitive analysis. I look forward to contributing my skills while learning from experts in the field. Warm Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hey everyone! I am Saket, a final-year B.Tech student in Electronics and Telecommunication Engineering at Sardar Patel Institute of Technology (SPIT), Mumbai. My passion lies in Machine Learning for Medical Imaging, with a strong focus on deep learning, signal processing, and neuroimaging applications. I previously worked as a Research Intern at IIT Bombay, where I developed a deep learning-based contactless palmprint recognition system, improving biometric authentication accuracy. Additionally, I have worked on medical image analysis, including Lung Cancer Detection and Age-Related Macular Degeneration classification, using CNNs, ensemble learning, and GANs for dataset balancing. My experience extends to Diffusion Models, U-Net, Transformers, and probabilistic modeling, and I am proficient in Python, PyTorch, and TensorFlow. I am particularly excited about "Advancing Brain Decoding and Cognitive Analysis: Leveraging Diffusion Models for Spatiotemporal Pattern Recognition in fMRI Data", as it aligns with my goal of leveraging generative models for spatiotemporal neuroimaging analysis. Through this project, I aim to explore diffusion-based approaches for fMRI, refine conditioning techniques for cognitive state classification, and contribute to advancing brain decoding research. Looking forward to collaborating and learning from the community! Best Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello everyone, I'm a current postdoctoral researcher at Imperial College London, where I continue my work in numerical simulations for diffusion MRI in biological tissues following my PhD. My academic background spans Aerospace Engineering and Computational Biophysics, which has provided me with a strong foundation in mathematics, numerical methods, and the underlying physics of MRI. Currently, I am involved in projects that leverage transformers and deep generative models. In this project, I aim to explore diffusion-based models in brain fMRI, refine innovative conditioning techniques for cognitive state classification, and ultimately contribute to advancing brain decoding research. I want to deepen my expertise in diffusion models while enhancing our understanding of cognitive processes. I look forward to collaborating with and learning from experts in the community as we push the boundaries of neuroimaging research. |
Beta Was this translation helpful? Give feedback.
-
|
Hello all, My name is Prachi Kedar, and I'm nearing the completion of my Master's in AI at Politecnico di Milano. My journey includes three years of practical experience in AI/ML technologies. Currently, interning for Spanish National Research Council at Department of Functional and Systems Neurobiology , focusing on analyzing neural activity from calcium imaging recordings. Utilizing machine learning techniques, I'm working to correlate this activity with observed behavior. My core interest is in computer vision and image processing, and working on this project will provide me a valuable opportunity to delve deeper into the fascinating intersection of AI and biomedical neuroscience. I would like to propose below topic based on the given idea: Project Title: Brain Neural Activity Decoder by using Time-Varying Dependency Structures from fMRI using Graph Neural Network Have a summarize following steps which can be performed achieve this: 1. Data Preprocessing: Clean fMRI data, parcellate the brain into regions, and segment the time series into dynamic windows. Dataset : 1. Human Connectome Project (HCP): Good fit for high-resolution fMRI and rich behavioral data. I have also attached the link of some research methods which has been already implemented : 1. Spatio-Temporal Graph Convolution for Resting-State fMRI Analysis https://arxiv.org/pdf/2003.10613v3 Looking forward to collaborate on this challenging yet interesting project idea ! Best Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hey everyone! I have a few queries regarding the project: 1. GPU Support: Will additional GPU resources be provided for higher computational requirements? This would significantly aid in progressing with the project. 2. Dataset Availability: @zeydabadi Looking forward to your response. Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, I’m excited to start this discussion thread by sharing some initial research on my GSoC project. Pattern recognition in functional MRI (fMRI) data is a fascinating challenge that could provide deeper insights into how our brain transitions between cognitive states during various tasks. Why This Matters My Approach I’d love to hear thoughts from the community! Looking forward to discussing and refining this approach with all of you! Best, |
Beta Was this translation helpful? Give feedback.
-
|
Has anybody tried downloading the dataset? It's too huge. I tried downloading BOLD5000 but it's unzipped file is nearly 512gb. The zipped file is 125gb. Do let me know if anybody has an alternative dataset with lesser size |
Beta Was this translation helpful? Give feedback.
-
|
Hello Everyone! My name is Clyde Villacrusis (github: clyde0513; email: clyde0513@g.ucla.edu) and I am a 3rd year UCLA Computer Science student, passionate in deep learning, AI, and leveraging skills with Python! I am currently interning at UCLA Health and have been working with AI such as ChatGPT models and Handwritten OCR APIs to convert unorganized blood pressure data into a cleaned, format that doctors can easily read. Thus, I am very excited to work on this new project on advancing brain decoding and cognitive analysis, as it is interesting to learn how spatiotemporal pattern recognition works in our brain and how to better analyze it! I have been reading a couple of research papers related to this project topic and here is what I have found to get started on this project! NeuralFlix: Reconstructing Vivid Videos from Human Brain Activity: -They do temporal interpolation and spatial masking for contrastive learning of fMRI representations and a diffusion model enhanced with dependent prior noise for generating videos. -Their methods consist of fMRI feature learning and video decoding two-phase framework for reconstructing videos from fMRI-recorded brain activities.
Their codebase is accessible right here:
These research papers are closely aligned with the challenging, but interesting project idea. I believe that with all the community has been proposing thus far, we can manage to collaborate on this project and complete it in an efficient, timely manner! Additionally, I also took a data science and deep learning in computer vision course, so I am familiar with data processing, augmenting data, training data, and fine-tuning a model. Here is my website to learn more about me and my experiences! https://clyde.at/. Lastly, I plan on collaborating on this project full-time in the summer! I hope to look forward on collaborating with you guys and feel free to contact me and/or reply to my message! |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone! I’m Riya Rahim, a student at IIT Madras with a deep interest in machine learning, data science, and open-source development. I’m particularly excited about the "Advancing Brain Decoding and Cognitive Analysis" project at Emory BMI for GSoC 2025 and eager to contribute. My experience includes working with Python, Flask, SQL, and deep learning, with a strong focus on data analysis and building ML-driven applications. Currently, I’m exploring fMRI data analysis using Nilearn and MNE-Python, along with diffusion models, to better understand spatiotemporal pattern recognition in brain imaging. I’m looking forward to engaging with the community, making meaningful contributions, and learning from experienced mentors. Any guidance on getting started, beginner-friendly issues, or relevant resources would be greatly appreciated. Excited to collaborate and be a part of this journey! Best regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hi @pradeeban @zeydabadi @monjoybme @anbhimi @abdelrahman725 , I have submitted an initial draft of the proposal for this project. Kindly give your valuable feedback before the deadline so that I am able to refine my proposal. Regards, |
Beta Was this translation helpful? Give feedback.
-
|
Hello mentors, @pradeeban @zeydabadi I wanted to know while submitting proposal what project size should I need to select ? Also , I have submitted my proposal it would be great if you could provide your feedback for the same . Thanks, |
Beta Was this translation helpful? Give feedback.
-
|
Hello, **Project Title:Advancing Brain Decoding and Cognitive Analysis: Leveraging Diffusion Models for Spatiotemporal Pattern Recognition in fMRI Data** Thank you. |
Beta Was this translation helpful? Give feedback.
-
Build a reproducible and lightweight fMRI preprocessing pipeline using Nilearn & NiBabel. Convert voxel-level BOLD sequences into structured time-series and brain graphs. Implement a compact U-Net + ConvLSTM-based DDPM model with limited timesteps and mixed-precision training. Integrate fairness-aware conditioning (e.g., age, gender, task) into generative modeling. Visualize counterfactual reconstructions with SHAP to improve interpretability. Add GNN module to model dynamic brain region interactions. Evaluate decoding and generalization using ABIDE and BOLD5000 subsets. Provide comprehensive documentation, training notebook, and annotated codebase.
8.1) Community Bonding Period (May 20 – June 16) Review recent work on fMRI, DDPMs, and graph-based neural decoding. Identify datasets (e.g., ABIDE, OpenNeuro, BOLD5000-mini) and secure access. Set up Colab/Google Cloud + lightweight PyTorch-based training environment. Finalize architecture design and experimental roadmap with mentors. 8.2) Development Phase Week 1 (June 17 – 23): Preprocess fMRI data: align, denoise, normalize. Convert to voxel-time tensor and brain graph representations. Week 2 (June 24 – 30): Build U-Net + ConvLSTM encoder-decoder. Implement DDPM training loop with 100–200 timesteps. Week 3 (July 1 – 7): Add conditioning layers (age/gender/task embeddings). Evaluate fairness-aware reconstructions. Week 4 (July 8 – 14): Integrate GNN to model dynamic brain region graphs. Combine graph features with voxel encoder. Week 5 (July 15 – 21): Apply SHAP to evaluate importance of conditioning variables. Generate counterfactual reconstructions (e.g., “older subject” scenario). Week 6 (July 22 – 28): Finalize model tuning, run ablations on fairness and graph inputs. Visualize spatiotemporal embeddings and prediction heatmaps. Week 7 (July 29 – August 4): Run full evaluation on unseen ABIDE/BOLD5000 subsets. Apply runtime optimizations for model deployment. Week 8 (August 5 – 11): Build visualization dashboard for user interaction with model outputs. Integrate results into reproducible scripts and versioned artifacts. Week 9 (August 12 – 18): Document training and evaluation setup. Record model walkthrough and inference demos. Week 10 (August 19 – 25): Final testing and cleanup. Push full repo and publish usage guide. Submit final report and project video. 8.3) Project Completion, Testing, and Documentation (August 26 – September 1) Final review and polish. Mentor feedback incorporation. Official GSoC submission.
Type: Large-size project (35 hours/week) Timezone: IST (UTC+5:30) Preferred Hours: 10:00 AM – 2:00 PM IST (weekdays), flexible based on mentor feedback
Programming: Python Neuroimaging: Nilearn, NiBabel, BIDS format, basic FSL ML/DL: CNNs, LSTMs, DDPMs, SHAP, Transformers, Graph Neural Networks |
Beta Was this translation helpful? Give feedback.
-
|
Hi! I’m Archanaa, a final-year B.Tech student specializing in Artificial Intelligence and Machine Learning. I’ve spent the last few years exploring the world of machine learning, deep learning, and computer vision through hands-on projects and research. Some of my most exciting work includes building sentiment analysis systems using BERT,Segmentation models, detecting deepfakes with GANs and Vision Transformers, and developing real-time disease detection tools for agriculture. I’ve also worked on optimizing AI models for deployment using OpenVINO and have published research papers in reputed conferences and journals. My technical toolbox includes Python, TensorFlow, PyTorch, Scikit-learn, and OpenCV, and I’m confident working with tools like GitHub, Google Colab, and Jupyter Notebooks. I’m particularly passionate about applying AI to solve real-world problems, especially those that impact people and the environment. My experience comes not only from coursework but also from internships, collaborative research, and self-driven learning. What excites me most about GSoC is the opportunity to work on meaningful open-source projects, learn from experienced mentors, and contribute to a larger community. I believe my curiosity, dedication, and ability to quickly pick up new technologies will help me be a valuable contributor this summer. I see this as more than just a learning opportunity — it’s a chance to be part of something bigger and make a lasting impact. 📧 Email: 17archanaas@gmail.com |
Beta Was this translation helpful? Give feedback.
-
|
Hi, I’m Priyanka Joshi, a B.Tech CSE (AI & ML) student at Sir Padampat Singhania University. I’m passionate about working at the intersection of AI, neuroscience, and healthcare. I’ve built an AI-powered medical assistant model integrating RAG and LLMs, and I’m deeply interested in brain encoding/decoding using datasets like BOLD5000 and NSD. I love exploring how large language models and visual-language systems can be applied in real-world medical and cognitive science domains. GSoC 2025 feels like the perfect platform to contribute meaningfully to open-source research, learn from the community, and grow as a researcher. E-mail: priyankajoshi2300@gmail.com |
Beta Was this translation helpful? Give feedback.
-
|
Hello everybody. I am Swaytha, a Computer Science Sophomore studying at Nanyang Technological University. I have had hands-on experience in deep learning projects in the past. Recently, I completed an internship where I utilized supervised and unsupervised learning techniques to detect and analyze specific operational conditions of home appliances. I am also currently interning at a lab where we are investigating vision language models for medical diagnosis. I have been exploring generative architectures and I am interested in this project as I am keen on applying diffusion models for reconstructing visual stimuli from brain signals. Email: vswaytha4@gmail.com Project Goals:
Project Timeline: Planned GSoC Hours: Skill Set:
I am currently learning about diffusion models and graph neural networks. |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, |
Beta Was this translation helpful? Give feedback.
-
|
Hi, I’m P. Y. Rajkamal Tutu, an M.Tech Artificial Intelligence student at NIT Silchar, also pursuing a B.S. in Data Science and Applications from IIT Madras in parallel. I’m passionate about brain-computer interfaces, cognitive modeling, and the intersection of AI, neuroscience, and generative modeling. I’ve previously worked on explainable AI for Indic-language spam classification, and few other kaggle projects. I enjoy working with LLMs, visual-language models, and models that combine structure and reasoning in biological contexts. I believe GSoC 2025 is an ideal opportunity for me to contribute to high-impact open-source research, collaborate with domain experts, and grow technically and intellectually through mentorship and community engagement. E-mail: tutuponnekanty@gmail.com Thank you! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone,
Starting this discussion thread by sharing some basic research into this new project. Pattern recognition in functional MRIs is a very exciting task at hand. It could help us understand a lot regarding how our brain changes state throughout a sequence of action/s which I am very curious about. It could potentially help us model “how we think”.
Beta Was this translation helpful? Give feedback.
All reactions