|
| 1 | +--- |
| 2 | +type: "project" # DON'T TOUCH THIS ! :) |
| 3 | +date: "2025-06-18" # Date you first upload your project. |
| 4 | +# Title of your project (we like creative title) |
| 5 | +title: "Images10K Compendium" |
| 6 | + |
| 7 | +# List the names of the collaborators within the [ ]. If alone, simple put your name within [] |
| 8 | +names: [Sara Barbu, Lune Bellec] |
| 9 | + |
| 10 | +# Your project GitHub repository URL |
| 11 | +github_repo: https://github.com/brainhack-school2025/Barbu-Images10K |
| 12 | + |
| 13 | +# If you are working on a project that has website, indicate the full url including "https://" below or leave it empty. |
| 14 | +website: https://sarabarbu.github.io/Images10k-compendium/ |
| 15 | + |
| 16 | +# List +- 4 keywords that best describe your project within []. Note that the project summary also involves a number of key words. Those are listed on top of the [github repository](https://github.com/brainhack-school2020/project_template), click `manage topics`. |
| 17 | +# Please only lowercase letters |
| 18 | +tags: [images, visualization, markdown, jupyter] |
| 19 | + |
| 20 | +# Summarize your project in < ~75 words. This description will appear at the top of your page and on the list page with other projects.. |
| 21 | + |
| 22 | +summary: "A web-based tool for exploring visual datasets across real-world categories using carousels, tables, and interactive views." |
| 23 | + |
| 24 | +# If you want to add a cover image (listpage and image in the right), add it to your directory and indicate the name |
| 25 | +# below with the extension. |
| 26 | +image: "Images10k_logo.png" |
| 27 | +--- |
| 28 | +<!-- This is an html comment and this won't appear in the rendered page. You are now editing the "content" area, the core of your description. Everything that you can do in markdown is allowed below. We added a couple of comments to guide your through documenting your progress. --> |
| 29 | + |
| 30 | +## Project definition |
| 31 | + |
| 32 | +### Background |
| 33 | + |
| 34 | +Many studies in cognitive neuroscience still use really simplified or artificial images, like objects on plain backgrounds.That can limit ecological validity and create a disconnect between what you see in the lab and real world perception. In contrast to that, Images10K provides naturalistic scenes, showing objects in their natural environments, which makes it easier to study how people understand visual scenes in real-world contexts. This approach fits with recent work that emphasizes the importance of using more realistic images in research (Hosu, Lin, Szirányi, & Saupe, 2019). |
| 35 | + |
| 36 | +This project is based on the Images10K dataset — containing over 8,000 naturalistic images, annotated by human participants on the Zooniverse platform. The overall goal is to provide a well-organized and richly annotated image set (8,382 images, 15 semantic categories) that can be reused for training visual recognition models in AI and neuroscience |
| 37 | + |
| 38 | +### Tools |
| 39 | + |
| 40 | +- **GitHub** for version control and to organize the project in a clear, collaborative, and shareable format. |
| 41 | +- **DataLad** to retrieve and manage the dataset in a reproducible way. |
| 42 | +- **Python scripts** to filter images by category, convert metadata formats, and prepare image paths for display. |
| 43 | +- **Jupyter Notebooks** to explore the metadata, test visualizations, and generate previews of the dataset. |
| 44 | +- **Dash Bootstrap Components** to build interactive UI elements like carousels and dropdown menus for browsing images. |
| 45 | +- **MyST Markdown** to structure the content of the interactive website and document the project cleanly within Jupyter Book. |
| 46 | +- **Ubuntu terminal** to navigate the file system, run scripts, and better understand how to work with files and folders at the command line. |
| 47 | + |
| 48 | +## Data |
| 49 | + |
| 50 | +The dataset includes: |
| 51 | +- Over 8,000 naturalistic images, stored in semantically organized folders based on object categories |
| 52 | +- Annotated labels for each image, including category, subcategory, and file path |
| 53 | +- High-level semantic classifications (e.g., living vs. non-living, natural vs. artificial) |
| 54 | +- Rich image-level metadata such as: |
| 55 | + - Number of participant annotations (via Zooniverse) |
| 56 | + - Inter-participant agreement scores |
| 57 | + - License information and usage permissions |
| 58 | + - Source URLs and author attributions |
| 59 | + |
| 60 | +### Deliverables |
| 61 | + |
| 62 | +- A structured GitHub repository with scripts, metadata, and documentation |
| 63 | +- Jupyter Notebooks for metadata preview and exploratory analysis |
| 64 | +- An interactive web interface built with Jupyter Book and MyST Markdown |
| 65 | +- Image carousels and scrollable metadata tables for category-based exploration |
| 66 | +- Downloadable metadata files hosted via Google Drive |
| 67 | + |
| 68 | +## Results |
| 69 | + |
| 70 | +### Tools I learned during this project |
| 71 | + |
| 72 | +Tools are listed above in the Tools section |
| 73 | + |
| 74 | +### Results |
| 75 | + |
| 76 | +#### Deliverable: |
| 77 | + |
| 78 | +##### Link to view google slide presentation |
| 79 | +[presentation](https://docs.google.com/presentation/d/1INdPO4mDrgXu64EogxEHda7Kbf1mZ-EG5l1t3ICp8UQ/edit?usp=sharing) |
| 80 | + |
| 81 | +##### Link to the Image10k-Compendium Website |
| 82 | +You can view the full project and explore the carousels here: |
| 83 | +[**Image10k Compendium Website**](https://sarabarbu.github.io/Images10k-compendium/) |
| 84 | + |
| 85 | + **Note** Only a limited set of images is included on the website due to GitHub storage and bandwidth constraints. |
| 86 | + |
| 87 | + **Note** The interactive carousels (Dash apps) **require Python to run**, so they won’t launch directly in the website view. |
| 88 | + |
| 89 | +##### To view the carousels interactively on the Website: |
| 90 | + |
| 91 | +1. **Clone this repository**: |
| 92 | + [Images10k-compendium](https://github.com/SaraBarbu/Images10k-compendium) |
| 93 | + |
| 94 | +2. **Install dependencies**: |
| 95 | + `pip install -r binder/requirements.txt` |
| 96 | + |
| 97 | +3. **Run both notebooks** in the `content/` folder: |
| 98 | + - `animated_being.ipynb` |
| 99 | + - `Objects.ipynb` |
| 100 | + |
| 101 | + |
| 102 | + |
| 103 | +## Conclusion and acknowledgement |
| 104 | + |
| 105 | +This project helped me learn how to organize data and build interactive visual tools using Dash and Jupyter. I also got more comfortable working with GitHub and sharing work online. |
| 106 | + |
| 107 | +Thanks to Lune, Marie, Cléo, and the whole BrainHack School team for all the help and support along the way! |
0 commit comments