Ability to filter on modality and dataset size #47
shamikbose
started this conversation in
Ideas
Replies: 1 comment 7 replies
-
That's a great idea. We thought of having a difficulty flag for datasets which could also be partially about dataset size. @cakiki @albertvillanova @clancyoftheoverflow WDYT -- the difficulty rating is probably a little bit harder to determine objectively. We maybe want to include the size info (where the data is in a repository/easily found) so people have a sense of which datasets are easier to work with. We could then encourage people to use the issue itself for more detailed comments about potential challenges i.e. complex data/APIs to work with. |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
@davanstrien @cakiki
As someone working on this in their spare time, I think it might be helpful to add labels to the datasets indicating their size and modalities. With limited compute, some people might be able to tackle smaller text datasets whereas others with large compute and time might be able to handle large multimodal datasets. I'd be happy to add the labels to the existing datasets if this is something others would like to see as well
In terms of labels, I was thinking of something along these lines
Beta Was this translation helpful? Give feedback.
All reactions