Skip to content

Commit 25b1f93

Browse files
committed
store files
2 parents 5a04389 + 3bb5b73 commit 25b1f93

File tree

10 files changed

+165
-26
lines changed

10 files changed

+165
-26
lines changed

_pages/sprints.md

Lines changed: 27 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -9,39 +9,42 @@ author_profile: false
99

1010

1111
- 2022
12-
- [WiMLDS Paris](https://scikit-learn.fondation-inria.fr/wimlds-paris-sprint-and-contribution-workshop/) (March 2022)
12+
- Salta, Argentina: [SciPy Latin America](https://pythoncientifico.ar) (Sep 2022)
13+
- Basel, Switzerland: [EuroSciPy 2022](https://www.euroscipy.org/2022/index.html) (Sep 2022)
14+
- Dublin, Ireland: [EuroPython 2022](https://ep2022.europython.eu/mentored-sprint) (Jul 2022)
15+
- Paris, France: [WiMLDS Paris](https://scikit-learn.fondation-inria.fr/wimlds-paris-sprint-and-contribution-workshop/) (Mar 2022)
1316

1417
- 2021
15-
- [Fujitsu](https://www.fujitsu.com/global/about/research/article/202111-devsprint2021a.html) (Dec 2021)
16-
- Data Umbrella [Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report) (Oct 2021)
17-
- Data Umbrella [Latin America Sprint](https://blog.dataumbrella.org/data-umbrella-latam-2021-scikit-learn-sprint-report) (Jun 2021)
18-
- Data Umbrella [Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme1-2021-scikit-learn-sprint-report) (Feb 2021)
19-
- [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202104-devsprint.html) (May 2021)
18+
- Japan: [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202111-devsprint2021a.html) (Dec 2021)
19+
- Online: [Data Umbrella Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report) (Oct 2021)
20+
- Online: [Data Umbrella Latin America Sprint](https://blog.dataumbrella.org/data-umbrella-latam-2021-scikit-learn-sprint-report) (Jun 2021)
21+
- Online: [Data Umbrella Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme1-2021-scikit-learn-sprint-report) (Feb 2021)
22+
- Japan: [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202104-devsprint.html) (May 2021)
2023

2124
- 2020
22-
- [EuroPython 2020 Sprints](https://wiki.python.org/moin/EuroPython2020/Sprints) (Jul 2020)
23-
- [SciPy 2020](https://www.scipy2020.scipy.org/sprints-schedule) (Jul 2020)
24-
- Data Umbrella + NYC PyLadies [First ONLINE sprint](https://tinyurl.com/sklearn-online) (Jun 2020)
25+
- Online: [EuroPython 2020 Sprints](https://wiki.python.org/moin/EuroPython2020/Sprints) (Jul 2020)
26+
- Online: [SciPy 2020](https://www.scipy2020.scipy.org/sprints-schedule) (Jul 2020)
27+
- Online: [Data Umbrella First Online Global Sprint](https://tinyurl.com/sklearn-online) (Jun 2020)
2528
- Paris, France: [Sprint of the Decade](https://github.com/scikit-learn/scikit-learn/wiki/Paris-scikit-learn-Sprint-of-the-Decade) (Jan 2020)
2629
- Berlin, Germany: [WiMLDS sprint](https://github.com/WiMLDS/berlin-2020-scikit-sprint) (Jan 2020)
2730

2831
- 2019
29-
- London: [ManAHL hackathon](https://www.man.com/hackathon2019) (Nov 2, 2019)
32+
- London, UK: [ManAHL hackathon](https://www.man.com/hackathon2019) (Nov 2, 2019)
3033
- San Francisco, CA: [WiMLDS sprint](http://wimlds.org/opensourcesprints-2/bay-area-scikit-sprint-2019/) (Nov 2019)
31-
- New York, NY: [WiMLDS sprint](http://wimlds.org/opensourcesprints-2/nyc-scikit-sprint-2019/) (Aug 24, 2019)
32-
- SciPy: [Austin, TX](https://www.scipy2019.scipy.org/sprints) (Jul 13-14, 2019)
34+
- New York, NY: [NYC WiMLDS](http://wimlds.org/opensourcesprints-2/nyc-scikit-sprint-2019/) (Aug 24, 2019)
35+
- Austin, TX: [SciPy](https://www.scipy2019.scipy.org/sprints) (Jul 13-14, 2019)
3336
- Nairobi, Kenya: [WiMLDS sprint](http://wimlds.org/nairobi-scikit-sprint-2019/) (Jun 22, 2019)
34-
- Paris: [core sprint, for advanced contributors](https://scikit-learn.fondation-inria.fr/en/scikit-learn-sprint-in-paris/) (Feb)
37+
- Paris, France: [core sprint, for advanced contributors](https://scikit-learn.fondation-inria.fr/en/scikit-learn-sprint-in-paris/) (Feb)
3538
- 2018
36-
- WiMLDS: [New York City](https://reshamas.github.io/highlights-from-the-2018-NYC-WiMLDS-scikit-sprint) (Sep)
37-
- SciPy: [Austin](http://gael-varoquaux.info/programming/sprint-on-scikit-learn-in-paris-and-austin.html) (open sprint, for new contributors) (Jul)
38-
- Paris: core sprint, for advanced contributors (Jul)
39-
- Two Sigma: [New York City](https://twitter.com/amuellerml/status/1007670849774784512) (Jun)
40-
- UC Berkeley: [Berkeley](https://github.com/scikit-image/scikit-image/wiki/UC-Berkeley-(BIDS)-sprint,-May-28-Jun-2-2018)(May 28 to Jun 2)
41-
- ManAHL: London (April 21-22, 2018)
39+
- New York, NY: [NYC WiMLDS](https://reshamas.github.io/highlights-from-the-2018-NYC-WiMLDS-scikit-sprint) (Sep 2018)
40+
- Austin, TX: [SciPy](http://gael-varoquaux.info/programming/sprint-on-scikit-learn-in-paris-and-austin.html) (open sprint, for new contributors) (Jul 2018)
41+
- Paris, France: core sprint, for advanced contributors (Jul 2018)
42+
- New York, NY: [Two Sigma](https://twitter.com/amuellerml/status/1007670849774784512) (Jun 2018)
43+
- Berkeley, CA: [UC Berkeley](https://github.com/scikit-image/scikit-image/wiki/UC-Berkeley-(BIDS)-sprint,-May-28-Jun-2-2018)(May 28 to Jun 2)
44+
- London, UK: ManAHL (April 21-22, 2018)
4245
- 2017
43-
- [Paris](http://gael-varoquaux.info/programming/scikit-learn-paris-sprint-2017.html) (Jun)
44-
- WiMLDS: [New York City](https://github.com/WiMLDS/scikit-sprint-nyc-2017/blob/master/README.md) (Mar)
46+
- [Paris, France](http://gael-varoquaux.info/programming/scikit-learn-paris-sprint-2017.html) (Jun)
47+
- New York, NY: [NYC WiMLDS](https://github.com/WiMLDS/scikit-sprint-nyc-2017/blob/master/README.md) (Mar)
4548
- 2016
4649
- SciPy: [Austin](https://scipy2016.scipy.org/ehome/146062/332969/) (Jul)
4750
- 2015
@@ -67,9 +70,9 @@ author_profile: false
6770
- Paris, Logilab (Apr)
6871
- Boston MIT
6972
- 2010
70-
- Paris (Sep)
71-
- Paris (Jun)
72-
- Paris (Mar)
73+
- Paris, France: (Sep 2010)
74+
- Paris, France: (Jun 2010)
75+
- Paris, France: (Mar 2010)
7376
- **first release of scikit-learn** (Feb)
7477

7578
## References

_posts/2022-07-13-sprints-value.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Sprints are **working sessions to contribute to an open source library**. The go
3030

3131
The [scikit-learn](https://scikit-learn.org/dev/index.html) project has a long and extraordinary legacy of open source sprints. Since 2010, when its [first public version](https://en.wikipedia.org/wiki/Scikit-learn) was released, there have been as many as [45 sprints organized](https://blog.scikit-learn.org/sprints/). The number 45 is a lower bound, since there are likely more sprints that have not been listed.
3232

33-
To date, more than 2400 people have contributed to [scikit-learn](https://github.com/scikit-learn/scikit-learn). The number of contributors to scikit-learn exceeds those of other related libraries such as numpy, scipy and matplotlib, with the exception of the [pandas](https://github.com/pandas-dev/pandas), which has a greater number of contributors (See Appendix A).
33+
To date, more than 2400 people have contributed to [scikit-learn](https://github.com/scikit-learn/scikit-learn). The number of contributors to scikit-learn exceeds those of other related libraries such as numpy, scipy and matplotlib, with the exception of [pandas](https://github.com/pandas-dev/pandas), which has a greater number of contributors (See Appendix A).
3434

3535
The public discourse on open source has expanded to explore topics of sustainability, funding models, and diversity and inclusion, to name a few. A *reasonable*, yet *difficult to answer* question that has been posed is:
3636
>*<span style="background-color: #CAE9F5;">
@@ -41,7 +41,7 @@ What is the effectiveness of sprint models and what is the long-term engagement
4141

4242
Due to technological limitations of GitHub and privacy concerns, we do not hold precise data on how many scikit-learn contributors connected to the project via a sprint. We have no formal data collection process which records statistics on how many sprint participants are recurring or information on their contributions to other open source projects or other long term positive ripple effects. A scientific look at the correlation between the number of sprints and contributors is beyond the scope of this article. What we *will examine* in this article are the **objectives, results and aspirations** of running the scikit-learn sprints.
4343

44-
<span style="background-color: #CAE9F5;">The queries from other open-source projects requesting guidance on sprints and diversity and inclusions have been increasing.</span> We share these experiences and lessons learned with the community, potential funders and open source project maintainers, particularly those projects which are nascent in their quest to build community, sustainability and diversity and inclusion.
44+
<span style="background-color: #CAE9F5;">The queries from other open-source projects requesting guidance on sprints and diversity and inclusion have been increasing.</span> We share these experiences and lessons learned with the community, potential funders and open source project maintainers, particularly those projects which are nascent in their quest to build community, sustainability and diversity and inclusion.
4545

4646
## Outline
4747

_posts/2022-09-29-salta-sprint.md

Lines changed: 54 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,54 @@
1+
---
2+
title: "scikit-learn Sprint in Salta, Argentina"
3+
date: September 29, 2022
4+
5+
categories:
6+
- Events
7+
tags:
8+
- Sprints
9+
- Community
10+
11+
postauthors:
12+
- name: Juan Martín Loyola
13+
website: https://jmloyola.github.io/
14+
image: juan-martin-loyola.jpeg
15+
---
16+
<div>
17+
{% include postauthor.html %}
18+
</div>
19+
20+
In September of 2022, the [SciPy Latin America](https://pythoncientifico.ar/) conference took place in Salta, Argentina.
21+
As part of the event, we organized a [scikit-learn sprint](https://pythoncientifico.ar/events/sprints/).
22+
The main idea was to introduce the participants to the open source world and help them make their first contribution.
23+
The sprint event was an in-person event.
24+
25+
<img src="https://pythoncientifico.ar/static/assets/images/scipy-la-2022_logo.png" alt="SciPy logo" width="50%" height="50%" >
26+
27+
## Schedule
28+
- September 27, 2022 - **Pre-sprint** - 10:00 to 12:00 hs (UTC -3)
29+
- September 28, 2022 - **Sprint** - 10:00 to 17:00 hs (UTC -3)
30+
31+
## Repository
32+
For more information in Spanish, [check this repository](https://github.com/jmloyola/sklearn-sprint-argentina-2022).
33+
You will find details about the event, instructions to set up the development environment, links with further information and tutorials, and an example git workflow to make a pull request for the project.
34+
35+
## Photos
36+
<figure>
37+
<img src="/assets/images/posts_images/sprint-salta-2022-1.jpg" alt="11 people standing behind some computers and 2 people projected in the screen" max-width="20%" max-height="20%" />
38+
<figcaption>
39+
Group photo of the SciPy Latin America sprint, Salta, Argentina, 2022. Sandra Meneses and Juan Martín Loyola are projected on the screen from a Zoom call. Photo credit: Lucía Torres.
40+
</figcaption>
41+
</figure>
42+
43+
<figure>
44+
<img src="/assets/images/posts_images/sprint-salta-2022-2.jpeg" alt="11 people coding in their computers" max-width="20%" max-height="20%" />
45+
<figcaption>
46+
Participants of the SciPy Latin America sprint working on their computers. Photo credit: Ariel Silvio Norberto Ramos.
47+
</figcaption>
48+
</figure>
49+
50+
## Acknowledgment
51+
These people made this sprint possible:
52+
- Ariel Silvio Norberto Ramos, one of the organizers of the SciPy Latin America,
53+
- [Data Umbrella](https://www.dataumbrella.org/), [one of the community partners of the event](https://twitter.com/ScipyLA/status/1573710649963724802), especially Sandra Meneses and Reshama Shaikh,
54+
- The mentors that helped run the sprint.
Lines changed: 77 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,77 @@
1+
---
2+
title: "scikit-learn and Hugging Face join forces"
3+
date: October 13, 2022
4+
5+
categories:
6+
- Updates
7+
- Community
8+
tags:
9+
- Open Source
10+
11+
featured-image: HFxsklearn.png
12+
13+
postauthors:
14+
- name: Lysandre Debut
15+
16+
website: https://github.com/LysandreJik
17+
image: "lysandre_debut.jpg"
18+
- name: François Goupil
19+
20+
website: https://github.com/francoisgoupil
21+
image: "francois_goupil.jpeg"
22+
---
23+
<div>
24+
<img src="/assets/images/posts_images/{{ page.featured-image }}" alt="">
25+
{% include postauthor.html %}
26+
</div>
27+
28+
29+
[Hugging Face](hf.co) is happy to announce that we're partnering with [scikit-learn](https://scikit-learn.org/stable/index.html) to further our support of the machine learning tools and ecosystem.
30+
31+
At Hugging Face, we've been putting a lot of effort into supporting deep learning, but we believe that machine learning as a whole can benefit from the tools we release. With statistical machine learning being essential in this field and scikit-learn dominating statistical ML, we're excited to partner and move forward together.
32+
33+
As of September 2022, the Hugging Face Hub already hosts nearly 4,000 tabular classification and tabular regression model checkpoints, and we strive for this trend to continue.
34+
35+
<div>
36+
<video preload="auto" autoplay loop muted="muted" volume="0">
37+
<source src="/assets/videos/HFxsklearn.mp4" type="video/mp4">
38+
</video>
39+
</div>
40+
41+
## Support to the scikit-learn consortium
42+
43+
Starting June 2022, Hugging Face is now an official sponsor of the scikit-learn consortium . Through this support, Hugging Face actively promotes the development and sustainability of sklearn. As a sponsor of the scikit-learn consortium hosted at the Inria foundation, we'll now participate in the scikit-learn consortium technical committee
44+
45+
## Development support
46+
To help  sustaining the development of the library , we're happy to welcome Adrin Jalali and Benjamin Bossan to the Hugging Face team. Adrin is a core developer of scikit-learn as well as [fairlearn](fairlearn.org), while Benjamin is the author of the [skorch](https://github.com/skorch-dev/skorch) library and is now a contributor to scikit-learn.
47+
48+
Hugging Face is happy to support the development of scikit-learn through code contributions, issues, pull requests, reviews, and discussions.
49+
50+
## Integration to and from the Hugging Face Hub
51+
52+
["Skops"](https://github.com/skops-dev/skops) is the name of the framework being actively developed as the link between the scikit-learn and the Hugging Face ecosystems. With Skops, we hope to facilitate essential workflows:
53+
54+
- The ability to push scikit-learn models on the Hugging Face Hub
55+
- The possibility to try out models directly in the browser
56+
- The automatic creation of model cards, to improve model documentation and understanding
57+
- The ability to collaborate with others on machine learning projects
58+
59+
### Snapshot of your work
60+
61+
Working at the intersection of scikit-learn and the Hub offers challenges linked to the two platforms. One of these challenges is secure persistence: the ability to serialize models in a secure, safe manner.
62+
63+
scikit-learn models (estimators, predictors, ...) are usually saved using pickle, which is notorious for not being a secure format. Sharing scikit-learn models in this format exposes receivers to potentially malicious data which could execute arbitrary code when run.
64+
65+
That's where secure persistence comes in: as the Hugging Face Hub aims to provide a platform for models, the ability to share safe, secure objects is essential. We've been working on adding secure persistence for scikit-learn models in [skops#128](https://github.com/skops-dev/skops/pull/128) and [skops#145](https://github.com/skops-dev/skops/pull/145)([doc preview](https://skops--145.org.readthedocs.build/en/145/persistence.html)). Instead of serializing using pickle, the object's contents are put into a zip file with an accompanying schema JSON file.
66+
67+
Read about the Skops library in the following blog post: [Introducing Skops](https://huggingface.co/blog/skops).
68+
69+
## Improving interoperability
70+
71+
Skops is an example of an integration of scikit-learn within our tools, but it is not the only example! We will strive to integrate with the rest of our ecosystem so that Hugging Face users may benefit from using scikit-learn tools and vice-versa.
72+
73+
An example is the `evaluate` library, dedicated to efficiently evaluating machine learning models and datasets. We aim for this tool to natively support [scikit-learn metrics](https://github.com/huggingface/evaluate/issues/297) in its API.
74+
75+
---
76+
77+
Through these efforts, we hope to kickstart a lasting relationship between the two ecosystems and provide simple, efficient bridges to lower the barrier of entry. We believe that educating and sharing models is the best way to foster inclusive machine learning from which all can benefit. We're excited to partner with scikit-learn for this endeavor.

assets/css/main.scss

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,3 +27,8 @@ html {
2727
position: relative;
2828
font-size: 22px;
2929
}
30+
31+
video {
32+
width: 100% !important;
33+
height: auto !important;
34+
}
56.8 KB
Loading
87.4 KB
Loading
2.77 MB
Loading
99.8 KB
Loading

assets/videos/HFxsklearn.mp4

481 KB
Binary file not shown.

0 commit comments

Comments
 (0)