scikit-learn
diff --git a/‎_pages/sprints.md
Lines changed: 27 additions & 24 deletions b/‎_pages/sprints.md
Lines changed: 27 additions & 24 deletions
diff --git a/‎_posts/2022-07-13-sprints-value.md
Lines changed: 2 additions & 2 deletions b/‎_posts/2022-07-13-sprints-value.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎_posts/2022-09-29-salta-sprint.md
Lines changed: 54 additions & 0 deletions b/‎_posts/2022-09-29-salta-sprint.md
Lines changed: 54 additions & 0 deletions
diff --git a/‎_posts/2022-10-13-joining-forces-hugging-face.md
Lines changed: 77 additions & 0 deletions b/‎_posts/2022-10-13-joining-forces-hugging-face.md
Lines changed: 77 additions & 0 deletions
diff --git a/‎assets/css/main.scss
Lines changed: 5 additions & 0 deletions b/‎assets/css/main.scss
Lines changed: 5 additions & 0 deletions
diff --git a/‎assets/images/author_images/lysandre_debut.jpg
56.8 KB b/‎assets/images/author_images/lysandre_debut.jpg
56.8 KB
diff --git a/‎assets/images/posts_images/HFxsklearn.png
87.4 KB b/‎assets/images/posts_images/HFxsklearn.png
87.4 KB
diff --git a/‎assets/images/posts_images/sprint-salta-2022-1.jpg
2.77 MB b/‎assets/images/posts_images/sprint-salta-2022-1.jpg
2.77 MB
diff --git a/‎assets/images/posts_images/sprint-salta-2022-2.jpeg
99.8 KB b/‎assets/images/posts_images/sprint-salta-2022-2.jpeg
99.8 KB
diff --git a/‎assets/videos/HFxsklearn.mp4
481 KB b/‎assets/videos/HFxsklearn.mp4
481 KB
@@ -9,39 +9,42 @@ author_profile: false
 
 
 - 2022
-  - [WiMLDS Paris](https://scikit-learn.fondation-inria.fr/wimlds-paris-sprint-and-contribution-workshop/) (March 2022)
+  - Salta, Argentina: [SciPy Latin America](https://pythoncientifico.ar) (Sep 2022)
+  - Basel, Switzerland: [EuroSciPy 2022](https://www.euroscipy.org/2022/index.html) (Sep 2022)
+  - Dublin, Ireland: [EuroPython 2022](https://ep2022.europython.eu/mentored-sprint) (Jul 2022)
+  - Paris, France: [WiMLDS Paris](https://scikit-learn.fondation-inria.fr/wimlds-paris-sprint-and-contribution-workshop/) (Mar 2022)
 
 - 2021
-  - [Fujitsu](https://www.fujitsu.com/global/about/research/article/202111-devsprint2021a.html) (Dec 2021)
-  - Data Umbrella [Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report) (Oct 2021)
-  - Data Umbrella [Latin America Sprint](https://blog.dataumbrella.org/data-umbrella-latam-2021-scikit-learn-sprint-report)  (Jun 2021)
-  - Data Umbrella [Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme1-2021-scikit-learn-sprint-report) (Feb 2021)
-  - [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202104-devsprint.html) (May 2021)
+  - Japan: [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202111-devsprint2021a.html) (Dec 2021)
+  - Online: [Data Umbrella Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme2-2021-scikit-learn-sprint-report) (Oct 2021)
+  - Online: [Data Umbrella Latin America Sprint](https://blog.dataumbrella.org/data-umbrella-latam-2021-scikit-learn-sprint-report)  (Jun 2021)
+  - Online: [Data Umbrella Africa & Middle East Sprint](https://blog.dataumbrella.org/data-umbrella-afme1-2021-scikit-learn-sprint-report) (Feb 2021)
+  - Japan: [Fujitsu Research](https://www.fujitsu.com/global/about/research/article/202104-devsprint.html) (May 2021)
 
 - 2020
-  - [EuroPython 2020 Sprints](https://wiki.python.org/moin/EuroPython2020/Sprints) (Jul 2020)
-  - [SciPy 2020](https://www.scipy2020.scipy.org/sprints-schedule) (Jul 2020)
-  - Data Umbrella + NYC PyLadies [First ONLINE sprint](https://tinyurl.com/sklearn-online) (Jun 2020)
+  - Online: [EuroPython 2020 Sprints](https://wiki.python.org/moin/EuroPython2020/Sprints) (Jul 2020)
+  - Online: [SciPy 2020](https://www.scipy2020.scipy.org/sprints-schedule) (Jul 2020)
+  - Online: [Data Umbrella First Online Global Sprint](https://tinyurl.com/sklearn-online) (Jun 2020)
   - Paris, France:  [Sprint of the Decade](https://github.com/scikit-learn/scikit-learn/wiki/Paris-scikit-learn-Sprint-of-the-Decade) (Jan 2020)
   - Berlin, Germany: [WiMLDS sprint](https://github.com/WiMLDS/berlin-2020-scikit-sprint) (Jan 2020)
 
 - 2019
-  - London:  [ManAHL hackathon](https://www.man.com/hackathon2019) (Nov 2, 2019)
+  - London, UK:  [ManAHL hackathon](https://www.man.com/hackathon2019) (Nov 2, 2019)
   - San Francisco, CA:  [WiMLDS sprint](http://wimlds.org/opensourcesprints-2/bay-area-scikit-sprint-2019/) (Nov 2019)
-  - New York, NY:  [WiMLDS sprint](http://wimlds.org/opensourcesprints-2/nyc-scikit-sprint-2019/) (Aug 24, 2019)
-  - SciPy:  [Austin, TX](https://www.scipy2019.scipy.org/sprints) (Jul 13-14, 2019)  
+  - New York, NY:  [NYC WiMLDS](http://wimlds.org/opensourcesprints-2/nyc-scikit-sprint-2019/) (Aug 24, 2019)
+  - Austin, TX: [SciPy](https://www.scipy2019.scipy.org/sprints) (Jul 13-14, 2019)  
   - Nairobi, Kenya:  [WiMLDS sprint](http://wimlds.org/nairobi-scikit-sprint-2019/) (Jun 22, 2019)
-  - Paris:  [core sprint, for advanced contributors](https://scikit-learn.fondation-inria.fr/en/scikit-learn-sprint-in-paris/) (Feb)
+  - Paris, France:  [core sprint, for advanced contributors](https://scikit-learn.fondation-inria.fr/en/scikit-learn-sprint-in-paris/) (Feb)
 - 2018
-  - WiMLDS:  [New York City](https://reshamas.github.io/highlights-from-the-2018-NYC-WiMLDS-scikit-sprint) (Sep)
-  - SciPy:  [Austin](http://gael-varoquaux.info/programming/sprint-on-scikit-learn-in-paris-and-austin.html) (open sprint, for new contributors) (Jul)
-  - Paris:  core sprint, for advanced contributors (Jul)
-  - Two Sigma:  [New York City](https://twitter.com/amuellerml/status/1007670849774784512) (Jun) 
-  - UC Berkeley:  [Berkeley](https://github.com/scikit-image/scikit-image/wiki/UC-Berkeley-(BIDS)-sprint,-May-28-Jun-2-2018)(May 28 to Jun 2)
-  - ManAHL:  London (April 21-22, 2018)
+  - New York, NY: [NYC WiMLDS](https://reshamas.github.io/highlights-from-the-2018-NYC-WiMLDS-scikit-sprint) (Sep 2018)
+  - Austin, TX: [SciPy](http://gael-varoquaux.info/programming/sprint-on-scikit-learn-in-paris-and-austin.html) (open sprint, for new contributors) (Jul 2018)
+  - Paris, France:  core sprint, for advanced contributors (Jul 2018)
+  - New York, NY:  [Two Sigma](https://twitter.com/amuellerml/status/1007670849774784512) (Jun 2018) 
+  - Berkeley, CA: [UC Berkeley](https://github.com/scikit-image/scikit-image/wiki/UC-Berkeley-(BIDS)-sprint,-May-28-Jun-2-2018)(May 28 to Jun 2)
+  - London, UK: ManAHL (April 21-22, 2018)
 - 2017
-  - [Paris](http://gael-varoquaux.info/programming/scikit-learn-paris-sprint-2017.html) (Jun)
-  - WiMLDS: [New York City](https://github.com/WiMLDS/scikit-sprint-nyc-2017/blob/master/README.md) (Mar)
+  - [Paris, France](http://gael-varoquaux.info/programming/scikit-learn-paris-sprint-2017.html) (Jun)
+  - New York, NY: [NYC WiMLDS](https://github.com/WiMLDS/scikit-sprint-nyc-2017/blob/master/README.md) (Mar)
 - 2016
   - SciPy:  [Austin](https://scipy2016.scipy.org/ehome/146062/332969/) (Jul)
 - 2015
@@ -67,9 +70,9 @@ author_profile: false
   - Paris, Logilab (Apr)
     - Boston MIT
 - 2010
-  - Paris (Sep)
-  - Paris (Jun)
-  - Paris (Mar)
+  - Paris, France: (Sep 2010)
+  - Paris, France: (Jun 2010)
+  - Paris, France: (Mar 2010)
   - **first release of scikit-learn** (Feb)
 
 ## References
 
@@ -30,7 +30,7 @@ Sprints are **working sessions to contribute to an open source library**. The go
 
 The [scikit-learn](https://scikit-learn.org/dev/index.html) project has a long and extraordinary legacy of open source sprints.  Since 2010, when its [first public version](https://en.wikipedia.org/wiki/Scikit-learn) was released, there have been as many as [45 sprints organized](https://blog.scikit-learn.org/sprints/). The number 45 is a lower bound, since there are likely more sprints that have not been listed. 
 
-To date, more than 2400 people have contributed to [scikit-learn](https://github.com/scikit-learn/scikit-learn).  The number of contributors to scikit-learn exceeds those of other related libraries such as numpy, scipy and matplotlib, with the exception of the [pandas](https://github.com/pandas-dev/pandas), which has a greater number of contributors (See Appendix A).
+To date, more than 2400 people have contributed to [scikit-learn](https://github.com/scikit-learn/scikit-learn).  The number of contributors to scikit-learn exceeds those of other related libraries such as numpy, scipy and matplotlib, with the exception of [pandas](https://github.com/pandas-dev/pandas), which has a greater number of contributors (See Appendix A).
 
 The public discourse on open source has expanded to explore topics of sustainability, funding models, and diversity and inclusion, to name a few.  A *reasonable*, yet *difficult to answer* question that has been posed is: 
 >*<span style="background-color: #CAE9F5;">
@@ -41,7 +41,7 @@ What is the effectiveness of sprint models and what is the long-term engagement
 
 Due to technological limitations of GitHub and privacy concerns, we do not hold precise data on how many scikit-learn contributors connected to the project via a sprint. We have no formal data collection process which records statistics on how many sprint participants are recurring or information on their contributions to other open source projects or other long term positive ripple effects.  A scientific look at the correlation between the number of sprints and contributors is beyond the scope of this article.  What we *will examine* in this article are the **objectives, results and aspirations** of running the scikit-learn sprints.
 
-<span style="background-color: #CAE9F5;">The queries from other open-source projects requesting guidance on sprints and diversity and inclusions have been increasing.</span> We share these experiences and lessons learned with the community, potential funders and open source project maintainers, particularly those projects which are nascent in their quest to build community, sustainability and diversity and inclusion.  
+<span style="background-color: #CAE9F5;">The queries from other open-source projects requesting guidance on sprints and diversity and inclusion have been increasing.</span> We share these experiences and lessons learned with the community, potential funders and open source project maintainers, particularly those projects which are nascent in their quest to build community, sustainability and diversity and inclusion.  
 
 ## Outline
 
 
@@ -0,0 +1,54 @@
+---
+title: "scikit-learn Sprint in Salta, Argentina"
+date: September 29, 2022
+
+categories:
+  - Events
+tags:
+  - Sprints
+  - Community
+
+postauthors:
+  - name: Juan Martín Loyola
+    website: https://jmloyola.github.io/
+    image: juan-martin-loyola.jpeg
+---
+<div>
+  {% include postauthor.html %}
+</div>
+
+In September of 2022, the [SciPy Latin America](https://pythoncientifico.ar/) conference took place in Salta, Argentina.
+As part of the event, we organized a [scikit-learn sprint](https://pythoncientifico.ar/events/sprints/).
+The main idea was to introduce the participants to the open source world and help them make their first contribution.
+The sprint event was an in-person event.
+
+<img src="https://pythoncientifico.ar/static/assets/images/scipy-la-2022_logo.png" alt="SciPy logo" width="50%" height="50%" >
+
+## Schedule
+- September 27, 2022 - **Pre-sprint** - 10:00 to 12:00 hs (UTC -3)
+- September 28, 2022 - **Sprint** - 10:00 to 17:00 hs (UTC -3)
+
+## Repository
+For more information in Spanish, [check this repository](https://github.com/jmloyola/sklearn-sprint-argentina-2022).
+You will find details about the event, instructions to set up the development environment, links with further information and tutorials, and an example git workflow to make a pull request for the project.
+
+## Photos
+<figure>
+ <img src="/assets/images/posts_images/sprint-salta-2022-1.jpg" alt="11 people standing behind some computers and 2 people projected in the screen" max-width="20%" max-height="20%" />
+ <figcaption>
+ Group photo of the SciPy Latin America sprint, Salta, Argentina, 2022. Sandra Meneses and Juan Martín Loyola are projected on the screen from a Zoom call. Photo credit: Lucía Torres.
+ </figcaption>
+</figure>
+
+<figure>
+ <img src="/assets/images/posts_images/sprint-salta-2022-2.jpeg" alt="11 people coding in their computers" max-width="20%" max-height="20%" />
+ <figcaption>
+ Participants of the SciPy Latin America sprint working on their computers. Photo credit: Ariel Silvio Norberto Ramos.
+ </figcaption>
+</figure>
+
+## Acknowledgment
+These people made this sprint possible:
+- Ariel Silvio Norberto Ramos, one of the organizers of the SciPy Latin America,
+- [Data Umbrella](https://www.dataumbrella.org/), [one of the community partners of the event](https://twitter.com/ScipyLA/status/1573710649963724802), especially Sandra Meneses and Reshama Shaikh,
+- The mentors that helped run the sprint.
@@ -0,0 +1,77 @@
+---
+title: "scikit-learn and Hugging Face join forces"
+date: October 13, 2022
+
+categories:
+  - Updates
+  - Community
+tags:
+  - Open Source
+
+featured-image: HFxsklearn.png
+
+postauthors:
+  - name: Lysandre Debut
+    email: [email protected]
+    website: https://github.com/LysandreJik
+    image: "lysandre_debut.jpg"
+  - name: François Goupil
+    email: [email protected]
+    website: https://github.com/francoisgoupil
+    image: "francois_goupil.jpeg"
+---
+<div>
+  <img src="/assets/images/posts_images/{{ page.featured-image }}" alt="">
+  {% include postauthor.html %}
+</div>
+
+
+[Hugging Face](hf.co) is happy to announce that we're partnering with [scikit-learn](https://scikit-learn.org/stable/index.html) to further our support of the machine learning tools and ecosystem.
+
+At Hugging Face, we've been putting a lot of effort into supporting deep learning, but we believe that machine learning as a whole can benefit from the tools we release. With statistical machine learning being essential in this field and scikit-learn dominating statistical ML, we're excited to partner and move forward together.
+
+As of September 2022, the Hugging Face Hub already hosts nearly 4,000 tabular classification and tabular regression model checkpoints, and we strive for this trend to continue.
+
+<div>
+  <video   preload="auto" autoplay loop muted="muted" volume="0"> 
+  <source src="/assets/videos/HFxsklearn.mp4" type="video/mp4"> 
+  </video>
+</div>
+
+## Support to the scikit-learn consortium
+
+Starting June 2022, Hugging Face is now an official sponsor of the scikit-learn consortium . Through this support, Hugging Face actively promotes the development and sustainability of sklearn. As a sponsor of the scikit-learn consortium hosted at the Inria foundation, we'll now participate in the scikit-learn consortium technical committee
+
+## Development support 
+To help  sustaining the development of the library , we're happy to welcome Adrin Jalali and Benjamin Bossan to the Hugging Face team. Adrin is a core developer of scikit-learn as well as [fairlearn](fairlearn.org), while Benjamin is the author of the [skorch](https://github.com/skorch-dev/skorch) library and is now a contributor to scikit-learn.
+
+Hugging Face is happy to support the development of scikit-learn through code contributions, issues, pull requests, reviews, and discussions.
+
+## Integration to and from the Hugging Face Hub
+
+["Skops"](https://github.com/skops-dev/skops) is the name of the framework being actively developed as the link between the scikit-learn and the Hugging Face ecosystems. With Skops, we hope to facilitate essential workflows:
+
+- The ability to push scikit-learn models on the Hugging Face Hub
+- The possibility to try out models directly in the browser
+- The automatic creation of model cards, to improve model documentation and understanding
+- The ability to collaborate with others on machine learning projects
+
+### Snapshot of your work
+
+Working at the intersection of scikit-learn and the Hub offers challenges linked to the two platforms. One of these challenges is secure persistence: the ability to serialize models in a secure, safe manner.
+
+scikit-learn models (estimators, predictors, ...) are usually saved using pickle, which is notorious for not being a secure format. Sharing scikit-learn models in this format exposes receivers to potentially malicious data which could execute arbitrary code when run.
+
+That's where secure persistence comes in: as the Hugging Face Hub aims to provide a platform for models, the ability to share safe, secure objects is essential. We've been working on adding secure persistence for scikit-learn models in [skops#128](https://github.com/skops-dev/skops/pull/128) and [skops#145](https://github.com/skops-dev/skops/pull/145)([doc preview](https://skops--145.org.readthedocs.build/en/145/persistence.html)). Instead of serializing using pickle, the object's contents are put into a zip file with an accompanying schema JSON file.
+
+Read about the Skops library in the following blog post: [Introducing Skops](https://huggingface.co/blog/skops).
+
+## Improving interoperability
+
+Skops is an example of an integration of scikit-learn within our tools, but it is not the only example! We will strive to integrate with the rest of our ecosystem so that Hugging Face users may benefit from using scikit-learn tools and vice-versa.
+
+An example is the `evaluate` library, dedicated to efficiently evaluating machine learning models and datasets. We aim for this tool to natively support [scikit-learn metrics](https://github.com/huggingface/evaluate/issues/297) in its API.
+
+---
+
+Through these efforts, we hope to kickstart a lasting relationship between the two ecosystems and provide simple, efficient bridges to lower the barrier of entry. We believe that educating and sharing models is the best way to foster inclusive machine learning from which all can benefit. We're excited to partner with scikit-learn for this endeavor.
@@ -27,3 +27,8 @@ html {
     position: relative;
     font-size: 22px;
 }
+
+video {
+  width: 100% !important;
+  height: auto !important;
+}
Original file line number	Diff line number	Diff line change
`@@ -27,3 +27,8 @@ html {`
`27`	`27`	`position: relative;`
`28`	`28`	`font-size: 22px;`
`29`	`29`	`}`
	`30`	`+`
	`31`	`+video {`
	`32`	`+ width: 100% !important;`
	`33`	`+ height: auto !important;`
	`34`	`+}`