🚨 Paid mentoring program at scikit-learn 🚨 We're excited to announce our next paid mentoring program for scikit-learn, which will be about implementing and improving callbacks in the project. If you're excited about working with a distributed team on a well-established project, we'd love to hear from you. This is a $3000/month program, as a hybrid position from Paris (preferred), or Berlin. The duration of the mentoring is ideally one year. You can find the description of the position here: https://lnkd.in/eFq7trKs Whether you're a student, a career switcher, or an experienced dev curious about contributing to open source if you're excited about working on a mature, community-driven project with real impact, we want to hear from you.
About us
scikit-learn is an Open Source library for machine learning in Python.
- Website
-
https://scikit-learn.org
External link for scikit-learn
- Industry
- Software Development
- Company size
- 2-10 employees
- Type
- Nonprofit
Employees at scikit-learn
Updates
-
Happy to see our community of certified engineers growing in #srilanka 🇱🇰
Happy to share that I have officially earned the scikit-learn Associate Practitioner Certification from :probabl.! 🎉 I first got to know about these certifications offered by probabl during a recent meeting between Boffin Institute of Data Science and the fantastic team at :probabl. A big thank you to Stephen Bauer and Penelope GITTOS from :probabl for facilitating this process. What impressed me most about the exam was its focus on testing core machine learning understanding. For anyone looking to deepen their knowledge and prepare for the certification, I highly recommend checking out the comprehensive MOOC developed by Inria: https://lnkd.in/gKJc-TNg This certification marks an exciting step in my data science journey, and I am eager to apply these skills in my current and future projects and teaching. https://lnkd.in/g3fa4cUU
-
📣 PyData Paris 2024 talk 🟠 An Update on the Latest scikit-learn Features 🎤Speakers: scikit-learn maintainers Stefanie Senger, PhD & Guillaume Lemaitre ⌛️30-minutes 🔵 In this talk, we provide an update on the latest `scikit-learn` features that have been implemented in versions 1.4 and 1.5. We will particularly discuss the following features: 🔸 the metadata routing API allowing to pass metadata around estimators; 🔸the `TunedThresholdClassifierCV` allowing to tuned operational decision through custom metric; 🔸better support for categorical features and missing values; 🔸interoperability of array and dataframe. #python #machinelearning #datascience #opensource PyData NumFOCUS https://lnkd.in/ecC-2YYQ
Lemaitre & Senger - An update on the latest scikit-learn features | PyData Paris 2024
https://www.youtube.com/
-
📣 PyData Global 2023 talk 🟠 Get the Best from Your scikit-learn Classifier 🎤Speaker: scikit-learn maintainer Guillaume Lemaitre ⌛️30-minutes 🔵 When operating a classifier in a production setting (i.e. predictive phase), practitioners are interested in potentially two different outputs: a "hard" decision used to leverage a business decision or/and a "soft" decision to get a confidence score linked to each potential decision (e.g. usually related to class probabilities). Scikit-learn does not provide any flexibility to go from "soft" to "hard" predictions: it uses a cut-off point at a confidence score of 0.5 (or 0 when using decision_function) to get class labels. However, optimizing a classifier to get a confidence score close to the true probabilities (i.e. a calibrated classifier) does not guarantee to obtain accurate "hard" predictions using this heuristic. Reversely, training a classifier for an optimum "hard" prediction accuracy (with the cut-off constraint at 0.5) does not guarantee obtaining a calibrated classifier. In this talk, we will present a new scikit-learn meta-estimator allowing us to get the best of the two worlds: a calibrated classifier providing optimum "hard" predictions. This meta-estimator will land in a future version of scikit-learn. We will provide some insights regarding the way to obtain accurate probabilities and predictions and also illustrate how to use in practice this model on different use cases: cost-sensitive problems and imbalanced classification problems. #python #machinelearning #datascience #opensource PyData NumFOCUS https://lnkd.in/ewHFfDNy
Guillaume Lemaitre - Get the best from your scikit-learn classifier | PyData Global 2023
https://www.youtube.com/
-
📣 PyData Global 2023 Tutorial 🎤Speaker: scikit-learn maintainer Olivier Grisel 🟠 Predictive survival analysis with scikit-learn, scikit-survival and lifelines 🔵This 90-minute tutorial will introduce how to train machine learning models for time-to-event prediction tasks (health care, predictive maintenance, marketing, insurance...) without introducing a bias from censored training (and evaluation) data. #python #machinelearning #datascience #opensource https://lnkd.in/dbh4FvTK
Olivier Grisel - Predictive survival analysis with scikit-learn, scikit-survival and lifelines
https://www.youtube.com/
-
🔵🟠Pushing Cython to its limits in scikit-learn Check out the PyData NYC 2024 presentation by maintainer Thomas J. Fan scikit-learn is a machine-learning library for Python that uses NumPy and SciPy for numerical operations. Scikit-learn has its own compiled code for performance-critical computation written in C, C++, and Cython. The library primarily focuses on Cython for compiled code because it is easy to use and approachable. In this talk, we dive into many techniques scikit-learn employs to utilize Cython fully. We will cover features like using the C++ standard library within Cython, fused types, code generation with the Tempita engine, and OpenMP for parallelization. #opensource #python #machinelearning #datascience https://lnkd.in/e27AhKUz
Thomas J. Fan - Pushing Cython to its Limits in Scikit-learn | PyData NYC 2024
https://www.youtube.com/
-
scikit-learn reposted this
📣 Calling all professional scikit-learn users! We’re developing professional support services and need your input. Imagine a central helpdesk to manage and discuss your challenges (confidentially!), direct input from core contributors, rapid technical guidance and best practices, early access to new features, and long-term support options—all in one place. Share your thoughts in our quick survey to help shape these services for our entire community. 📊 Complete the survey here 👉 https://lnkd.in/gx6_Macc 🎁 Plus, one lucky respondent will score our :probabl. #backpack, jam-packed with goodies—stickers, a sweatshirt, water bottle, candy, and more! 🍀 How to increase your chances for the giveaway: - Like this post and tag someone who might also benefit from these services. - Each tag = an extra entry, so spread the word! #DataScience #MachineLearning #ScikitLearn #OpenSource
-
-
🦋scikit-learn is now on Bluesky! …. and our account is verified 🛎️ Follow us here for updates on the #️⃣1️⃣, python, open source, machine learning library. https://lnkd.in/eR_j5CZT
-
scikit-learn is not only a library, it is an entire living data science ecosystem! There is a newbie in town — say hello to skore! From hyperparameter optimization to preprocessing tools and more, there are plenty of projects that work seamlessly alongside scikit-learn. Curious? Check out our Related Projects page (https://lnkd.in/evdj6rNR) and discover how to enhance your workflows! What is your favorite scikit-learn compatible project?
🚀 :probabl. is thrilled to launch skore (https://lnkd.in/eAdV6K3Y) the scikit-learn sidekick that makes recommended practices actionable. 🛠️ What is skore? skore is a Python open-source library designed to help data scientists during model development. While scikit-learn sets the stage for scientific and engineering excellence, skore helps you find your path through the maze of experimentation. ✨ Key Features: ✅ Diagnose – Catch methodological errors before they impact your models with smart alerts that analyze both code execution and data patterns in real-time. ✅ Evaluate – Uncover actionable insights through automated reports surfacing relevant metrics. Explore faster with our intelligent caching system. 🔍 Why skore? Scikit-learn offers powerful machine learning tools, but achieving great results takes more than great tools. Skore closes this gap, providing the guidance you need to achieve excellence with scikit-learn. ⚡️ Try skore now and join our growing community! We’re excited to shape the future of Data Science and pre-MLOps—and this is just the beginning! Be among the first to explore skore’s upcoming commercial features.👀 👉 GitHub: https://lnkd.in/eAdV6K3Y 📘 Docs: https://skore.probabl.ai 📢 Discord: https://discord.probabl.ai 🎥 Quick demo: https://lnkd.in/e7uw5sht 🔑 Sign up for beta access to our upcoming commercial offering: https://lnkd.in/e5guQt98 #MachineLearning #DataScience #OpenSource #AI #preMLOps #skore #scikitlearn
-
❄️ The Christmas release is out ❄️ 🎥 Release video below Discover scikit-learn 1.6 and its: 🟢 2 major features & 34 features 🔵 5 efficiency improvements & 21 enhancements 🟡 14 API changes 🔴 30 fixes 👥 160 contributors (thank you all!) More details in the changelog: https://lnkd.in/e5pui3ev You can upgrade with pip as usual: pip install -U scikit-learn Using conda-forge builds: conda install -c conda-forge scikit-learn Thanks to 👋 Vincent D. Warmerdam for this release highlights video. https://lnkd.in/eRM_5rte #scikitlearn #Python #release #sklearn #software #ML #machinelearning #datavisualization #dataanalytics #data #dataanalysis #deeplearning #opensource #opensourcesoftware #opensourcecommunity
scikit-learn Version 1.6.0 Release Highlights
https://www.youtube.com/