You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Kern AI refinery (abbr. _refinery_) is like the data-centric sibling of your favorite programming environment. It provides an easy-to-use interface for weak supervision as well as extensive data management, neural search and monitoring to ensure that the quality of your training data is as good as possible.
21
+
Kern AI refinery (abbr. _refinery_) is like the data-centric sibling of your favorite programming environment. It provides an easy-to-use interface for weak supervision as well as extensive data management, neural search, and monitoring to ensure that the quality of your training data is as good as possible.
22
22
23
23
_refinery_ doesn't get rid of manual labeling, but it makes sure that your valuable time is spent well.
24
24
@@ -27,7 +27,6 @@ _refinery_ doesn't get rid of manual labeling, but it makes sure that your valua
27
27
_refinery_ consists of multiple microservices to enable a scalable and optimized workload balance, so this is the central repository used to orchestrate the system. It builds on top of [🤗 Hugging Face](https://www.huggingface.co) and [spaCy](https://spacy.io/) to leverage pre-built language models for your NLP tasks, as well as [qdrant](https://github.com/qdrant/qdrant) for neural search. Our microservices natively support GPU acceleration.
28
28
29
29
## Table of contents
30
-
-[Table of contents](#table-of-contents)
31
30
-[🧑💻 Why _refinery_? Built for developers with collaboration in mind](#-why-refinery-built-for-developers-with-collaboration-in-mind)
32
31
-[Open-source and developer-oriented](#open-source-and-developer-oriented)
33
32
-[For automation or quality control](#for-automation-or-quality-control)
@@ -55,6 +54,7 @@ _refinery_ consists of multiple microservices to enable a scalable and optimized
55
54
-[🏠 Architecture](#-architecture)
56
55
-[🏫 Glossary](#-glossary)
57
56
-[👩💻👨💻 Team and contributors](#-team-and-contributors)
57
+
-[🌟 Star History](#-star-history)
58
58
-[📃 License](#-license)
59
59
60
60
@@ -72,7 +72,7 @@ The labeling workflow in _refinery_ is designed to integrate heuristics like lab
72
72
While doing so, we aim to improve the collaboration between engineers and subject matter experts (SMEs). In the past, we've seen how our application was being used in meetings to discuss label patterns in form of labeling functions and distant supervisors. We believe that data-centric AI is the best way to leverage collaboration.
73
73
74
74
### Integrations
75
-
Lastly, _refinery_ supports [SDK actions](https://github.com/code-kern-ai/refinery-python) like pulling and pushing data. Data-centric AI redefines labeling to be more than a one-time job by giving it an iterative workflow, so we aim to give you more power every day by providing end-to-end capabilities, growing the large-scale availability of high-quality training data. Use our SDK to program integrations with your existing landscapes. By the way, tell us which integrations you'd like to have [here](https://discuss.kern.ai/c/enhancements/6).
75
+
Lastly, _refinery_ supports [SDK actions](https://github.com/code-kern-ai/refinery-python) like pulling and pushing data. Data-centric AI redefines labeling to be more than a one-time job by giving it an iterative workflow, so we aim to give you more power every day by providing end-to-end capabilities, growing the large-scale availability of high-quality training data. Use our SDK to program integrations with your existing landscapes.
76
76
77
77
78
78
### Your benefits
@@ -97,7 +97,7 @@ If you like what we're working on, please leave a ⭐!
97
97
- Overview of project metrics like label distributions and confusion matrix
98
98
- Data accessible and extendable via our [Python SDK](https://github.com/code-kern-ai/refinery-python)
99
99
100
-
### Team workspaces in the [managed version](https://kern.ai/pricing)
100
+
### Team workspaces in the [managed version](https://www.kern.ai/pricing/options)
101
101
- Allow multiple users to label your data
102
102
- Automated calculation of inter-annotator agreements
103
103
@@ -107,7 +107,7 @@ If you like what we're working on, please leave a ⭐!
107
107
```
108
108
pip install kern-refinery
109
109
```
110
-
Once the library is installed, go to the directory where you want to store the data and run `refinery start`. To stop the server, run `refinery stop`.
110
+
Once the library is installed, go to the directory where you want to store the data and run `refinery start`. This will automatically `git clone` this repository first if you haven't done so yet. To stop the server, run `refinery stop`.
111
111
112
112
### From repository
113
113
@@ -146,14 +146,10 @@ If you need help writing your first labeling functions, look into our [template
146
146
You can find our changelog [here](https://docs.kern.ai/changelog).
147
147
148
148
## 😵💫 Need help?
149
-
No worries, we've got you:
150
-
- If you run into issues during installation or have other technical questions, please [open a ticket](https://discuss.kern.ai/c/technical-help/9) in the "technical help" category of our forum.
151
-
- If you're not sure how to apply heuristics or have general questions about the workflow of _refinery_, please [open a ticket](https://discuss.kern.ai/c/questions/5) in our "_refinery_ help" category of our forum.
149
+
No worries, we've got you. If you have questions, please [open a ticket](https://github.com/code-kern-ai/refinery/discussions/categories/q-a) in the "q&a" category of our forum.
152
150
153
151
## 🪢 Community and contact
154
-
Feel free to join our community spaces, where we'll discuss about recent findings in data-centric AI:
155
-
-[Discord](https://discord.gg/qf4rGCEphW)
156
-
-[Our forum](https://discuss.kern.ai/)
152
+
Feel free to join our [Discord](https://discord.gg/qf4rGCEphW), where we'll discuss about recent findings in data-centric AI:
157
153
158
154
We send out a (mostly) weekly newsletter about recent findings in data-centric AI, product highlights in development and more. You can subscribe to the newsletter [here](https://www.kern.ai/resources/newsletter).
159
155
@@ -162,7 +158,7 @@ Also, you can follow us on [Twitter](https://twitter.com/MeetKern) and [LinkedIn
162
158
To reach out to us, please use our [contact formula](https://www.kern.ai/contact).
163
159
164
160
## 🙌 Contributing
165
-
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**. You can do so by providing feedback about [desired features](https://discuss.kern.ai/c/enhancements/6) and [bugs](https://github.com/code-kern-ai/refinery/issues) you might detect.
161
+
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**. You can do so by providing feedback about [desired features and bugs](https://github.com/code-kern-ai/refinery/issues) you might detect.
166
162
167
163
If you actively want to participate in extending the code base, [reach out to us]([mailto:[email protected]](https://www.kern.ai/contact)). We'll explain you how the architecture is set up, so you can customize the application as you desire.
168
164
@@ -213,7 +209,7 @@ You can find our short- to midterm feature plans in the [public roadmap](https:/
213
209
214
210
<details>
215
211
<summary><b>I want to install a library for my labeling function</b></summary>
216
-
For this, we need to change the requirements.txt of the <a href="https://github.com/code-kern-ai/refinery-lf-exec-env">lf-exec-env</a>, the containerized execution environment for your labeling functions. Please just <a href="https://discuss.kern.ai/c/enhancements/6">open a ticket</a> in our forum, and we'll integrate your library as soon as possible.
212
+
For this, we need to change the requirements.txt of the <a href="https://github.com/code-kern-ai/refinery-lf-exec-env">lf-exec-env</a>, the containerized execution environment for your labeling functions. Please just <a href="https://github.com/code-kern-ai/refinery/issues">open an issue</a>, and we'll integrate your library as soon as possible.
217
213
</details>
218
214
219
215
<details>
@@ -478,6 +474,8 @@ Missing anything in the glossary? [Add the term](https://github.com/code-kern-ai
478
474
</tr>
479
475
</table>
480
476
477
+
## 🌟 Star History
478
+
[](https://star-history.com/#code-kern-ai/refinery&Date)
481
479
482
480
## 📃 License
483
481
_refinery_ is licensed under the Apache License, Version 2.0. View a copy of the [License file](LICENSE).
0 commit comments