HumanSignal
diff --git a/‎label_studio_ml/examples/deepgram/Dockerfile‎
Lines changed: 48 additions & 0 deletions b/‎label_studio_ml/examples/deepgram/Dockerfile‎
Lines changed: 48 additions & 0 deletions
diff --git a/‎label_studio_ml/examples/deepgram/README.md‎
Lines changed: 195 additions & 0 deletions b/‎label_studio_ml/examples/deepgram/README.md‎
Lines changed: 195 additions & 0 deletions
diff --git a/‎label_studio_ml/examples/deepgram/_wsgi.py‎
Lines changed: 122 additions & 0 deletions b/‎label_studio_ml/examples/deepgram/_wsgi.py‎
Lines changed: 122 additions & 0 deletions
@@ -0,0 +1,48 @@
+# syntax=docker/dockerfile:1
+ARG PYTHON_VERSION=3.13
+
+FROM python:${PYTHON_VERSION}-slim AS python-base
+ARG TEST_ENV
+
+WORKDIR /app
+
+ENV PYTHONUNBUFFERED=1 \
+    PYTHONDONTWRITEBYTECODE=1 \
+    PORT=${PORT:-9090} \
+    PIP_CACHE_DIR=/.cache \
+    WORKERS=1 \
+    THREADS=8
+
+# Update the base OS
+RUN --mount=type=cache,target="/var/cache/apt",sharing=locked \
+    --mount=type=cache,target="/var/lib/apt/lists",sharing=locked \
+    set -eux; \
+    apt-get update; \
+    apt-get upgrade -y; \
+    apt install --no-install-recommends -y  \
+        git; \
+    apt-get autoremove -y
+
+# install base requirements
+COPY requirements-base.txt .
+RUN --mount=type=cache,target=${PIP_CACHE_DIR},sharing=locked \
+    pip install -r requirements-base.txt
+
+# install custom requirements
+COPY requirements.txt .
+RUN --mount=type=cache,target=${PIP_CACHE_DIR},sharing=locked \
+    pip install -r requirements.txt
+
+# install test requirements if needed
+COPY requirements-test.txt .
+# build only when TEST_ENV="true"
+RUN --mount=type=cache,target=${PIP_CACHE_DIR},sharing=locked \
+    if [ "$TEST_ENV" = "true" ]; then \
+      pip install -r requirements-test.txt; \
+    fi
+
+COPY . .
+
+EXPOSE 9090
+CMD gunicorn --preload --bind :$PORT --workers $WORKERS --threads $THREADS --timeout 0 _wsgi:app
+
@@ -0,0 +1,195 @@
+<!--
+---
+title: SAM2 with Images
+type: guide
+tier: all
+order: 15
+hide_menu: true
+hide_frontmatter_title: true
+meta_title: Using SAM2 with Label Studio for Image Annotation
+categories:
+    - Computer Vision
+    - Image Annotation
+    - Object Detection
+    - Segment Anything Model
+image: "/tutorials/sam2-images.png"
+---
+-->
+
+# Using SAM2 with Label Studio for Image Annotation
+
+Segment Anything 2, or SAM 2, is a model released by Meta in July 2024. An update to the original Segment Anything Model, 
+SAM 2 provides even better object segmentation for both images and video. In this guide, we'll show you how to use 
+SAM 2 for better image labeling with label studio. 
+
+Click on the image below to watch our ML Evangelist Micaela Kaplan explain how to link SAM 2 to your Label Studio Project.
+You'll need to follow the instructions below to stand up an instance of SAM2 before you can link your model! 
+
+[![Connecting SAM2 Model to Label Studio for Image Annotation ](https://img.youtube.com/vi/FTg8P8z4RgY/0.jpg)](https://www.youtube.com/watch?v=FTg8P8z4RgY)
+
+## Before you begin
+
+Before you begin, you must install the [Label Studio ML backend](https://github.com/HumanSignal/label-studio-ml-backend?tab=readme-ov-file#quickstart). 
+
+This tutorial uses the [`segment_anything_2_image` example](https://github.com/HumanSignal/label-studio-ml-backend/tree/master/label_studio_ml/examples/segment_anything_2_image). 
+
+Note that as of 8/1/2024, SAM2 only runs on GPU.
+
+## Labeling configuration
+
+The current implementation of the Label Studio SAM2 ML backend works using Interactive mode. The user-guided inputs are:
+- `KeypointLabels`
+- `RectangleLabels`
+
+And then SAM2 outputs `BrushLabels` as a result.
+
+This means all three control tags should be represented in your labeling configuration:
+
+```xml
+<View>
+<Style>
+  .main {
+    font-family: Arial, sans-serif;
+    background-color: #f5f5f5;
+    margin: 0;
+    padding: 20px;
+  }
+  .container {
+    display: flex;
+    justify-content: space-between;
+    margin-bottom: 20px;
+  }
+  .column {
+    flex: 1;
+    padding: 10px;
+    background-color: #fff;
+    border-radius: 5px;
+    box-shadow: 0 2px 5px rgba(0, 0, 0, 0.1);
+    text-align: center;
+  }
+  .column .title {
+    margin: 0;
+    color: #333;
+  }
+  .column .label {
+    margin-top: 10px;
+    padding: 10px;
+    background-color: #f9f9f9;
+    border-radius: 3px;
+  }
+  .image-container {
+    width: 100%;
+    height: 300px;
+    background-color: #ddd;
+    border-radius: 5px;
+  }
+</Style>
+<View className="main">
+  <View className="container">
+    <View className="column">
+      <View className="title">Choose Label</View>
+      <View className="label">
+        <BrushLabels name="tag" toName="image">
+          
+          
+        <Label value="defect" background="#FFA39E"/></BrushLabels>
+      </View>
+    </View>
+    <View className="column">
+      <View className="title">Use Keypoint</View>
+      <View className="label">
+        <KeyPointLabels name="tag2" toName="image" smart="true">
+          
+          
+        <Label value="defect" background="#250dd3"/></KeyPointLabels>
+      </View>
+    </View>
+    <View className="column">
+      <View className="title">Use Rectangle</View>
+      <View className="label">
+        <RectangleLabels name="tag3" toName="image" smart="true">
+          
+          
+        <Label value="defect" background="#FFC069"/></RectangleLabels>
+      </View>
+    </View>
+  </View>
+  <View className="image-container">
+    <Image name="image" value="$image" zoom="true" zoomControl="true"/>
+  </View>
+</View>
+</View>
+```
+
+## Running from source
+
+1. To run the ML backend without Docker, you have to clone the repository and install all dependencies using pip:
+
+```bash
+git clone https://github.com/HumanSignal/label-studio-ml-backend.git
+cd label-studio-ml-backend
+pip install -e .
+cd label_studio_ml/examples/segment_anything_2_image
+pip install -r requirements.txt
+```
+
+2. Download [`segment-anything-2` repo](https://github.com/facebookresearch/sam2) into the root directory. Install SegmentAnything model and download checkpoints using [the official Meta documentation](https://github.com/facebookresearch/sam2?tab=readme-ov-file#installation)
+You should now have the following folder structure: 
+
+
+    | root directory 
+        | label-studio-ml-backend 
+            | label-studio-ml
+                | examples 
+                    | segment_anything_2_image
+        | sam2
+            | sam2
+            | checkpoints
+
+
+3. Then you can start the ML backend on the default port `9090`:
+
+```bash
+cd ~/sam2
+label-studio-ml start ../label-studio-ml-backend/label_studio_ml/examples/segment_anything_2_image
+```
+
+Due to breaking changes from Meta [HERE](https://github.com/facebookresearch/sam2/blob/c2ec8e14a185632b0a5d8b161928ceb50197eddc/sam2/build_sam.py#L20), it is CRUCIAL that you run this command from the sam2 directory at your root directory. 
+
+4. Connect running ML backend server to Label Studio: go to your project `Settings -> Machine Learning -> Add Model` and specify `http://localhost:9090` as a URL. Read more in the official [Label Studio documentation](https://labelstud.io/guide/ml#Connect-the-model-to-Label-Studio).
+
+## Running with Docker
+
+1. Start Machine Learning backend on `http://localhost:9090` with prebuilt image:
+
+```bash
+docker-compose up
+```
+
+2. Validate that backend is running
+
+```bash
+$ curl http://localhost:9090/
+{"status":"UP"}
+```
+
+3. Connect to the backend from Label Studio running on the same host: go to your project `Settings -> Machine Learning -> Add Model` and specify `http://localhost:9090` as a URL.
+
+
+## Configuration
+Parameters can be set in `docker-compose.yml` before running the container.
+
+
+The following common parameters are available:
+- `DEVICE` - specify the device for the model server (currently only `cuda` is supported, `cpu` is coming soon)
+- `MODEL_CONFIG` - SAM2 model configuration file (`sam2_hiera_l.yaml` by default)
+- `MODEL_CHECKPOINT` - SAM2 model checkpoint file (`sam2_hiera_large.pt` by default)
+- `BASIC_AUTH_USER` - specify the basic auth user for the model server
+- `BASIC_AUTH_PASS` - specify the basic auth password for the model server
+- `LOG_LEVEL` - set the log level for the model server
+- `WORKERS` - specify the number of workers for the model server
+- `THREADS` - specify the number of threads for the model server
+
+## Customization
+
+The ML backend can be customized by adding your own models and logic inside the `./segment_anything_2` directory. 
@@ -0,0 +1,122 @@
+import os
+import argparse
+import json
+import logging
+import logging.config
+
+logging.config.dictConfig({
+  "version": 1,
+  "disable_existing_loggers": False,
+  "formatters": {
+    "standard": {
+      "format": "[%(asctime)s] [%(levelname)s] [%(name)s::%(funcName)s::%(lineno)d] %(message)s"
+    }
+  },
+  "handlers": {
+    "console": {
+      "class": "logging.StreamHandler",
+      "level": os.getenv('LOG_LEVEL'),
+      "stream": "ext://sys.stdout",
+      "formatter": "standard"
+    }
+  },
+  "root": {
+    "level": os.getenv('LOG_LEVEL'),
+    "handlers": [
+      "console"
+    ],
+    "propagate": True
+  }
+})
+
+from label_studio_ml.api import init_app
+from model import DeepgramModel
+
+
+_DEFAULT_CONFIG_PATH = os.path.join(os.path.dirname(__file__), 'config.json')
+
+
+def get_kwargs_from_config(config_path=_DEFAULT_CONFIG_PATH):
+    if not os.path.exists(config_path):
+        return dict()
+    with open(config_path) as f:
+        config = json.load(f)
+    assert isinstance(config, dict)
+    return config
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(description='Label studio')
+    parser.add_argument(
+        '-p', '--port', dest='port', type=int, default=9090,
+        help='Server port')
+    parser.add_argument(
+        '--host', dest='host', type=str, default='0.0.0.0',
+        help='Server host')
+    parser.add_argument(
+        '--kwargs', '--with', dest='kwargs', metavar='KEY=VAL', nargs='+', type=lambda kv: kv.split('='),
+        help='Additional LabelStudioMLBase model initialization kwargs')
+    parser.add_argument(
+        '-d', '--debug', dest='debug', action='store_true',
+        help='Switch debug mode')
+    parser.add_argument(
+        '--log-level', dest='log_level', choices=['DEBUG', 'INFO', 'WARNING', 'ERROR'], default=None,
+        help='Logging level')
+    parser.add_argument(
+        '--model-dir', dest='model_dir', default=os.path.dirname(__file__),
+        help='Directory where models are stored (relative to the project directory)')
+    parser.add_argument(
+        '--check', dest='check', action='store_true',
+        help='Validate model instance before launching server')
+    parser.add_argument('--basic-auth-user',
+                        default=os.environ.get('ML_SERVER_BASIC_AUTH_USER', None),
+                        help='Basic auth user')
+    
+    parser.add_argument('--basic-auth-pass',
+                        default=os.environ.get('ML_SERVER_BASIC_AUTH_PASS', None),
+                        help='Basic auth pass')    
+    
+    args = parser.parse_args()
+
+    # setup logging level
+    if args.log_level:
+        logging.root.setLevel(args.log_level)
+
+    def isfloat(value):
+        try:
+            float(value)
+            return True
+        except ValueError:
+            return False
+
+    def parse_kwargs():
+        param = dict()
+        for k, v in args.kwargs:
+            if v.isdigit():
+                param[k] = int(v)
+            elif v == 'True' or v == 'true':
+                param[k] = True
+            elif v == 'False' or v == 'false':
+                param[k] = False
+            elif isfloat(v):
+                param[k] = float(v)
+            else:
+                param[k] = v
+        return param
+
+    kwargs = get_kwargs_from_config()
+
+    if args.kwargs:
+        kwargs.update(parse_kwargs())
+
+    if args.check:
+        print('Check "' + DeepgramModel.__name__ + '" instance creation..')
+        model = DeepgramModel(**kwargs)
+
+    app = init_app(model_class=DeepgramModel, basic_auth_user=args.basic_auth_user, basic_auth_pass=args.basic_auth_pass)
+
+    app.run(host=args.host, port=args.port, debug=args.debug)
+
+else:
+    # for uWSGI use
+    app = init_app(model_class=DeepgramModel)